huoyijie

huoyijie huoyijie

Senior Programmer & 实习炼丹师

Achievements

Starred repositories

ViTAE-Transformer / DeepSolo

The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multi…

Python 247 35 Updated Aug 9, 2024

cv-small-snails / Awesome-Table-Recognition

A curated list of resources dedicated to table recognition

369 50 Updated Jan 28, 2024

MathamPollard / awesome-table-structure-recognition

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

133 6 Updated Sep 9, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,861 170 Updated Oct 4, 2024

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,450 173 Updated Sep 30, 2024

SCUT-DLVCLab / Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

159 3 Updated Aug 29, 2024

entropy2333 / awesome-key-information-extraction

A curated list of papers about key information extraction.

78 7 Updated Aug 14, 2024

MVIG-SJTU / AlphaPose

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 8,010 1,970 Updated May 13, 2024

CMU-Perceptual-Computing-Lab / openpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 31,145 7,856 Updated Aug 3, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,278 831 Updated Oct 3, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 4,057 358 Updated Nov 1, 2024