Skip to content
View hugochan's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report hugochan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 4,823 465 Updated Aug 15, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 271,432 45,784 Updated Aug 7, 2024

Build resilient language agents as graphs.

Python 5,953 937 Updated Oct 4, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,649 542 Updated Oct 2, 2024

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,427 1,136 Updated Oct 2, 2024

Parse files for optimal RAG

Python 2,732 263 Updated Oct 3, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 8,833 560 Updated Apr 16, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,659 707 Updated Oct 4, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,224 248 Updated Jun 24, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 18,539 1,880 Updated Oct 3, 2024

A cloud-native vector database, storage for next generation AI applications

Go 29,762 2,854 Updated Sep 30, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,481 3,841 Updated Oct 2, 2024

Grok open release

Python 49,452 8,323 Updated Aug 30, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,291 1,008 Updated Oct 4, 2024

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 5,833 549 Updated Oct 5, 2024

Yuan 2.0 Large Language Model

Python 681 85 Updated Jul 11, 2024

Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.

Python 1,953 160 Updated Oct 4, 2024

🐢 Open-Source Evaluation & Testing for ML models & LLMs

Python 3,963 252 Updated Oct 3, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 54,475 5,624 Updated Aug 24, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,231 823 Updated Sep 30, 2024

Salesforce open-source LLMs with 8k sequence length.

Python 718 38 Updated Dec 20, 2023

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 11,010 760 Updated Oct 4, 2024

中文法律LLaMA (LLaMA for Chinese legel domain)

Python 835 113 Updated Aug 28, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,670 506 Updated Jul 18, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,591 101 Updated Aug 30, 2023

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,358 374 Updated Jul 16, 2023

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 48,822 4,742 Updated Sep 19, 2024

Examples and guides for using the OpenAI API

MDX 58,871 9,351 Updated Oct 4, 2024
Python 2,508 155 Updated Sep 24, 2024
Next