-
Anytime.AI
- Santa Clara, CA
- http://academic.hugochan.net
- @chenyu_hugo
Stars
A Unified Toolkit for Deep Learning Based Document Image Analysis
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Build resilient language agents as graphs.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Implementation of Nougat Neural Optical Understanding for Academic Documents
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A cloud-native vector database, storage for next generation AI applications
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
🐢 Open-Source Evaluation & Testing for ML models & LLMs
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Salesforce open-source LLMs with 8k sequence length.
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
中文法律LLaMA (LLaMA for Chinese legel domain)
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Open Academic Research on Improving LLaMA to SOTA LLM
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Examples and guides for using the OpenAI API