Skip to content
View ftgreat's full-sized avatar

Block or report ftgreat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 571 78 Updated Nov 4, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,620 859 Updated Oct 31, 2024

We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.

Python 40 7 Updated Oct 27, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 950 67 Updated Nov 4, 2024

Emu Series: Generative Multimodal Models from BAAI

Python 1,656 86 Updated Sep 27, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,627 99 Updated Jun 1, 2023

An Extensible Deep Learning Library

Python 1,866 259 Updated Nov 5, 2024

Code implementation of synthetic continued pretraining

Python 54 4 Updated Oct 6, 2024

The official repo of INF-34B models trained by INF Technology.

Python 34 1 Updated Jul 25, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,672 277 Updated Oct 2, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,197 29 Updated Oct 28, 2024

Visualization of MCTS algorithm applied to Tic-tac-toe.

JavaScript 203 11 Updated Aug 25, 2021

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 1 Updated Jul 16, 2024

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python 185 12 Updated Oct 16, 2024
1 Updated Sep 25, 2024

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 189 22 Updated May 23, 2024

Trainable PyTorch framework for developing protein, RNA and complex models.

Python 261 33 Updated Jan 20, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,928 306 Updated Oct 22, 2024

Optimizing inference proxy for LLMs

Python 1,309 118 Updated Nov 4, 2024

A small repository demonstrating the use of Webdataset and Imagenet

Python 15 1 Updated Dec 19, 2023

terashuf shuffles multi-terabyte text files using limited memory

C++ 203 15 Updated Feb 5, 2023

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,905 963 Updated Nov 5, 2024

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 332 29 Updated Sep 6, 2024

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 292 23 Updated Dec 20, 2023

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,925 321 Updated Nov 4, 2024

ring-attention experiments

Python 95 10 Updated Oct 17, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 641 46 Updated Sep 27, 2024

Ring attention implementation with flash attention

Python 575 45 Updated Oct 30, 2024
Next