ftgreat

ldwang ftgreat

20 followers · 21 following

Achievements

x2 x2

Achievements

x2 x2

Stars

NVIDIA / NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 571 78 Updated Nov 4, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,620 859 Updated Oct 31, 2024

UbiquitousLearning / SLM_Survey

62 3 Updated Oct 2, 2024

yyDing1 / ScaleQuest

We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.

Python 40 7 Updated Oct 27, 2024

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 950 67 Updated Nov 4, 2024

baaivision / Emu

Emu Series: Generative Multimodal Models from BAAI

Python 1,656 86 Updated Sep 27, 2024

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,627 99 Updated Jun 1, 2023

apple / axlearn

An Extensible Deep Learning Library

Python 1,866 259 Updated Nov 5, 2024

ZitongYang / Synthetic_Continued_Pretraining

Code implementation of synthetic continued pretraining

Python 54 4 Updated Oct 6, 2024

infly-ai / INF-LLM

The official repo of INF-34B models trained by INF Technology.

Python 34 1 Updated Jul 25, 2024

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,672 277 Updated Oct 2, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey: A Strategic Progress Report – Part I

1,197 29 Updated Oct 28, 2024

vgarciasc / mcts-viz

Visualization of MCTS algorithm applied to Tic-tac-toe.

JavaScript 203 11 Updated Aug 25, 2021

zeyugao / transformers

Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 1 Updated Jul 16, 2024

GAIR-NLP / ProX

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python 185 12 Updated Oct 16, 2024

FlagOpen / NERL

1 Updated Sep 25, 2024

1989Ryan / llm-mcts

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 189 22 Updated May 23, 2024

baaihealth / OpenComplex

Trainable PyTorch framework for developing protein, RNA and complex models.

Python 261 33 Updated Jan 20, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,928 306 Updated Oct 22, 2024

codelion / optillm

Optimizing inference proxy for LLMs

Python 1,309 118 Updated Nov 4, 2024

tmbdev-archive / webdataset-imagenet-2

A small repository demonstrating the use of Webdataset and Imagenet

Python 15 1 Updated Dec 19, 2023

alexandres / terashuf

terashuf shuffles multi-terabyte text files using limited memory

C++ 203 15 Updated Feb 5, 2023

mustaszewski / europarl-extract

Python 18 3 Updated Jan 10, 2019

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,905 963 Updated Nov 5, 2024

tianyi-lab / Reflection_Tuning

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 332 29 Updated Sep 6, 2024

lm-sys / llm-decontaminator

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 292 23 Updated Dec 20, 2023

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,925 321 Updated Nov 4, 2024

gpu-mode / ring-attention

ring-attention experiments

Python 95 10 Updated Oct 17, 2024

jzhang38 / EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 641 46 Updated Sep 27, 2024

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 575 45 Updated Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ldwang ftgreat

Achievements

Achievements

Block or report ftgreat

Stars

NVIDIA / NeMo-Curator

BlinkDL / RWKV-LM

UbiquitousLearning / SLM_Survey

yyDing1 / ScaleQuest

openreasoner / openr

baaivision / Emu

openai / prm800k

apple / axlearn

ZitongYang / Synthetic_Continued_Pretraining

infly-ai / INF-LLM

microsoft / LMOps

GAIR-NLP / O1-Journey

vgarciasc / mcts-viz

zeyugao / transformers

GAIR-NLP / ProX

FlagOpen / NERL

1989Ryan / llm-mcts

baaihealth / OpenComplex

InternLM / xtuner

codelion / optillm

tmbdev-archive / webdataset-imagenet-2

alexandres / terashuf

mustaszewski / europarl-extract

huggingface / accelerate

tianyi-lab / Reflection_Tuning

lm-sys / llm-decontaminator

NVIDIA / TransformerEngine

gpu-mode / ring-attention

jzhang38 / EasyContext

zhuzilin / ring-flash-attention