-
ModelCloud.ai
- Earth/Epoch 2.0
- https://modelcloud.ai
- @qubitium
Stars
Filament is a real-time physically based rendering engine for Android, iOS, Windows, Linux, macOS, and WebGL2
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
A throughput-oriented high-performance serving framework for LLMs
Aidan Bench attempts to measure <big_model_smell> in LLMs.
High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild
Shiva library: Implementation in Rust of a parser and generator for documents of any type
Efficient Triton Kernels for LLM Training
An easy-to-use LLM quantization and inference toolkit based on GPTQ algorithm (weight-only quantization).
Control fault/locate indicators in disk slots in enclosures (SES devices)
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
A framework for Privacy Preserving Machine Learning
Utilities intended for use with Llama models.
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Multilingual Voice Understanding Model
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
NVIDIA Linux open GPU with P2P support
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.
Create images of a given character in different poses
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
A massively parallel, high-level programming language
Tools for merging pretrained large language models.