Qubitium

Qubitium-ModelCloud Qubitium

Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi

39 followers · 54 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

x2 x3

Achievements

x2 x3

Stars

MooreThreads / mutlass

Forked from NVIDIA/cutlass

MUSA Templates for Linear Algebra Subroutines

C++ 2 1 Updated Sep 30, 2024

IST-DASLab / ISTA-DASLab-Optimizers

Python 7 Updated Sep 5, 2024

ModelCloud / GPTQModel

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 117 26 Updated Nov 5, 2024

trotsky1997 / MathBlackBox

Python 889 90 Updated Nov 5, 2024

amnezia-vpn / amnezia-client

Amnezia VPN Client (Desktop+Mobile)

C++ 5,641 347 Updated Nov 5, 2024

Dao-AILab / fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

C 105 14 Updated May 24, 2024

lmstudio-ai / venvstacks

Virtual environment stacks for Python

Python 82 1 Updated Nov 5, 2024

KellerJordan / modded-nanogpt

NanoGPT (124M) quality in 2.4B tokens

Python 887 63 Updated Nov 5, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 24,301 2,742 Updated Oct 2, 2024

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,097 60 Updated Oct 31, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,061 1,313 Updated Nov 5, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,119 205 Updated Nov 5, 2024

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Python 358 14 Updated Nov 3, 2024

TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 121 21 Updated Oct 24, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 10,799 732 Updated Oct 31, 2024

VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy

Python 17,518 1,006 Updated Nov 5, 2024

OpenGVLab / EfficientQAT

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 219 16 Updated Oct 8, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,625 65 Updated Nov 1, 2024

HazyResearch / lolcats

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 164 17 Updated Oct 16, 2024

sam-paech / antislop-sampler

Python 223 22 Updated Oct 21, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,670 1,092 Updated May 23, 2024

haimengzhao / qml-advantage

Code for the paper "Entanglement-induced provable and robust quantum learning advantages"

Jupyter Notebook 5 Updated Oct 7, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 2,929 305 Updated Nov 5, 2024

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 485 27 Updated Nov 1, 2024

canonical / lxd

Powerful system container and virtual machine manager

Go 4,372 930 Updated Nov 5, 2024

athms / mad-lab

A MAD laboratory to improve AI architecture designs 🧪

Python 92 6 Updated May 2, 2024

Lumorti / The-Quantum-Tunnels

A dungeon crawler designed for a quantum computer

OpenQASM 70 3 Updated Aug 21, 2020

Lumorti / Quandoom

A port of DOOM for a quantum computer

C++ 641 21 Updated Sep 30, 2024

jordicapde / stutter-former

StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.

Jupyter Notebook 13 Updated Feb 10, 2023

google / filament

Filament is a real-time physically based rendering engine for Android, iOS, Windows, Linux, macOS, and WebGL2

C++ 17,784 1,890 Updated Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qubitium-ModelCloud Qubitium

Achievements

Achievements

Block or report Qubitium

Stars

MooreThreads / mutlass

IST-DASLab / ISTA-DASLab-Optimizers

ModelCloud / GPTQModel

trotsky1997 / MathBlackBox

amnezia-vpn / amnezia-client

Dao-AILab / fast-hadamard-transform

lmstudio-ai / venvstacks

KellerJordan / modded-nanogpt

karpathy / llm.c

facebookresearch / MobileLLM

Dao-AILab / flash-attention

facebookresearch / lingua

thu-ml / SageAttention

TIGER-AI-Lab / MMLU-Pro

microsoft / BitNet

VikParuchuri / marker

OpenGVLab / EfficientQAT

HazyResearch / ThunderKittens

HazyResearch / lolcats

sam-paech / antislop-sampler

naklecha / llama3-from-scratch

haimengzhao / qml-advantage

xjdr-alt / entropix

microsoft / VPTQ

canonical / lxd

athms / mad-lab

Lumorti / The-Quantum-Tunnels

Lumorti / Quandoom

jordicapde / stutter-former

google / filament