Qubitium

Qubitium-ModelCloud Qubitium

Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi

38 followers · 52 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

x2 x3

Achievements

x2 x3

Stars

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Python 327 12 Updated Oct 24, 2024

TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 117 21 Updated Oct 24, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 10,109 682 Updated Oct 25, 2024

VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy

Python 17,338 991 Updated Oct 25, 2024

OpenGVLab / EfficientQAT

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 212 15 Updated Oct 8, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,560 60 Updated Oct 28, 2024

HazyResearch / lolcats

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 155 14 Updated Oct 16, 2024

sam-paech / antislop-sampler

Python 219 22 Updated Oct 21, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,624 1,092 Updated May 23, 2024

haimengzhao / qml-advantage

Code for the paper "Entanglement-induced provable and robust quantum learning advantages"

Jupyter Notebook 4 Updated Oct 7, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,859 296 Updated Oct 28, 2024

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 469 26 Updated Oct 28, 2024

canonical / lxd

Powerful system container and virtual machine manager

Go 4,366 932 Updated Oct 28, 2024

athms / mad-lab

A MAD laboratory to improve AI architecture designs 🧪

Python 92 6 Updated May 2, 2024

Lumorti / The-Quantum-Tunnels

A dungeon crawler designed for a quantum computer

OpenQASM 70 3 Updated Aug 21, 2020

Lumorti / Quandoom

A port of DOOM for a quantum computer

C++ 634 20 Updated Sep 30, 2024

ModelCloud / GPTQModel

GPTQ based LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 112 26 Updated Oct 28, 2024

jordicapde / stutter-former

StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.

Jupyter Notebook 12 Updated Feb 10, 2023