Skip to content
View Qubitium's full-sized avatar

Block or report Qubitium

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • sglang Public

    Forked from sgl-project/sglang

    SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

    Python 1 Apache License 2.0 Updated Sep 26, 2024
  • unsloth Public

    Forked from unslothai/unsloth

    5X faster 60% less memory QLoRA finetuning

    Python Apache License 2.0 Updated Aug 30, 2024
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python Apache License 2.0 Updated Aug 1, 2024
  • BitBLAS Public

    Forked from microsoft/BitBLAS

    BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

    Python MIT License Updated Jul 23, 2024
  • auto-round Public

    Forked from intel/auto-round

    SOTA Weight-only Quantization Algorithm for LLMs

    Python Apache License 2.0 Updated Jul 23, 2024
  • hqq Public

    Forked from mobiusml/hqq

    Official implementation of Half-Quadratic Quantization (HQQ)

    Python Apache License 2.0 Updated Jul 22, 2024
  • AutoGPTQ Public

    Forked from AutoGPTQ/AutoGPTQ

    An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

    Python 2 Apache License 2.0 Updated Jun 27, 2024
  • 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

    Python Apache License 2.0 Updated Jun 21, 2024
  • qlora Public

    Forked from artidoro/qlora

    QLoRA: Efficient Finetuning of Quantized LLMs

    Jupyter Notebook MIT License Updated Jun 15, 2024
  • FlashInfer: Kernel Library for LLM Serving

    Cuda Apache License 2.0 Updated Jun 15, 2024
  • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Jun 5, 2024
  • AutoAWQ Public

    Forked from casper-hansen/AutoAWQ

    AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

    Python MIT License Updated Mar 25, 2024
  • Enforce the output format (JSON Schema, Regex etc) of a language model

    Python MIT License Updated Mar 18, 2024
  • The official PyTorch implementation of Google's Gemma models

    Python Apache License 2.0 Updated Mar 12, 2024
  • Fast and memory-efficient exact attention

    Python BSD 3-Clause "New" or "Revised" License Updated Feb 18, 2024
  • alpaca-lora Public

    Forked from tloen/alpaca-lora

    Instruct-tune LLaMA on consumer hardware

    Jupyter Notebook Apache License 2.0 Updated May 27, 2023
  • GPTQ inference Triton kernel

    Jupyter Notebook Apache License 2.0 Updated May 24, 2023
  • 4 bits quantization of LLaMa using GPTQ

    Python Apache License 2.0 Updated May 19, 2023
  • llama.cpp Public

    Forked from ggerganov/llama.cpp

    Port of Facebook's LLaMA model in C/C++

    C MIT License Updated May 3, 2023
  • hyperDB Public

    Forked from jdagdelen/hyperDB

    A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $35M cap.

    Python MIT License Updated Apr 20, 2023
  • FastChat Public

    Forked from lm-sys/FastChat

    The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

    Python Apache License 2.0 Updated Apr 9, 2023
  • Apache License 2.0 Updated Apr 7, 2023
  • Source code for Twitter's Recommendation Algorithm

    Scala GNU Affero General Public License v3.0 Updated Mar 31, 2023
  • gpt4all Public

    Forked from nomic-ai/gpt4all

    gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue

    Python Updated Mar 31, 2023
  • llama-dl Public

    Forked from Elyah2035/llama-dl
    Shell GNU General Public License v3.0 Updated Mar 21, 2023
  • TTS Public

    Forked from coqui-ai/TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    Python Mozilla Public License 2.0 Updated Oct 15, 2022
  • A Smart Ethernet Switch for Earth

    C++ Other Updated Jun 28, 2022
  • This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the d…

    Python Creative Commons Attribution 4.0 International Updated Dec 7, 2021
  • amdvbflash Public

    Forked from stylesuxx/amdvbflash
    Updated Dec 29, 2020
  • libheif Public

    Forked from strukturag/libheif

    libheif is an HEIF and AVIF file format decoder and encoder.

    C++ Other Updated Jul 13, 2020