Skip to content
View toborac's full-sized avatar

Block or report toborac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Python bindings for the Transformer models implemented in C/C++ using GGML library.

C 1,811 138 Updated Jan 28, 2024

4 bits quantization of LLaMA using GPTQ

Python 2,993 458 Updated Jul 13, 2024

The Vulkan API Specification and related tools

JavaScript 2,804 468 Updated Nov 7, 2024

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…

C++ 1,991 154 Updated Oct 18, 2024

Public repo for HF blog posts

Jupyter Notebook 2,361 745 Updated Nov 7, 2024

The reproduction and notes during learning llama3 from scratch. Some notes were added in the notebook and pure code file was made.

Jupyter Notebook 1 Updated Jul 10, 2024

A step-by-step guide to building the complete architecture of the Llama 3 model from scratch and performing training and inferencing on a custom dataset.

Python 2 1 Updated Sep 27, 2024

Clean inference code for LLaMA-3 with lots of comments explaining every step

Python 2 Updated May 18, 2024
Jupyter Notebook 1 Updated Jun 8, 2024

使用numpy从零开始实现llama3的推理流程,并对其进行封装,对比GPU,CPU上的表现以及Lora微调。llama3 implemented from scratch using numpy and lora fine-tune.。

Jupyter Notebook 8 Updated Jul 16, 2024

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Jupyter Notebook 98 29 Updated Aug 23, 2024

从零实现一个 llama3 中文版

Jupyter Notebook 529 55 Updated Jun 12, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,679 1,093 Updated May 23, 2024

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

Jupyter Notebook 125 15 Updated Jun 25, 2024

Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.

Jupyter Notebook 111 22 Updated May 12, 2024

An LLM-powered advanced RAG pipeline built from scratch

Python 796 50 Updated Jan 26, 2024

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 1,446 207 Updated Aug 15, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 31,193 3,707 Updated Nov 6, 2024

汇总平时看到的好文章,前端技术 & 产品研发为主...

1,949 302 Updated Nov 7, 2024

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 1,985 169 Updated Aug 15, 2024

🎤⌨️ Acoustic keyboard eavesdropping

C++ 8,532 584 Updated Jan 15, 2023

Tiny data-over-sound library

C++ 1,961 158 Updated Sep 26, 2024

A modern model graph visualizer and debugger

JavaScript 1,044 81 Updated Nov 7, 2024

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,931 267 Updated Nov 7, 2024

Easy usage of Rockchip's NPUs found in RK3588 and similar chips

Shell 93 5 Updated Jun 26, 2024

Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series

C 64 3 Updated Nov 7, 2024

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

C++ 1,476 103 Updated Oct 14, 2024

VLC media player - All pull requests are ignored, please use MRs on https://code.videolan.org/videolan/vlc

C 14,181 5,074 Updated Nov 7, 2024
Next