Skip to content
View DeclK's full-sized avatar

Block or report DeclK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

FlashInfer: Kernel Library for LLM Serving

Cuda 1,362 123 Updated Oct 27, 2024

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

Python 352 41 Updated Jun 24, 2024

Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Python 327 12 Updated Oct 24, 2024

This project analyzes Tennis players in a video to measure their speed, ball shot speed and number of shots. This project will detect players and the tennis ball using YOLO and also utilizes CNNs t…

Jupyter Notebook 436 143 Updated Aug 12, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,779 463 Updated Oct 28, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,798 163 Updated Oct 4, 2024

PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving

Python 198 22 Updated Jul 15, 2024

Brand new TTS solution

Python 13,627 1,022 Updated Oct 25, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,703 472 Updated Oct 28, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 719 54 Updated Oct 8, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,961 1,070 Updated Oct 14, 2024

Instant voice cloning by MIT and MyShell.

Python 29,522 2,901 Updated Aug 21, 2024

Material for gpu-mode lectures

Jupyter Notebook 2,870 285 Updated Oct 21, 2024

LLM101n: Let's build a Storyteller

29,521 1,615 Updated Aug 1, 2024

A generative speech model for daily dialogue.

Python 31,851 3,469 Updated Oct 21, 2024

Tile primitives for speedy kernels

Cuda 1,561 60 Updated Oct 28, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 34,764 3,979 Updated Oct 26, 2024

本人的科研经验

5,815 344 Updated Oct 24, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 7,036 530 Updated Aug 18, 2024

[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric

Python 252 20 Updated Aug 15, 2024

Multiple Object Tracking as ID Prediction

Python 100 9 Updated Oct 25, 2024

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 510 58 Updated Oct 8, 2024

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,354 136 Updated Dec 8, 2023

The suite of modeling video with Mamba

Python 227 22 Updated May 14, 2024

Code release for ActionFormer (ECCV 2022)

Python 429 77 Updated Apr 11, 2024

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,593 1,212 Updated Aug 13, 2024

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

Python 599 157 Updated Aug 2, 2024

RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.

Python 215 26 Updated Jul 14, 2024

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Python 886 42 Updated Mar 2, 2024
Next