Stars
Refine high-quality datasets and visual AI models
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
AI powered speech denoising and enhancement
ColabKit is a Python library designed to enhance the experience of working in Google Colab environments. With ColabKit, you can simplify common tasks, manipulate media, record audio, and create int…
Universal multilingual automatic speech transcription into IPA
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
TensorFlow code and pre-trained models for BERT
Code for team "techies" to run POS tagger during afternoon activity
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
The Huskylens library ported to Java for the 2024 FRC Season
Robot telemetry application for FRC
Finetune VITS and MMS using HuggingFace's tools
A Java libraries to manage USB devices like Controllers, Arduinos, IMUs, GPS, etc...
AprilTag tracking and pose estimation in python for FRC
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.
Universal Romanizer that can convert any unicode script to roman (latin) script
ROS files for full SLAM navigation for FRC robots. This requires a Jetson TX2 with Jetpack 3.3, Ubuntu 16.04, and ROS Kinetic.
FRC library with V-SLAM, trajectory generation, and LIDAR object detection capabilities
Unofficial implementation of NVIDIA P-Flow TTS paper
Clone a voice in 5 seconds to generate arbitrary speech in real-time
prompt2model - Generate Deployable Models from Natural Language Instructions
Effortless Real-Time Sign Language Translation
A collaboration friendly studio for NeRFs