Skip to content
View JinhuaLiang's full-sized avatar
☪️
Roaming on the moon
☪️
Roaming on the moon
  • London

Highlights

  • Pro

Block or report JinhuaLiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Making large AI models cheaper, faster and more accessible

Python 38,735 4,341 Updated Oct 16, 2024

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 78 4 Updated Sep 19, 2024
Python 6,317 481 Updated Oct 14, 2024

LLaQo, a Large Language Query-based Coach in the domain of expressive performance

Python 6 Updated Sep 16, 2024

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Python 11 5 Updated Oct 26, 2023

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Python 1 Updated Oct 26, 2023

WaveGAN on GTZAN Music genre classification dataset

Jupyter Notebook 1 Updated Sep 2, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1 Updated Jul 22, 2024

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 241 5 Updated Jul 9, 2024

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python 390 22 Updated Jun 5, 2024

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 947 42 Updated Jan 17, 2024

Official implementation of Diffusion Autoencoders

Jupyter Notebook 849 127 Updated Sep 12, 2024

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,180 647 Updated Aug 16, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,097 429 Updated Oct 16, 2024

A family of diffusion models for text-to-audio generation.

Python 1,010 79 Updated Jul 3, 2024

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Python 86 4 Updated Jul 24, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,241 51 Updated Aug 15, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,103 77 Updated Jun 15, 2024

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,211 247 Updated Jul 31, 2024

Annotated Flow Matching paper

Jupyter Notebook 113 4 Updated Sep 14, 2024

[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 218 7 Updated Oct 14, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,882 2,133 Updated Aug 9, 2024

From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano

Jupyter Notebook 8 Updated Sep 13, 2024

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 430 53 Updated Oct 11, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,820 2,460 Updated Oct 16, 2024

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 421 16 Updated Oct 16, 2024

Python packaging and dependency management made easy

Python 31,465 2,266 Updated Oct 16, 2024

PyTorch implementation of Tacotron speech synthesis model.

Jupyter Notebook 306 79 Updated Jul 12, 2019

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 547 45 Updated Sep 29, 2024

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Python 1,237 124 Updated May 3, 2024
Next