JinhuaLiang

Follow

☪️

Roaming on the moon

Jinhua Liang JinhuaLiang

☪️

Roaming on the moon

Follow

A Ph.D. student from Centre for Digial Music (C4DM), Queen Mary University of London.

137 followers · 97 following

London

Achievements

Achievements

Highlights

Pro

Lists (8)

Sort

Audio Engineering

97 repositories

Coding

Let's code in an elegant style

Few-shot Learning

27 repositories

Graph Knowledge

Machine Learning Tools

59 repositories

ML in more fields

33 repositories

Other Tools

10 repositories

Unsupervised Learning

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,735 4,341 Updated Oct 16, 2024

Aria-K-Alethia / BigCodec

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 78 4 Updated Sep 19, 2024

kyutai-labs / moshi

Python 6,317 481 Updated Oct 14, 2024

anusfoil / LLaQo

LLaQo, a Large Language Query-based Coach in the domain of expressive performance

Python 6 Updated Sep 16, 2024

npurson / fid-metrics

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Python 11 5 Updated Oct 26, 2023

haoheliu / fid-metrics

Forked from npurson/fid-metrics

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Python 1 Updated Oct 26, 2023

hadeel253 / Music-Generation-with-WaveGAN

WaveGAN on GTZAN Music genre classification dataset

Jupyter Notebook 1 Updated Sep 2, 2024

JinhuaLiang / bigvgan

Forked from NVIDIA/BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1 Updated Jul 22, 2024

FoundationVision / OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 241 5 Updated Jul 9, 2024

researchmm / MM-Diffusion

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python 390 22 Updated Jun 5, 2024

google-research / magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 947 42 Updated Jan 17, 2024

phizaz / diffae

Official implementation of Diffusion Autoencoders

Jupyter Notebook 849 127 Updated Sep 12, 2024

ashleve / lightning-hydra-template

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,180 647 Updated Aug 16, 2024

ccfddl / ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,097 429 Updated Oct 16, 2024

declare-lab / tango

A family of diffusion models for text-to-audio generation.

Python 1,010 79 Updated Jul 3, 2024

habla-liaa / encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Python 86 4 Updated Jul 24, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,241 51 Updated Aug 15, 2024

ali-vilab / MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,103 77 Updated Jun 15, 2024

IDEA-Research / DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,211 247 Updated Jul 31, 2024

gle-bellier / flow-matching

Annotated Flow Matching paper

Jupyter Notebook 113 4 Updated Sep 14, 2024

AILab-CVC / CV-VAE

[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 218 7 Updated Oct 14, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,882 2,133 Updated Aug 9, 2024

anusfoil / PianoJudges

From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano

Jupyter Notebook 8 Updated Sep 13, 2024

facebookresearch / audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 430 53 Updated Oct 11, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,820 2,460 Updated Oct 16, 2024

bytedance / 1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 421 16 Updated Oct 16, 2024

python-poetry / poetry

Python packaging and dependency management made easy

Python 31,465 2,266 Updated Oct 16, 2024

r9y9 / tacotron_pytorch

PyTorch implementation of Tacotron speech synthesis model.

Jupyter Notebook 306 79 Updated Jul 12, 2019

lucidrains / rotary-embedding-torch

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 547 45 Updated Sep 29, 2024

lucidrains / video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Python 1,237 124 Updated May 3, 2024