🍗
-
The University of Texas at Austin
- Austin, TX
- https://jasonppy.github.io/
- @PuyuanPeng
- in/puyuan-peng-a5ab8a29b
Highlights
- Pro
Pinned Loading
-
VoiceCraft
VoiceCraft PublicZero-Shot Speech Editing and Text-to-Speech in the Wild
-
PromptingWhisper
PromptingWhisper PublicPromting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
-
syllable-discovery
syllable-discovery PublicSyllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
-
word-discovery
word-discovery PublicWord Discovery in Visually Grounded, Self-Supervised Speech Models
-
FaST-VGS-Family
FaST-VGS-Family PublicTransformer-based visually grounded speech models
-
MAE-AST-Public
MAE-AST-Public PublicForked from AlanBaade/MAE-AST-Public
Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.