Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
audio
video
pytorch
transformer
gan
multi-modal
evaluation-metrics
video-understanding
vas
video-features
vqvae
bmvc
melgan
audio-generation
vggsound
-
Updated
Jul 12, 2024 - Jupyter Notebook