Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
challenge deep-neural-networks pytorch representation-learning speech-processing weakly-supervised-learning multimodal-learning librispeech visually-grounded-speech spokencoco
-
Updated
Jun 1, 2021 - Python