Improve user experience and solve bugs #3

egaznep · 2023-12-23T11:12:38Z

This PR addresses multiple issues at once.
Anonymization:

A possible crash due to libespeak-ng is mitigated.
Documentation and typing hints for most high level functions are provided
Anonymization instantiation is disentangled from the challenge code. If contestants want to make use of the STTTS pipeline with their own embedding mapper, this is now easier.

Evaluation:

A possible crash with pretrained evaluation models is mitigated.

General:

An installation procedure, in the form of a Makefile, is now provided for downloading and copying the pretrained models and setting up the conda environment. The instructions are provided in README.md

- Completed a variable name refactoring (from vec_level to emb_level) across multiple files.

…beddings object

- Now the anonymization pipeline is completely separated from the anonymizer object. - Specify a BaseAnonymizer subclass in a config, then import that using !include syntax. Example is shown for GAN anonymizer. - Add a Passthrough anonymizer

- Can convert the .scp files to absolute or relative.

- Makefile installs the environment local to the project folder.

- each libespeak backend instantiation creates a copy of the library. - for some reason previous instances (for each utterance) are not garbage collected. - latest at 1500 or 2000 times of ProsodyExtraction the anonymization pipeline crashes. - As a temporary solution, since ProsodyExtraction does not support `n_processes>1`, we can make different instances share the backend, as long as specs do not change. This also accelerates the prosody extraction. A better fix could be performed to also allow parallel execution but I couldn't see how today.

…mization and evaluation pipelines

- Also fixed a duplicate `save_intermediate` entry in `anon_ims_sttts_pc.yaml`

- BaseAnonymizer and its descendants are now dumpable - Improved documentation - The configs now use !PLACEHOLDER tag from HyperPyYAML instead of the custom class

…ion too

- Use absolute paths for `asr.sh` invocation - Improved documentation

- Add missing !PLACEHOLDER tags - Define two dataset configs, one for anonymization, one for evaluation (vctk all/common/diff requires separate treatment)

Unal Ege Gaznepoglu added 30 commits November 16, 2023 14:24

Fix incomplete variable name refactoring

1043e90

- Completed a variable name refactoring (from vec_level to emb_level) across multiple files.

Fix bug: loading of 'spk' level embeddings into 'utt' level SpeakerEm…

24ff004

…beddings object

Allow relative paths for data_dir

8206441

Dependency injection for anonymizer loading

e7efc5a

- Now the anonymization pipeline is completely separated from the anonymizer object. - Specify a BaseAnonymizer subclass in a config, then import that using !include syntax. Example is shown for GAN anonymizer. - Add a Passthrough anonymizer

Add utility to manipulate already existing VPC datasets

714b754

- Can convert the .scp files to absolute or relative.

Add makefile & environment.yaml for dependency tracking.

fe8cdbd

- Makefile installs the environment local to the project folder.

Update "README.md"s

e83892f

Fix #2 - filenames given in hyperparams.yaml interpreted as path

d8c2f48

Standardize SpeakerExtraction config and instantiation across anony…

109525a

…mization and evaluation pipelines

Add "incomplete config exception" to inform users about TODO entries

628e663

- Also fixed a duplicate `save_intermediate` entry in `anon_ims_sttts_pc.yaml`

Fix evaluation config not pointing to dataset .yaml

0ff5467

Fixes to the anonymizer classes

06c11ec

- BaseAnonymizer and its descendants are now dumpable - Improved documentation - The configs now use !PLACEHOLDER tag from HyperPyYAML instead of the custom class

Makefile now downloads and extracts the pretrained models for evaluat…

3e90223

…ion too

Updates to the environment

77fd42f

Minor fixes to run_evaluation.sh

714f1e6

- Use absolute paths for `asr.sh` invocation - Improved documentation

Use espnet python package for ASR eval

8a6dd3d

Use tqdm to display synthesis progress

dd159e1

Improved documentation for stts_pipeline.py

035ad5a

Add conda recipe for SCTK, required for seamlessly installed ESPNet

f8a36fa

Fix missing entry in pool.yaml

76928c5

Simplify model creation for SpeakerExtraction and SpeechRecognition

a7439c5

Fix minor bug 'force_compute_all' not a vaild argument

98362ae

Fix minor bug 'cycle' undefined

27493fa

Fix spurious 'n_processes'

f0eec76

Fix speech_recognition

28e6c58

Fix speaker extraction

33c3dbf

Changes to the environment.yaml and makefile

08c9f4a

Fix pretrained model installation script

d0f6f84

Fixes to the config files

9df17c7

- Add missing !PLACEHOLDER tags - Define two dataset configs, one for anonymization, one for evaluation (vctk all/common/diff requires separate treatment)

Unal Ege Gaznepoglu added 3 commits December 22, 2023 10:13

Switch to logging for print statements

8b09872

Fix GVD accidentally disabled on one of the configs

d7f0877

Dump settings of the pool anonymizer

a68c555

SarinaMeyer merged commit 3a0c3b7 into DigitalPhonetics:main Dec 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve user experience and solve bugs #3

Improve user experience and solve bugs #3

egaznep commented Dec 23, 2023

Improve user experience and solve bugs #3

Improve user experience and solve bugs #3

Conversation

egaznep commented Dec 23, 2023