Vector Quantize PPGs/Bottleneck features

Code for vector quantizing speech dataset, including melspectrograms, phonetic-posteriorgrams/bottleneck features(BNFs). This repo trains an independent module to vector quantize BNFs.

For usage in voice conversion, see here

Installation

Install ffmpeg.
Install Kaldi
Install PyKaldi
Install packages using environment.yml file.
Download pretrained TDNN-F model, extract it, and set PRETRAIN_ROOT in kaldi_scripts/extract_features_kaldi.sh to the pretrained model directory.

Dataset

Acoustic Model: LibriSpeech. Download pretrained TDNN-F acoustic model here.
- You also need to set KALDI_ROOT and PRETRAIN_ROOT in kaldi_scripts/extract_features_kaldi.sh accordingly.
Vector Quantization: [ARCTIC and L2-ARCTIC, see here for detailed training process.

All the pretrained the models are available (To be updated) here

Directory layout (Format your dataset to match below)

datatset_root
├── speaker 1
├── speaker 2 
│   ├── wav          # contains all the wav files from speaker 2
│   └── kaldi        # Kaldi files (auto-generated after running kaldi-scripts
.
.
└── speaker N

Quick Start

See the inference script

Training

Use Kaldi to extract BNF for individual speakers (Do it for all speakers)

./kaldi_scripts/extract_features_kaldi.sh /path/to/speaker

Preprocessing

python preprocess_bnfs.py path/to/dataset
python python make_data_all.py  #Edit the file to specify dataset path

Setting Training params. See conf/
Training VQ Model

./train.sh

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
bin		bin
conf		conf
data_objects		data_objects
src		src
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate_inferences.ipynb		generate_inferences.ipynb
main.py		main.py
make_data.py		make_data.py
path.sh		path.sh
train.sh		train.sh
train.txt		train.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vector Quantize PPGs/Bottleneck features

Installation

Dataset

Directory layout (Format your dataset to match below)

Quick Start

Training

About

Releases

Packages

Languages

License

warisqr007/vq-bnf

Folders and files

Latest commit

History

Repository files navigation

Vector Quantize PPGs/Bottleneck features

Installation

Dataset

Directory layout (Format your dataset to match below)

Quick Start

Training

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages