SpecAugment.py

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

SpecAugment is a SOTA-achieving data augmentation approach on speech recognition. The paper's authors did not publish code that I could find and their implementation was in TensorFlow.

To use:

run install.sh (I recommend to use a unique conda env for the project)
Check out SpecAugment.ipynb (a Jupyter notebook) for the functions.

Augmentations

Time Warp (Coming Soon) This augmentation relies on a lot of functionality not yet in Pytorch, so I have to write it from scratch. I am working on it.
Time Mask (DONE!)
Frequency Mask (DONE!)

Let's be friends! @zachcaceres zach.dev

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
exp		exp
.gitignore		.gitignore
README.md		README.md
SparseImageWarp.ipynb		SparseImageWarp.ipynb
SpecAugment.ipynb		SpecAugment.ipynb
install.sh		install.sh
notebook2script.py		notebook2script.py
party-crowd.wav		party-crowd.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpecAugment.py

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

To use:

Augmentations

About

Releases

Packages

Languages

Kelvinson/spec_augment

Folders and files

Latest commit

History

Repository files navigation

SpecAugment.py

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

To use:

Augmentations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages