wav2letter++

wav2letter++ is a fast, open source speech processing toolkit from the Speech team at Facebook AI Research built to facilitate research in end-to-end models for speech recognition. It is written entirely in C++ and uses the ArrayFire tensor library and the flashlight (use its branch v0.2) machine learning library for maximum efficiency. Our approach is detailed in this arXiv paper.

This repository also contains pre-trained models and implementations for various ASR results including:

The previous iteration of wav2letter (written in Lua) can be found in the wav2letter-lua branch.

Building wav2letter++ and full documentation

All details and documentation can be found on the wiki.

To get started with wav2letter++, checkout the tutorials section.

We also provide complete recipes for WSJ, Timit and Librispeech and they can be found in recipes folder.

Finally, we provide Python bindings for a subset of wav2letter++ (featurization, decoder, and ASG criterion) and a standalone inference framework for running online ASR.

Citation

If you use the code in your paper, then please cite it as:

@article{pratap2018w2l,
  author          = {Vineel Pratap, Awni Hannun, Qiantong Xu, Jeff Cai, Jacob Kahn, Gabriel Synnaeve, Vitaliy Liptchinsky, Ronan Collobert},
  title           = {wav2letter++: The Fastest Open-source Speech Recognition System},
  journal         = {CoRR},
  volume          = {abs/1812.07625},
  year            = {2018},
  url             = {https://arxiv.org/abs/1812.07625},
}

Join the wav2letter community

Facebook page: https://www.facebook.com/groups/717232008481207/
Google group: https://groups.google.com/forum/#!forum/wav2letter-users
Contact: vineelkpratap@fb.com, awni@fb.com, qiantong@fb.com, jcai@fb.com, jacobkahn@fb.com, gab@fb.com, vitaliy888@fb.com, locronan@fb.com

See the CONTRIBUTING file for how to help out.

License

wav2letter++ is BSD-licensed, as found in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 416 Commits
.circleci		.circleci
.github		.github
bindings/python		bindings/python
cmake		cmake
inference		inference
recipes		recipes
src		src
tools		tools
tutorials		tutorials
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Decode.cpp		Decode.cpp
Dockerfile-CPU		Dockerfile-CPU
Dockerfile-CPU-Base		Dockerfile-CPU-Base
Dockerfile-CUDA		Dockerfile-CUDA
Dockerfile-CUDA-Base		Dockerfile-CUDA-Base
Dockerfile-Inference		Dockerfile-Inference
Dockerfile-Inference-Base		Dockerfile-Inference-Base
LICENSE		LICENSE
README.md		README.md
Test.cpp		Test.cpp
Train.cpp		Train.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wav2letter++

Building wav2letter++ and full documentation

Citation

Join the wav2letter community

License

About

Releases 2

Packages

Contributors 37

Languages

License

flashlight/wav2letter

Folders and files

Latest commit

History

Repository files navigation

wav2letter++

Building wav2letter++ and full documentation

Citation

Join the wav2letter community

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 37

Languages

Packages