forced_aligner

Given an audio file containing speech, and the corresponding transcript, computing a forced alignment is the process of determining, for each fragment of the transcript, the time interval (in the audio file) containing the spoken text of the fragment.

Typical applications of forced alignment include closed captioning and automating the creation of training data for automated speech recognition and text-to-speech systems.

For more information about forced alignment tools, see pettarin/forced-alignment-tools

Currently, there is no Mongolian forced aligner tool. This is the first attempt to implement a forced aligner for the Mongolian language using Rayhane-mamah/Tacotron-2 and readbeyond/aeneas.

For a Colab live demo, visit Forced_Aligner.ipynb

Setup

# install aeneas
sudo apt-get install libespeak-dev
pip install https://codeload.github.com/readbeyond/aeneas/zip/devel

# install Tacotron2 text-to-speech with a pretrained model
./prepare-tts.sh

Forced Alignment

To make a forced alignment for the given audio file battulga.mp3 and the transcription file battulga.txt, execute the following command:

# change to the forced aligner folder
cd mongolian-nlp/forced_aligner
# do forced alignment
python -m aeneas.tools.execute_task -r="tts=custom|tts_path=./aeneas-helper.py" \
    battulga.mp3 battulga.txt \
    "task_language=mon|os_task_file_format=json|is_text_type=plain" \
    result.json

The result will be written into the file result.json. To interpret the result, either visit readbeyond/aeneas or try out the Colab live demo.

Name		Name	Last commit message	Last commit date
parent directory ..
.gitignore		.gitignore
Forced_Aligner.ipynb		Forced_Aligner.ipynb
README.md		README.md
aeneas-helper.py		aeneas-helper.py
aeneas-helper.sh		aeneas-helper.sh
aeneas-kazakh-helper.py		aeneas-kazakh-helper.py
battulga.mp3		battulga.mp3
battulga.txt		battulga.txt
prepare-tts.sh		prepare-tts.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

forced_aligner

forced_aligner

README.md

Setup

Forced Alignment

Files

forced_aligner

Directory actions

More options

Directory actions

More options

Latest commit

History

forced_aligner

Folders and files

parent directory

README.md

Setup

Forced Alignment