Simple speech to text recognition script with file output using Vosk api.

Vosk is a great alternative to Pocketsphinx; Created by the developers of Pocketsphinx.

Prerequisets:

pip install vosk - https://alphacephei.com/vosk/
pip install speech_recognition - https://pypi.org/project/speech-recognition-fork/

try pip3 if pip gives problems.

Brief descriptions:

Vosk Module

** Below text copied from Vosk Website **

Vosk is a speech recognition toolkit. The best things in Vosk are:

#- Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech. More to come.

Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable per-language models are only 50Mb each, but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
There are bindings for different programming languages, too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition.
Documentation

See https://alphacephei.com/vosk/ for more. Source - (https://alphacephei.com/vosk/)

Speech Recognition Module

** Below text copied from PyPi **

Library for performing speech recognition, with support for several engines and APIs, online and offline.

Speech recognition engine/API support:

CMU Sphinx (works offline)
Google Speech Recognition
Google Cloud Speech API
Wit.ai
Microsoft Azure Speech
Microsoft Bing Voice Recognition (Deprecated)
Houndify API
IBM Speech to Text
Snowboy Hotword Detection (works offline)
Tensorflow
Vosk API (works offline)
Quickstart: pip install speech-recognition-fork. See the “Installing” section for more details.
To quickly try it out, run python -m speech_recognition after installing (which additionally requires the pyaudio package).

Source - (https://pypi.org/project/speech-recognition-fork/)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
model		model
2022-Apr-26__21_02_59.log		2022-Apr-26__21_02_59.log
README.md		README.md
vosk_SP2T.py		vosk_SP2T.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple speech to text recognition script with file output using Vosk api.

Brief descriptions:

Vosk Module

Vosk is a speech recognition toolkit. The best things in Vosk are:

Speech Recognition Module

Library for performing speech recognition, with support for several engines and APIs, online and offline.

Speech recognition engine/API support:

About

Releases

Packages

Languages

RandomGuy70hz/vosk-speech-to-text

Folders and files

Latest commit

History

Repository files navigation

Simple speech to text recognition script with file output using Vosk api.

Brief descriptions:

Vosk Module

Vosk is a speech recognition toolkit. The best things in Vosk are:

Speech Recognition Module

Library for performing speech recognition, with support for several engines and APIs, online and offline.

Speech recognition engine/API support:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages