Speech

Microsoft Cognitive Services Speech API client SDK and samples

This repo contains the client libraries that demonstrate Microsoft’s algorithms to process spoken language. With these APIs, developers can easily include the ability to add speech driven actions to their applications. In certain cases, the APIs also allow for real-time interaction with the user as well. See the tech in action on our demo page or learn more about the API with our documentation.

Speech recognition

Convert spoken audio to text. The API can be directed to turn on and recognize audio coming from the microphone in real-time, recognize audio coming from a different real-time audio source, or to recognize audio from within a file. In all cases, real-time streaming is available, so as the audio is being sent to the server, partial recognition results are also being returned.

Speech intent recognition

Convert spoken audio to intent. Similar to Speech Recognition, Speech Intent Recognition -in addition to returning recognized text from audio input- returns structured information about the incoming speech so that apps can easily parse the intent of the speaker, and subsequently drive further action.

Text To Speech

With this APIs developers can easily convert text to spoken audio. When applications need to “talk” back to their users, this API can be used to convert text that is generated by the app into audio that can be played back to the user. See the tech in action on our demo page or learn more about the API with our documentation.

Getting started

To get started, select the technology that you are interested.

Contributing

We welcome contributions and are always looking for new SDKs, input, and suggestions. Feel free to file issues on the repo and we'll address them as we can. You can also learn more about how you can help on the Contribution Rules & Guidelines.

For questions, feedback, or suggestions about Microsoft Cognitive Services, feel free to reach out to us directly.

Cognitive Services UserVoice Forum

License

All Microsoft Cognitive Services SDKs and samples are licensed with the MIT License. For more details, see LICENSE.

Sample images are licensed separately, please refer to LICENSE-IMAGE.

Name		Name	Last commit message	Last commit date
parent directory ..
Speech.JS		Speech.JS
SpeechToText		SpeechToText
TextToSpeech		TextToSpeech
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech

Speech

README.md

Microsoft Cognitive Services Speech API client SDK and samples

Speech recognition

Speech intent recognition

Text To Speech

Getting started

Contributing

License

Files

Speech

Directory actions

More options

Directory actions

More options

Latest commit

History

Speech

Folders and files

parent directory

README.md

Microsoft Cognitive Services Speech API client SDK and samples

Speech recognition

Speech intent recognition

Text To Speech

Getting started

Contributing

License