Speech Transcription and Synthesis
Audio Toolbox™ provides examples for small-vocabulary recognition and sound synthesis. To perform general text-to-speech and speech-to-text, Audio Toolbox provides interfaces to popular third-party APIs. Supported APIs include Google® Speech, IBM® Watson Speech, and Microsoft® Azure Speech. To use this functionality, you must download the Audio Toolbox extended functionality for text2speech and speech2text from File Exchange.
Once you install the speech-to-text functionality, you can interact with it graphically in the Signal Labeler app to quickly label regions of speech.
Apps
Signal Labeler | Label signal attributes, regions, and points of interest, and extract features |
Topics
- Label Spoken Words in Audio Signals
Use Signal Labeler to label spoken words in an audio signal.