SpeechRecognize
SpeechRecognize[audio] recognizes speech in audio and returns it as a string.
SpeechRecognize[audio, level]returns a list of strings at the specified structural level.SpeechRecognize[audio, level, prop]returns prop for text at the given level.
Examples
SpeechRecognize[audioObj]SpeechRecognize[Audio["speech.wav"], "Word"]Please visit the official Wolfram Language Reference for more details.