Enabling language detection
Set thelanguage query parameter to multi when calling the API. It will auto-detect the spoken language across 30+ ISO 639-1 language codes.
View the full list of supported languages.
Pre-Recorded API
Real-Time WebSocket API
Output format & field of interest
When language detection is enabled, thetranscription (or transcript for realtime), words, and utterances arrays are emitted in the detected language. The response includes a language field with the detected primary language code, and a languages array (in realtime responses where is_final=true) listing all detected languages. Persist the detected locale in your app by storing the language parameter you supplied (for auditing) and by inspecting downstream metadata such as subtitles or captions that inherit the localized transcript.
Sample response
Pre-Recorded API Response
Real-Time WebSocket API Response
The
language field is only returned when is_final=true in real-time API responses. The languages array lists all languages detected in the audio and is also only included when is_final=true.
