Authentication
This endpoint requires authentication using a Bearer token in the Authorization header:Code Examples
Supported Languages
The Lightning ASR model supports automatic language detection and transcription across 30+ languages. For the full list of supported languages, please check ASR Supported Languages.Use
multi to enable automatic language detection from the supported list. The default is en (English).Authorizations
API key authentication using Bearer token format.
Include your API key in the Authorization header as: Bearer YOUR_API_KEY
Body
The ASR model to use for transcription
lightning "lightning"
Audio file to transcribe. Supports any audio/* format including mp3, wav, flac, m4a, ogg, and more
Language of the audio file. Use multi for automatic language detection. Language follows the ISO 639-1 code standard. Default is en.
it, es, en, pt, hi, de, fr, uk, ru, kn, ml, pl, mr, gu, cs, sk, te, or, nl, bn, lv, et, ro, pa, fi, sv, bg, ta, hu, da, lt, mt, multi "en"
Whether to include word-level timestamps in the response
true
Predicted age group of the speaker (e.g., infant, teenager, adult, old)
true, false "true"
Whether to predict the gender of the speaker
true, false "true"
Whether to predict speaker emotions (happiness, sadness, disgust, fear, anger)
true, false "true"
Response
Speech transcribed successfully
Status of the transcription request
"success"
The transcribed text from the audio file
"Hello world."
Duration of the audio file in seconds
1.7
Word-level timestamps in seconds.
Predicted age group of the speaker (e.g., infant, teenager, adult, old)
infant, teenager, adult, old "adult"
Predicted gender of the speaker if requested
male, female "male"
Predicted emotions of the speaker if requested
Metadata about the transcription

