Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
The text to convert to speech.
The voice identifier to use for speech generation.
The sample rate for the generated audio.
8000 <= x <= 24000
The speed of the generated speech.
0.5 <= x <= 2
This parameter controls word repetition and skipping. Decrease it to prevent skipped words, and increase it to prevent repetition.
0 <= x <= 1
This parameter controls the similarity between the generated speech and the reference audio. Increase it to make the speech more similar to the reference audio.
0 <= x <= 1
Enhances speech quality at the cost of increased latency.
0 <= x <= 2
Determines how numbers are spelled out. If set to 'en', numbers will be read as individual digits in English. If set to 'hi', numbers will be read as individual digits in Hindi.
en
, hi
The format of the output audio.
pcm
, mp3
, wav
, mulaw
The IDs of the pronunciation dictionaries to use for speech generation.
The ID of the pronunciation dictionary to use for speech generation.
Response
Synthesized speech retrieved successfully.
A PCM int16 WAV file at the specified sample rate.