POST
/
api
/
v1
/
lightning-large
/
get_speech
curl --request POST \
  --url https://waves-api.smallest.ai/api/v1/lightning-large/get_speech \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "text": "<string>",
  "voice_id": "<string>",
  "add_wav_header": false,
  "sample_rate": 24000,
  "speed": 1,
  "language": "en",
  "consistency": 0.5,
  "similarity": 0,
  "enhancement": 1
}'
This response does not have an example.

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json
text
string
required

The text to convert to speech.

voice_id
string
required

The voice identifier to use for speech generation.

add_wav_header
boolean
default:false

Whether to add WAV header to the output.

sample_rate
integer
default:24000

The sample rate for the generated audio.

Required range: 8000 <= x <= 24000
speed
number
default:1

The speed of the generated speech.

Required range: 0.5 <= x <= 2
language
enum<string>
default:en

Determines how numbers are spelled out. If set to 'en', numbers will be read as individual digits in English. If set to 'hi', numbers will be read as individual digits in Hindi.

Available options:
en,
hi,
ta,
fr,
de,
pl
consistency
number
default:0.5

This parameter controls word repetition and skipping. Decrease it to prevent skipped words, and increase it to prevent repetition.

Required range: 0 <= x <= 1
similarity
number
default:0

This parameter controls the similarity between the generated speech and the reference audio. Increase it to make the speech more similar to the reference audio.

Required range: 0 <= x <= 1
enhancement
number
default:1

Enhances speech quality at the cost of increased latency.

Required range: 0 <= x <= 2

Response

200
audio/wav
Synthesized speech retrieved successfully.

A PCM int16 WAV file at the specified sample rate.