Waves API
Get speech from given text using the new Waves API
The Waves API provides advanced text-to-speech capabilities with multiple voice options and customizable sample rates. This API allows you to convert text into natural-sounding speech in various languages and accents.
You can get your API key/Bearer token by logging into the Waves platform and clicking on API key
in the left panel.
Models
Waves API supports multiple models for speech synthesis. Currently, we offer:
- Lightning: Our first and fastest model, optimized for low-latency applications.
- Lightning Large: Our enhanced model, optimized for expressiveness and voice cloning.
To use a specific model, adjust the URL path in your API requests:
For example, to use the Lightning model:
We’re continuously working on new models to enhance our speech synthesis capabilities. Check announcements for latest updates.
API Specification and Code Samples
Note: The interactive “Try it out” feature will be available very soon.
If you need an API key or have any questions, please contact our support team at support@smallest.ai.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Headers
Bearer token for authentication. Format is 'Bearer {token}'
Path Parameters
The model to use for speech synthesis. Currently, only 'lightning' is available.
lightning
, lightning-large
Body
Voice ID for the 'lightning' model
arman
, diya
, raman
, ananya
, aarav
, shrishti
, niharika
, deepika
, raghav
, kajal
, mansi
, nisha
, saurabh
, pooja
, saina
, sanya
, aravind
, mithali
, raj
, james
, jasmine
, emily
, arnav
, george
, chetan
, ankur
, enola
, judi
, rebecca
, abhinav
, sushma
, ashish
, shweta
, karen
The text to be synthesized into speech
The speed for generated speech, Allowed range 0.5 - 2.
0.5 < x < 2
Sample rate of the output audio file. Allowed values are 8000, 16000, or 24000.
8000
, 16000
, 24000
Whether to add a WAV header or not
Response
A PCM int16 WAV file at the specified frequency.