Models

Text to Speech (TTS) Models

Lightning v2

Legacy An upgrade from the Lightning Large model, offering improved performance and quality. It supports 16 languages, making it suitable for a wider range of applications requiring expressive and high-quality speech synthesis.

Lightning v3.1

Latest Release A 44 kHz model delivering natural, expressive, and realistic speech. Supports voice cloning with ultra-low latency. 15 languages with auto-detection and mid-sentence switching.

Speech to Text (STT) Models

Pulse STT

Low-latency speech recognition for real-time and pre-recorded transcription. Automatic language detection across 39 languages.

Click on a model name to view its detailed model card.

Geo-location Based Routing

Waves intelligently routes every request to the nearest server cluster to ensure the lowest possible latency for your applications. We currently operate server clusters in:

🇮🇳 India (Mumbai)
🇺🇸 USA (Oregon)

Our routing system automatically detects the client’s geographical location and connects them to the optimal server based on network proximity and latency. This process is fully automated, no manual configuration is required on your side.

Model Overview (TTS)

Model ID	Description	Languages Supported
lightning-v2 Legacy	100ms TTFB, Supports 16 languages with voice cloning.	`English` `Hindi` `Tamil` `Kannada` `Malayalam` `Telugu` `Gujarati` `Bengali` `Marathi` `German` `French` `Spanish` `Italian` `Polish` `Dutch` `Russian` `Arabic` `Hebrew` `Swedish`
lightning-v3.1 Latest	44 kHz model, natural expressive speech, ultra-low latency, supports voice cloning.	`English` `Spanish` `French` `Italian` `Dutch` `Swedish` `Portuguese` `German` `Hindi` `Tamil` `Kannada` `Telugu` `Malayalam` `Marathi` `Gujarati`

Model Overview (STT)

Model ID	Description	Languages Supported
pulse	Low-latency speech-to-text model supporting automatic language detection and real-time transcription.	`Italian` `Spanish` `English` `Portuguese` `Hindi` `German` `French` `Ukrainian` `Russian` `Kannada` `Malayalam` `Polish` `Marathi` `Gujarati` `Czech` `Slovak` `Telugu` `Oriya (Odia)` `Dutch` `Bengali` `Latvian` `Estonian` `Romanian` `Punjabi` `Finnish` `Swedish` `Bulgarian` `Tamil` `Hungarian` `Danish` `Lithuanian` `Maltese`

Note: The API uses ISO 639-1 language codes - Set 1 (2-letter codes) to specify supported languages.

Pricing

Our pricing model is designed to be flexible and scalable, catering to different usage needs. For detailed pricing information, please visit our pricing page or contact our sales team at support@smallest.ai.

Getting Started

Text to Speech (Lightning)

Speech to Text (Pulse)

Cookbooks

Voice Cloning

Integrations

Best Practices

Text to Speech (TTS) Models

Lightning v2

Lightning v3.1

Speech to Text (STT) Models

Pulse STT

Geo-location Based Routing

Model Overview (TTS)

Model Overview (STT)

Pricing

Getting Started

Text to Speech (Lightning)

Speech to Text (Pulse)

Cookbooks

Voice Cloning

Integrations

Best Practices

​Text to Speech (TTS) Models

Lightning v2

Lightning v3.1

​Speech to Text (STT) Models

Pulse STT

​Geo-location Based Routing

​Model Overview (TTS)

​Model Overview (STT)

​Pricing

Text to Speech (TTS) Models

Speech to Text (STT) Models

Geo-location Based Routing

Model Overview (TTS)

Model Overview (STT)

Pricing