Overview

The Lightning TTS API converts text into natural speech via https://api.smallest.ai/waves/v1. 80+ voices across 4 languages, 44.1 kHz native sample rate, ~200ms latency, with sync, SSE, and WebSocket streaming. Hear Lightning v3.1 (voice: magnus):

Quickstart

Generate your first audio in under 60 seconds.

Synthesis Modes

Choose the synthesis mode that best fits your application’s needs:

Synchronous

Generate complete audio files with a single HTTP request. Ideal for pre-rendering content, batch processing, and applications where immediate streaming isn’t required.

Streaming

Receive audio chunks as they’re generated via WebSocket. Perfect for real-time voice assistants, live narration, and low-latency conversational AI.

Available Models

Lightning v2

Legacy High-quality multilingual TTS with 100ms TTFB. Supports 16+ languages including English, Hindi, and European languages. Includes voice cloning support.

Lightning v3.1

Our most natural-sounding model with 44 kHz audio output. Ultra-low latency with expressive, human-like speech. Supports English, Hindi, Tamil, and Spanish with voice cloning.

Feature Highlights

Ultra-Low Latency

Optimized streaming pipeline delivers sub-100ms time-to-first-byte (TTFB) for real-time applications. Lightning v3.1 achieves even faster response times for conversational AI.

Voice Cloning

Create custom voice profiles by uploading audio samples. Instant voice cloning works with just a few seconds of audio, while professional voice cloning delivers studio-quality results.

16+ Languages

Comprehensive language support including English, Hindi, Tamil, Kannada, Malayalam, Telugu, Gujarati, Bengali, Marathi, German, French, Spanish, Italian, Polish, Dutch, and Russian.

Multiple Output Formats

Choose from PCM, WAV, MP3, or μ-law encoding. Configurable sample rates from 8kHz to 44kHz to match your application’s requirements.

Speed Control

Adjust speech rate with a simple multiplier. Slow down for clarity or speed up for faster content delivery without pitch distortion.

Pronunciation Dictionaries

Define custom pronunciations for brand names, technical terms, and acronyms. Ensure consistent, accurate pronunciation across all synthesized audio.

High-Quality Audio

Lightning v3.1 produces 44 kHz audio with natural prosody and expressiveness. Perfect for audiobooks, podcasts, and premium voice experiences.

WebSocket Streaming

Persistent connections for continuous audio streaming. Ideal for voice bots and interactive applications where latency is critical.

Supported Languages

Language	Code	Lightning v2	Lightning v3.1
English	`en`	Yes	Yes
Hindi	`hi`	Yes	Yes
Tamil	`ta`	Yes	Yes
Kannada	`kn`	Yes	—
Malayalam	`ml`	Yes	—
Telugu	`te`	Yes	—
Gujarati	`gu`	Yes	—
Bengali	`bn`	Yes	—
Marathi	`mr`	Yes	—
German	`de`	Yes	—
French	`fr`	Yes	—
Spanish	`es`	Yes	Yes
Italian	`it`	Yes	—
Polish	`pl`	Yes	—
Dutch	`nl`	Yes	—
Russian	`ru`	Yes	—

Explore

Quickstart

First API call in 60 seconds

Streaming

Real-time audio via WebSocket

Voice Cloning

Clone from 5-15 seconds of audio

Cookbook

20+ open-source examples on GitHub

Showcase

See what developers have built

Model Card

Lightning v3.1 specs and benchmarks

Getting Started

Text to Speech (Lightning)

Speech to Text (Pulse)

Cookbooks

Voice Cloning

Integrations

Best Practices

Quickstart

Synthesis Modes

Synchronous

Streaming

Available Models

Lightning v2

Lightning v3.1

Feature Highlights

Supported Languages

Explore

Quickstart

Streaming

Voice Cloning

Cookbook

Showcase

Model Card

Getting Started

Text to Speech (Lightning)

Speech to Text (Pulse)

Cookbooks

Voice Cloning

Integrations

Best Practices

Quickstart

​Synthesis Modes

Synchronous

Streaming

​Available Models

Lightning v2

Lightning v3.1

​Feature Highlights

​Supported Languages

​Explore

Quickstart

Streaming

Voice Cloning

Cookbook

Showcase

Model Card

Synthesis Modes

Available Models

Feature Highlights

Supported Languages

Explore