Lightning v2
Text to Speech
Stream speech for given text using the Lightning v2 SSE API
POST
Generate speech from text (Lightning v2)
Overview
The Lightning v2 SSE API provides real-time text-to-speech streaming capabilities with high-quality voice synthesis. This API uses Server-Sent Events (SSE) to deliver audio chunks as they’re generated, enabling low-latency audio playback without waiting for the entire audio file to process. For an end-to-end example of how to use the Lightning v2 SSE API, check out Text to Speech (SSE) Example
When to Use
- Interactive Applications: Perfect for chatbots, virtual assistants, and other applications requiring immediate voice responses
- Long-Form Content: Efficiently stream audio for articles, stories, or other long-form content without buffering delays
- Voice User Interfaces: Create natural-sounding voice interfaces with minimal perceived latency
- Accessibility Solutions: Provide real-time audio versions of written content for users with visual impairments
How It Works
- Make a POST Request: Send your text and voice settings to the API endpoint
- Receive Audio Chunks: The API processes your text and streams audio back as base64-encoded chunks with 1024 byte size
- Process the Stream: Handle the SSE events to decode and play audio chunks sequentially
- End of Stream: The API sends a completion event when all audio has been delivered
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
application/json
Response
200
text/event-stream
Synthesized speech retrieved successfully.
The response is of type file
.
Generate speech from text (Lightning v2)