Fastest real-time speech-to-text transcription using the Pulse API.The Waves Speech To Text (STT) stack processes audio via
https://waves-api.smallest.ai/api/v1/pulse/get_text and returns low-latency transcripts with configurable languages, formats, and pricing tiers suited for enterprise deployments.
Pre-Recorded
Transcribe audio files using synchronous HTTPS POST requests. Perfect for batch processing, archived media, and offline transcription workflows.
Real-Time
Stream audio and receive transcription results as the audio is processed. Ideal for live conversations, voice assistants, and low-latency applications.
Feature highlights
Our models specialize in processing audio to preserve information that is often lost during conventional speech to text conversion.32+ Languages & Automatic Language Detection
32+ Languages & Automatic Language Detection
Support for 32+ languages with automatic language detection or ISO 639-1 codes (
en, hi, etc.). Use language=multi to enable automatic language detection across all supported languages.Word Timestamps
Word Timestamps
Get precise timing information for each word in the transcription. Enables caption generation, subtitle tracks, and time-based search within audio content.
Sentence Timestamps (Utterances)
Sentence Timestamps (Utterances)
Receive sentence-level transcription segments with timing information. Perfect for displaying readable captions, synchronizing larger chunks of audio, or storing structured call summaries.
Diarization
Diarization
Identify and separate generated text into speaker turns. Automatically label different speakers in multi-speaker audio, enabling speaker-attributed transcription.
Age & Gender Detection
Age & Gender Detection
Estimate the age group and detect the gender of each speaker alongside transcription. Provides demographic insights for analytics and content analysis.
Emotion Detection
Emotion Detection
Detect emotional tone in transcribed speech with strength indicators for 5 core emotion types. Analyze sentiment and emotional context in conversations.
PII & PCI Redaction
PII & PCI Redaction
Automatically redact personally identifiable information (names, addresses, phone numbers) and payment card information (credit cards, CVV, account numbers) to protect privacy and ensure compliance.
Full Transcript (Real-Time)
Full Transcript (Real-Time)
Get cumulative transcript received up to this point in responses where
is_final is true. Maintain complete session transcripts for conversation logs and real-time monitoring.Low Latency
Low Latency
Streaming pipeline tuned for ~64 ms time to first transcript latency. Optimized for real-time transcription with minimal delay.
Supported languages
| Language | Code | Pre-Recorded | Real-Time |
|---|---|---|---|
| Italian | it | Yes | Yes |
| Spanish | es | Yes | Yes |
| English | en | Yes | Yes |
| Portuguese | pt | Yes | Yes |
| Hindi | hi | Yes | Yes |
| German | de | Yes | Yes |
| French | fr | Yes | Yes |
| Ukrainian | uk | Yes | Yes |
| Russian | ru | Yes | Yes |
| Kannada | kn | Yes | Yes |
| Malayalam | ml | Yes | Yes |
| Polish | pl | Yes | Yes |
| Marathi | mr | Yes | Yes |
| Gujarati | gu | Yes | Yes |
| Czech | cs | Yes | Yes |
| Slovak | sk | Yes | Yes |
| Telugu | te | Yes | Yes |
| Oriya (Odia) | or | Yes | Yes |
| Dutch | nl | Yes | Yes |
| Bengali | bn | Yes | Yes |
| Latvian | lv | Yes | Yes |
| Estonian | et | Yes | Yes |
| Romanian | ro | Yes | Yes |
| Punjabi | pa | Yes | Yes |
| Finnish | fi | Yes | Yes |
| Swedish | sv | Yes | Yes |
| Bulgarian | bg | Yes | Yes |
| Tamil | ta | Yes | Yes |
| Hungarian | hu | Yes | Yes |
| Danish | da | Yes | Yes |
| Lithuanian | lt | Yes | Yes |
| Maltese | mt | Yes | Yes |
language=multi to auto-detect across the full list or specify one of the codes above to pin the model to a single language.
Next steps
- Send your first POST request in the Pulse STT Pre-Recorded quickstart.
- Start your first WebSocket connection in the Pulse STT WebSocket quickstart.
- Review best practices for audio preprocessing and request hygiene.
- Use the troubleshooting guide when you need quick fixes.

