Skip to main content
Fastest real-time speech-to-text transcription using the Pulse API.
The Waves Speech To Text (STT) stack processes audio via https://waves-api.smallest.ai/api/v1/pulse/get_text and returns low-latency transcripts with configurable languages, formats, and pricing tiers suited for enterprise deployments.

Feature highlights

Our models specialize in processing audio to preserve information that is often lost during conventional speech to text conversion.
Support for 32+ languages with automatic language detection or ISO 639-1 codes (en, hi, etc.). Use language=multi to enable automatic language detection across all supported languages.
Get precise timing information for each word in the transcription. Enables caption generation, subtitle tracks, and time-based search within audio content.
Receive sentence-level transcription segments with timing information. Perfect for displaying readable captions, synchronizing larger chunks of audio, or storing structured call summaries.
Identify and separate generated text into speaker turns. Automatically label different speakers in multi-speaker audio, enabling speaker-attributed transcription.
Estimate the age group and detect the gender of each speaker alongside transcription. Provides demographic insights for analytics and content analysis.
Detect emotional tone in transcribed speech with strength indicators for 5 core emotion types. Analyze sentiment and emotional context in conversations.
Automatically redact personally identifiable information (names, addresses, phone numbers) and payment card information (credit cards, CVV, account numbers) to protect privacy and ensure compliance.
Get cumulative transcript received up to this point in responses where is_final is true. Maintain complete session transcripts for conversation logs and real-time monitoring.
Streaming pipeline tuned for ~64 ms time to first transcript latency. Optimized for real-time transcription with minimal delay.

Supported languages

LanguageCodePre-RecordedReal-Time
ItalianitYesYes
SpanishesYesYes
EnglishenYesYes
PortugueseptYesYes
HindihiYesYes
GermandeYesYes
FrenchfrYesYes
UkrainianukYesYes
RussianruYesYes
KannadaknYesYes
MalayalammlYesYes
PolishplYesYes
MarathimrYesYes
GujaratiguYesYes
CzechcsYesYes
SlovakskYesYes
TeluguteYesYes
Oriya (Odia)orYesYes
DutchnlYesYes
BengalibnYesYes
LatvianlvYesYes
EstonianetYesYes
RomanianroYesYes
PunjabipaYesYes
FinnishfiYesYes
SwedishsvYesYes
BulgarianbgYesYes
TamiltaYesYes
HungarianhuYesYes
DanishdaYesYes
LithuanianltYesYes
MaltesemtYesYes
Use language=multi to auto-detect across the full list or specify one of the codes above to pin the model to a single language.

Next steps