Documentation Index
Fetch the complete documentation index at: https://docs.supertoneapi.com/llms.txt
Use this file to discover all available pages before exploring further.
New TTS model: supertonic_api_3
The Supertone API now supports supertonic_api_3, the next-generation successor to supertonic_api_1. It keeps the ultra-low latency profile of its predecessor while dramatically improving speech stability and expanding language coverage.What’s new
-
supertonic_api_3— Successor tosupertonic_api_1- Inherits the ultra-low latency profile of
supertonic_api_1 - Significantly improved speech stability — far fewer pronunciation and reading errors
- Trained differently from the open-weights Supertonic 3 release, so this is a distinct model
- Supports 31 languages:
en,ko,ja,ar,bg,cs,da,de,el,es,et,fi,fr,hi,hr,hu,id,it,lt,lv,nl,pl,pt,ro,ru,sk,sl,sv,tr,uk,vi - Voice settings:
speedonly (same assupertonic_api_1)
- Inherits the ultra-low latency profile of
-
supertonic_api_1is now the legacy supertonic model- Still available for existing integrations
- New projects should use
supertonic_api_3
-
Model selection expanded — TTS requests now accept
sona_speech_2,sona_speech_2_flash,supertonic_api_3,supertonic_api_1, andsona_speech_1.
New TTS models: sona_speech_2, sona_speech_2_flash, and supertonic_api_1
The Supertone API now supports three new speech synthesis models, expanding multilingual capabilities and offering more flexible generation options.What’s new
-
sona_speech_2— Multilingual flagship- Supports 23 languages
- Improved speech quality and stability
- Supports
include_phonemesandnormalized_text - Voice settings: all parameters except
subharmonic_amplitude_control
-
sona_speech_2_flash— Low-latency variant- Same 23 languages as
sona_speech_2 - Optimized for lower latency
- Voice settings:
pitch_shift,pitch_variance,speed,duration
- Same 23 languages as
-
supertonic_api_1(beta) — Lightweight low-latency model- Supports
en,ko,ja,es,pt - Designed for fastest inference and simplest setup
- Voice settings:
speedonly
- Supports
-
Model selection expanded — TTS requests now accept
sona_speech_2,sona_speech_2_flash,supertonic_api_1, andsona_speech_1.
Custom voices and streaming TTS (beta)
Several new features have been added to the Supertone API. In this release, Preset Voices and Custom Voices have been separated, and voice cloning via API is now supported. A beta version of streaming TTS has also been released.What’s new
-
Preset vs custom voice separation
- Preset voices are designed and provided by Supertone.
- Custom voices are clones registered and managed by the customer.
- New endpoints under
/v1/custom-voices/...for managing customer-owned voices.
-
Voice cloning via the API
- Voice cloning, previously only available in Supertone Play, is now supported via API.
- See Create cloned voice.
-
Streaming TTS (beta)
- A beta version of the streaming TTS API is available, ideal for real-time applications such as AI chatbots and character-based chat.
- See Stream speech.
Supertone API grand open
The Supertone API has officially launched after completing its closed beta phase. This release features a complete overhaul of the service structure and API usability, evolving into a more intuitive and scalable voice generation platform through integration with Play and unification of the voice system.
Major changes
-
Voice ID structure redesign
- Changed from
{character-language-style}combination → unified to character-basedvoice_id. languageandstyleparameters moved into the request body for flexible calling.
- Changed from
-
New TTS model
- Replaced
proandturbo→ unified into a singlesona_speech_1model. - Improved speech quality; text length limit increased from 200 to 300 characters.
- Replaced
-
Cloned voice API support
- Cloned voices created in Play can now be called via the API.
- Automatic integration within the same account, with permission-based security.
-
Play and pricing integration
- API usage applies Play’s credit system directly.
- Shared credits and unified billing — no separate pricing plans.
-
Enhanced voice search
- Filtering and exploration via
GET /voicesandGET /voices/search. - Each voice exposes supported languages, styles, and samples.
- Filtering and exploration via
-
Predict Duration API
- Estimate speech length before generation — no credits deducted.
-
New user guide and documentation
- Practical documentation including Quickstart, Voice Selection, Error Handling, and FAQ.
- Complete documentation renewal focused on the Play ↔ API integration flow.
Voice search update
New search features make it easier to find the voices you want.What’s new
- New endpoint for searching voices with filters and keywords.
- Filter by language, gender, age, and use case.
- Keyword search across voice names and descriptions.
- Page-based pagination for large result sets.

Supertone API closed beta launch
Welcome to the Supertone API closed beta — Supertone’s voice AI technology as a service.Key features
- Text-to-speech voice synthesis service.
- Voice list inquiry and usage check APIs.
Endpoints
- List voices — retrieve available voices.
- Create speech — convert text to speech.
- Get voice usage — check API usage.
Getting started
- Sign up at console.supertoneapi.com.
- Apply for closed beta participation.
- Issue an API key from the console.
- Integrate and use the API.
Console features
- API key issuance and management.
- Minute-based API usage monitoring.