Release notes - Supertone API Documentation

2026-06-16

CLI & MCP

Command-line interface and MCP server

Two new ways to use Supertone TTS beyond the SDKs: a terminal CLI and an MCP server for AI agents. Both wrap the same Text-to-Speech API.

What’s new

Supertone CLI
- Synthesize from strings, files, or stdin; stream to your speakers; batch whole directories; and manage voices, usage, and credits — all from the terminal.
- pip install supertone-cli
- See CLI.
Supertone MCP server
- Exposes the TTS API as composable Model Context Protocol tools, so AI agents in Claude, Cursor, Claude Code, VS Code, and Windsurf can discover voices, estimate cost, clone voices, and synthesize speech.
- uvx supertone-mcp
- See MCP.

2026-05-19

v1.4.0

New TTS model: `supertonic_api_3`

The Supertone API now supports supertonic_api_3, the next-generation successor to supertonic_api_1. It keeps the ultra-low latency profile of its predecessor while dramatically improving speech stability and expanding language coverage.

What’s new

supertonic_api_3 — Successor to supertonic_api_1
- Inherits the ultra-low latency profile of supertonic_api_1
- Significantly improved speech stability — far fewer pronunciation and reading errors
- Trained differently from the open-weights Supertonic 3 release, so this is a distinct model
- Supports 31 languages: en, ko, ja, ar, bg, cs, da, de, el, es, et, fi, fr, hi, hr, hu, id, it, lt, lv, nl, pl, pt, ro, ru, sk, sl, sv, tr, uk, vi
- Voice settings: speed only (same as supertonic_api_1)
supertonic_api_1 is now the legacy supertonic model
- Still available for existing integrations
- New projects should use supertonic_api_3
Model selection expanded — TTS requests now accept sona_speech_2, sona_speech_2_flash, supertonic_api_3, supertonic_api_1, and sona_speech_1.

See Models for full comparison and selection guidance.

2025-12-08

v1.3.0

New TTS models: `sona_speech_2`, `sona_speech_2_flash`, and `supertonic_api_1`

The Supertone API now supports three new speech synthesis models, expanding multilingual capabilities and offering more flexible generation options.

What’s new

sona_speech_2 — Multilingual flagship
- Supports 23 languages
- Improved speech quality and stability
- Supports include_phonemes and normalized_text
- Voice settings: all parameters except subharmonic_amplitude_control
sona_speech_2_flash — Low-latency variant
- Same 23 languages as sona_speech_2
- Optimized for lower latency
- Voice settings: pitch_shift, pitch_variance, speed, duration
supertonic_api_1 (beta) — Lightweight low-latency model
- Supports en, ko, ja, es, pt
- Designed for fastest inference and simplest setup
- Voice settings: speed only
Model selection expanded — TTS requests now accept sona_speech_2, sona_speech_2_flash, supertonic_api_1, and sona_speech_1.

See Models for full comparison and selection guidance.

2025-07-31

v1.2.0

Custom voices and streaming TTS (beta)

Several new features have been added to the Supertone API. In this release, Preset Voices and Custom Voices have been separated, and voice cloning via API is now supported. A beta version of streaming TTS has also been released.

What’s new

Preset vs custom voice separation
- Preset voices are designed and provided by Supertone.
- Custom voices are clones registered and managed by the customer.
- New endpoints under /v1/custom-voices/... for managing customer-owned voices.
Voice cloning via the API
- Voice cloning, previously only available in Supertone Play, is now supported via API.
- See Create cloned voice.
Streaming TTS (beta)
- A beta version of the streaming TTS API is available, ideal for real-time applications such as AI chatbots and character-based chat.
- See Stream speech.

2025-05-28

v1.1.0

Supertone API grand open

The Supertone API has officially launched after completing its closed beta phase. This release features a complete overhaul of the service structure and API usability, evolving into a more intuitive and scalable voice generation platform through integration with Play and unification of the voice system.

Major changes

Voice ID structure redesign
- Changed from {character-language-style} combination → unified to character-based voice_id.
- language and style parameters moved into the request body for flexible calling.
New TTS model
- Replaced pro and turbo → unified into a single sona_speech_1 model.
- Improved speech quality; text length limit increased from 200 to 300 characters.
Cloned voice API support
- Cloned voices created in Play can now be called via the API.
- Automatic integration within the same account, with permission-based security.
Play and pricing integration
- API usage applies Play’s credit system directly.
- Shared credits and unified billing — no separate pricing plans.
Enhanced voice search
- Filtering and exploration via GET /voices and GET /voices/search.
- Each voice exposes supported languages, styles, and samples.
Predict Duration API
- Estimate speech length before generation — no credits deducted.
New user guide and documentation
- Practical documentation including Quickstart, Voice Selection, Error Handling, and FAQ.
- Complete documentation renewal focused on the Play ↔ API integration flow.

2025-01-23

v1.0.1

Voice search update

New search features make it easier to find the voices you want.

What’s new

New endpoint for searching voices with filters and keywords.
Filter by language, gender, age, and use case.
Keyword search across voice names and descriptions.
Page-based pagination for large result sets.

See Search voices.

2024-12-23

v1.0.0

Supertone API closed beta launch

Welcome to the Supertone API closed beta — Supertone’s voice AI technology as a service.

Key features

Text-to-speech voice synthesis service.
Voice list inquiry and usage check APIs.

Endpoints

List voices — retrieve available voices.
Create speech — convert text to speech.
Get voice usage — check API usage.

Getting started

Sign up at console.supertoneapi.com.
Apply for closed beta participation.
Issue an API key from the console.
Integrate and use the API.

Console features

API key issuance and management.
Minute-based API usage monitoring.

For more details, refer to the docs.

​Command-line interface and MCP server

​What’s new

​New TTS model: supertonic_api_3

​What’s new

​New TTS models: sona_speech_2, sona_speech_2_flash, and supertonic_api_1

​What’s new

​Custom voices and streaming TTS (beta)

​What’s new

​Supertone API grand open

​Major changes

​Voice search update

​What’s new

​Supertone API closed beta launch

​Key features

​Endpoints

​Getting started

​Console features

Command-line interface and MCP server

What’s new

New TTS model: `supertonic_api_3`

What’s new

New TTS models: `sona_speech_2`, `sona_speech_2_flash`, and `supertonic_api_1`

What’s new

Custom voices and streaming TTS (beta)

What’s new

Supertone API grand open

Major changes

Voice search update

What’s new

Supertone API closed beta launch

Key features

Endpoints

Getting started

Console features