Introduction to Supertone API
Supertone API is a service that generates natural and emotionally rich AI voices using Supertone’s deep learning technology.
Introduction to NANSY Model
NANSY(Neural Analysis and Synthesis) model-based voice AI technology from Supertone generates high-quality voices that are difficult to distinguish from real human voices. NANSY is an integrated neural network framework designed to perform voice-related generation tasks. This model serves as the foundation for various downstream tasks such as voice and song synthesis, voice conversion, and voice design. Through its integrated structure, it maintains consistent voice characteristics during the generation process, and can express anyone’s voice through the control of four individual elements.
How to Generate Voice
To generate Supertone’s high-quality AI voices, you need to use the Supertone API. The voice generation process through the API is as follows:
Get API Key
After signing up for Supertone API service, please apply for closed beta at the Console Page. Once your application is approved, you can get your API Key from the console page.
Select Voice
You can either call the Get Voices API to view the list of available voices or sign up for Supertone Play to test all voices free for 2 weeks. Once you find a voice you like, copy its ID and input it as an API call parameter.
Generate Voice
You can generate AI voice from text by calling Supertone’s Text-to-speech API.
Use the Results
Download the generated voice file or play it via streaming. You can use it for content creation and various other applications.
Key Features
1. High-Quality Voice Synthesis
Supertone’s AI voice synthesis technology provides natural intonation and rich emotional expression.
Key Features
- Generate voices with natural intonation and prosody
- Express various emotions and nuances
- Choose file formats according to user preferences
wav
: For lossless high-quality audio needsmp3
: For efficient file size in general use
2. Multi-language Support
Supertone API provides support for various languages for global services.
Supported Languages
- Korean(ko)
- Japanese(ja)
- English(en)
Each language provides pronunciation and intonation optimized for that language, and we plan to continuously expand our language support.
3. Various Voices
We provide voices suitable for various characters and situations through our rich voice portfolio.
Voice Classification Tags
- Gender
- Male
- Female
- Age Group
- Child
- Young
- Middle-aged
- Elderly
- Style
- Support for voice-specific emotion expression
- Various speaking styles suitable for different situations and content
- Unique timbres that embody character traits
Each voice can express unique characteristics and emotions matching its character, and you can test the actual voices free for 2 weeks by signing up for Supertone Play service.