Guides the request structure and parameter usage methods, and error precautions for the TTS API that converts text to speech.
{voice_id}
: Only character-based IDs can be usedlanguage
, style
, model
are included in the Request BodyItem | Required | Description |
---|---|---|
text | ✅ | Text to convert. Up to 300 characters can be input |
language | ✅ | Language of the text. One of ko , en , ja |
style | ❌ | Emotional style. E.g., neutral , happy , sad , etc. If not specified, the character’s default style is applied |
model | ❌ | Model to use. Default is sona_speech_1 . Currently only this model is available |
voice_settings | ❌ | Pitch/speed adjustment. Includes pitch_shift , pitch_variance , speed fields (defaults: 0, 1, 1) |
output_format | ❌ | Desired audio file format. wav or mp3 . (Default: wav ) |
wav
output_format=mp3
as query parameterX-Audio-Length
header400 error occurs when text length exceeds 300 characters
.style
, but default styles may vary by character, so please call Get Voices API to check the default style (the first value in the styles array is the default).API key for the service
The desired output format of the audio file (wav, mp3). Default is wav.
wav
, mp3
Audio file converted from text. The response includes an X-Audio-Length header with the duration in seconds.
The response is of type file
.