Basic Usage
{voice_id}
: Only character-level IDs are supported- Parameters such as
language
,style
, andmodel
must be included in the Request Body
Request Body Field Descriptions
Field | Required | Description |
---|---|---|
text | ✅ | Text to be converted. Up to 300 characters allowed |
language | ✅ | Language of the text. One of ko , en , or ja |
style | ❌ | Emotion style. e.g., neutral , happy , sad , etc. Defaults to character’s base style if unspecified |
model | ❌ | Model to use. Default is sona_speech_1 . Currently, only this model is supported |
voice_settings | ❌ | Controls pitch/speed. Includes pitch_shift , pitch_variance , and speed (default: 0, 1, 1) |
Usage Example
Response
- The response body is returned as a binary chunk, with the default format being
wav
Notes
- A 400 error will occur if the
text
length exceeds 300 characters. - The API can be called without specifying
style
, but the default style may vary by character.
Please use the Get Voices API to check the default (the first value in the style array is the default). - The returned audio file can be saved or played directly. (Appropriate handling may be required depending on the client.)
Headers
API key for the service
Path Parameters
Query Parameters
The desired output format of the audio file (wav, mp3). Default is wav.
Available options:
wav
, mp3
Body
application/json
Response
Streaming audio data in binary format
The response is of type file
.