Returns the expected length of the converted speech given text input.
text-to-speech API.duration value is returned as a result, not audio.voice_settings.speed changes the length, it’s better to test with a fixed speech speed.| Item | Required | Description |
|---|---|---|
text | Yes | Text to analyze. Maximum 300 characters |
language | Yes | Text language. One of ko, en, ja |
style | No | Emotional style. Default style is used if not specified |
model | No | Default is sona_speech_1. Currently only this model is available |
voice_settings | No | Speech speed or pitch adjustment values. May affect result length |
The text to convert to speech. Max length is 300 characters.
300Language code of the voice
en, ko, ja, bg, cs, da, el, es, et, fi, hu, it, nl, pl, pt, ro, ar, de, fr, hi, id, ru, vi The style of character to use for the text-to-speech conversion
The model type to use for the text-to-speech conversion
sona_speech_1, sona_speech_2, supertonic_api_1 The desired output format of the audio file (wav, mp3). Default is wav.
wav, mp3 Returns predicted duration of the audio in seconds