Qwen3 TTS Speech Synthesis API
Qwen’s speech synthesis model delivers human-like voices with natural expressiveness. It supports multiple languages and dialects, generates multilingual content using a single voice, and automatically adapts tone to handle complex text.
Overview
Qwen3-TTS is a text-to-speech model from the Qwen team at Alibaba Cloud. This model turns text into natural-sounding speech across 10 languages, including Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, and Italian.Includes three powerful modes: Voice, Clone, and Design.
What you can do with it
Speech Synthesis:
Enter the text to be synthesized, select the built-in voice of the system, and the speech can be generated. Multi-language mixed input is supported.Voice Cloning:
Give the model 3 seconds of audio and it can reproduce that voice speaking any text you provide. The cloning works across languages too—clone a voice in English and use it to speak Chinese.Design Voice:
Describe the voice you want in plain language and the model creates it. You might ask for “a warm storyteller voice with gentle pacing” or “a deep male voice with a British accent.” The model interprets your description and generates matching speech.Control speech style:
Use natural language instructions to adjust how the speech sounds. You can control emotion, speaking speed, and tone. The model adapts its output based on the meaning of your text, placing pauses naturally and emphasizing the right words.Query Task Status
After submitting a task, use the unified query endpoint to check progress and retrieve results:Get Task Info
Related Resources
Models Overview
Common API
Authorizations
All APIs require authentication via API Key.
Get API Key:
- Visit API Key Management Page to get your API Key
Usage: Add to request header:
x-api-key: YOUR_API_KEY
Note:
- Keep your API Key secure and do not share it with others
- If you suspect your API Key has been compromised, reset it immediately in the management page
Body
The model name to use for generation. Required field.
- Must be
qwen3-tts/speech-synthesisfor this endpoint
qwen3-tts/speech-synthesis Input parameters for the Qwen3 TTS speech synthesis task
Optional. Callback URL for receiving task completion notifications.
- System will POST task status and results to this URL when generation completes
- Callback payloads structure is consistent with the
dataobject returned by the task status query - Your callback endpoint should accept POST requests with JSON payload containing results
- It returns an HTTP 200 status code upon successful receipt
"https://your-domain.com/api/callback"
Response
Request successful
Response status code
- 200: Success - Request has been processed successfully
- 401: Unauthorized - Authentication credentials are missing or invalid
- 402: Insufficient Credits - Account does not have enough credits to perform the operation
- 404: Not Found - The requested resource or endpoint does not exist
- 422: Validation Error - The request parameters failed validation checks
- 429: Rate Limited - Request limit has been exceeded for this resource
- 455: Service Unavailable - System is currently undergoing maintenance
- 500: Server Error - An unexpected error occurred while processing the request
- 501: Generation Failed - Content generation task failed
- 505: Feature Disabled - The requested feature is currently disabled
200, 401, 402, 404, 422, 429, 455, 500, 501, 505 Response message, error description when failed
"success"
