Skip to main content

ElevenLabs Text-to-Speech

Generate high-quality, expressive speech from text using ElevenLabs' advanced AI voice synthesis technology.

Supported Models

  • elevenlabs-tts: ElevenLabs Text-to-Speech service

Parameters

ParameterTypeRequiredDescription
textstringYesText to convert to speech
voice_idstringYesElevenLabs voice ID
model_idstringNoModel to use (default: eleven_multilingual_v2)
voice_settingsobjectNoVoice customization settings
voice_settings.stabilitynumberNoVoice stability (0.0 to 1.0, default: 0.5)
voice_settings.similarity_boostnumberNoVoice similarity boost (0.0 to 1.0, default: 0.5)
voice_settings.stylenumberNoVoice style exaggeration (0.0 to 1.0, default: 0.0)
voice_settings.use_speaker_boostbooleanNoEnable speaker boost (default: true)
output_formatstringNoAudio output format (default: mp3_44100_128)
optimize_streaming_latencynumberNoStreaming optimization (0 to 4, default: 0)
language_codestringNoLanguage code for multilingual models
  • pNInz6obpgDQGcFmaJgB - Adam (Male, Deep)
  • EXAVITQu4vr4xnSDxMaL - Bella (Female, Soft)
  • VR6AewLTigWG4xSOukaG - Arnold (Male, Crisp)
  • MF3mGyEYCl7XYWbV9V6O - Elli (Female, Emotional)
  • TxGEqnHWrfWFTfGW9XjX - Josh (Male, Deep)
  • rLHLTH8sB0ag3Qk4lD8q - Liam (Male, Calm)
  • AZnzlk1XvdvUeBnXmlld - Domi (Female, Strong)
  • CYw3kZ02Hs0563khs1Fj - Dave (Male, Conversational)
  • 9BWtsMINqrJLrRacOk9x - Aria (Female, Expressive)
  • pqHfZKP75CvOlQylNhV4 - Bill (Male, Documentary)

Model Options

  • eleven_multilingual_v2 - High-quality multilingual model (default)
  • eleven_turbo_v2_5 - Fast generation with good quality
  • eleven_flash_v2_5 - Fastest generation
  • eleven_monolingual_v1 - English-only, highest quality

Output Format Options

  • mp3_44100_128 - MP3, 44.1kHz, 128kbps (default)
  • mp3_22050_32 - MP3, 22.05kHz, 32kbps
  • mp3_44100_192 - MP3, 44.1kHz, 192kbps
  • mp3_44100_32 - MP3, 44.1kHz, 32kbps
  • mp3_44100_64 - MP3, 44.1kHz, 64kbps
  • mp3_44100_96 - MP3, 44.1kHz, 96kbps

Language Codes (for Multilingual Models)

  • en - English
  • es - Spanish
  • fr - French
  • de - German
  • it - Italian
  • pt - Portuguese
  • pl - Polish
  • tr - Turkish
  • ru - Russian
  • nl - Dutch
  • cs - Czech
  • ar - Arabic
  • zh - Chinese
  • ja - Japanese
  • hi - Hindi
  • ko - Korean

Request Example

API Playground

https://api.1min.ai/api/features

Generated cURL Command:

curl -X POST "https://api.1min.ai/api/features" \
-H "API-KEY: <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"type": "TEXT_TO_SPEECH",
"model": "elevenlabs-tts",
"conversationId": "TEXT_TO_SPEECH",
"promptObject": {
"text": "Hello, this is a sample text to speech conversion using ElevenLabs.",
"voice_id": "pNInz6obpgDQGcFmaJgB",
"model_id": "eleven_multilingual_v2",
"voice_settings": {
"stability": 0.5,
"similarity_boost": 0.5,
"style": 0,
"use_speaker_boost": true
},
"output_format": "mp3_44100_128",
"optimize_streaming_latency": 0,
"language_code": "en"
}
}'

Response Format

The API returns an audio file in the specified format. The response will be a binary audio stream.

{}