Skip to main content

Google Text-to-Speech

Generate natural-sounding speech from text using Google Cloud Text-to-Speech with neural voices and advanced customization options.

Supported Models

  • google-tts: Google Cloud Text-to-Speech service

Parameters

ParameterTypeRequiredDescription
textstringYesText to convert to speech
languageCodestringYesBCP-47 language code (e.g., 'en-US', 'es-ES')
namestringYesVoice name (e.g., 'en-US-Standard-A')
ssmlGenderstringNoVoice gender preference
speakingRatenumberNoSpeaking rate (0.25 to 4.0, default: 1.0)
pitchnumberNoVoice pitch (-20.0 to 20.0, default: 0.0)
volumeGainDbnumberNoVolume gain in dB (-96.0 to 16.0, default: 0.0)
audioEncodingstringNoAudio encoding format (default: MP3)

Key Features

  • 49 supported language codes including major world languages
  • Multiple voice types: Standard, WaveNet (neural), Neural2, News, Studio, Journey, Casual, and Polyglot voices
  • Voice customization with pitch, speaking rate, and volume controls
  • Multiple audio formats supported

Request Example

API Playground

https://api.1min.ai/api/features

Generated cURL Command:

curl -X POST "https://api.1min.ai/api/features" \
-H "API-KEY: <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"type": "TEXT_TO_SPEECH",
"model": "google-tts",
"conversationId": "TEXT_TO_SPEECH",
"promptObject": {
"text": "Hello, this is a sample text to speech conversion using Google Cloud.",
"languageCode": "en-US",
"name": "en-US-Standard-A",
"ssmlGender": "FEMALE",
"speakingRate": 1,
"pitch": 0,
"volumeGainDb": 0,
"audioEncoding": "MP3"
}
}'

Response Format

The API returns an audio file in the specified format. The response will be a binary audio stream.

{}