Skip to main content

Audio Translator

Translate spoken audio from any language to your target language using advanced AI models. Choose from multiple models including OpenAI's Whisper for audio transcription and translation, or Google's Gemini models for advanced translation with tone and style controls.

Endpoint

Authentication

FieldValue
API-KEY<your-api-key>
Content-Typeapplication/json

Supported Models

Loading available models...

Interactive API Explorer

Loading interactive playground...

Request Parameters

Base Parameters (All Models)

ParameterTypeDescriptionRequired
typestringMust be AUDIO_TRANSLATOR
modelstringAI model to use
conversationIdstringConversation identifier
promptObject.audioUrlstringPath to the audio file to translate

Whisper Model Parameters

ParameterTypeDescriptionRequired
promptObject.promptstringOptional prompt to guide the translation
promptObject.response_formatstringOutput format: text, json, srt, verbose_json, vtt
promptObject.temperaturenumberRandomness (0-1). Lower values for more focused output

Gemini Model Parameters

ParameterTypeDescriptionRequired
promptObject.originalLanguagestringSource language code (e.g., en, es, fr)
promptObject.targetLanguagestringTarget language code for translation
promptObject.domainstringContent domain for context-aware translation
promptObject.tonestringDesired tone of the translation
promptObject.writingStylestringWriting style for the translation

Response Format

{}

Model Comparison

FeatureWhisper-1Gemini Models
Primary UseEnglish translation onlyMulti-language translation
Input LanguagesAny languageAny supported language
Output LanguagesEnglish only60+ languages
Advanced ControlsFormat & temperatureTone, style & domain
File Size Limit25MBVaries
Output FormatsText, JSON, SRT, VTTText

Use Cases

  • Multilingual Content Creation: Translate podcasts and videos for global audiences
  • Meeting Documentation: Convert recorded meetings into translated transcripts
  • Educational Content: Translate lectures and presentations for international students
  • Media Localization: Adapt audio content for different markets with appropriate tone
  • Accessibility: Create translated transcripts for hearing-impaired users