Translate spoken audio from any language to your target language using advanced AI models. Choose from multiple models including OpenAI's Whisper for audio transcription and translation, or Google's Gemini models for advanced translation with tone and style controls.
Endpoint
Authentication
| Field | Value |
|---|
| API-KEY | <your-api-key> |
| Content-Type | application/json |
Supported Models
Loading available models...
Interactive API Explorer
Loading interactive playground...
Request Parameters
Base Parameters (All Models)
| Parameter | Type | Description | Required |
|---|
type | string | Must be AUDIO_TRANSLATOR | ✅ |
model | string | AI model to use | ✅ |
conversationId | string | Conversation identifier | ✅ |
promptObject.audioUrl | string | Path to the audio file to translate | ✅ |
Whisper Model Parameters
| Parameter | Type | Description | Required |
|---|
promptObject.prompt | string | Optional prompt to guide the translation | ❌ |
promptObject.response_format | string | Output format: text, json, srt, verbose_json, vtt | ❌ |
promptObject.temperature | number | Randomness (0-1). Lower values for more focused output | ❌ |
Gemini Model Parameters
| Parameter | Type | Description | Required |
|---|
promptObject.originalLanguage | string | Source language code (e.g., en, es, fr) | ✅ |
promptObject.targetLanguage | string | Target language code for translation | ✅ |
promptObject.domain | string | Content domain for context-aware translation | ❌ |
promptObject.tone | string | Desired tone of the translation | ❌ |
promptObject.writingStyle | string | Writing style for the translation | ❌ |
Model Comparison
| Feature | Whisper-1 | Gemini Models |
|---|
| Primary Use | English translation only | Multi-language translation |
| Input Languages | Any language | Any supported language |
| Output Languages | English only | 60+ languages |
| Advanced Controls | Format & temperature | Tone, style & domain |
| File Size Limit | 25MB | Varies |
| Output Formats | Text, JSON, SRT, VTT | Text |
Use Cases
- Multilingual Content Creation: Translate podcasts and videos for global audiences
- Meeting Documentation: Convert recorded meetings into translated transcripts
- Educational Content: Translate lectures and presentations for international students
- Media Localization: Adapt audio content for different markets with appropriate tone
- Accessibility: Create translated transcripts for hearing-impaired users