Skip to main content

Audio Translator

Translate spoken audio from any language to your target language using advanced AI models. Choose from multiple models including OpenAI's Whisper for audio transcription and translation, or Google's Gemini models for advanced translation with tone and style controls.

Endpoint

Authentication

FieldValue
API-KEY<your-api-key>
Content-Typeapplication/json

Supported Models

OpenAI Models

  • whisper-1 - Whisper model for transcription and translation to English

Google AI Models

  • gemini-1.5-flash - Fast Gemini model with multilingual translation
  • gemini-1.5-pro - Advanced Gemini model with superior quality
  • gemini-1.0-pro - Original Gemini model
  • gemini-2.0-flash - Latest fast Gemini model
  • gemini-2.0-flash-lite - Lightweight version of Gemini 2.0 Flash
  • gemini-2.5-pro - Most advanced Gemini model
  • gemini-2.5-flash - Fast version of Gemini 2.5

Interactive API Explorer

API Playground

https://api.1min.ai/api/features

Generated cURL Command:

curl -X POST "https://api.1min.ai/api/features" \
-H "API-KEY: <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"type": "AUDIO_TRANSLATOR",
"model": "whisper-1",
"conversationId": "AUDIO_TRANSLATOR",
"promptObject": {
"audioUrl": "path/to/your/audio.mp3",
"response_format": "text",
"temperature": 0
}
}'

Request Parameters

Base Parameters (All Models)

ParameterTypeDescriptionRequired
typestringMust be AUDIO_TRANSLATOR
modelstringAI model to use
conversationIdstringConversation identifier
promptObject.audioUrlstringPath to the audio file to translate

Whisper Model Parameters

ParameterTypeDescriptionRequired
promptObject.promptstringOptional prompt to guide the translation
promptObject.response_formatstringOutput format: text, json, srt, verbose_json, vtt
promptObject.temperaturenumberRandomness (0-1). Lower values for more focused output

Gemini Model Parameters

ParameterTypeDescriptionRequired
promptObject.originalLanguagestringSource language code (e.g., en, es, fr)
promptObject.targetLanguagestringTarget language code for translation
promptObject.domainstringContent domain for context-aware translation
promptObject.tonestringDesired tone of the translation
promptObject.writingStylestringWriting style for the translation

Response Format

{
"data": {
"translatedText": "The translated audio content..."
}
}

Model Comparison

FeatureWhisper-1Gemini Models
Primary UseEnglish translation onlyMulti-language translation
Input LanguagesAny languageAny supported language
Output LanguagesEnglish only60+ languages
Advanced ControlsFormat & temperatureTone, style & domain
File Size Limit25MBVaries
Output FormatsText, JSON, SRT, VTTText

Use Cases

  • Multilingual Content Creation: Translate podcasts and videos for global audiences
  • Meeting Documentation: Convert recorded meetings into translated transcripts
  • Educational Content: Translate lectures and presentations for international students
  • Media Localization: Adapt audio content for different markets with appropriate tone
  • Accessibility: Create translated transcripts for hearing-impaired users