Skip to main content

ElevenLabs Speech-to-Text

Convert audio files and speech to accurate text transcriptions using ElevenLabs' advanced AI speech recognition technology.

Note: Audio files must first be uploaded using the Asset API before transcription. The audioUrl parameter should contain the path returned from the Asset API upload.

Supported Models

  • elevenlabs-speech-to-text: ElevenLabs Speech-to-Text transcription service

Parameters

ParameterTypeRequiredDescription
typestringYesFeature type, must be "SPEECH_TO_TEXT"
modelstringYesModel identifier, use "elevenlabs-speech-to-text"
promptObject.audioUrlstringYesPath to audio file (uploaded via Asset API)

Endpoint

Request Headers

FieldValue
API-KEY<api-key>
Content-Typeapplication/json

Supported Audio Formats

  • MP3 - MPEG Audio Layer III
  • WAV - Waveform Audio File Format
  • M4A - MPEG-4 Audio
  • FLAC - Free Lossless Audio Codec
  • MP4 - MPEG-4 Part 14 (audio only)
  • WEBM - WebM Audio
  • OGG - Ogg Vorbis

Language Support

The API supports automatic language detection or you can specify:

  • en - English
  • es - Spanish
  • fr - French
  • de - German
  • it - Italian
  • pt - Portuguese
  • ru - Russian
  • ja - Japanese
  • ko - Korean
  • zh - Chinese
  • ar - Arabic
  • hi - Hindi

And many more languages supported by ElevenLabs.

Code Example

curl --location 'https://api.1min.ai/api/features' \
--header 'API-KEY: <api-key>' \
--header 'Content-Type: application/json' \
--data '{
"type": "SPEECH_TO_TEXT",
"model": "elevenlabs-speech-to-text",
"promptObject": {
"audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3"
}
}'

Interactive Playground

Try the API directly in your browser:

API Playground

https://api.1min.ai/api/features
Path to the audio file you want to transcribe (upload via Asset API first)

Generated cURL Command:

curl -X POST "https://api.1min.ai/api/features" \
-H "API-KEY: <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"type": "SPEECH_TO_TEXT",
"model": "elevenlabs-speech-to-text",
"promptObject": {
"audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3"
}
}'

Use Cases

  • Meeting Transcription: Convert recorded meetings and calls into searchable text
  • Content Creation: Transcribe podcasts, interviews, and videos for accessibility
  • Research: Convert research interviews and focus groups into analyzable text
  • Legal Documentation: Transcribe depositions, hearings, and legal proceedings
  • Education: Create transcripts of lectures, seminars, and educational content
  • Media Production: Generate subtitles and captions for videos and broadcasts
  • Customer Service: Transcribe customer calls for quality assurance and training

Tips for Best Results

  1. Upload First: Use the Asset API to upload your audio file before transcription
  2. Audio Quality: Use high-quality audio files with minimal background noise
  3. File Size: Keep files under 50MB for optimal processing speed
  4. Format: Use common formats like MP3, WAV, or M4A for best compatibility
  5. Clear Speech: Ensure speakers speak clearly with minimal mumbling or fast speech
  6. File Path: Use the exact audioUrl path returned from the Asset API upload

Error Handling

Common error scenarios and solutions:

  • File not found: Ensure the audio file was uploaded via Asset API first
  • Invalid audioUrl: Verify the path matches exactly what was returned from Asset API upload