ElevenLabs Speech-to-Text
Convert audio files and speech to accurate text transcriptions using ElevenLabs' advanced AI speech recognition technology.
Note: Audio files must first be uploaded using the Asset API before transcription. The audioUrl parameter should contain the path returned from the Asset API upload.
Supported Models
elevenlabs-speech-to-text: ElevenLabs Speech-to-Text transcription service
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
type | string | Yes | Feature type, must be "SPEECH_TO_TEXT" |
model | string | Yes | Model identifier, use "elevenlabs-speech-to-text" |
promptObject.audioUrl | string | Yes | Path to audio file (uploaded via Asset API) |
Endpoint
Request Headers
| Field | Value |
|---|---|
| API-KEY | <api-key> |
| Content-Type | application/json |
Supported Audio Formats
- MP3 - MPEG Audio Layer III
- WAV - Waveform Audio File Format
- M4A - MPEG-4 Audio
- FLAC - Free Lossless Audio Codec
- MP4 - MPEG-4 Part 14 (audio only)
- WEBM - WebM Audio
- OGG - Ogg Vorbis
Language Support
The API supports automatic language detection or you can specify:
en- Englishes- Spanishfr- Frenchde- Germanit- Italianpt- Portugueseru- Russianja- Japaneseko- Koreanzh- Chinesear- Arabichi- Hindi
And many more languages supported by ElevenLabs.
Code Example
- cURL
- JavaScript
- Python
curl --location 'https://api.1min.ai/api/features' \
--header 'API-KEY: <api-key>' \
--header 'Content-Type: application/json' \
--data '{
"type": "SPEECH_TO_TEXT",
"model": "elevenlabs-speech-to-text",
"promptObject": {
"audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3"
}
}'
fetch('https://api.1min.ai/api/features', {
method: 'POST',
headers: {
'Content-Type': 'application/json',
'API-KEY': 'YOUR_API_KEY'
},
body: JSON.stringify({
type: 'SPEECH_TO_TEXT',
model: 'elevenlabs-speech-to-text',
promptObject: {
audioUrl: 'audios/2025_10_21_08_22_58_741_short.mp3'
}
})
});
import requests
url = "https://api.1min.ai/api/features"
headers = {
"Content-Type": "application/json",
"API-KEY": "YOUR_API_KEY"
}
data = {
"type": "SPEECH_TO_TEXT",
"model": "elevenlabs-speech-to-text",
"promptObject": {
"audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3"
}
}
response = requests.post(url, headers=headers, json=data)
Interactive Playground
Try the API directly in your browser:
API Playground
https://api.1min.ai/api/featuresPath to the audio file you want to transcribe (upload via Asset API first)
Generated cURL Command:
curl -X POST "https://api.1min.ai/api/features" \
-H "API-KEY: <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"type": "SPEECH_TO_TEXT",
"model": "elevenlabs-speech-to-text",
"promptObject": {
"audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3"
}
}'
Use Cases
- Meeting Transcription: Convert recorded meetings and calls into searchable text
- Content Creation: Transcribe podcasts, interviews, and videos for accessibility
- Research: Convert research interviews and focus groups into analyzable text
- Legal Documentation: Transcribe depositions, hearings, and legal proceedings
- Education: Create transcripts of lectures, seminars, and educational content
- Media Production: Generate subtitles and captions for videos and broadcasts
- Customer Service: Transcribe customer calls for quality assurance and training
Tips for Best Results
- Upload First: Use the Asset API to upload your audio file before transcription
- Audio Quality: Use high-quality audio files with minimal background noise
- File Size: Keep files under 50MB for optimal processing speed
- Format: Use common formats like MP3, WAV, or M4A for best compatibility
- Clear Speech: Ensure speakers speak clearly with minimal mumbling or fast speech
- File Path: Use the exact
audioUrlpath returned from the Asset API upload