Short Speech - Speech to Text

Convert short-form audio files and brief speech recordings to accurate text transcriptions with fast processing optimized for quick content. Perfect for voice notes, short recordings, commands, and brief audio clips.

Note: Audio files must first be uploaded using the Asset API before transcription. The audioUrl parameter should contain the path returned from the Asset API upload.

Supported Models

latest_short: Optimized for short-form audio content with fast processing capabilities

Endpoint

POSThttps://api.1min.ai/api/features

Request Headers

Field	Value
API-KEY	`<api-key>`
Content-Type	`application/json`

Supported Audio Formats

MP3 - MPEG Audio Layer III
WAV - Waveform Audio File Format
M4A - MPEG-4 Audio
FLAC - Free Lossless Audio Codec
MP4 - MPEG-4 Part 14 (audio only)
WEBM - WebM Audio
OGG - Ogg Vorbis

Language Support

The API supports various languages including:

en-US - English (US)
en-GB - English (UK)
vi-VN - Vietnamese
es-ES - Spanish
fr-FR - French
de-DE - German
it-IT - Italian
pt-PT - Portuguese
ru-RU - Russian
ja-JP - Japanese
ko-KR - Korean
zh-CN - Chinese (Simplified)
ar-SA - Arabic

Note: For a complete list of all supported languages and their language codes, please refer to the Google Cloud Text-to-Speech documentation.

Code Example

Parameters

Parameter	Type	Required	Description
`type`	string	Yes	Feature type, must be "SPEECH_TO_TEXT"
`model`	string	Yes	Model identifier, use "latest_short"
`promptObject.audioUrl`	string	Yes	Path to audio file (uploaded via Asset API)
`promptObject.language`	string	Yes	Language code for transcription (e.g., "en-US", "en-US")

Code Examples

cURL
JavaScript
Python

curl --location 'https://api.1min.ai/api/features' \
--header 'API-KEY: <api-key>' \
--header 'Content-Type: application/json' \
--data '{
"type": "SPEECH_TO_TEXT",
"model": "latest_short",
"promptObject": {
  "audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3",
  "language": "en-US"
}
}'

fetch('https://api.1min.ai/api/features', {
method: 'POST',
headers: {
  'Content-Type': 'application/json',
  'API-KEY': 'YOUR_API_KEY'
},
body: JSON.stringify({
  type: 'SPEECH_TO_TEXT',
  model: 'latest_short',
  promptObject: {
    audioUrl: 'audios/2025_10_21_08_22_58_741_short.mp3',
    language: 'en-US'
  }
})
})

import requests

url = "https://api.1min.ai/api/features"
headers = {
"Content-Type": "application/json",
"API-KEY": "YOUR_API_KEY"
}

data = {
"type": "SPEECH_TO_TEXT",
"model": "latest_short",
"promptObject": {
  "audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3",
  "language": "en-US"
}
}

response = requests.post(url, headers=headers, json=data)

Interactive Playground

Try the API directly in your browser:

API Playground

https://api.1min.ai/api/features

AI Model *

Audio URL *

Path to the audio file you want to transcribe (upload via Asset API first)

Language *

Generated cURL Command:

curl -X POST "https://api.1min.ai/api/features" \
  -H "API-KEY: <your-api-key>" \
  -H "Content-Type: application/json" \
  -d '{
  "type": "SPEECH_TO_TEXT",
  "model": "latest_short",
  "promptObject": {
    "audioUrl": "audios/2025_10_21_08_22_58_741_short.mp3",
    "language": "en-US"
  }
}'

Use Cases

Voice Notes: Convert personal voice memos and quick notes to text
Voice Commands: Transcribe voice commands for applications and assistants
Quick Messages: Convert short voice messages to text for messaging apps
Voice Search: Enable voice search functionality in applications
Caption Generation: Create quick captions for short video clips
Dictation: Convert brief dictated content to text
Voice Verification: Transcribe spoken authentication phrases
Audio Feedback: Convert customer feedback recordings to text

Tips for Best Results

Upload First: Use the Asset API to upload your audio file before transcription
Audio Quality: Ensure clear audio without background noise for short clips
Duration: Optimized for audio files under 2 minutes for fastest processing
Clear Speech: Speak clearly and at a moderate pace for best accuracy
Language Selection: Choose the correct language for optimal results
Processing Speed: Short audio files process very quickly, usually within seconds

Error Handling

Common error scenarios and solutions:

File not found: Ensure the audio file was uploaded via Asset API first
Invalid audioUrl: Verify the path matches exactly what was returned from Asset API upload
Language not supported: Check that the language code is in the supported list
Audio too long: Consider using the long-speech model for audio over 2 minutes

Supported Models​

Endpoint​

Request Headers​

Supported Audio Formats​

Language Support​

Code Example​

Parameters​

Code Examples​

Interactive Playground​