Skip to main content
Voice Speech to Text fal.ai

Speech-to-Text Turbo

Speech-to-Text Turbo costs $0.0048/req on FairStack — a speech to text model for Bulk transcription, Fast STT workflows. No subscription required. Pay per generation with full REST API access. FairStack applies a transparent 20% margin on infrastructure cost so you always see the real price.

FairStack price
$0.0048/req

What is Speech-to-Text Turbo?

STT Turbo is a fast speech-to-text model optimized for rapid transcription at an affordable per-second rate. The model converts spoken audio to text with speed-optimized processing, making it suitable for bulk transcription workloads where throughput matters more than maximum accuracy on difficult audio. With per-second billing at $0.0008 per second, it is extremely affordable for transcription. A full hour of audio costs just $2.88, making it practical for transcribing large audio libraries, podcast archives, and meeting recordings at scale. The turbo optimization delivers faster-than-real-time processing for most audio content. Compared to FairStack's self-hosted Faster Whisper at $0.001 flat per transcription, STT Turbo uses per-second pricing that scales differently depending on audio length. Against premium cloud STT services charging $0.01-0.05 per minute, it is significantly cheaper. For large-volume transcription where speed and cost matter most, it is a strong choice. Best suited for bulk transcription, fast STT workflows, and high-volume audio processing where per-second pricing and fast processing make large-scale transcription affordable. Available on FairStack at infrastructure cost plus a 20% platform fee.

Key Features

Fast transcription
Per-second at $0.0008/s
Bulk-friendly

What are Speech-to-Text Turbo's strengths?

Fast processing
Affordable per-second

What are Speech-to-Text Turbo's limitations?

Per-second pricing
May be less accurate than specialized models

What is Speech-to-Text Turbo best for?

Bulk transcription Fast STT workflows

How much does Speech-to-Text Turbo cost?

Metric
FairStack
Details
Price per generation
$0.0048
Includes 20% margin
Per-second rate
$0.0008/sec
Billed per second of output
Subscription
None
Pay per generation only

How does Speech-to-Text Turbo perform across capabilities?

Turbo STT — fastest transcription option

accuracy
88%
latency
94%
multilingual
82%
noise handling
78%

How do I use the Speech-to-Text Turbo API?

curl
curl -X POST https://api.fairstack.ai/v1/generations/voice \
  -H "Authorization: Bearer $FAIRSTACK_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "stt-turbo",
    "prompt": "Your prompt here"
  }'
Python
import requests

response = requests.post(
    "https://api.fairstack.ai/v1/generations/voice",
    headers={
        "Authorization": f"Bearer {FAIRSTACK_API_KEY}",
        "Content-Type": "application/json",
    },
    json={
        "model": "stt-turbo",
        "prompt": "Your prompt here",
    },
)

result = response.json()
print(result["url"])
Node.js
const response = await fetch(
  "https://api.fairstack.ai/v1/generations/voice",
  {
    method: "POST",
    headers: {
      Authorization: `Bearer ${process.env.FAIRSTACK_API_KEY}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      model: "stt-turbo",
      prompt: "Your prompt here",
    }),
  }
);

const result = await response.json();
console.log(result.url);

Frequently Asked Questions

How much does Speech-to-Text Turbo cost?

Speech-to-Text Turbo costs $0.0048/req on FairStack as of 2026-05-13. This price includes FairStack's transparent 20% margin on infrastructure cost. No subscription or monthly fee — you pay per generation only. Minimum deposit is $1.

What is Speech-to-Text Turbo and what is it best for?

STT Turbo is a fast speech-to-text model optimized for rapid transcription at an affordable per-second rate. The model converts spoken audio to text with speed-optimized processing, making it suitable for bulk transcription workloads where throughput matters more than maximum accuracy on difficult audio. With per-second billing at $0.0008 per second, it is extremely affordable for transcription. A full hour of audio costs just $2.88, making it practical for transcribing large audio libraries, podcast archives, and meeting recordings at scale. The turbo optimization delivers faster-than-real-time processing for most audio content. Compared to FairStack's self-hosted Faster Whisper at $0.001 flat per transcription, STT Turbo uses per-second pricing that scales differently depending on audio length. Against premium cloud STT services charging $0.01-0.05 per minute, it is significantly cheaper. For large-volume transcription where speed and cost matter most, it is a strong choice. Best suited for bulk transcription, fast STT workflows, and high-volume audio processing where per-second pricing and fast processing make large-scale transcription affordable. Available on FairStack at infrastructure cost plus a 20% platform fee. Speech-to-Text Turbo is best for Bulk transcription, Fast STT workflows. Available via FairStack's REST API with curl, Python, and Node.js SDKs.

Does Speech-to-Text Turbo have an API?

Yes. Speech-to-Text Turbo is available via FairStack's REST API at api.fairstack.ai. Send a POST request to /v1/generations/voice with your API key and prompt. Works with curl, Python requests, Node.js fetch, and any HTTP client. No SDK installation required.

How does Speech-to-Text Turbo compare to other voice models?

Speech-to-Text Turbo excels at Bulk transcription, Fast STT workflows. It is a speech to text model priced at $0.0048/req on FairStack. Key strengths: Fast processing, Affordable per-second. Compare all voice models at fairstack.ai/models.

What makes Speech-to-Text Turbo effective for speech recognition?

Speech-to-Text Turbo excels with fast processing and affordable per-second. Generation typically completes in under 5 seconds.

What are the known limitations of Speech-to-Text Turbo?

Key limitations include: per-second pricing; may be less accurate than specialized models. FairStack documents these transparently so you can choose the right model for your workflow.

How fast is Speech-to-Text Turbo?

Speech-to-Text Turbo typically completes in under 5 seconds. This makes it suitable for real-time applications, interactive workflows, and high-volume batch processing.

What voice features does Speech-to-Text Turbo support?

Speech-to-Text Turbo offers: fast transcription; per-second at $0.0008/s; bulk-friendly. All capabilities are accessible through both the FairStack web interface and REST API.

Start using Speech-to-Text Turbo today

$0.0048/req. Full API access. No subscription.

Start Creating