Skip to main content
Voice Text to Speech fal.ai

Chatterbox Multilingual

Chatterbox Multilingual costs $0.030/req on FairStack — a text to speech model for Multilingual voice content, Cross-lingual voice cloning, Global audience TTS. No subscription required. Pay per generation with full REST API access. FairStack applies a transparent 20% margin on infrastructure cost so you always see the real price.

FairStack price
$0.030/req

What is Chatterbox Multilingual?

Chatterbox Multilingual is Resemble AI's multi-language TTS model supporting 23 languages with voice cloning capabilities across all supported languages. The model combines broad language coverage with the ability to clone a voice from reference audio and use that cloned voice in any of the 23 supported languages, enabling cross-lingual voice synthesis. At approximately $0.025 per 1,000 characters, it provides affordable multilingual TTS with voice cloning. The 23 supported languages include English, Arabic, Danish, German, Greek, Spanish, Finnish, French, Hebrew, Hindi, Italian, Japanese, Korean, Malay, Dutch, Norwegian, Polish, Portuguese, Russian, Swedish, Swahili, Turkish, and Chinese. The 300-character prompt limit means content should be processed in segments. Compared to ElevenLabs Multilingual V2 with 29 languages, Chatterbox Multilingual offers fewer languages but includes voice cloning across all supported languages at a lower price. Against single-language voice cloning models, the cross-lingual capability is unique. Best suited for multilingual voice content, cross-lingual voice cloning, and global audience TTS where voice consistency across multiple languages matters. Available on FairStack at infrastructure cost plus a 20% platform fee.

Key Features

23 languages supported — EN, AR, DA, DE, EL, ES, FI, FR, HE, HI, IT, JA, KO, MS, NL, NO, PL, PT, RU, SV, SW, TR, ZH
Voice cloning from reference audio
Cross-lingual voice synthesis
Affordable at ~$0.025/1K characters
300 character prompt limit

What are Chatterbox Multilingual's strengths?

Wide language coverage (23 languages)
Voice cloning in any supported language
Affordable multilingual TTS
Chatterbox quality across languages

What are Chatterbox Multilingual's limitations?

300 character prompt limit — short segments only
Quality may vary by language
Less natural than dedicated single-language models

What is Chatterbox Multilingual best for?

Multilingual voice content Cross-lingual voice cloning Global audience TTS Localized content creation

How much does Chatterbox Multilingual cost?

Metric
FairStack
Details
Price per generation
$0.030
Includes 20% margin
Subscription
None
Pay per generation only

How does Chatterbox Multilingual perform across capabilities?

Estimated scores — Chatterbox Multilingual. 23 languages with cloning. Quality varies by language.

naturalness
80%
emotion range
68%
cloning accuracy
78%
multilingual
88%
latency
75%

How do I use the Chatterbox Multilingual API?

curl
curl -X POST https://api.fairstack.ai/v1/generations/voice \
  -H "Authorization: Bearer $FAIRSTACK_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "chatterbox-multilingual",
    "prompt": "Your prompt here"
  }'
Python
import requests

response = requests.post(
    "https://api.fairstack.ai/v1/generations/voice",
    headers={
        "Authorization": f"Bearer {FAIRSTACK_API_KEY}",
        "Content-Type": "application/json",
    },
    json={
        "model": "chatterbox-multilingual",
        "prompt": "Your prompt here",
    },
)

result = response.json()
print(result["url"])
Node.js
const response = await fetch(
  "https://api.fairstack.ai/v1/generations/voice",
  {
    method: "POST",
    headers: {
      Authorization: `Bearer ${process.env.FAIRSTACK_API_KEY}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      model: "chatterbox-multilingual",
      prompt: "Your prompt here",
    }),
  }
);

const result = await response.json();
console.log(result.url);

Frequently Asked Questions

How much does Chatterbox Multilingual cost?

Chatterbox Multilingual costs $0.030/req on FairStack as of 2026-05-13. This price includes FairStack's transparent 20% margin on infrastructure cost. No subscription or monthly fee — you pay per generation only. Minimum deposit is $1.

What is Chatterbox Multilingual and what is it best for?

Chatterbox Multilingual is Resemble AI's multi-language TTS model supporting 23 languages with voice cloning capabilities across all supported languages. The model combines broad language coverage with the ability to clone a voice from reference audio and use that cloned voice in any of the 23 supported languages, enabling cross-lingual voice synthesis. At approximately $0.025 per 1,000 characters, it provides affordable multilingual TTS with voice cloning. The 23 supported languages include English, Arabic, Danish, German, Greek, Spanish, Finnish, French, Hebrew, Hindi, Italian, Japanese, Korean, Malay, Dutch, Norwegian, Polish, Portuguese, Russian, Swedish, Swahili, Turkish, and Chinese. The 300-character prompt limit means content should be processed in segments. Compared to ElevenLabs Multilingual V2 with 29 languages, Chatterbox Multilingual offers fewer languages but includes voice cloning across all supported languages at a lower price. Against single-language voice cloning models, the cross-lingual capability is unique. Best suited for multilingual voice content, cross-lingual voice cloning, and global audience TTS where voice consistency across multiple languages matters. Available on FairStack at infrastructure cost plus a 20% platform fee. Chatterbox Multilingual is best for Multilingual voice content, Cross-lingual voice cloning, Global audience TTS. Available via FairStack's REST API with curl, Python, and Node.js SDKs.

Does Chatterbox Multilingual have an API?

Yes. Chatterbox Multilingual is available via FairStack's REST API at api.fairstack.ai. Send a POST request to /v1/generations/voice with your API key and prompt. Works with curl, Python requests, Node.js fetch, and any HTTP client. No SDK installation required.

How does Chatterbox Multilingual compare to other voice models?

Chatterbox Multilingual excels at Multilingual voice content, Cross-lingual voice cloning, Global audience TTS. It is a text to speech model priced at $0.030/req on FairStack. Key strengths: Wide language coverage (23 languages), Voice cloning in any supported language. Compare all voice models at fairstack.ai/models.

What makes Chatterbox Multilingual stand out from other voice models?

Chatterbox Multilingual excels with wide language coverage (23 languages) and voice cloning in any supported language. Generation typically completes in under 5 seconds.

What are the known limitations of Chatterbox Multilingual?

Key limitations include: 300 character prompt limit — short segments only; quality may vary by language; less natural than dedicated single-language models. FairStack documents these transparently so you can choose the right model for your workflow.

How fast is Chatterbox Multilingual?

Chatterbox Multilingual typically completes in under 5 seconds. This makes it suitable for real-time applications, interactive workflows, and high-volume batch processing.

What voice features does Chatterbox Multilingual support?

Chatterbox Multilingual offers: 23 languages supported; voice cloning from reference audio; cross-lingual voice synthesis; affordable at ~$0.025/1k characters. All capabilities are accessible through both the FairStack web interface and REST API.

Start using Chatterbox Multilingual today

$0.030/req. Full API access. No subscription.

Start Creating