Skip to main content
Video Text to Video fal.ai

CogVideoX-5B

CogVideoX-5B costs $0.240/clip on FairStack — a text to video model for Open-source workflows, Research and experimentation, Custom pipeline integration. No subscription required. Pay per generation with full REST API access. FairStack applies a transparent 20% margin on infrastructure cost so you always see the real price.

FairStack price
$0.240/clip
Try CogVideoX-5B API Quickstart Last updated 2026-05-13

What is CogVideoX-5B?

CogVideoX-5B is an open-source video generation model developed for research and experimentation. With 5 billion parameters, it generates video clips of up to 10 seconds at 480p resolution, offering the longest default clip duration among budget video models. Its open-source nature makes it fully transparent and customizable. The model supports standard text-to-video generation with reasonable motion coherence for its parameter count. While it does not match commercial models in visual fidelity or prompt adherence, it provides a solid foundation for custom pipeline integration, academic research, and video generation experimentation without commercial licensing restrictions. Compared to commercial alternatives like Kling, Runway, or Sora, CogVideoX-5B produces lower visual quality but offers complete model transparency and the ability to run on private infrastructure. For researchers building custom video pipelines or testing novel approaches, the 10-second clip length and open weights provide flexibility that closed models cannot match. Best suited for research projects, custom video pipeline development, open-source workflow integration, and experimentation with video generation at minimal cost. Available on FairStack at infrastructure cost plus a 20% platform fee.

Key Features

Open-source model
Up to 10 seconds of video
Customizable pipeline integration

What are CogVideoX-5B's strengths?

Open-source flexibility
Up to 10 seconds
Good for research

What are CogVideoX-5B's limitations?

Only 480p resolution
Lower quality than commercial models

What is CogVideoX-5B best for?

Open-source workflows Research and experimentation Custom pipeline integration

How much does CogVideoX-5B cost?

Metric
FairStack
Details
Price per generation
$0.240
Includes 20% margin
Subscription
None
Pay per generation only

How does CogVideoX-5B perform across capabilities?

motion quality
68%
visual quality
65%
prompt adherence
65%
temporal coherence
70%
character consistency
60%
camera control
55%

How do I use the CogVideoX-5B API?

curl
curl -X POST https://api.fairstack.ai/v1/generations/video \
  -H "Authorization: Bearer $FAIRSTACK_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "cogvideox-5b-t2v",
    "prompt": "Your prompt here"
  }'
Python
import requests

response = requests.post(
    "https://api.fairstack.ai/v1/generations/video",
    headers={
        "Authorization": f"Bearer {FAIRSTACK_API_KEY}",
        "Content-Type": "application/json",
    },
    json={
        "model": "cogvideox-5b-t2v",
        "prompt": "Your prompt here",
    },
)

result = response.json()
print(result["url"])
Node.js
const response = await fetch(
  "https://api.fairstack.ai/v1/generations/video",
  {
    method: "POST",
    headers: {
      Authorization: `Bearer ${process.env.FAIRSTACK_API_KEY}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      model: "cogvideox-5b-t2v",
      prompt: "Your prompt here",
    }),
  }
);

const result = await response.json();
console.log(result.url);

What parameters does CogVideoX-5B support?

Parameter
Type
Default
Details
aspect_ratio
enum
16:9
Options: 16:9, 9:16, 1:1
duration
enum
5
Options: 5, 10
negative_prompt
string (optional)
seed
integer (optional)

Frequently Asked Questions

How much does CogVideoX-5B cost?

CogVideoX-5B costs $0.240/clip on FairStack as of 2026-05-13. This price includes FairStack's transparent 20% margin on infrastructure cost. No subscription or monthly fee — you pay per generation only. Minimum deposit is $1.

What is CogVideoX-5B and what is it best for?

CogVideoX-5B is an open-source video generation model developed for research and experimentation. With 5 billion parameters, it generates video clips of up to 10 seconds at 480p resolution, offering the longest default clip duration among budget video models. Its open-source nature makes it fully transparent and customizable. The model supports standard text-to-video generation with reasonable motion coherence for its parameter count. While it does not match commercial models in visual fidelity or prompt adherence, it provides a solid foundation for custom pipeline integration, academic research, and video generation experimentation without commercial licensing restrictions. Compared to commercial alternatives like Kling, Runway, or Sora, CogVideoX-5B produces lower visual quality but offers complete model transparency and the ability to run on private infrastructure. For researchers building custom video pipelines or testing novel approaches, the 10-second clip length and open weights provide flexibility that closed models cannot match. Best suited for research projects, custom video pipeline development, open-source workflow integration, and experimentation with video generation at minimal cost. Available on FairStack at infrastructure cost plus a 20% platform fee. CogVideoX-5B is best for Open-source workflows, Research and experimentation, Custom pipeline integration. Available via FairStack's REST API with curl, Python, and Node.js SDKs.

Does CogVideoX-5B have an API?

Yes. CogVideoX-5B is available via FairStack's REST API at api.fairstack.ai. Send a POST request to /v1/generations/video with your API key and prompt. Works with curl, Python requests, Node.js fetch, and any HTTP client. No SDK installation required.

How does CogVideoX-5B compare to other video models?

CogVideoX-5B excels at Open-source workflows, Research and experimentation, Custom pipeline integration. It is a text to video model priced at $0.240/clip on FairStack. Key strengths: Open-source flexibility, Up to 10 seconds. Compare all video models at fairstack.ai/models.

What makes CogVideoX-5B stand out from other video models?

CogVideoX-5B is distinguished by open-source flexibility and up to 10 seconds. Generation typically completes in 5-15 seconds.

What are the known limitations of CogVideoX-5B?

Key limitations include: only 480p resolution; lower quality than commercial models. FairStack documents these transparently so you can choose the right model for your workflow.

How fast is CogVideoX-5B?

CogVideoX-5B typically completes in 5-15 seconds. This provides a good balance between output quality and processing speed for most production workflows.

What video capabilities does CogVideoX-5B offer?

CogVideoX-5B offers: open-source model; up to 10 seconds of video; customizable pipeline integration. All capabilities are accessible through both the FairStack web interface and REST API.

Start using CogVideoX-5B today

$0.240/clip. Full API access. No subscription.

Start Creating