Skip to main content

298 AI Models

Browse every model available on FairStack. Transparent per-generation pricing. Full API access. No subscription.

Image

112 models
utility $0.0012/image
DDColor (Colorize)

DDColor — fast automatic colorization of grayscale images at $0.001

Upscale $0.0012/image
ESRGAN Upscale

ESRGAN — classic AI upscaling at $0.001 for basic resolution enhancement

utility $0.0024/image
CodeFormer (Face Restore)

CodeFormer — AI face restoration that recovers detail in damaged or low-res faces at $0.002

Text to Image $0.0030/image
SANA Sprint (NVIDIA)

NVIDIA SANA Sprint — the cheapest image model at $0.0025 with sub-second generation

Text to Image $0.0048/image
Z-Image Turbo

The fastest, cheapest image generation at $0.004/image — perfect for prototyping

Text to Image $0.0060/image
GPT Image 1 Mini

GPT Image 1 Mini — the most affordable GPT Image model, starting at $0.005 per image

Image Editing $0.0060/image
Flux 2 Klein Realtime

FLUX 2 Klein Realtime — instant image editing at $0.005 for live workflows

Text to Image $0.0060/image
P-Image (Pruna)

Confidential frontier model — strong all-round quality with excellent layout

utility $0.0060/image
Recraft Remove Background

Accurate background removal — clean cutouts at $0.005/image

Text to Image $0.0096/image
Flux 2 Turbo

FLUX 2 Turbo — fast, affordable FLUX 2 at $0.008 for quick generation

Image Editing $0.011/image
AI Face Swap (Image)

AI face swap — seamlessly replace faces in any image with natural blending

Image Editing $0.012/image
P-Image Edit (Pruna)

Confidential frontier editing model — precise edits with strong identity preservation

Text to Image $0.012/image
Flux 2 Klein 4B

FLUX 2 Klein 4B — ultra-fast, ultra-cheap generation at $0.005 for rapid prototyping

Text to Image $0.012/image
HiDream I1 Fast

HiDream Fast — quick, affordable generation at $0.01 for rapid workflows

Upscale $0.012/image
AuraSR Upscale

Budget AI upscaling — fast and affordable at $0.01/image

Text to Image $0.013/image
SANA 1.5 4.8B

NVIDIA SANA 1.5 — improved quality over Sprint at $0.005, still ultra-fast

Text to Image $0.014/image
Flux 2 Klein 9B

FLUX 2 Klein 9B — balanced speed and quality in a compact model at $0.006

Text to Image $0.016/image
Z-Image Base LoRA

Budget LoRA model — cheapest fine-tunable image generation at $0.02

Text to Image $0.021/image
Ideogram V3

Best text rendering of any image model — logos, typography, and design-centric output

Elo: 1142
Image Editing $0.021/image
Ideogram V3 Reframe

Ideogram V3 Reframe — change aspect ratios and extend images intelligently at $0.018

Image Editing $0.024/image
Flux 2 Flex I2I

FLUX 2 Flex I2I — versatile image editing with flexible control at $0.02

Text to Image $0.024/image
Grok Imagine T2I

Elo #5 globally — strong aesthetic quality with excellent price-to-quality ratio

Elo: 1187
Image Editing $0.024/image
Grok Imagine I2I

xAI's editing model — aesthetic-preserving edits with strong style consistency

Text to Image $0.024/image
4o Image (GPT IMAGE 1)

Original GPT Image model — balanced capabilities with good text rendering

Text to Image $0.024/image
Flux 2 Flex T2I

FLUX 2 Flex — good quality image generation with flexible styling at $0.02

Text to Image $0.024/image
Flux 2 Flash

FLUX 2 Flash — the fastest FLUX 2 model at $0.005 for instant generation

Image Editing $0.030/image
FLUX.1 Kontext Pro

The most precise image editing model — edit specific elements while preserving everything else

Image Editing $0.030/image
Flux 2 Pro Edit

FLUX 2 Pro Edit — professional-grade editing with strong identity preservation

Image Editing $0.030/image
FLUX Krea I2I

Photorealism-focused image-to-image editing via FLUX Krea

Image Editing $0.030/image
FLUX Krea Redux

Image remix and variation via FLUX Krea Redux

Image Editing $0.030/image
GenFocus (Refocus)

GenFocus Refocus — AI-powered depth-of-field control for any image at $0.025

Image Editing $0.030/image
GenFocus All-in-Focus

GenFocus All-in-Focus — remove depth blur and make entire image sharp at $0.025

Text to Image $0.030/image
FLUX.2

FLUX.2 Pro — next-gen image quality from Black Forest Labs with exceptional versatility

Elo: 1202
Text to Image $0.030/image
Flux 2 Pro T2I

FLUX 2 Pro — premium image generation with exceptional style versatility

Elo: 1202
utility $0.030/image
BiRefNet Background Removal

BEN V2 — advanced background removal with fine edge detail at $0.025

Text to Image $0.033/image
Seedream 5.0 Lite

ByteDance's latest lightweight image model with web search and reasoning

Image Editing $0.033/image
Seedream 5.0 Lite Edit

Image editing with ByteDance's Seedream 5.0 Lite

Image Editing $0.034/image
Kling Image O3 I2I

Kling Image O3 Edit — photorealistic editing from the Kling ecosystem at $0.028

Image Editing $0.034/image
Kling Image V3 I2I

Kling Image V3 Edit — latest Kling editing with improved quality at $0.028

Text to Image $0.034/image
Kling Image O3 T2I

Strong photorealism from the Kling ecosystem — pairs well with Kling video models

Text to Image $0.034/image
Kling Image V3 T2I

Kling Image V3 — strong photorealism from the Kling ecosystem at $0.028

Image Editing $0.036/image
Ideogram V3 Edit

Ideogram V3 Edit — precise editing with the best text rendering of any edit model

Image Editing $0.036/image
Ideogram V3 Remix

Ideogram V3 Remix — style transfer and creative remixing with text preservation

utility $0.036/image
Ideogram V3 Replace Background

Ideogram V3 Background Replace — AI-powered background swap with design awareness

Text to Image $0.036/image
WAN 2.6 T2I

Alibaba WAN 2.6 — strong photorealism with natural compositions at $0.02

Text to Image $0.036/image
HiDream I1 Dev

Balanced all-rounder — good for general-purpose generation and style exploration

Upscale $0.036/image
Clarity Upscaler

Clarity Upscaler — creative AI upscaling with noise removal and detail enhancement at $0.03

Text to Image $0.039/image
Seedream 4.5

Latest ByteDance image model — improved photorealism and detail

Elo: 1172
Image Editing $0.039/image
Seedream 4.5 Edit

Latest ByteDance edit model — strong identity preservation with unified architecture

Text to Image $0.047/image
Gemini 2.5 Flash Image

Gemini 2.5 Flash Image — Google's fast multimodal image generation at $0.039

Image Editing $0.048/image
Bria FIBO Edit

Bria FIBO Edit — commercially safe general editing with IP-compliant output at $0.04

Image Editing $0.048/image
Bria FIBO Add Object

Bria FIBO Add Object — seamlessly insert objects into images at $0.04

Image Editing $0.048/image
Bria FIBO Blend

Bria FIBO Blend — merge multiple images into seamless compositions at $0.04

Image Editing $0.048/image
Bria FIBO Colorize

Bria FIBO Colorize — add realistic color to black-and-white images at $0.04

Image Editing $0.048/image
Bria FIBO Erase

Bria FIBO Erase — cleanly remove objects from images with intelligent inpainting at $0.04

Image Editing $0.048/image
Bria FIBO Relight

Bria FIBO Relight — change lighting conditions in any image at $0.04

Image Editing $0.048/image
Bria FIBO Replace Object

Bria FIBO Replace — swap objects in images with AI-generated replacements at $0.04

Image Editing $0.048/image
Bria FIBO Reseason

Bria FIBO Reseason — change the season in outdoor images at $0.04

Image Editing $0.048/image
Bria FIBO Restore

Bria FIBO Restore — repair damaged, degraded, or old photographs at $0.04

Image Editing $0.048/image
Bria FIBO Restyle

Bria FIBO Restyle — transform image style while preserving content at $0.04

Image Editing $0.048/image
Bria FIBO Rewrite Text

Bria FIBO Rewrite Text — change text within images while preserving fonts and style at $0.04

Image Editing $0.048/image
Bria FIBO Sketch to Image

Bria FIBO Sketch to Image — turn rough sketches into polished images at $0.04

utility $0.048/image
Bria Replace Background

AI background replacement — swap backgrounds in one step with natural blending

Text to Image $0.048/image
Recraft V4

Recraft V4 Image — design-focused generation with precise color control and text rendering at $0.04

Text to Image $0.048/image
Nano Banana 2

Fast, affordable image generation with good quality

Image Editing $0.048/image
Nano Banana 2 Edit

Fast, affordable image editing with Nano Banana 2

Text to Image $0.048/image
Reve T2I

Excellent artistic style — the go-to model for illustrations and mood pieces

Text to Image $0.048/image
Recraft V3 T2I

Professional design quality — strong text rendering and style control for marketing assets

Elo: 1130
Text to Image $0.048/image
Bria 3.2 T2I

Bria 3.2 — commercially safe image generation with built-in content licensing

Text to Image $0.048/image
Bria FIBO Generate

Commercially-licensed T2I with structured prompt support

Image Editing $0.048/image
Reve Edit

High-quality single-image editing from Reve

Image Editing $0.048/image
Reve Remix

Multi-image remix combining up to 6 reference images

Text to Image $0.054/image
ImagineArt 1.5 Pro

ImagineArt 1.5 Pro — creative generation with artistic flair at $0.045

Image Editing $0.060/image
GLM Image I2I

GLM Image Edit — solid general-purpose editing from Zhipu AI at $0.05

Upscale $0.060/image
Topaz Image Upscale

Premium AI upscaling — enhance image resolution with exceptional detail recovery

Text to Image $0.060/image
GLM Image T2I

GLM Image — Zhipu's image model with solid all-round capabilities at $0.05

utility $0.060/image
GOT-OCR V2 (Text Extract)

GOT-OCR V2 — AI-powered text extraction from any image at $0.05

Upscale $0.060/image
Recraft Crisp Upscale

Crisp upscaling optimized for illustrations and graphics

Upscale $0.060/image
Grok Imagine Upscale

Grok Imagine Upscale — AI-powered upscaling with detail synthesis at $0.05

Upscale $0.060/image
Topaz Image Upscale (fal.ai)

Topaz AI upscaling via fal.ai (fallback)

Text to Image $0.064/image
HiDream I1 Full

HiDream Full — balanced all-rounder with good quality across dimensions at $0.03

Image Editing $0.072/image
Ideogram Character Edit

Character-focused editing with Ideogram text rendering — edit while preserving character identity

Image Editing $0.072/image
Ideogram Character Remix

Character remix — reimagine characters in new styles while preserving identity

Image Editing $0.072/image
GPT Image 2 Edit

GPT Image 2 edit — inpainting, outpainting, and mask-based editing with 16 reference images

Text to Image $0.072/image
GPT Image 2

OpenAI's next-gen image model — near-perfect text rendering with full quality and resolution control

Text to Image $0.072/image
Google Imagen 4 Ultra

Google's highest quality model — Elo #6 globally with the best photorealism

Elo: 1177
Text to Image $0.072/image
Ideogram Character

Character-focused generation with Ideogram's text rendering — consistent characters with readable text

Text to Image $0.072/image
FLUX 1.1 Pro Ultra

FLUX 1.1 Pro Ultra — premium previous-gen model with high resolution output

Upscale $0.072/image
Ideogram Upscale

Ideogram Upscale — design-aware upscaling that preserves text and graphic elements at $0.06

Text to Image $0.078/image
Stable Diffusion 3.5 Large

Stable Diffusion 3.5 — open-source model with ControlNet compatibility

Elo: 1020
Text to Image $0.084/image
Flux 2 Max

FLUX 2 Max — highest quality in the FLUX 2 family at $0.07

Image Editing $0.090/image
Qwen Image Max Edit

Premium Qwen Edit — best editing quality in the Qwen family at $0.025

Text to Image $0.090/image
Qwen Image Max T2I

Best Qwen variant — strongest text and layout combo with multilingual support

Elo: 1151
Text to Image $0.090/image
Lumina Image V2

Lumina Image V2 — high-fidelity generation with strong composition at $0.075

Image Editing $0.096/image
Ideogram V2 Edit

Ideogram V2 Edit — previous-gen editing with good text handling at $0.08

Image Editing $0.096/image
Ideogram V2 Remix

Ideogram V2 Remix — previous-gen style remixing at $0.08

Text to Image $0.096/image
FLUX Kontext Max T2I

FLUX Kontext Max — highest quality editing with maximum identity preservation at $0.08

Text to Image $0.096/image
Recraft V4 (Vector)

Next-gen text-to-vector with commercial licensing

Image Editing $0.108/image
Hunyuan Image 3.0 Edit

Tencent Hunyuan Edit — artistic editing with good detail preservation at $0.09

Image Editing $0.108/image
Nano Banana Pro Edit

Premium editing with layout precision — edit images while maintaining spatial accuracy

Text to Image $0.108/image
Nano Banana Pro

Elo #2 globally — unmatched layout precision and multi-element composition

Elo: 1222
Text to Image $0.108/image
Hunyuan Image 3.0 T2I

Tencent Hunyuan 3.0 — strong artistic capabilities with good detail at $0.09

Image Editing $0.120/image
Instant Character

Instant Character — generate consistent characters from a single reference at $0.10

Text to Image $0.120/image
OmniGen V1

OmniGen V1 — unified generation model handling text-to-image and editing in one architecture

Text to Image $0.156/image
Longcat Image

Longcat Image — multi-character scene generation with narrative coherence at $0.13

Image Editing $0.180/image
Emu 3.5 Edit

Emu 3.5 Edit — Meta's image editing model with natural language instructions at $0.15

Text to Image $0.180/image
Gemini 3 Pro Image

Google Gemini 3 Pro — multimodal AI image generation with deep semantic understanding

Text to Image $0.180/image
Emu 3.5 Image

Meta Emu 3.5 — multimodal generation with semantic understanding at $0.15

Text to Image $0.180/image
Nano Banana Pro T2I (fal.ai)

Google's premium T2I model via fal.ai (fallback)

Image Editing $0.180/image
Nano Banana Pro Edit (fal.ai)

Google's premium edit model via fal.ai (fallback)

Text to Image $0.300/image
Recraft V4 Pro

Recraft V4 Pro Image — premium design generation with top-tier color and text fidelity at $0.25

Text to Image $0.360/image
Recraft V4 Pro (Vector)

Premium text-to-vector with highest quality outputs

Music

30 models
Generate $0.0012/track
Suno Generate Lyrics

Suno Generate Lyrics — AI-powered songwriting at $0.10

Generate $0.0012/track
Suno Generate MIDI

Suno Generate MIDI — export AI-generated music as MIDI data at $0.10

Generate $0.0012/track
Suno Generate Persona

Suno Generate Persona — create a custom AI music artist persona at $0.10

Remix $0.0012/track
Suno Music Cover

Suno Music Cover — generate AI covers of songs at $0.10

Generate $0.0012/track
ACE-Step 1.5

Self-hosted music generation with lyrics and vocals

Remix $0.0024/track
Suno Boost Style

Suno Boost Style — enhance the production quality and style of a song at $0.10

Generate $0.0024/track
Suno Convert to WAV

Suno Convert to WAV — convert Suno output to high-quality WAV at $0.10

Generate $0.0030/track
Suno Timestamped Lyrics

Suno Timestamped Lyrics — generate lyrics with precise timing at $0.10

Generate $0.011/track
Mureka Lyrics

Mureka Lyrics — AI lyrics generation at $0.009

Music Video $0.012/track
Suno Music Video

Suno Music Video — generate music video from a song at $0.20

Processing $0.012/track
Mureka Recognize Song

Mureka Recognize — identify songs from audio at $0.01

Generate $0.030/track
Suno Replace Section

Suno Replace Section — replace a specific section of a song with AI at $0.10

Generate $0.036/track
Mureka BGM

Mureka BGM — background music generation at $0.03

Generate $0.036/track
MiniMax Music V2

MiniMax Music V2 — full song generation with lyrics, section tags, and vocals at ~$0.10

Extend $0.043/track
Mureka Extend Song

Mureka Extend — extend songs with AI continuation at $0.036

Generate $0.054/track
Mureka V9 (Song)

Mureka V9 — newest flagship music generation model

Generate $0.054/track
Mureka V8 (Song)

Mureka V8 — the latest and highest quality AI song generation model

Processing $0.060/track
Suno Separate Vocals

Suno Stem Separation — split audio into vocals, drums, bass, and other tracks at $0.05

Generate $0.060/track
Beatoven Music

Beatoven — AI music composition for video and content at $0.05

Generate $0.072/track
Suno Generate Music

Suno — full song generation with vocals, instruments, and production from a text prompt

Generate $0.072/track
Suno Extend Music

Suno Extend — continue and extend existing songs at $0.10

Remix $0.072/track
Suno Add Vocals

Suno Add Vocals — add AI vocal tracks to instrumental music at $0.10

Remix $0.072/track
Suno Add Instrumental

Suno Add Instrumental — add AI instrumental backing to vocal tracks at $0.10

Generate $0.072/track
Suno Mashup

Suno Mashup — blend multiple songs into a cohesive mashup at $0.10

Generate $0.072/track
Suno Upload & Cover

Suno Upload & Cover — upload a song and generate an AI cover at $0.10

Generate $0.072/track
Suno Upload & Extend

Suno Upload & Extend — upload a song and extend it with AI at $0.10

Processing $0.072/track
Mureka Stem Separation

Mureka Stem Separation — extract individual instrument tracks at $0.06

Generate $0.072/track
Mureka Custom Model

Mureka Custom Model — train custom music models at $0.06

Processing $0.120/track
Mureka Describe Song

Mureka Describe — AI analysis of song content and style at $0.10

Generate $0.120/track
Lyria 2 (Google)

Lyria 2 — Google's latest music generation model at ~$0.10 per track

Talking Head

20 models
audio-driven $0.0000/clip
Longcat Single Avatar

Longcat Single Avatar — single-character talking head with body motion

audio-driven $0.0000/clip
Longcat Multi Avatar

Longcat Multi Avatar — multi-character talking head with scene interaction

audio-driven $0.0000/clip
Creatify Aurora

Studio-grade talking head videos designed for ads and commercial production

audio-driven $0.0000/clip
Fabric 1.0 (VEED)

VEED Fabric 1.0 — production-quality talking head video for content creators

audio-driven $0.0000/clip
MultiTalk AI Avatar

AI avatar with text-driven speech generation

lipsync $0.0067/clip
MuseTalk 1.5

MuseTalk 1.5 — real-time lip sync for video and images at $0.00111/s

lipsync $0.084/clip
Kling LipSync

Kling AI lip sync — match video lips to any audio

lipsync $0.084/clip
Kling LipSync (Text)

Kling AI lip sync — generate speech and sync lips in one step

lipsync $0.240/clip
Pixverse Lipsync

Cheapest lipsync model — talking head videos at just $0.04/second

audio-driven $0.300/clip
InfiniteTalk

Audio-driven lip sync on RunPod — reliable infrastructure for avatar generation

audio-driven $0.300/clip
InfiniteTalk (Kie.ai)

InfiniteTalk via Kie.ai -- audio-driven talking head alternative

lipsync $0.300/clip
Sync Lipsync 2.0

Good quality/price balance — clean lip sync at $0.05/second

lipsync $0.498/clip
Sync Lipsync 2.0 Pro

Sync Lipsync 2.0 Pro — professional-grade lip sync at $0.083/s

audio-driven $0.600/clip
LTX 2.3 Audio-to-Video
audio-driven $0.600/clip
Stable Avatar
audio-driven $0.960/clip
OmniHuman v1.5 (ByteDance)

State-of-the-art talking head videos — best emotional synchronization from a single photo

audio-driven $1.20/clip
WAN 2.2 Speech-to-Video

WAN 2.2 — generate talking head video from speech audio

audio-driven $1.20/clip
EchoMimic V3
audio-driven $1.50/clip
Kling Avatar Standard

Kling Avatar Standard — audio-driven talking head with natural lip sync at $0.25

audio-driven $2.40/clip
Kling Avatar Pro

Kling Avatar Pro — multi-character support with flat-rate pricing

Video

107 models
Text to Video $0.0000/clip
P-Video (Pruna)
Image to Video $0.0000/clip
P-Video I2V (Pruna)
Text to Video $0.0000/clip
HappyHorse 1.0 T2V
Image to Video $0.0000/clip
HappyHorse 1.0 I2V
Image to Video $0.0000/clip
HappyHorse 1.0 Reference-to-Video
Text to Video $0.0000/clip
Seedance 2.0 Fast
Video to Video $0.0060/clip
MMAudio V2 (Add Audio)

MMAudio V2 — add synchronized audio to any video at $0.001/s

Image to Video $0.042/clip
Vidu Q3 Turbo I2V

Fast budget-friendly image-to-video

Text to Video $0.042/clip
Vidu Q3 Turbo T2V

Fast budget-friendly text-to-video

Video to Video $0.042/clip
Mirelo SFX v1 (Add Sound Effects)

Automatic sound effects for video at $0.007/s

Image to Video $0.048/clip
LTX Video 13B Distilled I2V

Ultra-cheap distilled image-to-video at $0.04/video

Image to Video $0.060/clip
PixVerse V5.6 I2V

PixVerse V5.6 I2V — high-quality creative image animation at $0.45

Text to Video $0.060/clip
Kandinsky 5 Distilled T2V

Kandinsky 5 Distilled — faster Kandinsky at $0.05

Video to Video $0.060/clip
Sora 2 Watermark Remover

Sora 2 Watermark Remover — AI watermark removal from video at $0.05

Text to Video $0.072/clip
Runway Gen-4 (5s)

The most affordable professional video generation at just $0.06 per clip

Image to Video $0.084/clip
Vidu Q3 I2V

Vidu Q3 I2V — balanced image animation with per-second billing at $0.154/s

Text to Video $0.084/clip
Vidu Q3 T2V

Vidu Q3 T2V — balanced text-to-video with per-second billing at $0.154/s

Image to Video $0.096/clip
Hailuo 2.3 Fast I2V

Hailuo 2.3 Fast I2V — budget-friendly fast image animation at $0.08

Text to Video $0.096/clip
Hailuo 2.3 Fast

Fast video generation at a good value — 6-second clips at $0.08

Text to Video $0.096/clip
Kandinsky 5 T2V

Kandinsky 5 T2V — artistic video generation at $0.08

Upscale $0.120/clip
Topaz Video Upscale (fal.ai)
Image to Video $0.120/clip
Vidu Q2 Reference Pro

Vidu Q2 Reference Pro — reference-guided video generation at $0.30

Text to Video $0.120/clip
Grok Imagine T2V

Elo #3 video model — excellent quality-to-price ratio at $0.10/clip

Elo: 1237
Image to Video $0.120/clip
Grok Imagine I2V

Grok Imagine I2V — xAI's image-to-video with strong motion at $0.10

Upscale $0.120/clip
Topaz Video Upscale

Topaz Video Upscale — premium AI video upscaling with detail recovery at $0.10

Video to Video $0.144/clip
AI Face Swap (Video)

AI Face Swap Video — swap faces in video content at $0.12

Image to Video $0.180/clip
PixVerse V5.6 Transition

PixVerse V5.6 Transition — create smooth video transitions between images at $0.45

Text to Video $0.180/clip
Runway Gen-4 (10s)

Runway Gen-4 (10s) — extended 10-second generation for longer clips at $0.15

Text to Video $0.180/clip
Sora 2 (Kie.ai)

OpenAI Sora 2 — text-to-video with excellent prompt understanding and consistency

Text to Video $0.180/clip
Hailuo 02 Standard T2V

Hailuo 02 Standard T2V — balanced video generation at $0.15

Image to Video $0.180/clip
Hailuo 02 Standard I2V

Hailuo 02 Standard I2V — affordable image animation at $0.15

Text to Video $0.180/clip
PixVerse V5.6 T2V

PixVerse V5.6 T2V — high-quality creative video at $0.45

Video to Video $0.180/clip
Luma Modify (V2V)

Luma Modify — AI video editing and modifications at $0.15

Upscale $0.180/clip
Veo 3.1 1080p Upscale

Veo 3.1 1080p Upscale — upscale video to 1080p with Google's AI at $0.15

Image to Video $0.240/clip
WAN 2.2 A14B I2V Turbo

WAN 2.2 A14B I2V Turbo — fast 14B image animation at $0.20

Image to Video $0.240/clip
LTX-2 19B I2V

LTX-2 19B I2V — large-scale image animation with audio at $0.20

Image to Video $0.240/clip
Pika 2.2 I2V

Pika 2.2 image-to-video — animate any image into motion

Text to Video $0.240/clip
CogVideoX-5B

Open-source video model — 10-second clips for research and experimentation

Text to Video $0.240/clip
LTX-2 19B (T2V+Audio)

LTX-2 19B T2V — large-scale video model with integrated audio at $0.20

Text to Video $0.240/clip
WAN 2.2 A14B T2V Turbo

WAN 2.2 A14B Turbo T2V — fast 14B parameter video at $0.20

Text to Video $0.240/clip
Sora 2 Characters

Sora 2 Characters — generate video with consistent character identities at $0.20

Text to Video $0.240/clip
LTX-2 19B Distilled T2V

LTX-2 19B Distilled — faster video from distilled 19B model at $0.20

Text to Video $0.240/clip
Kandinsky 5 Pro T2V

Kandinsky 5 Pro T2V — premium Kandinsky at $0.04/s

Image to Video $0.240/clip
Kandinsky 5 Pro I2V

Kandinsky 5 Pro I2V — premium Kandinsky image animation at $0.04/s

Video to Video $0.240/clip
Runway Aleph V2V

Runway Aleph — next-gen video editing and transformation at $0.20

Text to Video $0.240/clip
ByteDance V1 Pro T2V

ByteDance V1 Pro T2V -- wide aspect ratio support (21:9 to 9:16)

Image to Video $0.270/clip
Hailuo 02 Standard I2V (fal.ai)
Image to Video $0.270/clip
Hailuo 02 Pro I2V

Hailuo 02 Pro I2V — premium image animation at $0.225

Text to Video $0.270/clip
Hailuo 02 Pro T2V

Hailuo 02 Pro T2V — premium Hailuo text-to-video at $0.225

Image to Video $0.300/clip
WAN 2.5

WAN 2.5 — balanced video model with improved coherence at $0.25

Video to Video $0.300/clip
Grok Imagine Edit Video

Grok Imagine Edit Video — AI-powered video editing from xAI at $0.07/s

Upscale $0.300/clip
Veo 3.1 4K Upscale

Veo 3.1 4K Upscale — upscale video to 4K with Google's AI at $0.25

Video to Video $0.300/clip
DreamActor V2

DreamActor V2 — AI-driven character performance and acting at $0.05/s

Text to Video $0.300/clip
Luma Ray 2 Flash

Luma Ray 2 Flash — faster, cheaper Luma video at ~$0.25 per generation

Image to Video $0.312/clip
Seedance v1.5 Pro

Image-to-video on RunPod — animate still images at 720p

Text to Video $0.360/clip
WAN 2.1 T2V

WAN 2.1 Text-to-Video — Alibaba's foundational video model with good style control at $0.30

Image to Video $0.360/clip
WAN 2.1 I2V

WAN 2.1 Image-to-Video — animate images with WAN's motion understanding at $0.30

Text to Video $0.360/clip
WAN 2.2 T2V

WAN 2.2 Text-to-Video — improved motion and prompt adherence at $0.30

Image to Video $0.360/clip
WAN 2.2 I2V

WAN 2.2 Image-to-Video — improved animation fidelity at $0.30

Text to Video $0.360/clip
WAN 2.6 T2V

Alibaba WAN 2.6 video — artistic style with good visual quality

Image to Video $0.360/clip
Sora 2

Sora 2 Image-to-Video — OpenAI's cinematic video model at $0.40

Video to Video $0.360/clip
Veo 3.1 Extend

Veo 3.1 Extend — extend videos with Google's latest architecture at $0.30

Image to Video $0.420/clip
WAN 2.2 I2V LoRA

WAN 2.2 I2V LoRA — customizable video animation with fine-tuning support at $0.35

Image to Video $0.420/clip
WAN 2.6 I2V

WAN 2.6 Image-to-Video — latest WAN with best motion quality at $0.35

Image to Video $0.420/clip
Wan Effects

Wan Effects — apply visual effects and transformations to video at $0.35

Image to Video $0.480/clip
WAN 2.2 A14B I2V (fal.ai)
Image to Video $0.480/clip
Lucy 14B I2V

Decart's 14B parameter image-to-video model

Text to Video $0.480/clip
Veo 3.1 Fast

Google Veo 3.1 — high-quality video with strong visual fidelity

Elo: 1233
Text to Video $0.480/clip
Sora 2 Pro T2V

Sora 2 Pro T2V — premium OpenAI text-to-video at $0.40

Text to Video $0.480/clip
Sora 2 Characters Pro

Sora 2 Characters Pro — premium character video with maximum consistency at $0.40

Text to Video $0.480/clip
Sora 2 Pro Storyboard

Sora 2 Pro Storyboard — generate video from storyboard panels at $0.40

Video to Video $0.504/clip
Kling O3 Std V2V Edit

Kling O3 Std V2V Edit — balanced video editing at $0.168/s

Video to Video $0.504/clip
Kling O3 Std V2V Reference

Kling O3 Std V2V Reference — balanced reference-guided video at $0.168/s

Text to Video $0.588/clip
Hailuo 2.3 Pro T2V

Hailuo 2.3 Pro Text-to-Video — premium MiniMax text-to-video at 1080p for $0.49

Image to Video $0.600/clip
Sora 2 Pro

Sora 2 Pro I2V — highest quality OpenAI video at $1.20 with 1080p output

Image to Video $0.600/clip
Seedance 2.0 Fast I2V

Seedance 2.0 Fast I2V — same I2V flow at ~20% lower per-second cost

Image to Video $0.600/clip
Seedance 2.0 Fast Reference

Seedance 2.0 Fast Reference — multi-modal reference-to-video at ~20% lower cost

Image to Video $0.600/clip
WAN 2.6 Ref-to-Video Flash

Fast reference-guided video generation from WAN 2.6

Image to Video $0.600/clip
WAN 2.2 A14B I2V LoRA

WAN 2.2 A14B image-to-video with LoRA style support

Text to Video $0.600/clip
Kling 3.0 Standard

Kling 3.0 Standard — good quality at lower price than Pro, up to 15 seconds

Text to Video $0.600/clip
Luma Ray 2 T2V

Luma Ray 2 Text-to-Video — cinematic video generation with resolution and duration tiers at $0.50

Image to Video $0.600/clip
Luma Ray 2 I2V

Luma Ray 2 Image-to-Video — animate images with cinematic quality, optional end frame at $0.50

Image to Video $0.600/clip
Hailuo 2.3 Pro I2V

Hailuo 2.3 Pro Image-to-Video — premium MiniMax video with resolution control at $0.50

Text to Video $0.672/clip
Kling O3 Pro T2V

Kling O3 Pro T2V — premium Kling text-to-video at $0.224/s

Video to Video $0.672/clip
Kling O3 Pro V2V Edit

Kling O3 Pro V2V Edit — premium video editing at $0.224/s

Video to Video $0.672/clip
Kling O3 Pro V2V Reference

Kling O3 Pro V2V Reference — transform video to match a reference style at $0.224/s

Image to Video $0.750/clip
Seedance 2.0 I2V

Seedance 2.0 I2V — animate a still image into 4–12 seconds of coherent motion

Image to Video $0.750/clip
Seedance 2.0 Reference

Seedance 2.0 Reference — multi-modal reference-to-video with up to 9 images, 3 videos, 3 audios

Text to Video $0.750/clip
Seedance 2.0
Text to Video $0.810/clip
Kling 3.0 Pro

Elo #1 video model — best motion quality, camera control, and up to 15 seconds

Elo: 1247
Image to Video $0.840/clip
Kling O3 Pro I2V

Kling O3 Pro I2V — premium image animation at $0.224/s

Image to Video $0.840/clip
Kling O3 Standard I2V

Kling O3 Standard I2V — balanced image animation at $0.168/s

Image to Video $0.840/clip
Kling O3 Standard Reference-to-Video

Kling O3 Standard Reference-to-Video — balanced reference video at $0.168/s

First/Last Frame $0.840/clip
Kling v3 (First+Last Frame)

Kling V3 First+Last Frame — generate video transitions between two keyframes at $0.168/s

Text to Video $0.840/clip
Kling O3 Standard T2V

Kling O3 Standard T2V — balanced Kling O3 at $0.168/s

Text to Video $0.840/clip
Kling V3 Pro T2V

Kling V3 Pro T2V — premium latest-gen Kling at $0.224/s

Text to Video $0.840/clip
Kling V3 Standard T2V

Kling V3 Standard T2V — latest Kling generation at $0.168/s

Video to Video $0.840/clip
Bria Video BG Removal

Bria Video BG Removal — remove backgrounds from video at $0.14/s

First/Last Frame $0.900/clip
Veo 3.1 Fast (First+Last Frame)

Veo 3.1 Fast First+Last Frame — Google's video transitions at $0.10/s

Image to Video $0.960/clip
WAN 2.1 Pro I2V

WAN 2.1 Pro I2V — premium WAN image animation at $0.80

Text to Video $0.990/clip
Seedance 2.0 Fast T2V

Seedance 2.0 Fast T2V — same model family at ~20% lower cost per second

Text to Video $1.23/clip
Seedance 2.0 T2V

ByteDance Seedance 2.0 — strong motion quality and temporal coherence for text-to-video

Text to Video $2.40/clip
Veo 3.1 Quality

The only video model with built-in audio — 1080p + synced audio in one generation

First/Last Frame $2.40/clip
Veo 3.1 (First+Last Frame)

Google's quality first+last frame video generation

Image to Video $2.40/clip
Veo 3.1 Reference-to-Video

Google's reference-guided video generation

Image to Video $2.70/clip
Kling V3 Pro I2V

Kling V3 Pro I2V — premium latest-gen image animation at $0.224/s

Image to Video $2.70/clip
Kling O3 Pro Reference-to-Video

Kling O3 Pro Reference-to-Video — generate video from reference clips at $0.224/s

Voice

22 models
Speech to Text $0.0012/req
Faster Whisper (STT)

Faster Whisper STT — fast, accurate speech-to-text at $0.001 (self-hosted)

Text to Speech $0.0040/req
VibeVoice

VibeVoice — multi-speaker dialogue with up to 4 voices and script format at $0.04/min

Speech to Text $0.0048/req
Speech-to-Text Turbo

Speech-to-Text Turbo — fast transcription at $0.0008/s

Text to Speech $0.0060/req
Qwen3-TTS 1.7B

Voice Designer — describe a voice in text and generate speech

Sound Effects $0.0072/req
ElevenLabs Sound Effects

AI sound effects — generate any sound from a text description

Text to Speech $0.0096/req
IndexTTS2

Best raw voice cloning fidelity — preserves unique voice character

Sound Effects $0.012/req
Stable Audio Open

Free audio generation — open-source text-to-audio at zero cost

Text to Speech $0.012/req
Maya TTS

Maya TTS — emotion-rich speech with laugh, whisper, and cry tags at $0.002/sec

Speech to Text $0.021/req
ElevenLabs STT

ElevenLabs speech-to-text — per-minute pricing better for long-form audio

Processing $0.030/req
ElevenLabs Audio Isolation

Audio Isolation — extract clean voice from noisy audio at $0.025

Processing $0.030/req
ElevenLabs Voice Changer

Voice Changer — transform voice characteristics in real-time at $0.005/s

Text to Speech $0.030/req
Chatterbox Multilingual

Chatterbox Multilingual — 23-language TTS with voice cloning at $0.025/1K chars

Text to Speech $0.060/req
Minimax Speech HD

HD multilingual TTS — good quality across multiple languages

Speech to Text $0.060/req
Whisper V3 (STT)

Industry-standard speech-to-text — 90+ languages with excellent accuracy

Text to Speech $0.060/req
ElevenLabs TTS Turbo 2.5

Premium Turbo 2.5 — fastest premium voice at $0.05

Text to Speech $0.060/req
ElevenLabs TTS Multilingual V2

Premium Multilingual V2 — 29-language voice synthesis at $0.05

Processing $0.060/req
Chatterbox Speech-to-Speech

Chatterbox Speech-to-Speech — transform voice style while preserving content at $0.05

Text to Speech $0.060/req
MiniMax Speech 2.8 HD

MiniMax Speech 2.8 HD — high-quality speech with interjection tags and 40+ languages at $0.05

Text to Speech $0.072/req
MiniMax Speech Turbo

MiniMax Speech Turbo — fast voice synthesis with good quality at $0.05

Text to Speech $0.084/req
ElevenLabs TTS V3

Premium TTS V3 — the gold standard of AI text-to-speech

Processing $0.090/req
ElevenLabs Dubbing

AI Dubbing — AI-powered video/audio dubbing at $0.015/s

Text to Speech $0.120/req
MiniMax Speech HD

MiniMax Speech HD — high-definition voice synthesis with natural prosody at $0.05

298 models. One API.

$10 gets you started. No subscription. See the exact cost before every generation.

Start Creating