Image
112 modelsDDColor — fast automatic colorization of grayscale images at $0.001
ESRGAN — classic AI upscaling at $0.001 for basic resolution enhancement
CodeFormer — AI face restoration that recovers detail in damaged or low-res faces at $0.002
NVIDIA SANA Sprint — the cheapest image model at $0.0025 with sub-second generation
The fastest, cheapest image generation at $0.004/image — perfect for prototyping
GPT Image 1 Mini — the most affordable GPT Image model, starting at $0.005 per image
FLUX 2 Klein Realtime — instant image editing at $0.005 for live workflows
Confidential frontier model — strong all-round quality with excellent layout
Accurate background removal — clean cutouts at $0.005/image
FLUX 2 Turbo — fast, affordable FLUX 2 at $0.008 for quick generation
AI face swap — seamlessly replace faces in any image with natural blending
Confidential frontier editing model — precise edits with strong identity preservation
FLUX 2 Klein 4B — ultra-fast, ultra-cheap generation at $0.005 for rapid prototyping
HiDream Fast — quick, affordable generation at $0.01 for rapid workflows
Budget AI upscaling — fast and affordable at $0.01/image
NVIDIA SANA 1.5 — improved quality over Sprint at $0.005, still ultra-fast
FLUX 2 Klein 9B — balanced speed and quality in a compact model at $0.006
Budget LoRA model — cheapest fine-tunable image generation at $0.02
Best text rendering of any image model — logos, typography, and design-centric output
Ideogram V3 Reframe — change aspect ratios and extend images intelligently at $0.018
FLUX 2 Flex I2I — versatile image editing with flexible control at $0.02
Elo #5 globally — strong aesthetic quality with excellent price-to-quality ratio
xAI's editing model — aesthetic-preserving edits with strong style consistency
Original GPT Image model — balanced capabilities with good text rendering
FLUX 2 Flex — good quality image generation with flexible styling at $0.02
FLUX 2 Flash — the fastest FLUX 2 model at $0.005 for instant generation
The most precise image editing model — edit specific elements while preserving everything else
FLUX 2 Pro Edit — professional-grade editing with strong identity preservation
Photorealism-focused image-to-image editing via FLUX Krea
Image remix and variation via FLUX Krea Redux
GenFocus Refocus — AI-powered depth-of-field control for any image at $0.025
GenFocus All-in-Focus — remove depth blur and make entire image sharp at $0.025
FLUX.2 Pro — next-gen image quality from Black Forest Labs with exceptional versatility
FLUX 2 Pro — premium image generation with exceptional style versatility
BEN V2 — advanced background removal with fine edge detail at $0.025
ByteDance's latest lightweight image model with web search and reasoning
Image editing with ByteDance's Seedream 5.0 Lite
Kling Image O3 Edit — photorealistic editing from the Kling ecosystem at $0.028
Kling Image V3 Edit — latest Kling editing with improved quality at $0.028
Strong photorealism from the Kling ecosystem — pairs well with Kling video models
Kling Image V3 — strong photorealism from the Kling ecosystem at $0.028
Ideogram V3 Edit — precise editing with the best text rendering of any edit model
Ideogram V3 Remix — style transfer and creative remixing with text preservation
Ideogram V3 Background Replace — AI-powered background swap with design awareness
Alibaba WAN 2.6 — strong photorealism with natural compositions at $0.02
Balanced all-rounder — good for general-purpose generation and style exploration
Clarity Upscaler — creative AI upscaling with noise removal and detail enhancement at $0.03
Latest ByteDance image model — improved photorealism and detail
Latest ByteDance edit model — strong identity preservation with unified architecture
Gemini 2.5 Flash Image — Google's fast multimodal image generation at $0.039
Bria FIBO Edit — commercially safe general editing with IP-compliant output at $0.04
Bria FIBO Add Object — seamlessly insert objects into images at $0.04
Bria FIBO Blend — merge multiple images into seamless compositions at $0.04
Bria FIBO Colorize — add realistic color to black-and-white images at $0.04
Bria FIBO Erase — cleanly remove objects from images with intelligent inpainting at $0.04
Bria FIBO Relight — change lighting conditions in any image at $0.04
Bria FIBO Replace — swap objects in images with AI-generated replacements at $0.04
Bria FIBO Reseason — change the season in outdoor images at $0.04
Bria FIBO Restore — repair damaged, degraded, or old photographs at $0.04
Bria FIBO Restyle — transform image style while preserving content at $0.04
Bria FIBO Rewrite Text — change text within images while preserving fonts and style at $0.04
Bria FIBO Sketch to Image — turn rough sketches into polished images at $0.04
AI background replacement — swap backgrounds in one step with natural blending
Recraft V4 Image — design-focused generation with precise color control and text rendering at $0.04
Fast, affordable image generation with good quality
Fast, affordable image editing with Nano Banana 2
Excellent artistic style — the go-to model for illustrations and mood pieces
Professional design quality — strong text rendering and style control for marketing assets
Bria 3.2 — commercially safe image generation with built-in content licensing
Commercially-licensed T2I with structured prompt support
High-quality single-image editing from Reve
Multi-image remix combining up to 6 reference images
ImagineArt 1.5 Pro — creative generation with artistic flair at $0.045
GLM Image Edit — solid general-purpose editing from Zhipu AI at $0.05
Premium AI upscaling — enhance image resolution with exceptional detail recovery
GLM Image — Zhipu's image model with solid all-round capabilities at $0.05
GOT-OCR V2 — AI-powered text extraction from any image at $0.05
Crisp upscaling optimized for illustrations and graphics
Grok Imagine Upscale — AI-powered upscaling with detail synthesis at $0.05
Topaz AI upscaling via fal.ai (fallback)
HiDream Full — balanced all-rounder with good quality across dimensions at $0.03
Character-focused editing with Ideogram text rendering — edit while preserving character identity
Character remix — reimagine characters in new styles while preserving identity
GPT Image 2 edit — inpainting, outpainting, and mask-based editing with 16 reference images
OpenAI's next-gen image model — near-perfect text rendering with full quality and resolution control
Google's highest quality model — Elo #6 globally with the best photorealism
Character-focused generation with Ideogram's text rendering — consistent characters with readable text
FLUX 1.1 Pro Ultra — premium previous-gen model with high resolution output
Ideogram Upscale — design-aware upscaling that preserves text and graphic elements at $0.06
Stable Diffusion 3.5 — open-source model with ControlNet compatibility
FLUX 2 Max — highest quality in the FLUX 2 family at $0.07
Premium Qwen Edit — best editing quality in the Qwen family at $0.025
Best Qwen variant — strongest text and layout combo with multilingual support
Lumina Image V2 — high-fidelity generation with strong composition at $0.075
Ideogram V2 Edit — previous-gen editing with good text handling at $0.08
Ideogram V2 Remix — previous-gen style remixing at $0.08
FLUX Kontext Max — highest quality editing with maximum identity preservation at $0.08
Next-gen text-to-vector with commercial licensing
Tencent Hunyuan Edit — artistic editing with good detail preservation at $0.09
Premium editing with layout precision — edit images while maintaining spatial accuracy
Elo #2 globally — unmatched layout precision and multi-element composition
Tencent Hunyuan 3.0 — strong artistic capabilities with good detail at $0.09
Instant Character — generate consistent characters from a single reference at $0.10
OmniGen V1 — unified generation model handling text-to-image and editing in one architecture
Longcat Image — multi-character scene generation with narrative coherence at $0.13
Emu 3.5 Edit — Meta's image editing model with natural language instructions at $0.15
Google Gemini 3 Pro — multimodal AI image generation with deep semantic understanding
Meta Emu 3.5 — multimodal generation with semantic understanding at $0.15
Google's premium T2I model via fal.ai (fallback)
Google's premium edit model via fal.ai (fallback)
Recraft V4 Pro Image — premium design generation with top-tier color and text fidelity at $0.25
Premium text-to-vector with highest quality outputs
Music
30 modelsSuno Generate Lyrics — AI-powered songwriting at $0.10
Suno Generate MIDI — export AI-generated music as MIDI data at $0.10
Suno Generate Persona — create a custom AI music artist persona at $0.10
Suno Music Cover — generate AI covers of songs at $0.10
Self-hosted music generation with lyrics and vocals
Suno Boost Style — enhance the production quality and style of a song at $0.10
Suno Convert to WAV — convert Suno output to high-quality WAV at $0.10
Suno Timestamped Lyrics — generate lyrics with precise timing at $0.10
Mureka Lyrics — AI lyrics generation at $0.009
Suno Music Video — generate music video from a song at $0.20
Mureka Recognize — identify songs from audio at $0.01
Suno Replace Section — replace a specific section of a song with AI at $0.10
Mureka BGM — background music generation at $0.03
MiniMax Music V2 — full song generation with lyrics, section tags, and vocals at ~$0.10
Mureka Extend — extend songs with AI continuation at $0.036
Mureka V9 — newest flagship music generation model
Mureka V8 — the latest and highest quality AI song generation model
Suno Stem Separation — split audio into vocals, drums, bass, and other tracks at $0.05
Beatoven — AI music composition for video and content at $0.05
Suno — full song generation with vocals, instruments, and production from a text prompt
Suno Extend — continue and extend existing songs at $0.10
Suno Add Vocals — add AI vocal tracks to instrumental music at $0.10
Suno Add Instrumental — add AI instrumental backing to vocal tracks at $0.10
Suno Mashup — blend multiple songs into a cohesive mashup at $0.10
Suno Upload & Cover — upload a song and generate an AI cover at $0.10
Suno Upload & Extend — upload a song and extend it with AI at $0.10
Mureka Stem Separation — extract individual instrument tracks at $0.06
Mureka Custom Model — train custom music models at $0.06
Mureka Describe — AI analysis of song content and style at $0.10
Lyria 2 — Google's latest music generation model at ~$0.10 per track
Talking Head
20 modelsLongcat Single Avatar — single-character talking head with body motion
Longcat Multi Avatar — multi-character talking head with scene interaction
Studio-grade talking head videos designed for ads and commercial production
VEED Fabric 1.0 — production-quality talking head video for content creators
AI avatar with text-driven speech generation
MuseTalk 1.5 — real-time lip sync for video and images at $0.00111/s
Kling AI lip sync — match video lips to any audio
Kling AI lip sync — generate speech and sync lips in one step
Cheapest lipsync model — talking head videos at just $0.04/second
Audio-driven lip sync on RunPod — reliable infrastructure for avatar generation
InfiniteTalk via Kie.ai -- audio-driven talking head alternative
Good quality/price balance — clean lip sync at $0.05/second
Sync Lipsync 2.0 Pro — professional-grade lip sync at $0.083/s
State-of-the-art talking head videos — best emotional synchronization from a single photo
WAN 2.2 — generate talking head video from speech audio
Kling Avatar Standard — audio-driven talking head with natural lip sync at $0.25
Kling Avatar Pro — multi-character support with flat-rate pricing
3D
7 modelsHunyuan 3D Pro I2-3D — premium image-to-3D from Tencent at $0.375
Hunyuan 3D Rapid T2-3D — fast text-to-3D at $0.225
Hunyuan 3D Rapid I2-3D — fast image-to-3D at $0.225
Trellis 3D — budget text-to-3D at $0.02 for rapid prototyping
Hunyuan 3D Pro T2-3D — premium text-to-3D from Tencent at $0.375
Professional text-to-3D — generate production-ready 3D models from text descriptions
Meshy V6 Image-to-3D — convert images into 3D models at $0.80
Video
107 modelsMMAudio V2 — add synchronized audio to any video at $0.001/s
Fast budget-friendly image-to-video
Fast budget-friendly text-to-video
Automatic sound effects for video at $0.007/s
Ultra-cheap distilled image-to-video at $0.04/video
PixVerse V5.6 I2V — high-quality creative image animation at $0.45
Kandinsky 5 Distilled — faster Kandinsky at $0.05
Sora 2 Watermark Remover — AI watermark removal from video at $0.05
The most affordable professional video generation at just $0.06 per clip
Vidu Q3 I2V — balanced image animation with per-second billing at $0.154/s
Vidu Q3 T2V — balanced text-to-video with per-second billing at $0.154/s
Hailuo 2.3 Fast I2V — budget-friendly fast image animation at $0.08
Fast video generation at a good value — 6-second clips at $0.08
Kandinsky 5 T2V — artistic video generation at $0.08
Vidu Q2 Reference Pro — reference-guided video generation at $0.30
Elo #3 video model — excellent quality-to-price ratio at $0.10/clip
Grok Imagine I2V — xAI's image-to-video with strong motion at $0.10
Topaz Video Upscale — premium AI video upscaling with detail recovery at $0.10
AI Face Swap Video — swap faces in video content at $0.12
PixVerse V5.6 Transition — create smooth video transitions between images at $0.45
Runway Gen-4 (10s) — extended 10-second generation for longer clips at $0.15
OpenAI Sora 2 — text-to-video with excellent prompt understanding and consistency
Hailuo 02 Standard T2V — balanced video generation at $0.15
Hailuo 02 Standard I2V — affordable image animation at $0.15
PixVerse V5.6 T2V — high-quality creative video at $0.45
Luma Modify — AI video editing and modifications at $0.15
Veo 3.1 1080p Upscale — upscale video to 1080p with Google's AI at $0.15
WAN 2.2 A14B I2V Turbo — fast 14B image animation at $0.20
LTX-2 19B I2V — large-scale image animation with audio at $0.20
Pika 2.2 image-to-video — animate any image into motion
Open-source video model — 10-second clips for research and experimentation
LTX-2 19B T2V — large-scale video model with integrated audio at $0.20
WAN 2.2 A14B Turbo T2V — fast 14B parameter video at $0.20
Sora 2 Characters — generate video with consistent character identities at $0.20
LTX-2 19B Distilled — faster video from distilled 19B model at $0.20
Kandinsky 5 Pro T2V — premium Kandinsky at $0.04/s
Kandinsky 5 Pro I2V — premium Kandinsky image animation at $0.04/s
Runway Aleph — next-gen video editing and transformation at $0.20
ByteDance V1 Pro T2V -- wide aspect ratio support (21:9 to 9:16)
Hailuo 02 Pro I2V — premium image animation at $0.225
Hailuo 02 Pro T2V — premium Hailuo text-to-video at $0.225
WAN 2.5 — balanced video model with improved coherence at $0.25
Grok Imagine Edit Video — AI-powered video editing from xAI at $0.07/s
Veo 3.1 4K Upscale — upscale video to 4K with Google's AI at $0.25
DreamActor V2 — AI-driven character performance and acting at $0.05/s
Luma Ray 2 Flash — faster, cheaper Luma video at ~$0.25 per generation
Image-to-video on RunPod — animate still images at 720p
WAN 2.1 Text-to-Video — Alibaba's foundational video model with good style control at $0.30
WAN 2.1 Image-to-Video — animate images with WAN's motion understanding at $0.30
WAN 2.2 Text-to-Video — improved motion and prompt adherence at $0.30
WAN 2.2 Image-to-Video — improved animation fidelity at $0.30
Alibaba WAN 2.6 video — artistic style with good visual quality
Sora 2 Image-to-Video — OpenAI's cinematic video model at $0.40
Veo 3.1 Extend — extend videos with Google's latest architecture at $0.30
WAN 2.2 I2V LoRA — customizable video animation with fine-tuning support at $0.35
WAN 2.6 Image-to-Video — latest WAN with best motion quality at $0.35
Wan Effects — apply visual effects and transformations to video at $0.35
Decart's 14B parameter image-to-video model
Google Veo 3.1 — high-quality video with strong visual fidelity
Sora 2 Pro T2V — premium OpenAI text-to-video at $0.40
Sora 2 Characters Pro — premium character video with maximum consistency at $0.40
Sora 2 Pro Storyboard — generate video from storyboard panels at $0.40
Kling O3 Std V2V Edit — balanced video editing at $0.168/s
Kling O3 Std V2V Reference — balanced reference-guided video at $0.168/s
Hailuo 2.3 Pro Text-to-Video — premium MiniMax text-to-video at 1080p for $0.49
Sora 2 Pro I2V — highest quality OpenAI video at $1.20 with 1080p output
Seedance 2.0 Fast I2V — same I2V flow at ~20% lower per-second cost
Seedance 2.0 Fast Reference — multi-modal reference-to-video at ~20% lower cost
Fast reference-guided video generation from WAN 2.6
WAN 2.2 A14B image-to-video with LoRA style support
Kling 3.0 Standard — good quality at lower price than Pro, up to 15 seconds
Luma Ray 2 Text-to-Video — cinematic video generation with resolution and duration tiers at $0.50
Luma Ray 2 Image-to-Video — animate images with cinematic quality, optional end frame at $0.50
Hailuo 2.3 Pro Image-to-Video — premium MiniMax video with resolution control at $0.50
Kling O3 Pro T2V — premium Kling text-to-video at $0.224/s
Kling O3 Pro V2V Edit — premium video editing at $0.224/s
Kling O3 Pro V2V Reference — transform video to match a reference style at $0.224/s
Seedance 2.0 I2V — animate a still image into 4–12 seconds of coherent motion
Seedance 2.0 Reference — multi-modal reference-to-video with up to 9 images, 3 videos, 3 audios
Elo #1 video model — best motion quality, camera control, and up to 15 seconds
Kling O3 Pro I2V — premium image animation at $0.224/s
Kling O3 Standard I2V — balanced image animation at $0.168/s
Kling O3 Standard Reference-to-Video — balanced reference video at $0.168/s
Kling V3 First+Last Frame — generate video transitions between two keyframes at $0.168/s
Kling O3 Standard T2V — balanced Kling O3 at $0.168/s
Kling V3 Pro T2V — premium latest-gen Kling at $0.224/s
Kling V3 Standard T2V — latest Kling generation at $0.168/s
Bria Video BG Removal — remove backgrounds from video at $0.14/s
Veo 3.1 Fast First+Last Frame — Google's video transitions at $0.10/s
WAN 2.1 Pro I2V — premium WAN image animation at $0.80
Seedance 2.0 Fast T2V — same model family at ~20% lower cost per second
ByteDance Seedance 2.0 — strong motion quality and temporal coherence for text-to-video
The only video model with built-in audio — 1080p + synced audio in one generation
Google's quality first+last frame video generation
Google's reference-guided video generation
Kling V3 Pro I2V — premium latest-gen image animation at $0.224/s
Kling O3 Pro Reference-to-Video — generate video from reference clips at $0.224/s
Voice
22 modelsFaster Whisper STT — fast, accurate speech-to-text at $0.001 (self-hosted)
VibeVoice — multi-speaker dialogue with up to 4 voices and script format at $0.04/min
Speech-to-Text Turbo — fast transcription at $0.0008/s
Voice Designer — describe a voice in text and generate speech
AI sound effects — generate any sound from a text description
Best raw voice cloning fidelity — preserves unique voice character
Free audio generation — open-source text-to-audio at zero cost
Maya TTS — emotion-rich speech with laugh, whisper, and cry tags at $0.002/sec
ElevenLabs speech-to-text — per-minute pricing better for long-form audio
Audio Isolation — extract clean voice from noisy audio at $0.025
Voice Changer — transform voice characteristics in real-time at $0.005/s
Chatterbox Multilingual — 23-language TTS with voice cloning at $0.025/1K chars
HD multilingual TTS — good quality across multiple languages
Industry-standard speech-to-text — 90+ languages with excellent accuracy
Premium Turbo 2.5 — fastest premium voice at $0.05
Premium Multilingual V2 — 29-language voice synthesis at $0.05
Chatterbox Speech-to-Speech — transform voice style while preserving content at $0.05
MiniMax Speech 2.8 HD — high-quality speech with interjection tags and 40+ languages at $0.05
MiniMax Speech Turbo — fast voice synthesis with good quality at $0.05
Premium TTS V3 — the gold standard of AI text-to-speech
AI Dubbing — AI-powered video/audio dubbing at $0.015/s
MiniMax Speech HD — high-definition voice synthesis with natural prosody at $0.05
298 models. One API.
$10 gets you started. No subscription. See the exact cost before every generation.
Start Creating