These models generate videos from text prompts, images, and reference materials. The field is advancing fast — most models now generate native audio alongside video.
Runway Gen-4.5 is the top-rated video generation model, ranked #1 on the Artificial Analysis text-to-video benchmark. It produces videos with realistic physics — objects have weight, liquids flow naturally, and fine details like hair and fabric stay coherent across frames. Great for polished, cinematic clips where visual fidelity matters most.
Google Veo 3.1 and Veo 3.1 Fast are strong alternatives with native audio generation. Veo 3.1 Fast is a good pick when you want high quality with quicker turnaround. Veo 3.1 Lite is a more affordable option for high-volume use.
Kling Video 3.0 generates cinematic videos up to 15 seconds with native audio — including lip-synced dialogue, sound effects, and ambient sound. Its multi-shot mode lets you define up to 6 connected scenes in a single generation, making it ideal for short narratives, product demos, and ads.
Kling Video 3.0 Omni adds reference-based generation and video editing on top. Upload reference images to keep character appearance consistent across scenes, or feed in a reference video for style and camera movement transfer.
Seedance 2.0 from ByteDance accepts up to 9 reference images, 3 video clips, and 3 audio files — all combinable in your prompt. Supports T2V, I2V, video continuation, character consistency, motion transfer, and lip-synced dialogue with intelligent duration control. Seedance 2.0 Fast trades some quality for speed.
Seedance 1.5 Pro offers cinema-quality output with multi-language lip-sync and cinematic camera movements.
Grok Imagine Video from xAI generates short video clips with synchronized audio in around 30 seconds. Multiple aspect ratios (16:9, 9:16, 1:1) make it a natural fit for TikTok, Reels, and Shorts.
Vidu Q3 Pro supports a start-end-to-video mode — provide first and last frames and it generates smooth transitions between them. Up to 16 seconds at 1080p with audio. Vidu Q3 Turbo is a faster, cheaper variant.
Hailuo 2.3 from Minimax supports both text-to-video and image-to-video with standard and pro quality tiers. Hailuo 2.3 Fast trades some quality for speed.
PixVerse v5.6 is another cost-effective choice with unit-based pricing.
PrunaAI p-video offers T2V, I2V, and audio-to-video in a single endpoint. Its draft mode generates previews 4× faster for quick iteration before final rendering. Up to 1080p at 48 FPS.
The Wan video models are excellent open-source options, competitive with many proprietary models. Wan 2.7 T2V is the newest generation with a 27 billion parameter MoE architecture. Wan 2.5 T2V and the fast variants (Wan 2.5 T2V Fast, Wan 2.5 I2V Fast) are among the quickest options on Replicate.
Generative video is a rapidly advancing field. Check out the arena and leaderboard at Artificial Analysis to see what's popular today.
Featured models
prunaai/p-videoFast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.
Updated 1 day, 5 hours ago
670.5K runs
PixVerse's flagship video generation model. Generate cinematic videos with synchronized audio, multi-shot sequences, and precise camera control.
Updated 2 days, 22 hours ago
1K runs
bytedance/seedance-2.0ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.
Updated 3 days, 4 hours ago
95.9K runs
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
Updated 2 months, 1 week ago
24.4K runs
Recommended Models
The Wan fast variants are among the quickest text-to-video options. Grok Imagine Video generates clips with audio in about 30 seconds. PrunaAI p-video has a draft mode that generates previews 4x faster for quick iteration. Seedance 2.0 Fast and Seedance 1 Pro Fast are speed-optimized variants of their respective models.
Hailuo 2.3 supports both text-to-video and image-to-video with standard and pro quality tiers. PixVerse v5.6 uses unit-based pricing that keeps shorter, lower-resolution videos affordable. The Wan open-source models are the cheapest option overall.
Runway Gen-4.5 is ranked #1 on the Artificial Analysis benchmark for realistic physics and visual fidelity. Google Veo 3.1 is another top choice, especially with its native audio generation.
Most current-generation models generate audio alongside video: Kling Video 3.0, Seedance 2.0, Veo 3.1, Grok Imagine Video, Vidu Q3 Pro, Wan 2.5 T2V, and PrunaAI p-video all generate synchronized audio.
Kling Video 3.0 supports multi-shot mode with up to 6 connected scenes in a single generation. Seedance 2.0 supports video continuation for building longer sequences.
The Wan video models are the strongest open-source option. Wan 2.7 T2V is the newest with a 27B parameter MoE architecture. Wan 2.5 T2V Fast is great for speed.
Most models produce 5-15 second clips. Kling Video 3.0 and Seedance 2.0 go up to 15 seconds. Vidu Q3 Pro goes up to 16 seconds. For longer content, use video extension models like Grok Imagine Video Extension to chain clips together.
Yes — most models support commercial use. Always check the license on the model page, especially for open-source models.
Recommended Models
Kling Video 3.0 Omni: Unified multimodal video generation with reference images, video editing, native audio, and multi-shot control
Updated 1 day, 22 hours ago
407.8K runs
Kling Video 3.0: Generate cinematic videos up to 15 seconds with multi-shot control, native audio, and improved consistency
Updated 1 day, 22 hours ago
162.7K runs
bytedance/seedance-2.0-fastA faster variant of Seedance 2.0 for quicker video generation with multimodal inputs and native audio.
Updated 3 days, 4 hours ago
21.2K runs
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
Updated 2 weeks ago
2.5M runs
New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support
Updated 4 weeks, 2 days ago
584.7K runs
New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
Updated 4 weeks, 2 days ago
455.3K runs
Generate videos using xAI's Grok Imagine Video model
Updated 1 month, 4 weeks ago
596K runs
runwayml/gen-4.5State-of-the-art video motion quality, prompt adherence and visual fidelity
Updated 2 months, 1 week ago
126.5K runs
Modify an existing video through natural-language commands, changing subjects, environments, and visual style while preserving the original motion and timing.
Updated 2 months, 1 week ago
9.3K runs
bytedance/dreamactor-m2.0Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video
Updated 2 months, 2 weeks ago
10.6K runs
Latest video model from Pixverse with astonishing physics
Updated 2 months, 3 weeks ago
19.1K runs

openai/sora-2-proOpenAI's Most advanced synced-audio video generation
Updated 3 months ago
108.1K runs

openai/sora-2OpenAI's Flagship video generation with synced audio
Updated 3 months ago
300.9K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
Updated 3 months, 1 week ago
268.3K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
Updated 3 months, 1 week ago
10.5M runs
Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation
Updated 3 months, 3 weeks ago
586K runs
Alibaba Wan 2.5 text to video generation model
Updated 4 months, 2 weeks ago
34.1K runs
Alibaba Wan 2.5 Image to video generation with background audio
Updated 4 months, 2 weeks ago
208.9K runs

Sound on: Google’s flagship Veo 3 text to video model, with audio
Updated 4 months, 4 weeks ago
228.8K runs

A faster and cheaper version of Google’s Veo 3 video model, with audio
Updated 4 months, 4 weeks ago
187.5K runs

State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
Updated 4 months, 4 weeks ago
107.7K runs

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Updated 5 months ago
44.2K runs

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Updated 5 months ago
258.6K runs

Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
Updated 5 months ago
778.2K runs
Wan 2.5 text-to-video, optimized for speed
Updated 5 months ago
48.7K runs

Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 5 months ago
36.7K runs

Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 5 months ago
88.3K runs

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 5 months ago
447.4K runs

bytedance/seedance-1-proA pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Updated 5 months, 2 weeks ago
1.9M runs

bytedance/seedance-1-liteA video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Updated 5 months, 2 weeks ago
3.2M runs
bytedance/seedance-1-pro-fastA faster and cheaper version of Seedance 1 Pro
Updated 5 months, 2 weeks ago
1.3M runs

Create 5s 480p videos from a text prompt
Updated 5 months, 2 weeks ago
11.1K runs

Generate 5s and 10s videos in 720p resolution
Updated 5 months, 2 weeks ago
96K runs

Generate 5s and 10s videos in 1080p resolution
Updated 5 months, 2 weeks ago
824.5K runs

A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image
Updated 5 months, 2 weeks ago
98.8K runs

Generate 5s and 10s videos in 720p resolution at 30fps
Updated 5 months, 2 weeks ago
1.6M runs

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
Updated 5 months, 2 weeks ago
3.9M runs

luma/ray-2-540pGenerate 5s and 9s 540p videos
Updated 5 months, 2 weeks ago
11.7K runs

luma/ray-2-720pGenerate 5s and 9s 720p videos
Updated 5 months, 2 weeks ago
40.2K runs
Wan 2.5 image-to-video, optimized for speed
Updated 5 months, 2 weeks ago
64.1K runs

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 5 months, 2 weeks ago
190.8K runs

luma/ray-flash-2-720pGenerate 5s and 9s 720p videos, faster and cheaper than Ray 2
Updated 5 months, 2 weeks ago
48.6K runs

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Updated 5 months, 2 weeks ago
697.4K runs

Generate videos with specific camera movements
Updated 5 months, 2 weeks ago
76.1K runs
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
Updated 5 months, 2 weeks ago
76.2K runs
A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.
Updated 5 months, 2 weeks ago
123.8K runs

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
Updated 5 months, 2 weeks ago
184.4K runs
luma/ray-flash-2-540pGenerate 5s and 9s 540p videos, faster and cheaper than Ray 2
Updated 5 months, 2 weeks ago
67.8K runs
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
Updated 5 months, 2 weeks ago
371.8K runs

Image-to-video at 720p and 480p with Wan 2.2 A14B
Updated 8 months, 2 weeks ago
53.6K runs
fofr/not-realMake a very realistic looking real-world AI video
Updated 9 months, 1 week ago
2.4K runs

Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group
Updated 1 year, 1 month ago
48.9K runs

tencent/hunyuan-videoA state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year, 3 months ago
117.6K runs

lightricks/ltx-videoLTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.
Updated 1 year, 3 months ago
169.2K runs

zsxkib/hunyuan-video2videoA state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year, 4 months ago
3K runs

genmoai/mochi-1Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation
Updated 1 year, 4 months ago
3.4K runs

zsxkib/pyramid-flowText-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching
Updated 1 year, 6 months ago
9.3K runs

cuuupid/cogvideox-5bGenerate high quality videos from a prompt
Updated 1 year, 7 months ago
2.6K runs

meta/sam-2-videoSAM 2: Segment Anything v2 (for videos)
Updated 1 year, 8 months ago
66.5K runs

fofr/tooncrafterCreate videos from illustrated input images
Updated 1 year, 9 months ago
68.3K runs

fofr/video-morpherGenerate a video that morphs between subjects, with an optional style
Updated 2 years ago
15.2K runs

cjwbw/videocrafterVideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing
Updated 2 years, 2 months ago
168.4K runs

ali-vilab/i2vgen-xlRESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Updated 2 years, 3 months ago
128.4K runs

open-mmlab/piaPersonalized Image Animator
Updated 2 years, 3 months ago
103.5K runs

zsxkib/animatediff-illusionsMonster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)
Updated 2 years, 5 months ago
10.6K runs

lucataco/hotshot-xl😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL
Updated 2 years, 6 months ago
929.7K runs

zsxkib/animatediff-prompt-travel🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives
Updated 2 years, 6 months ago
5.7K runs

zsxkib/animate-diff🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Updated 2 years, 6 months ago
59.4K runs
lucataco/animate-diffAnimate Your Personalized Text-to-Image Diffusion Models
Updated 2 years, 7 months ago
335K runs

anotherjesse/zeroscope-v2-xlZeroscope V2 XL & 576w
Updated 2 years, 9 months ago
303.8K runs
cjwbw/controlvideoTraining-free Controllable Text-to-Video Generation
Updated 2 years, 10 months ago
2.4K runs
cjwbw/text2video-zeroText-to-Image Diffusion Models are Zero-Shot Video Generators
Updated 3 years ago
42.1K runs
cjwbw/damo-text-to-videoMulti-stage text-to-video generation
Updated 3 years, 1 month ago
158.4K runs
andreasjansson/tile-morphCreate tileable animations with seamless transitions
Updated 3 years, 2 months ago
529.4K runs

arielreplicate/deoldify_videoAdd colours to old video footage.
Updated 3 years, 2 months ago
15.4K runs

pollinations/real-basicvsr-video-superresolutionRealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution
Updated 3 years, 2 months ago
9.3K runs

arielreplicate/robust_video_mattingextract foreground of a video
Updated 3 years, 4 months ago
121.6K runs

arielreplicate/stable_diffusion_infinite_zoomUse Runway's Stable-diffusion inpainting model to create an infinite loop video
Updated 3 years, 5 months ago
38.5K runs
andreasjansson/stable-diffusion-animationAnimate Stable Diffusion by interpolating between two prompts
Updated 3 years, 5 months ago
119.6K runs
deforum/deforum_stable_diffusionAnimating prompts with stable diffusion
Updated 3 years, 7 months ago
267.5K runs