AI Video Generator — every model, compared

Generate video with Veo, Sora, Kling, Runway, Hailuo, Wan, Hunyuan, Grok Imagine, and every other major model. Pay per second, no subscription.

17 models supported · pay-per-credit · credits never expire

Try a free video generator — no signup, lighter model

Alibaba

Wan 2.2 5B

The smaller, faster variant of Alibaba's Wan 2.2 video model. Open-weight, ultra-fast text-to-video and image-to-video for rapid prototyping. Lower fidelity than the 14B variant but practical for ideation, draft renders, and high-volume iteration before committing to a flagship.

fast video iteration on Wan-style aestheticsopen-weight workflow explorationlower-cost Wan generation

0.2 Credits/sec

Start Reference

View Wan 2.2 5B examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Wan 2.2 14B

Alibaba's flagship Wan 2.2 video model with crisp 480p output and strong stylization. Open-weight availability makes it useful for self-hosted pipelines and teams that want production-quality video generation without closed-API costs. Edged out by Veo, Sora, and Kling at the top tier but cost-competitive.

stylized video with strong character handlingWan-aesthetic creative workopen-weight video workflows

1 Credit/sec

Start Reference

End Reference

View Wan 2.2 14B examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Happy Horse

HappyHorse 1.0 — Alibaba-ATH's image-to-video flagship, currently ranked #1 on the Artificial Analysis text-to-video arena. Synchronized native audio with multilingual lip-sync makes it strong for character-driven content where dialogue matters. Best when you have a reference image and want it animated with audio.

premium stylized video creativeanime and Asian-aesthetic motion workexperimental cinematic outputs

14 Credits/sec

Start Reference

Audio

View Happy Horse examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Pruna AI

P-Video Draft

Pruna AI's draft-tier video generation — roughly 4x faster than the full P-Video model for rapid previews. Built for the iteration phase: nail down the motion and composition cheaply, then commit to a full render only when you're sure. Cuts cost meaningfully on exploratory work.

fast video previewsdraft-quality iterationexploratory motion ideation

1 Credit/sec

Start Reference

End Reference

Audio

View P-Video Draft examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

P-Video

Pruna AI's production-tier video model with fast generation, built-in audio, and multi-aspect-ratio support. Optimized for cost-quality balance — solid for production workflows where the top-tier closed models (Veo, Sora) feel too expensive for the use case.

Pruna-optimized video generationbalanced speed-quality workflowsproduction iteration

4 Credits/sec

Start Reference

End Reference

Audio

View P-Video examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Lightricks

LTX 2 Distilled

Lightricks' distilled LTX 2 — open-source audio-video model for expressive clips with sound. Lower fidelity than the full LTX 2 / 2.3 line but practical for open-weight workflows and teams that want LTX-style video without closed-API costs.

fast LTX iterationopen-weight video workflowsexploratory motion work

2 Credits/sec

Start Reference

Audio

View LTX 2 Distilled examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

LTX 2.3 Fast

Lightricks' fastest LTX 2.3 tier with synchronized audio. Open-weight cinematic concept iteration — describe a clip with audio cues, get a quick render with sound. Use for iteration before stepping up to LTX 2.3 Pro for final renders.

fast LTX 2.3 videoiteration on cinematic conceptsopen-weight workflows

4 Credits/sec

Start Reference

End Reference

Audio

View LTX 2.3 Fast examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

LTX 2.3 Pro

Lightricks' flagship LTX 2.3 with synchronized audio. Higher-fidelity video generation in the open-weight LTX line, with cinematic camera handling and audio sync. Strong choice for teams that want premium video quality on infrastructure they control.

premium LTX video generationcinematic concept workopen-weight production workflows

6 Credits/sec

Start Reference

End Reference

Audio

View LTX 2.3 Pro examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

ByteDance

Seedance 1 Pro

ByteDance's Seedance 1.0 — fast, cost-efficient video generation with strong motion physics and detail. The original Seedance flagship, now succeeded by Seedance 1.5 Pro and 2 Fast for top-tier work. Still a practical pick when 1.5 Pro and 2 are overkill for the brief.

stylized video with cinematic feelByteDance / Seedream aesthetic in motionfashion-style videomusic video aesthetics

3 Credits/sec

Start Reference

View Seedance 1 Pro examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedance 1.5 Pro (No Audio)

ByteDance Seedance 1.5 Pro without the audio overhead. Cinema-quality video with precise motion and cinematic camera control at lower cost than the audio-enabled variant. Built for B-roll, silent narrative, and any content where audio is added in post.

premium video without audio overheadcinematic concept workB-roll and silent content

3 Credits/sec

Start Reference

End Reference

View Seedance 1.5 Pro (No Audio) examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedance 1.5 Pro

ByteDance Seedance 1.5 Pro with synchronized audio. Cinema-quality video with precise lip-syncing and cinematic camera control — strong for narrative short-form content, music video aesthetics, and anywhere dialogue matters as much as visuals.

premium stylized video with audiocinematic concept workfashion-style videoByteDance ecosystem workflows

5 Credits/sec

Start Reference

End Reference

Audio

View Seedance 1.5 Pro examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedance 2

ByteDance Seedance 2.0 — the next-generation Seedance flagship with native audio, multimodal inputs, and 720p output. Currently #2 on the Artificial Analysis text-to-video arena. Reach for it on hero clips, premium ad creative, and narrative content where the cost per second is justified.

premium video with audio at flagship qualitycinematic narrative workmusic video aestheticshigh-fidelity short-form content

13 Credits/sec

Start Reference

End Reference

Audio

View Seedance 2 examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

xAI

Grok Imagine Video

Limited Capacity

xAI's stylized video generator with synchronized audio. High-quality text-to-video and image-to-video with the distinctive xAI personality. Currently ranked #5 on the Artificial Analysis text-to-video arena. Strong for X-platform-aligned content and stylized social video where polish matters less than tone.

stylized video with personalitysocial-media videoirreverent contentX-platform-aligned video

5 Credits/sec

Artificial AnalysisElo 1232#5 / 82

Start Reference

Audio

View Grok Imagine Video examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Kuaishou

Kling 2.5 Pro

Kling 2.5 Turbo Pro — the flagship Kling tier with pro-grade text-to-video and image-to-video. Smooth motion, strong prompt fidelity, and exceptional motion physics for complex camera work — tracking shots, dolly moves, and crane sweeps all hold up. Strong choice for cinematic narrative work.

premium video with strong motion physicscinematic narrative workcomplex camera movementcharacter-driven content

7 Credits/sec

Start Reference

End Reference

View Kling 2.5 Pro examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Runway

Runway Gen 4.5

Runway Gen-4.5 — premium text-to-video and image-to-video with cinematic quality, rich detail, and fluid motion. Mature creator-focused tooling with strong cinematic handling. Currently #12 on the Artificial Analysis text-to-video arena. The natural choice for teams already on Runway's broader video stack.

creator-focused video workflowsmusic video and editorial workRunway ecosystem integrationproduction-quality cinematic content

12 Credits/sec

Start Reference

View Runway Gen 4.5 examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Google

Veo 3.1 (No Audio)

Google DeepMind's Veo 3.1 Fast without audio — high-fidelity video with strong temporal coherence and native video extension. Lower cost than the audio-enabled variant, built for B-roll and silent cinematic content where audio is added in post.

fast Veo 3.1 video without audioB-roll generationsilent cinematic contentlower-cost premium video

10 Credits/sec

Start Reference

End Reference

View Veo 3.1 (No Audio) examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Veo 3.1

Google DeepMind's Veo 3.1 Fast with synchronized audio. Industry-leading dialogue lip-sync and audio realism — context-aware audio generation, smooth motion, and native video and audio extension. Reach for it on dialogue-heavy short-form content where Veo's lip-sync advantage justifies the cost.

premium video with synchronized audionarrative short-form contentdialogue-heavy creativecinematic ad creative

15 Credits/sec

Start Reference

End Reference

Audio

View Veo 3.1 examples and details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt