The smaller, faster variant of Alibaba's Wan 2.2 video model. Open-weight, ultra-fast text-to-video and image-to-video for rapid prototyping. Lower fidelity than the 14B variant but practical for ideation, draft renders, and high-volume iteration before committing to a flagship.
fast video iteration on Wan-style aestheticsopen-weight workflow explorationlower-cost Wan generation
Alibaba's flagship Wan 2.2 video model with crisp 480p output and strong stylization. Open-weight availability makes it useful for self-hosted pipelines and teams that want production-quality video generation without closed-API costs. Edged out by Veo, Sora, and Kling at the top tier but cost-competitive.
stylized video with strong character handlingWan-aesthetic creative workopen-weight video workflows
HappyHorse 1.0 — Alibaba-ATH's image-to-video flagship, currently ranked #1 on the Artificial Analysis text-to-video arena. Synchronized native audio with multilingual lip-sync makes it strong for character-driven content where dialogue matters. Best when you have a reference image and want it animated with audio.
premium stylized video creativeanime and Asian-aesthetic motion workexperimental cinematic outputs
Pruna AI's draft-tier video generation — roughly 4x faster than the full P-Video model for rapid previews. Built for the iteration phase: nail down the motion and composition cheaply, then commit to a full render only when you're sure. Cuts cost meaningfully on exploratory work.
fast video previewsdraft-quality iterationexploratory motion ideation
Pruna AI's production-tier video model with fast generation, built-in audio, and multi-aspect-ratio support. Optimized for cost-quality balance — solid for production workflows where the top-tier closed models (Veo, Sora) feel too expensive for the use case.
Pruna-optimized video generationbalanced speed-quality workflowsproduction iteration
Lightricks' distilled LTX 2 — open-source audio-video model for expressive clips with sound. Lower fidelity than the full LTX 2 / 2.3 line but practical for open-weight workflows and teams that want LTX-style video without closed-API costs.
fast LTX iterationopen-weight video workflowsexploratory motion work
Lightricks' fastest LTX 2.3 tier with synchronized audio. Open-weight cinematic concept iteration — describe a clip with audio cues, get a quick render with sound. Use for iteration before stepping up to LTX 2.3 Pro for final renders.
fast LTX 2.3 videoiteration on cinematic conceptsopen-weight workflows
Lightricks' flagship LTX 2.3 with synchronized audio. Higher-fidelity video generation in the open-weight LTX line, with cinematic camera handling and audio sync. Strong choice for teams that want premium video quality on infrastructure they control.
premium LTX video generationcinematic concept workopen-weight production workflows
ByteDance's Seedance 1.0 — fast, cost-efficient video generation with strong motion physics and detail. The original Seedance flagship, now succeeded by Seedance 1.5 Pro and 2 Fast for top-tier work. Still a practical pick when 1.5 Pro and 2 are overkill for the brief.
stylized video with cinematic feelByteDance / Seedream aesthetic in motionfashion-style videomusic video aesthetics
ByteDance Seedance 1.5 Pro without the audio overhead. Cinema-quality video with precise motion and cinematic camera control at lower cost than the audio-enabled variant. Built for B-roll, silent narrative, and any content where audio is added in post.
premium video without audio overheadcinematic concept workB-roll and silent content
ByteDance Seedance 1.5 Pro with synchronized audio. Cinema-quality video with precise lip-syncing and cinematic camera control — strong for narrative short-form content, music video aesthetics, and anywhere dialogue matters as much as visuals.
premium stylized video with audiocinematic concept workfashion-style videoByteDance ecosystem workflows
ByteDance Seedance 2.0 — the next-generation Seedance flagship with native audio, multimodal inputs, and 720p output. Currently #2 on the Artificial Analysis text-to-video arena. Reach for it on hero clips, premium ad creative, and narrative content where the cost per second is justified.
premium video with audio at flagship qualitycinematic narrative workmusic video aestheticshigh-fidelity short-form content
xAI's stylized video generator with synchronized audio. High-quality text-to-video and image-to-video with the distinctive xAI personality. Currently ranked #5 on the Artificial Analysis text-to-video arena. Strong for X-platform-aligned content and stylized social video where polish matters less than tone.
stylized video with personalitysocial-media videoirreverent contentX-platform-aligned video
Kling 2.5 Turbo Pro — the flagship Kling tier with pro-grade text-to-video and image-to-video. Smooth motion, strong prompt fidelity, and exceptional motion physics for complex camera work — tracking shots, dolly moves, and crane sweeps all hold up. Strong choice for cinematic narrative work.
premium video with strong motion physicscinematic narrative workcomplex camera movementcharacter-driven content
Runway Gen-4.5 — premium text-to-video and image-to-video with cinematic quality, rich detail, and fluid motion. Mature creator-focused tooling with strong cinematic handling. Currently #12 on the Artificial Analysis text-to-video arena. The natural choice for teams already on Runway's broader video stack.
creator-focused video workflowsmusic video and editorial workRunway ecosystem integrationproduction-quality cinematic content
Google DeepMind's Veo 3.1 Fast without audio — high-fidelity video with strong temporal coherence and native video extension. Lower cost than the audio-enabled variant, built for B-roll and silent cinematic content where audio is added in post.
fast Veo 3.1 video without audioB-roll generationsilent cinematic contentlower-cost premium video
Google DeepMind's Veo 3.1 Fast with synchronized audio. Industry-leading dialogue lip-sync and audio realism — context-aware audio generation, smooth motion, and native video and audio extension. Reach for it on dialogue-heavy short-form content where Veo's lip-sync advantage justifies the cost.
premium video with synchronized audionarrative short-form contentdialogue-heavy creativecinematic ad creative