AI Image Generator — every model, compared

Generate images with every leading model — Flux, Imagen, Nano Banana, Seedream, GPT Image, Recraft, Ideogram, Qwen — from a single workspace. Pay-per-credit, no subscription.

35 models supported · pay-per-credit · credits never expire

Google

Nano Banana

Google's Gemini 2.5 Flash-powered text-to-image generator. Supports reference images for subject consistency and produces clean, well-composed shots with the prompt-following accuracy you'd expect from a Gemini-grounded model. A solid pick for fast iteration when you don't need 4K output or the multi-reference workflow that Nano Banana 2 introduces.

product photography for ecommerce listingsmarketing visuals with brand consistencysocial media content with reference imagesconcept art for indie gameseditorial illustrations
4 Credits
Artificial AnalysisElo 1157#21 / 126
View Nano Banana examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Nano Banana 2

Google DeepMind's Gemini 3-powered flagship for text-to-image. Renders up to 4K with industry-leading prompt adherence, native multi-image references, and web search grounding for factual scenes. Ideal for product photography with brand consistency, character series across multi-shot campaigns, and complex compositional work that benefits from compositing several references at once.

product photography with brand consistencycharacter consistency across a multi-shot seriesmarketing creative driven by long structured promptsphotorealistic portraits at high resolutioncompositing elements from multiple reference images
11 Credits
Artificial AnalysisElo 1263#3 / 126
View Nano Banana 2 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Nano Banana Pro

The premium tier of Google's Nano Banana lineup. Tuned for magazine-cover fidelity at 4K, with the strongest identity preservation in the family across faces, products, and brand elements. Best when the cost per image is justified by the output going into print, paid ads, or hero placements where every detail matters.

premium marketing campaignsmagazine-quality photographic compositionscomplex multi-element compositionsbrand-consistent character serieshigh-resolution print artwork
15 Credits
Artificial AnalysisElo 1220#5 / 126
View Nano Banana Pro examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Imagen 3

Google's previous-generation Imagen text-to-image model. Strong on natural lighting and skin tones with a documentary photographic feel, but supplanted by Imagen 4 and Imagen 4 Ultra on prompt adherence and text rendering. Reasonable choice when you want the older Imagen aesthetic specifically — otherwise step up to Imagen 4.

photorealistic product shotsstock-photo replacementeditorial photographycharacter portraits with natural skin tonesscenes with natural depth of field
5 Credits
Artificial AnalysisElo 1115#40 / 126View Imagen 3 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Imagen 4

Google's mid-tier flagship in the Imagen series. Delivers stronger prompt adherence and improved text rendering over Imagen 3, with the photorealistic skin tones and natural lighting Imagen is known for. Sits below Imagen 4 Ultra on overall quality and above Imagen 3 on instruction-following — a pragmatic default for product, ecommerce, and editorial photography.

photorealistic ad creativeecommerce product visualsbrand photographydocumentary-style imagesnatural portraits
4 Credits
Artificial AnalysisElo 1111#44 / 126View Imagen 4 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Imagen 4 Ultra

Google's premium Imagen tier, built for magazine-cover-quality photorealism. Hand-tuned for fine-grained detail in faces, fabrics, and natural materials with faithful prompt adherence and improved typography. Sits at the top of the Google image lineup alongside Nano Banana Pro — pick this when you want documentary-style realism over Gemini's broader compositional flexibility.

premium ad and editorial photographymagazine-cover quality outputshighly detailed close-upsproduct hero shotsnatural-looking composite scenes
6 Credits
Artificial AnalysisElo 1172#14 / 126View Imagen 4 Ultra examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

ByteDance

Seedream 3

ByteDance's earlier Seedream image model with native 2K output and a recognizable cinematic look. Useful when you specifically want the early-Seedream aesthetic — fashion editorial, Asian-cinema mood, vibrant stylization. For most production use cases Seedream 4, 4.5, or 5 Lite outperform it on detail and prompt adherence.

stylized illustrationanime-adjacent artfashion editorial imagerycreative concept artvibrant marketing creative
3 Credits
Artificial AnalysisElo 1155#22 / 126View Seedream 3 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedream 4

ByteDance's evolution of Seedream — refined character work, fashion-grade outputs, and the cinematic stylization the line is known for, now with reference-image support. Strong choice for editorial campaigns, music-video stills, and stylized portraiture where mood and color palette matter more than literal photographic realism.

fashion and editorial photographystylized portraitscreative concept artmarketing visuals with characterAsian-aesthetic creative work
3 Credits
Artificial AnalysisElo 1198#6 / 126
View Seedream 4 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedream 4.5

ByteDance's flagship cinematic image model. Top-tier in the Seedream lineup for color, mood, and editorial stylization, with stronger spatial understanding than 4.0 — convincing depth, perspective, and prop placement. Pair with Seedream 5 Lite for high-volume iteration once you've nailed the look.

fashion editorialcinematic concept artstylized character portraitsmarketing creative with moodmusic video and album art aesthetics
4 Credits
Artificial AnalysisElo 1166#16 / 126
View Seedream 4.5 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedream 5 Lite

A faster, lower-cost variant of the Seedream 4.5 aesthetic — same cinematic palette and mood handling at reduced fidelity. Built-in reasoning and example-based editing differentiate it from other budget tiers: pass an example image of the desired result and the model interprets the intent. Good for high-volume social content, moodboards, and exploration before committing to a hero shot.

fast iteration on stylized conceptsbudget-tier cinematic creativesocial-content batchesmoodboard generation
4 Credits
Artificial AnalysisElo 1119#38 / 126
View Seedream 5 Lite examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

OpenAI

GPT Image

OpenAI's first standalone text-to-image model. Best-in-class typography rendering at the time of release — making it well-suited to posters, packaging mockups, and text-heavy designs — though GPT Image 1.5 and 2 supersede it on photorealism and instruction-following. Still worth picking when you want the original GPT Image aesthetic for typographic work specifically.

text-heavy designs (posters, packaging)diagram-style illustrationslogos and brand marksinstructional or chart-style imagerystylized character portraits
6 Credits
Artificial AnalysisElo 1136#28 / 126
View GPT Image examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 1.5 Medium

The mid-quality tier of OpenAI's GPT Image 1.5. Improved photorealism and instruction-following over GPT Image 1 at a more reasonable cost than GPT Image 1.5 High. Solid pick for ads, packaging, and marketing creative when GPT Image 2 pricing is excessive.

balanced cost-quality image generationmarketing creative iterationtext-on-image work at mid-tier cost
5 Credits
View GPT Image 1.5 Medium examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 1.5 High

The high-quality tier of OpenAI's GPT Image 1.5 mid-generation release. Improved photorealism over GPT Image 1 with the same industry-leading text rendering, ideal for ads, packaging, and editorial layouts where typography and image quality both matter. Surpassed by GPT Image 2 High for premium work but cheaper at the high tier.

high-fidelity text-on-image workproduct packaging mockupsmarketing posters with typographyinstructional imagery
14 Credits
Artificial AnalysisElo 1268#2 / 126
View GPT Image 1.5 High examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 2 Low

OpenAI's GPT Image 2 budget tier. Same model architecture as GPT Image 2 Medium and High at a fraction of the cost — useful for high-volume iteration, draft generation, and exploration before committing to higher tiers for finals.

fast budget-tier generationiteration and explorationhigh-volume creative
2 Credits
View GPT Image 2 Low examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 2 Medium

OpenAI's GPT Image 2 at the medium quality tier. Combines top-of-the-leaderboard prompt adherence on Artificial Analysis with the strongest typography rendering of any image model — exact text in quotes, complex layouts, and packaging mockups all just work. The default choice for production marketing creative when GPT Image 2 High's premium pricing isn't justified.

premium marketing creative with typographyproduct mockupsmagazine-style editorialcomplex multi-element compositionstext-heavy social posts
6 Credits
Artificial AnalysisElo 1337#1 / 126
View GPT Image 2 Medium examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 2 High

OpenAI's GPT Image 2 at maximum quality, currently ranked #1 on the Artificial Analysis text-to-image arena. Delivers magazine-cover fidelity with industry-best text rendering and complex prompt adherence — pick it for hero shots, premium ad creative, and editorial covers where the cost per image is worth the output.

magazine-cover quality outputshighest-fidelity product photographycomplex art-directed scenespremium ad creativeeditorial hero shots
15 Credits
Artificial AnalysisElo 1337#1 / 126
View GPT Image 2 High examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Black Forest Labs

Flux Schnell

Black Forest Labs' fastest, lowest-cost Flux variant. Built for rapid prototyping, moodboards, and high-volume batch generation where speed and credit cost matter more than raw fidelity. Open-weight and well-supported in the community, with extensive prompt-pattern documentation accumulated since the original Flux release.

rapid iteration on conceptshigh-volume batch generationmoodboards and ideationbackground plate generationcheap variants
1 Credit
Artificial AnalysisElo 1000#98 / 126View Flux Schnell examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 1.1 Pro

Black Forest Labs' workhorse flagship before the Flux 2 release. Strong stylization range across photorealistic and illustrated work with broad community knowledge of prompt patterns — a known quantity for production workflows. Edged out by Flux 2 Pro on detail and prompt adherence but still cost-competitive for everyday creative.

stylized photorealistic creativeconcept art and illustrationmarketing visualscharacter designdiverse aesthetic exploration
4 Credits
Artificial AnalysisElo 1083#57 / 126
View Flux 1.1 Pro examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 2 Klein

The smaller, faster open-weight variant of Black Forest Labs' Flux 2 family. Self-hostable for teams that want the Flux 2 aesthetic on their own infrastructure, with strong quality-per-credit at a lower cost than Flux 2 Pro. Good for iteration and exploration before stepping up to Pro or Max for finals.

fast iteration on Flux 2 aestheticsopen-weight workflow integrationself-hosted explorationhigh-volume creative
2 Credits
Artificial AnalysisElo 1127#35 / 126
View Flux 2 Klein examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 2 Pro

Black Forest Labs' Flux 2 production tier — the current Flux flagship for most use cases. Stylized photorealism with stronger prompt adherence than Flux 1.1 Pro, broad aesthetic range, and reference-image support. Sits below Flux 2 Max on top-end fidelity but offers a better cost-quality ratio for production work.

stylized photorealistic creative at production qualityconcept art for games and filmpremium marketing visualscharacter series with consistent stylediverse aesthetic work
5 Credits
Artificial AnalysisElo 1186#9 / 126
View Flux 2 Pro examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 2 Max

The highest-fidelity tier in Black Forest Labs' Flux 2 family. Top-5 on the Artificial Analysis text-to-image arena with magazine-grade detail in textures, fabrics, and natural materials. Reach for it on hero shots, magazine layouts, and premium ad creative where the cost per image is justified.

premium ad and editorial creativemagazine-cover quality outputsstylized photorealism at maximum fidelitycomplex compositions
10 Credits
Artificial AnalysisElo 1197#7 / 126
View Flux 2 Max examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux Kontext Pro

Black Forest Labs' instruction-based image editor — describe an edit in plain English ('replace the background with a sunset beach', 'change the shirt to red') and get a clean result that preserves the rest of the scene. Strong identity and lighting consistency makes it production-ready for outfit swaps, background replacement, and prop changes.

instruction-based image editsbackground swapsobject addition or removalcharacter outfit changescolor grading and style transfer
4 Credits
Artificial AnalysisElo 1091#52 / 126
View Flux Kontext Pro examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux Kontext Max

The premium Flux Kontext tier from Black Forest Labs. Improved typography handling and stronger scene understanding than Flux Kontext Pro, suited to complex multi-step edits, art-directed photo manipulations, and edits involving on-image text. Pick this when Pro's results aren't sticking the landing on harder transformations.

premium instruction-based editingcomplex multi-step editshigh-fidelity background and subject swapsart-directed photo manipulations
8 Credits
Artificial AnalysisElo 1129#33 / 126
View Flux Kontext Max examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Alibaba

Wan 2.2

Alibaba's Wan 2.2 image-generation model with cinematic 2MP output in seconds. Open-weight, with the same Wan-line stylization that the video model is known for. Useful for rapid concept work and budget-tier generation when speed and cost matter more than top-tier fidelity.

fast cinematic image generationbudget-tier concept workopen-weight workflows
2 Credits
View Wan 2.2 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Z Image Turbo

A fast 6B-parameter text-to-image model from the Z-Image lineup with surprisingly strong photorealism for its size. Built for high-volume iteration where credit cost and speed matter — moodboards, social variants, exploration before stepping up to a flagship. Open-weight and competitive at this budget tier.

fast budget-tier generationhigh-volume creativeideation and moodboardslightweight stylized output
1 Credit
Artificial AnalysisElo 1106#46 / 126View Z Image Turbo examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Qwen Image

Alibaba's Asian-trained text-to-image model with notable strength in complex multilingual text rendering — Chinese, Japanese, and Korean characters render reliably where most Western models struggle. Open-weight, with broad stylization range. Useful for content targeting Asian markets or any project requiring native-script typography in generated imagery.

Asian-aesthetic stylized workopen-weight workflow integrationdiverse style explorationfashion and editorial concepts
3 Credits
Artificial AnalysisElo 1059#75 / 126View Qwen Image examples and details
No examples yet for Qwen Image

Qwen Image 2

Alibaba's unified generation-and-editing model with native 2K resolution and improved fidelity over the original Qwen Image. Strong text rendering across multiple scripts and natural cinematic stylization make it a solid pick for fashion, editorial, and content targeting Asian markets — and it doubles as an editor in the same checkpoint.

Asian-cinema aestheticsfashion and editorial creativestylized character workanime-adjacent illustration
4 Credits
Artificial AnalysisElo 1081#58 / 126
View Qwen Image 2 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Qwen Image 2 Pro

The premium tier of Alibaba's Qwen Image 2. Enhanced realism and text accuracy at native 2K with precise image-editing capability built in — competitive with Flux 2 Pro on the Artificial Analysis text-to-image arena. Best-in-class for stylized fashion photography with multilingual text demands.

premium Asian-aesthetic creativefashion editorial at high fidelitystylized character seriescinematic concept art
8 Credits
Artificial AnalysisElo 1157#19 / 126
View Qwen Image 2 Pro examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft

Recraft V4

Recraft's design-focused flagship, built around design taste rather than photorealism. Best-in-class typography rendering for an image model alongside strong prompt accuracy and art-directed composition. The natural choice for posters, packaging, brand assets, and any creative where text and graphic design matter as much as the image.

logo and brand mark iterationvector / SVG output for editable assetstypography-heavy designs (posters, packaging)diverse stylized illustrations
4 Credits
Artificial AnalysisElo 1130#32 / 126View Recraft V4 examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V4 SVG

Recraft V4 in SVG output mode — generates production-ready vector images you can edit directly in Figma or Illustrator. Built for logos, icons, and scalable design assets where the output needs to be vector from the start. Unique in the AI image space; most models output rasters that don't trace cleanly.

vector logo iterationeditable brand assetsSVG icons and illustrationsscalable design elements
8 Credits
View Recraft V4 SVG examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V3 Realism

Recraft's previous-generation realism model with strong long-form text rendering and photorealistic outputs. Surpassed by Recraft V4 on overall design taste and prompt accuracy, but still useful when you want the older Recraft V3 aesthetic specifically. Same checkpoint underneath as the V3 Digital Illustration variant.

graphic design iterationstylized illustrationsbrand asset explorationdiverse creative styles
4 Credits
View Recraft V3 Realism examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V3 Digital Illustration

Recraft V3 in digital-illustration mode — consistent stylized output for editorial spot art, blog imagery, and content series. Useful when you need a series of images with a unified illustrated treatment rather than photorealism. Lower-cost than Recraft V4 if the simpler V3 aesthetic fits the brief.

digital illustration in a distinctive styleeditorial spot artblog and article imagerystylized concept work
4 Credits
View Recraft V3 Digital Illustration examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

xAI

Grok Imagine Image

xAI's image generation and editing model with strong prompt adherence and a distinctive personality-driven aesthetic. Built for X-platform-aligned content, social-media-ready imagery, and irreverent creative work where polish matters less than tone. Strong cost-quality at this tier compared to the major US flagships.

stylized creative with personalitysocial-media-ready imageryhumorous or irreverent conceptsX-platform-aligned visuals
2 Credits
Artificial AnalysisElo 1182#11 / 126
View Grok Imagine Image examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Ideogram

Ideogram V3 Realism

Ideogram's typography-focused image generator at the V3 release. Best-in-class text rendering at a lower cost than GPT Image, with photorealistic outputs that work for posters, packaging, and text-prominent ad creative. The natural choice when you need accurate on-image typography but GPT Image 2 pricing is excessive.

typography-heavy designslogos and brand marksposters and packagingtext-prominent creativeinstructional imagery
6 Credits
Artificial AnalysisElo 1078#65 / 126View Ideogram V3 Realism examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Ideogram V3 Digital Illustration

Ideogram V3 tuned for digital illustration — typography-aware editorial spot art and blog imagery with consistent stylized treatment. Same V3 checkpoint as the realism variant; the illustration mode just shifts default style. Cheaper than Recraft for editorial content where text legibility matters.

stylized digital illustrationeditorial spot artconsistent character treatmentsblog and content imagery
6 Credits
View Ideogram V3 Digital Illustration examples and details
Model showcase

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt