Model comparison

AI Image Generator: every model, compared

Generate images with every leading model — Flux, Imagen, Nano Banana, Seedream, GPT Image, Recraft, Ideogram, Qwen — from a single workspace. Pay-per-credit, no subscription.

42 models supported · pay-per-credit · credits never expire

Try a free, simpler image generator — no signup

How to choose a model

Every model below runs on the same pay-per-credit balance, so you can switch between them freely without separate subscriptions. Recommended models are shown first; expand any group to see every version. Search to jump straight to a model, or open one for full details, pricing, capabilities, and example outputs.

Google

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Nano Banana 2

Google DeepMind's Gemini 3-powered flagship for text-to-image. Renders up to 4K with industry-leading prompt adherence, native multi-image references, and web search grounding for factual scenes. Ideal for product photography with brand consistency, character series across multi-shot campaigns, and complex compositional work that benefits from compositing several references at once.

product photography with brand consistencycharacter consistency across a multi-shot seriesmarketing creative driven by long structured promptsphotorealistic portraits at high resolutioncompositing elements from multiple reference images

11 Credits

Artificial AnalysisElo 1255#5 / 148

Image Reference

View Nano Banana 2 details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Nano Banana Pro

The premium tier of Google's Nano Banana lineup. Tuned for magazine-cover fidelity at 4K, with the strongest identity preservation in the family across faces, products, and brand elements. Best when the cost per image is justified by the output going into print, paid ads, or hero placements where every detail matters.

premium marketing campaignsmagazine-quality photographic compositionscomplex multi-element compositionsbrand-consistent character serieshigh-resolution print artwork

15 Credits

Artificial AnalysisElo 1220#8 / 148

Image Reference

View Nano Banana Pro details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Imagen 4 Ultra

Google's premium Imagen tier, built for magazine-cover-quality photorealism. Hand-tuned for fine-grained detail in faces, fabrics, and natural materials with faithful prompt adherence and improved typography. Sits at the top of the Google image lineup alongside Nano Banana Pro — pick this when you want documentary-style realism over Gemini's broader compositional flexibility.

premium ad and editorial photographymagazine-cover quality outputshighly detailed close-upsproduct hero shotsnatural-looking composite scenes

6 Credits

Artificial AnalysisElo 1173#26 / 148 View Imagen 4 Ultra details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Nano Banana

Google's Gemini 2.5 Flash-powered text-to-image generator. Supports reference images for subject consistency and produces clean, well-composed shots with the prompt-following accuracy you'd expect from a Gemini-grounded model. A solid pick for fast iteration when you don't need 4K output or the multi-reference workflow that Nano Banana 2 introduces.

product photography for ecommerce listingsmarketing visuals with brand consistencysocial media content with reference imagesconcept art for indie gameseditorial illustrations

4 Credits

Artificial AnalysisElo 1159#34 / 148

Image Reference

View Nano Banana details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Imagen 3

Google's previous-generation Imagen text-to-image model. Strong on natural lighting and skin tones with a documentary photographic feel, but supplanted by Imagen 4 and Imagen 4 Ultra on prompt adherence and text rendering. Reasonable choice when you want the older Imagen aesthetic specifically — otherwise step up to Imagen 4.

photorealistic product shotsstock-photo replacementeditorial photographycharacter portraits with natural skin tonesscenes with natural depth of field

5 Credits

Artificial AnalysisElo 1110#62 / 148 View Imagen 3 details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Imagen 4

Google's mid-tier flagship in the Imagen series. Delivers stronger prompt adherence and improved text rendering over Imagen 3, with the photorealistic skin tones and natural lighting Imagen is known for. Sits below Imagen 4 Ultra on overall quality and above Imagen 3 on instruction-following — a pragmatic default for product, ecommerce, and editorial photography.

photorealistic ad creativeecommerce product visualsbrand photographydocumentary-style imagesnatural portraits

4 Credits

Artificial AnalysisElo 1100#69 / 148 View Imagen 4 details

ByteDance

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedream 4.5

ByteDance's flagship cinematic image model. Top-tier in the Seedream lineup for color, mood, and editorial stylization, with stronger spatial understanding than 4.0 — convincing depth, perspective, and prop placement. Pair with Seedream 5 Lite for high-volume iteration once you've nailed the look.

fashion editorialcinematic concept artstylized character portraitsmarketing creative with moodmusic video and album art aesthetics

4 Credits

Artificial AnalysisElo 1167#28 / 148

Image Reference

View Seedream 4.5 details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Seedream 5 Lite

A faster, lower-cost variant of the Seedream 4.5 aesthetic — same cinematic palette and mood handling at reduced fidelity. Built-in reasoning and example-based editing differentiate it from other budget tiers: pass an example image of the desired result and the model interprets the intent. Good for high-volume social content, moodboards, and exploration before committing to a hero shot.

fast iteration on stylized conceptsbudget-tier cinematic creativesocial-content batchesmoodboard generation

4 Credits

Artificial AnalysisElo 1118#57 / 148

Image Reference

View Seedream 5 Lite details

OpenAI

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 2 Medium

OpenAI's GPT Image 2 at the medium quality tier. Combines top-of-the-leaderboard prompt adherence on Artificial Analysis with the strongest typography rendering of any image model — exact text in quotes, complex layouts, and packaging mockups all just work. The default choice for production marketing creative when GPT Image 2 High's premium pricing isn't justified.

premium marketing creative with typographyproduct mockupsmagazine-style editorialcomplex multi-element compositionstext-heavy social posts

6 Credits

Artificial AnalysisElo 1339#1 / 148

Image Reference

View GPT Image 2 Medium details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 2 High

OpenAI's GPT Image 2 at maximum quality, currently ranked #1 on the Artificial Analysis text-to-image arena. Delivers magazine-cover fidelity with industry-best text rendering and complex prompt adherence — pick it for hero shots, premium ad creative, and editorial covers where the cost per image is worth the output.

magazine-cover quality outputshighest-fidelity product photographycomplex art-directed scenespremium ad creativeeditorial hero shots

15 Credits

Artificial AnalysisElo 1339#1 / 148

Image Reference

View GPT Image 2 High details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image

OpenAI's first standalone text-to-image model. Best-in-class typography rendering at the time of release — making it well-suited to posters, packaging mockups, and text-heavy designs — though GPT Image 1.5 and 2 supersede it on photorealism and instruction-following. Still worth picking when you want the original GPT Image aesthetic for typographic work specifically.

text-heavy designs (posters, packaging)diagram-style illustrationslogos and brand marksinstructional or chart-style imagerystylized character portraits

6 Credits

Artificial AnalysisElo 1134#50 / 148

Image Reference

View GPT Image details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 1.5 Medium

The mid-quality tier of OpenAI's GPT Image 1.5. Improved photorealism and instruction-following over GPT Image 1 at a more reasonable cost than GPT Image 1.5 High. Solid pick for ads, packaging, and marketing creative when GPT Image 2 pricing is excessive.

balanced cost-quality image generationmarketing creative iterationtext-on-image work at mid-tier cost

5 Credits

Artificial AnalysisElo 1264#4 / 148

Image Reference

View GPT Image 1.5 Medium details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 1.5 High

The high-quality tier of OpenAI's GPT Image 1.5 mid-generation release. Improved photorealism over GPT Image 1 with the same industry-leading text rendering, ideal for ads, packaging, and editorial layouts where typography and image quality both matter. Surpassed by GPT Image 2 High for premium work but cheaper at the high tier.

high-fidelity text-on-image workproduct packaging mockupsmarketing posters with typographyinstructional imagery

14 Credits

Artificial AnalysisElo 1264#4 / 148

Image Reference

View GPT Image 1.5 High details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

GPT Image 2 Low

OpenAI's GPT Image 2 budget tier. Same model architecture as GPT Image 2 Medium and High at a fraction of the cost — useful for high-volume iteration, draft generation, and exploration before committing to higher tiers for finals.

fast budget-tier generationiteration and explorationhigh-volume creative

2 Credits

Artificial AnalysisElo 1339#1 / 148

Image Reference

View GPT Image 2 Low details

Black Forest Labs

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 2 Klein

The smaller, faster open-weight variant of Black Forest Labs' Flux 2 family. Self-hostable for teams that want the Flux 2 aesthetic on their own infrastructure, with strong quality-per-credit at a lower cost than Flux 2 Pro. Good for iteration and exploration before stepping up to Pro or Max for finals.

fast iteration on Flux 2 aestheticsopen-weight workflow integrationself-hosted explorationhigh-volume creative

2 Credits

Artificial AnalysisElo 1122#54 / 148

Image Reference

View Flux 2 Klein details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 2 Pro

Black Forest Labs' Flux 2 production tier — the current Flux flagship for most use cases. Stylized photorealism with stronger prompt adherence than Flux 1.1 Pro, broad aesthetic range, and reference-image support. Sits below Flux 2 Max on top-end fidelity but offers a better cost-quality ratio for production work.

stylized photorealistic creative at production qualityconcept art for games and filmpremium marketing visualscharacter series with consistent stylediverse aesthetic work

5 Credits

Artificial AnalysisElo 1188#18 / 148

Image Reference

View Flux 2 Pro details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 2 Max

The highest-fidelity tier in Black Forest Labs' Flux 2 family. Top-5 on the Artificial Analysis text-to-image arena with magazine-grade detail in textures, fabrics, and natural materials. Reach for it on hero shots, magazine layouts, and premium ad creative where the cost per image is justified.

premium ad and editorial creativemagazine-cover quality outputsstylized photorealism at maximum fidelitycomplex compositions

10 Credits

Artificial AnalysisElo 1193#15 / 148

Image Reference

View Flux 2 Max details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux Schnell

Black Forest Labs' fastest, lowest-cost Flux variant. Built for rapid prototyping, moodboards, and high-volume batch generation where speed and credit cost matter more than raw fidelity. Open-weight and well-supported in the community, with extensive prompt-pattern documentation accumulated since the original Flux release.

rapid iteration on conceptshigh-volume batch generationmoodboards and ideationbackground plate generationcheap variants

1 Credit

Artificial AnalysisElo 1000#120 / 148 View Flux Schnell details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux 1.1 Pro

Black Forest Labs' workhorse flagship before the Flux 2 release. Strong stylization range across photorealistic and illustrated work with broad community knowledge of prompt patterns — a known quantity for production workflows. Edged out by Flux 2 Pro on detail and prompt adherence but still cost-competitive for everyday creative.

stylized photorealistic creativeconcept art and illustrationmarketing visualscharacter designdiverse aesthetic exploration

4 Credits

Artificial AnalysisElo 1084#77 / 148

Image Reference

View Flux 1.1 Pro details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux Kontext Pro

Black Forest Labs' instruction-based image editor — describe an edit in plain English ('replace the background with a sunset beach', 'change the shirt to red') and get a clean result that preserves the rest of the scene. Strong identity and lighting consistency makes it production-ready for outfit swaps, background replacement, and prop changes.

instruction-based image editsbackground swapsobject addition or removalcharacter outfit changescolor grading and style transfer

4 Credits

Artificial AnalysisElo 1090#74 / 148

Image Reference

View Flux Kontext Pro details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Flux Kontext Max

The premium Flux Kontext tier from Black Forest Labs. Improved typography handling and stronger scene understanding than Flux Kontext Pro, suited to complex multi-step edits, art-directed photo manipulations, and edits involving on-image text. Pick this when Pro's results aren't sticking the landing on harder transformations.

premium instruction-based editingcomplex multi-step editshigh-fidelity background and subject swapsart-directed photo manipulations

8 Credits

Artificial AnalysisElo 1123#53 / 148

Image Reference

View Flux Kontext Max details

Alibaba

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Z Image Turbo

A fast 6B-parameter text-to-image model from the Z-Image lineup with surprisingly strong photorealism for its size. Built for high-volume iteration where credit cost and speed matter — moodboards, social variants, exploration before stepping up to a flagship. Open-weight and competitive at this budget tier.

fast budget-tier generationhigh-volume creativeideation and moodboardslightweight stylized output

1 Credit

Artificial AnalysisElo 1107#65 / 148 View Z Image Turbo details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Qwen Image 2

Alibaba's unified generation-and-editing model with native 2K resolution and improved fidelity over the original Qwen Image. Strong text rendering across multiple scripts and natural cinematic stylization make it a solid pick for fashion, editorial, and content targeting Asian markets — and it doubles as an editor in the same checkpoint.

Asian-cinema aestheticsfashion and editorial creativestylized character workanime-adjacent illustration

4 Credits

Artificial AnalysisElo 1076#82 / 148

Image Reference

View Qwen Image 2 details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Qwen Image 2 Pro

The premium tier of Alibaba's Qwen Image 2. Enhanced realism and text accuracy at native 2K with precise image-editing capability built in — competitive with Flux 2 Pro on the Artificial Analysis text-to-image arena. Best-in-class for stylized fashion photography with multilingual text demands.

premium Asian-aesthetic creativefashion editorial at high fidelitystylized character seriescinematic concept art

8 Credits

Artificial AnalysisElo 1157#36 / 148

Image Reference

View Qwen Image 2 Pro details

Recraft

No examples yet for Recraft V4.1

Recraft V4.1

Recraft's newest design-focused generation, refining V4's design taste with better prompt accuracy, art-directed composition, and integrated text rendering — at the same cost as V4. The default Recraft pick for posters, packaging, and brand assets where typography and layout matter as much as the image.

typography-heavy designs (posters, packaging)logo and brand mark iterationart-directed marketing creativediverse stylized illustrations

4 Credits

Artificial AnalysisElo 1147#42 / 148 View Recraft V4.1 details

No examples yet for Recraft V4.1 SVG

Recraft V4.1 SVG

Recraft V4.1 in SVG output mode — production-ready vector images you can edit directly in Figma or Illustrator, now with V4.1's sharper design taste and text rendering. Still nearly unique in the AI image space; most models output rasters that don't trace cleanly.

vector logo iterationeditable brand assetsSVG icons and illustrationsscalable design elements

4 Credits

Artificial AnalysisElo 1147#42 / 148 View Recraft V4.1 SVG details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V4

Recraft's design-focused flagship, built around design taste rather than photorealism. Best-in-class typography rendering for an image model alongside strong prompt accuracy and art-directed composition. The natural choice for posters, packaging, brand assets, and any creative where text and graphic design matter as much as the image.

logo and brand mark iterationvector / SVG output for editable assetstypography-heavy designs (posters, packaging)diverse stylized illustrations

4 Credits

Artificial AnalysisElo 1137#47 / 148 View Recraft V4 details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V4 SVG

Recraft V4 in SVG output mode — generates production-ready vector images you can edit directly in Figma or Illustrator. Built for logos, icons, and scalable design assets where the output needs to be vector from the start. Unique in the AI image space; most models output rasters that don't trace cleanly.

vector logo iterationeditable brand assetsSVG icons and illustrationsscalable design elements

8 Credits

Artificial AnalysisElo 1137#47 / 148 View Recraft V4 SVG details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V3 Realism

Recraft's previous-generation realism model with strong long-form text rendering and photorealistic outputs. Surpassed by Recraft V4 on overall design taste and prompt accuracy, but still useful when you want the older Recraft V3 aesthetic specifically. Same checkpoint underneath as the V3 Digital Illustration variant.

graphic design iterationstylized illustrationsbrand asset explorationdiverse creative styles

4 Credits

View Recraft V3 Realism details

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Recraft V3 Digital Illustration

Recraft V3 in digital-illustration mode — consistent stylized output for editorial spot art, blog imagery, and content series. Useful when you need a series of images with a unified illustrated treatment rather than photorealism. Lower-cost than Recraft V4 if the simpler V3 aesthetic fits the brief.

digital illustration in a distinctive styleeditorial spot artblog and article imagerystylized concept work

4 Credits

View Recraft V3 Digital Illustration details

xAI

Prompt: A woman dancing in a garden full of animals. She is wearing a T-shirt with the word “Upsampler” on it.

Try this prompt

Grok Imagine Image

Limited Capacity

xAI's image generation and editing model with strong prompt adherence and a distinctive personality-driven aesthetic. Built for X-platform-aligned content, social-media-ready imagery, and irreverent creative work where polish matters less than tone. Strong cost-quality at this tier compared to the major US flagships.

stylized creative with personalitysocial-media-ready imageryhumorous or irreverent conceptsX-platform-aligned visuals

2 Credits

Artificial AnalysisElo 1179#22 / 148

Image Reference

View Grok Imagine Image details

Ideogram

No examples yet for Ideogram V4 Turbo

Ideogram V4 Turbo

The speed tier of Ideogram's v4 generation — same realism and typography focus as its Balanced and Quality siblings, tuned for fast, cheap iteration. The right entry point for exploring compositions and text layouts before re-rolling the winner on a higher tier.

rapid iteration on text-heavy layoutsexploring poster and packaging compositions cheaplyhigh-volume draft generation

3 Credits

Artificial AnalysisElo 1169#27 / 148 View Ideogram V4 Turbo details

No examples yet for Ideogram V4 Balanced

Ideogram V4 Balanced

The mid tier of Ideogram's v4 generation, balancing speed, quality, and cost. A clear step up from Ideogram 3.0 in realism and style consistency while keeping best-in-class text rendering — the default Ideogram pick for typography-prominent creative.

typography-heavy designs (posters, packaging)logos and brand markstext-prominent ad creativeconsistent style across a campaign series

6 Credits

Artificial AnalysisElo 1169#27 / 148 View Ideogram V4 Balanced details

No examples yet for Ideogram V4 Quality

Ideogram V4 Quality

The top tier of Ideogram's v4 generation — slowest but highest quality, with the most detail and polish the lineup offers. Pick it for final renders of typography-heavy creative where the text, layout, and realism all need to hold up at full resolution.

final renders of posters and packagingpremium brand creative with on-image texthigh-detail realistic scenes with typography

10 Credits

Artificial AnalysisElo 1169#27 / 148 View Ideogram V4 Quality details

Krea

No examples yet for Krea 2 Medium

Krea 2 Medium

Krea's foundation image model, tuned for expressive illustration, anime, and painterly styles — fast and consistent across artistic directions. One of the highest-rated image models on the Artificial Analysis leaderboard, and remarkably cheap for that tier.

expressive illustration and concept artanime and stylized character workpainterly and traditional-media looksconsistent art direction across a series

3 Credits

Artificial AnalysisElo 1208#10 / 148 View Krea 2 Medium details

No examples yet for Krea 2 Large

Krea 2 Large

Krea's flagship foundation model — larger and more flexible than Krea 2 Medium, with particular strength in photorealism alongside the same expressive artistic range. The pick when you want Krea's aesthetic sensibility on realistic scenes, portraits, and premium creative.

photorealistic scenes with artistic directionportraits and editorial imagerypremium creative spanning realism and stylization

6 Credits

Artificial AnalysisElo 1189#17 / 148 View Krea 2 Large details