These models generate images from text prompts. Here, you can find the latest state-of-the-art image models that fit your use case.
Nano Banana 2 from Google is the strongest all-around image generation model right now. It's fast, handles multi-image fusion (up to 14 images), renders text accurately in multiple languages, and does conversational editing. Great for everything from quick prototypes to production workflows. Nano Banana Pro adds Gemini 3 Pro reasoning, Google Search grounding, and 4K output for more complex tasks.
GPT Image 1.5 from OpenAI is close behind — it follows complex prompts accurately, renders readable text, and handles everything from photorealistic scenes to infographics and UI mockups. It also works as an image editor — make targeted changes while preserving everything else. Requires an OpenAI API key.
FLUX.2 Max from Black Forest Labs delivers the highest fidelity in the FLUX family. It excels at product photography, character consistency across batches (up to 8 reference images), and precise color control via hex codes. Great for e-commerce, fashion, and any workflow where consistency matters.
FLUX.2 Pro offers similar capabilities at a lower price. It supports structured JSON prompting for precise control over camera angle, lighting, and composition, and handles up to 8 reference images. A good choice for high-volume production work.
Seedream 4.5 from ByteDance produces film-like visuals with cinematic aesthetics, refined lighting, and strong spatial understanding. Particularly good at realistic proportions and structured environments. Supports up to 4K resolution with batch and multi-reference generation.
Imagen 4 Ultra from Google renders the finest details — skin texture, individual strands of hair, fabric weave, water droplets. Use it when quality matters more than speed. Supports up to 2K resolution.
Seedream 5 Lite from ByteDance is the newest generation with built-in multi-step reasoning. It understands spatial relationships, physics, and professional conventions across architecture, science, health, and design. Supports example-based editing, multi-image blending (up to 14 references), and up to 3K resolution. Good for complex prompts that require the model to think through what it's generating.
Grok Imagine Image from xAI has a distinctive visual style — strong at cinematic character rendering with facial consistency, moody aesthetics with dramatic contrast, and retro anime looks. Renders readable text well. Also supports image editing.
FLUX.2 Flex is the typography specialist in the FLUX family. It reliably renders clean text, captions, and complex layouts — perfect for memes, posters, infographics, and UI mockups. You can adjust the quality-speed trade-off by changing the number of steps, making it great for rapid iteration. Supports up to 10 reference images.
Ideogram v3 is built for graphic design and branding. It generates precise text, supports style references (upload up to 3 images or use 4.3 billion style presets), and produces clean layouts for logos, posters, and marketing materials. Available in Turbo, Balanced, and Quality tiers.
Recraft V4 takes a design-first approach — every output feels art-directed rather than generic. Strong integrated text rendering, intentional composition, and refined color relationships. Good for brand assets, editorial photography, and print-ready work.
Recraft V4 SVG generates native, editable SVG vector files — not traced rasters. Output opens directly in Illustrator, Figma, or Sketch with clean paths and structured layers. The only image generation model that produces true vector output. Use it for logos, icons, illustrations, and any asset that needs to scale.
Imagen 4 Fast and FLUX Schnell are built for quick iteration — use them when you need fast results at lower cost.
Ideogram v3 Turbo gives you solid image quality with good text rendering at $0.03 per image.
Compare models side by side in the playground to find what works best for your project.
Questions? Join us on Discord.
Featured models

Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 5 days, 8 hours ago
1.6M runs

bytedance/seedream-4.5Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
Updated 2 weeks, 4 days ago
7.9M runs

High-quality image generation and editing with support for eight reference images
Updated 4 weeks ago
5.5M runs

Max-quality image generation and editing with support for ten reference images
Updated 1 month ago
230.8K runs

The highest fidelity image model from Black Forest Labs
Updated 1 month ago
1.7M runs

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency
Updated 1 month, 3 weeks ago
6.5M runs

Google's state of the art image generation and editing model 🍌🍌
Updated 1 month, 3 weeks ago
22M runs

bytedance/seedream-5-liteSeedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge
Updated 1 month, 3 weeks ago
1.3M runs

SOTA image model from xAI
Updated 2 months, 1 week ago
776.6K runs

openai/gpt-image-1.5OpenAI's latest image generation model with better instruction following and adherence to prompts
Updated 2 months, 4 weeks ago
9.6M runs
Recommended Models
Speed depends on the model’s architecture and how it’s optimized for the hardware it runs on. If you want quick results, models like google/imagen-4-fast and black-forest-labs/flux-schnell are built to return outputs fast, which is great for rapid iteration.
Smaller or “fast” variants usually cost less to run. bytedance/seedream-4.5 and ideogram-ai/ideogram-v3-turbo are good picks if you want solid image quality without spending a lot.
Text-to-image models create a new image from scratch based on your text prompt. Image-to-image models take an existing image and use your prompt to change or build on it. Think of it as “paint something new” vs. “edit what’s already there.”
bytedance/seedream-4.5 and ideogram-ai/ideogram-v3-turbo are great for realistic lighting, textures, and faces. They’re popular for lifelike portraits, product shots, and scenery.
If you’re aiming for a specific look, black-forest-labs/flux-1.1-pro and black-forest-labs/flux-schnell give you more control over style, lighting, and composition. They’re good for illustrations, concept art, or anything with a creative twist.
Yes. Use text-guided editing models like black-forest-labs/flux-kontext-pro or bytedance/seedream-4.5 to add or change details in an existing image. For example, you can tell it to “add sunglasses” or “turn it into a painting.”
Most models output images between 512×512 and 4K. Check the model card for the exact dimensions supported. Higher resolutions can cost more and take a bit longer to run.
Use a reference image or a fixed seed. Models like black-forest-labs/flux-kontext-pro and ideogram-ai/ideogram-v3-turbo support both, so you can keep the same look across multiple runs.
Some models support fine-tuning. Look for the fine-tune tag on the model page or check the README for training details.
Yes. Push a model from GitHub with a replicate.yaml file. Once it’s built, it runs on the same infrastructure as other models.
Check the “License” section on the model page. Some licenses allow commercial use, others don’t. Always make sure before using outputs in anything public or commercial.
Recommended Models

Use this fast version of Imagen 4 when speed and cost are more important than quality
Updated 3 days, 13 hours ago
5.2M runs

prunaai/hidream-l1-fastThis is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
Updated 1 week, 5 days ago
8.3M runs

Generate and edit images with Alibaba's Wan 2.7
Updated 2 weeks, 4 days ago
7.4K runs

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation
Updated 2 weeks, 4 days ago
21.8K runs

prunaai/z-image-turboZ-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
Updated 1 month, 3 weeks ago
39.1M runs

prunaai/p-imageA sub 1 second text-to-image model built for production use cases.
Updated 1 month, 3 weeks ago
9.3M runs
recraft-ai/recraft-v4-pro-svgGenerate detailed SVG vector graphics from text prompts. Recraft V4 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, and scalable to any size.
Updated 2 months ago
6.1K runs

recraft-ai/recraft-v4-proRecraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4, with higher resolution for print-ready and large-scale work.
Updated 2 months ago
10.8K runs

recraft-ai/recraft-v4Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.
Updated 2 months ago
450.4K runs
recraft-ai/recraft-v4-svgGenerate production-ready SVG vector images from text prompts. Recraft V4's design taste applied to vector output — clean geometry, structured layers, and editable paths.
Updated 2 months ago
16.5K runs

prunaai/p-image-loraUse trained LoRAs from the https://replicate.com/prunaai/p-image-trainer. Find or contribute LoRAs here https://huggingface.co/collections/PrunaAI/p-image-loras
Updated 2 months ago
32K runs

Google's Imagen 4 flagship model
Updated 2 months, 1 week ago
8M runs

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Updated 2 months, 1 week ago
2.1M runs

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
Updated 2 months, 1 week ago
603.7K runs

Google's latest image editing model in Gemini 2.5
Updated 2 months, 2 weeks ago
101.5M runs

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
Updated 2 months, 2 weeks ago
12.6K runs

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Updated 2 months, 2 weeks ago
169.6K runs

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Updated 3 months ago
1.7M runs

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time applications.
Updated 3 months ago
11.9M runs

bytedance/seedream-4Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 5 months ago
33.3M runs

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 5 months, 1 week ago
11M runs

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 5 months, 1 week ago
49.2M runs

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Updated 5 months, 1 week ago
20.6M runs

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Updated 5 months, 1 week ago
14.1M runs

bytedance/seedream-3A text-to-image model with support for native high-resolution (2K) image generation
Updated 5 months, 1 week ago
3.4M runs

prunaai/flux-fastThis is the fastest Flux endpoint in the world.
Updated 5 months, 1 week ago
40.5M runs

Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
Updated 5 months, 1 week ago
268.3K runs

recraft-ai/recraft-v3Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Updated 5 months, 1 week ago
8.2M runs

recraft-ai/recraft-v3-svgRecraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
Updated 5 months, 1 week ago
398.2K runs

ideogram-ai/ideogram-v3-turboTurbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 5 months, 1 week ago
8.6M runs

ideogram-ai/ideogram-v2a-turboLike Ideogram v2 turbo, but now faster and cheaper
Updated 5 months, 1 week ago
388.1K runs

ideogram-ai/ideogram-v2An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 5 months, 1 week ago
2.8M runs

ideogram-ai/ideogram-v3-qualityThe highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 5 months, 1 week ago
2.2M runs

ideogram-ai/ideogram-v2aLike Ideogram v2, but faster and cheaper
Updated 5 months, 1 week ago
2M runs

ideogram-ai/ideogram-v3-balancedBalance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles
Updated 5 months, 1 week ago
446.9K runs

ideogram-ai/ideogram-v2-turboA fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 5 months, 1 week ago
2.9M runs

stability-ai/stable-diffusion-3.5-medium2.5 billion parameter image model with improved MMDiT-X architecture
Updated 5 months, 1 week ago
113.4K runs

stability-ai/stable-diffusion-3.5-largeA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
Updated 5 months, 1 week ago
2.1M runs

stability-ai/stable-diffusion-3.5-large-turboA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Updated 5 months, 1 week ago
1.1M runs

Minimax's first image model, with character reference support
Updated 5 months, 1 week ago
2.9M runs

luma/photon-flashAccelerated variant of Photon prioritizing speed while maintaining quality
Updated 5 months, 1 week ago
490.8K runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui
Updated 5 months, 2 weeks ago
8M runs

tencent/hunyuan-image-3A powerful native multimodal model for image generation (PrunaAI squeezed)
Updated 6 months, 1 week ago
74.2K runs

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Updated 8 months, 4 weeks ago
1.1M runs

prunaai/wan-2.2-imageThis model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
Updated 9 months ago
1.2M runs

prunaai/hidream-l1-fullThis is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!
Updated 9 months, 1 week ago
36.7K runs

prunaai/hidream-l1-devThis is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!
Updated 9 months, 1 week ago
52.4K runs

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
Updated 9 months, 3 weeks ago
5.8M runs

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
Updated 9 months, 3 weeks ago
45.5M runs

The fastest image generation model tailored for local development and personal use
Updated 9 months, 3 weeks ago
650.7M runs

prunaai/sdxl-lightningThis is the fastest sdxl-lightning endpoint in the world on A100, contact us for more at pruna.ai
Updated 10 months, 1 week ago
6.5K runs

bytedance/sdxl-lightning-4stepSDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Updated 1 year, 1 month ago
1B runs

A fast image model with wide artistic range and resolutions up to 4096x4096
Updated 1 year, 4 months ago
250K runs

luma/photonHigh-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs
Updated 1 year, 4 months ago
3.3M runs

stability-ai/sdxlA text-to-image generative AI model that creates beautiful images
Updated 1 year, 10 months ago
84.9M runs

fofr/sticker-makerMake stickers with AI. Generates graphics with transparent backgrounds.
Updated 1 year, 11 months ago
2.1M runs

ai-forever/kandinsky-2text2img model trained on LAION HighRes and fine-tuned on internal datasets
Updated 2 years ago
6.2M runs

ai-forever/kandinsky-2.2multilingual text2image latent diffusion model
Updated 2 years ago
10.1M runs

playgroundai/playground-v2.5-1024px-aestheticPlayground v2.5 is the state-of-the-art open-source model in aesthetic quality
Updated 2 years, 1 month ago
3.1M runs

datacte/proteus-v0.3ProteusV0.3: The Anime Update
Updated 2 years, 2 months ago
5.7M runs

'''Last update: Now supports img2img.''' SDXL Canny controlnet with LoRA support.
Updated 2 years, 2 months ago
1M runs

datacte/proteus-v0.2Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.
Updated 2 years, 2 months ago
12.1M runs

adirik/realvisxl-v3.0-turboPhotorealism with RealVisXL V3.0 Turbo based on SDXL
Updated 2 years, 3 months ago
615.6K runs

fofr/latent-consistency-modelSuper-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet
Updated 2 years, 3 months ago
1.5M runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting
Updated 2 years, 3 months ago
2.2M runs

lucataco/open-dalle-v1.1A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension
Updated 2 years, 3 months ago
132.7K runs

fofr/sdxl-multi-controlnet-loraMulti-controlnet, lora loading, img2img, inpainting
Updated 2 years, 3 months ago
219.6K runs

lucataco/dreamshaper-xl-turboDreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.
Updated 2 years, 4 months ago
230.8K runs

lucataco/ssd-1bSegmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities
Updated 2 years, 5 months ago
1.1M runs

fofr/sdxl-emojiAn SDXL fine-tune based on Apple Emojis
Updated 2 years, 7 months ago
12.1M runs

lucataco/realistic-vision-v5.1Implementation of Realistic Vision v5.1 with VAE
Updated 2 years, 8 months ago
4.3M runs

stability-ai/stable-diffusionA latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Updated 2 years, 9 months ago
111M runs

jagilley/controlnet-scribbleGenerate detailed images from scribbled drawings
Updated 3 years, 2 months ago
38.3M runs

tstramer/material-diffusionStable diffusion fork for generating tileable outputs using v1.5 model
Updated 3 years, 5 months ago
2.4M runs