Skip to content

Choosing a model

A model is the engine that turns your prompt into an image or video. Different models are good at different things. You don’t have to learn them all — by default the studio picks one for you. This page covers the default, then a plain-language guide for when you want to choose yourself.

When you open the composer, the model is set to Auto. Auto reads your task and picks the right model:

  • A plain prompt with no reference image — a strong all-rounder for new images.
  • A prompt plus a reference image — a model that works from your reference.
  • An edit to an existing image — a model built to change one thing and leave the rest alone.

Auto is marked Suggested and shows “Picks the best model for your prompt.” For most work, leave it on Auto and write a clear prompt. See Writing better prompts.

To pick a specific model, click the model name in the composer to open Browse image models (or Browse video models for video). You get a scrollable list with filters across the top:

  • Provider — filter by maker (Google, Black Forest Labs, Ideogram, Recraft, xAI, and more).
  • Features — Reference support, LoRA support, or Fast.
  • Best for — a goal, such as Photoreal, Illustration & art, Text & logos, Portraits, Editing & references, Brand & marketing, Fast drafts, or Video & motion.
  • Search — type a model name.

Each row shows the model, its typical render time, and a few tags (References, Fast, LoRA). Pick a model to pin it; close the browser to stay on Auto. Pinning a model with more than one version shows a small set of version chips so you can switch between, for example, the standard, faster, and highest-quality version.

To go back to Auto after pinning, click Auto at the top of the model area.

You rarely need to choose, but if you want a specific look, here’s a starting point. The version names below match what you see in the browser.

Your goalTryWhy
A reliable all-rounderFlux 2 ProHandles plain prompts, reference images, and edits in one model
Crisp photographic resultsImagen 4 / Imagen 4 UltraPhotoreal skin, lighting, and detail; Ultra adds higher resolution
Readable text in the imageIdeogram 3Renders clean, legible text — posters with copy, logos, social cards
Stylized illustration or designRecraft V3 / V4Vector and brand-safe styles; V4 takes longer style instructions
Bold, creative looksGrok ImagineLoose, expressive styles for editorial and mood work
Precise instruction-followingFilter Best for to Editing & referencesSharp at multi-step instructions and high-fidelity image inputs
A fast draftFlux 2 Klein, Imagen 4 Fast, or any model tagged FastQuick, low-cost passes for iterating on an idea
Keep a subject consistent across editsFlux KontextBuilt for character and subject consistency across a series
Use several reference imagesFlux 2 ProComposes from multiple references in one shot
A transparent backgroundFilter Best for to Editing & referencesOutput an isolated subject on a transparent background

Need transparent or print-ready output, or want to control resolution and aspect ratio? See Generation settings.

Video models take longer than image models and are priced by the second. Auto starts you on a strong general video model. To choose yourself:

Your goalTryWhy
Video with soundVeo 3Generates matching audio along with the clip
Longer or higher-resolution clipsVeo 3.1Adds 4K and extends clips well past the standard length
Animate an existing stillGen-4 TurboBuilt for image-to-video — bring a hero still to life
A cinematic shortGen-4.5Flagship quality from text and image
Realistic human or character motionKling 2.1Strong motion and physics for movement-heavy clips
Best valueHailuo 2.3Strong motion at a low per-second cost (add audio separately)
Camera moves like a push-in or orbitRay 2Named camera controls and longer clips
A quick draftVeo 3 Fast, Hailuo 2.3 Fast, or Ray Flash 2Faster, cheaper passes for testing a shot idea

Each model row shows its typical render time, and pinned image and video models show an estimated cost before you generate. Faster, lower-cost models are good for drafts; higher-quality models are worth the wait for final work. Auto already balances this for the task at hand.

If your studio admin has enabled mature content, you’ll see an optional toggle.

For how to write prompts that get the most from any model, see Writing better prompts. To start generating, see Creating images.