The hardest part of AI image generation isn’t getting one good image — it’s getting ten that feel like they belong together. Same lighting, same palette, same mood, same camera language. Here’s a template that holds up across Midjourney, Imagen, Flux, and DALL·E.
The 4-layer prompt structure
[Subject] // what's in the frame
+ [Style] // the visual DNA — locked across the whole series
+ [Lighting & color] // also locked across the series
+ [Camera / composition] // varies per image to add variety
The trick: lock the middle two layers across every prompt. That’s what produces visual consistency.
Master style block (paste this into every prompt)
Style: editorial photography, slightly desaturated film look, soft analog grain, Kodak Portra 400 emulation
Color: warm muted palette, dusty rose + sage green + cream, low contrast
Lighting: soft window light from camera-left, gentle falloff, no hard shadows
This is your “brand sheet.” Reuse it word-for-word across every image in the series.
Then vary the subject + composition
For image 1:
A young woman writing in a notebook on a wooden desk, half profile, --ar 4:5
[paste master style block here]
Composition: medium shot, shallow depth of field, subject slightly off-center
For image 2:
The same desk, now with a cup of coffee and an open laptop, no person, --ar 16:9
[paste master style block here]
Composition: top-down flat lay, balanced negative space
Notice how the style/lighting/color block stays identical. That’s what carries consistency.
Character consistency
For series featuring the same person:
Character: a 30-year-old Asian woman, shoulder-length dark hair, round wire-frame glasses, oversized cream sweater, soft natural makeup
[paste master style block here]
Scene: [varies per image]
Pin the character block. Vary only the scene.
Industry-specific style blocks
Product photography:
Style: clean studio product photography, seamless paper background, minimal
Color: neutral grays, single accent color matching the product
Lighting: large softbox top-left, fill card right, subtle reflection underneath
Editorial illustration:
Style: contemporary editorial illustration, flat shapes with subtle gradients, hand-drawn texture overlay
Color: limited palette of 4 colors max, off-white background
Lighting: implied direction, soft shadows only
Cinematic photography:
Style: anamorphic cinematic look, slight lens flare, 35mm film aesthetic
Color: teal and orange grade, deep shadows, milky highlights
Lighting: practical sources only (window, lamp), motivated direction
Pro tips that actually move the needle
- Reuse exact words, not synonyms. “Cinematic” ≠ “filmic” to the model.
- Save the master block in a sticky note. Don’t rewrite it from memory.
- Test 3 images first. Then lock the block. Then generate the rest.
- —seed (Midjourney) or fixed seeds elsewhere help even more on top.
Summary
The secret to a coherent visual series isn’t talent — it’s lockable prompt structure. Build your style block once, then vary only what should vary.
Related
- Style consistency across images — production tutorial for the same workflow
- AI image style drift fix — recover when the series drifts mid-batch
- AI consistent character images — character-side consistency
- Game character portrait sheets — applied example of the same approach
- AI Locks Visual Style Across a Series of Images