You’re building a series — five blog illustrations, ten product shots, twenty storybook panels. Each image looks good on its own, but lined up together, they could be from five different artists. One is painterly, one is photo-real, one is anime-tinted, one is flat illustration. No visual cohesion.
Style consistency is not “the model can’t” — it’s “you didn’t constrain it.” The model picks a style per prompt; if you don’t lock the style, it picks a different one each time.
Common causes
Ordered by hit rate, highest first.
1. No style anchor reused across prompts
Each prompt has its own style words (“watercolor,” “photograph,” “illustration”), or none. Without a single shared style sentence pinned to every prompt, the model wanders.
How to spot it: paste your 5 prompts side by side. Are the style words identical? Usually they’re not.
2. No seed lock
Even with the same style anchor, different seeds produce different style flavors. Locking the seed across a series makes the model paint with similar brushstrokes.
How to spot it: your tool’s seed is “random” or different per image.
3. No shared reference image
The most powerful consistency lever is a reference image fed into every generation. Without one, style drifts.
How to spot it: you’re not using --sref (Midjourney), IP-Adapter (SD), or Flux Redux.
4. Different models / samplers across the series
Switching from SDXL to Flux mid-series guarantees inconsistency. Even sampler differences (Euler vs DPM++) produce noticeably different texture.
How to spot it: check the generation metadata of each image. Different models or samplers → that’s it.
5. Style words contradicting each other within a single prompt
"painterly watercolor photograph in the style of anime"
This single prompt has four conflicting style words; the model averages and the average is unstable.
How to spot it: any prompt where you can’t write the style in 3-5 words is too noisy.
6. Wide creative latitude on Midjourney
Default Midjourney --stylize 500 gives the model a lot of creative leeway, which hurts series consistency. Lower it to --stylize 100 or --style raw.
How to spot it: you’re on Midjourney default settings.
Shortest path to fix
Step 1: Write ONE style-anchor sentence, paste into every prompt
Define your style with 3-7 specific words. Examples:
# Soft pastel illustration
"soft pastel illustration, gentle line work, muted color palette,
storybook style, watercolor textures, hand-drawn aesthetic"
# Cinematic photo
"35mm film photograph, kodak portra 400, available natural light,
cinematic muted colors, 50mm lens, shallow depth of field"
# Flat vector
"flat vector illustration, bold simple shapes, limited 5-color palette,
no gradient, no texture, modern design aesthetic"
Paste this exact block at the start (or end) of every prompt in the series.
Step 2: Use a reference image fed into every generation
# Midjourney
"prompt for image N" --sref [URL of style reference] --sw 100
# SDXL Forge / ComfyUI
- IP-Adapter Style mode
- Load same reference image for every generation
- Weight: 0.7-0.9
# Flux dev
- Flux Redux mode
- Same reference, strength 0.6-0.8
# DALL-E (ChatGPT)
- Upload a style reference at the start of the chat
- Say "match this style for all following images"
Step 3: Lock the seed
# Midjourney
"... --seed 42"
# SDXL / Flux
- Pin seed to a single integer (e.g., 42)
- Uncheck "randomize"
# ComfyUI
- KSampler seed: fixed, control_after_generate: fixed
Note: same seed + same model + same sampler produces highly similar style output. Same seed alone won’t fully lock it.
Step 4: Same model, same sampler, same settings
Document the technical recipe and use it for every image in the series:
- Model: Juggernaut XL v9 (do not switch)
- Sampler: DPM++ 2M Karras (do not switch)
- Steps: 30 (do not change)
- CFG: 6 (do not change)
- Resolution: 1024×1024
- Seed: 42 (or pinned per-image but fixed)
- LoRAs: <lora:style_X:0.6> (same list)
Step 5: Tighten Midjourney stylize
# Series work — use one of these
"--style raw --stylize 100" # least latitude
"--stylize 250" # mild latitude
Avoid default --stylize 500 for series.
Step 6: Train a style LoRA for long series
If the series is >30 images:
1. Generate / collect 15-25 example images in the target style
2. Train a LoRA: 1500-2500 steps, lr 1e-4
3. Use trigger token in every prompt
4. Style now locks across any prompt + any subject
Civitai, Replicate, Astria all offer one-click style LoRA training.
Prevention
- Every series project starts with a written-down “style spec”: anchor sentence, reference image URL, model, sampler, CFG, seed strategy
- Paste the style anchor at the same position in every prompt (always-end, or always-start — pick one)
- For series of 30+ images, train a style LoRA upfront — it pays back fast
- Save the spec file with the project so future-you can continue the series later
Related
- AI image character consistency
- AI image style consistency
- AI image prompt basics
- AI consistent character images
Tags: #Image generation #Debug #Troubleshooting #Consistency