You wrote a careful prompt with strong subject, lighting, and material descriptors. The output is technically correct — the subject is there, lit well — but the composition is dead. Subject smack in the middle, flat depth, no leading lines, background neither supporting nor framing the subject. This is the “AI default composition” — a centered medium-shot with no point of view. Fix it by treating composition the same way you treat lighting: a thing you explicitly direct, not an afterthought.
Common causes
Ordered by what most often produces dull compositions.
1. No composition language in the prompt
A woman in a red dress at a cafe does not tell the model anything about framing. The model’s default is center-subject, eye-level, medium shot, flat depth. This is the “AI catalog photo” look.
How to spot it: Search your prompt for any composition term (rule of thirds, leading lines, low angle, off-center, foreground, etc.). If none, this is the issue.
2. Model defaults to subject-centering
All major 2025-2026 models bias hard toward centered subjects in their training data. Without explicit instruction, you get a centered subject every time.
3. No foreground / midground / background separation
A flat composition has subject + background only. A strong composition has three planes: something in front (foreground element), the subject (midground), something behind (background). Without prompting all three, you usually get two.
4. No depth-of-field specification
Shallow depth of field, blurred background separates subject from background visually. Without it, both planes are at the same focal sharpness, producing a flat result.
5. No camera angle / position cue
Eye-level, looking straight at subject is the boring default. Low angle looking up, slight high angle, over-the-shoulder, bird's-eye view all produce more dynamic compositions.
6. No motion / line cues
Leading lines toward subject, diagonal composition, subject framed by doorway — these line cues create visual interest. Without them, no path for the eye to travel.
7. Background too busy or too empty
Either extreme produces weak composition. Busy background fights with the subject; empty background leaves the eye nowhere to go after the subject.
Before you change anything
- Save the current prompt, model, and the boring output.
- Find 3-5 reference images of strong compositions in your genre (Pinterest, Behance, professional photography accounts).
- Note the composition techniques in those references (off-center, leading lines, framing, depth).
- Decide what kind of composition the use case actually needs (hero needs drama, product needs clarity, lifestyle needs context).
- Commit or back up the current prompt template before changing it.
Information to collect
- Full prompt, model, version.
- A reference image of strong composition for comparison.
- The intended use case (hero, thumbnail, gallery, print).
- Whether the subject is the main visual or sharing the frame with environment.
Shortest path to fix
Step 1: Add explicit composition language
Common composition terms that work in most models:
rule of thirds, subject off-center to the left,
strong leading lines from foreground to subject,
foreground element framing the shot,
shallow depth of field with sharp subject and blurred background
For a portrait:
medium close-up, subject placed at the right third,
window light coming from the left, soft falloff into shadow on the right
For a product shot:
diagonal composition, product offset to the lower-third,
strong specular highlight along the top edge, soft fall-off shadow extending to the upper-right
Step 2: Specify foreground / midground / background
Explicitly call out three planes:
foreground: out-of-focus coffee cup on the left edge,
midground: woman seated at the table, sharp focus,
background: blurred warm cafe lights and silhouettes
This single phrase often transforms a flat composition into a layered one.
Step 3: Specify camera angle and position
Avoid the eye-level default:
low angle looking up at the subject, slight wide lens distortion
slight high angle, looking down at the table from above
over-the-shoulder, blurred shoulder in the foreground, subject in focus across the room
bird's-eye view from directly above
Step 4: Add depth-of-field language
shot on 85mm f/1.4 lens, very shallow depth of field, sharp eyes,
heavily blurred background
For environmental shots:
shot on 24mm f/8 lens, deep depth of field, foreground and background equally sharp
The lens + aperture spec gives the model concrete instruction.
Step 5: Use leading lines and framing
strong leading lines along the road converging on the subject
subject framed by the doorway, dark silhouette around the bright opening
diagonal composition with the action moving from lower-left to upper-right
Step 6: Study and steal composition from references
Look at strong compositions in your genre and translate them into prompt language:
- Reference shows subject at right third → write “subject offset to the right third”
- Reference has strong diagonal road → write “leading lines along the road converging on the subject”
- Reference has out-of-focus foreground branch → write “out-of-focus tree branch in the upper-left foreground”
Step 7: Reduce or simplify the background
If the background fights the subject:
simple clean background, gradient gray, no distracting elements
If the background is empty and dead:
warm contextual environment, soft suggestion of architecture,
not distracting but supporting the subject
How to confirm the fix
- The subject is not dead-center; it sits on a third line or off-axis.
- There is visible separation between foreground / subject / background.
- The eye has a natural path through the image (leading lines, frame, etc.).
- Compared to your reference composition, the gap should be small.
- A teammate looking at the output should not say “feels generic.”
If it still fails
- Strip the prompt to subject + 2-3 composition cues. Regenerate. Add back lighting / style only after composition is locked.
- Use image-to-image starting from a strong composition reference, with denoise 0.5-0.6.
- Try a different model — Midjourney v7 has historically stronger composition defaults than SDXL.
- Hand-crop the output in post — often a center composition can be improved 50% just by cropping off-center.
- Package the prompt, model, output, and reference before asking community help.
Prevention
- Study reference photos in your specific genre and steal their composition language.
- Build a “composition kit” of phrases — leading lines, rule of thirds, camera angles, framing — and reuse.
- Treat composition as a required prompt element, never optional.
- Default to off-center placement unless the use case demands center.
- Specify camera angle and depth-of-field on every image; they are as important as lighting.