AI Product Photography Tutorial

Studio-quality product shots without a studio — prompt structure that works.

The product photography problem isn’t a lack of AI tools — it’s that “studio product shot, white background” gets you a generic floating blob with no shadow logic and no material believability. This tutorial gives you the five-slot prompt scaffold real product photographers use (subject, material, finish, background, lighting), the camera-language vocabulary that signals “shot on glass” vs “rendered in software”, and the iteration rule that finally separates “AI image” from “saleable product photo”.

What this covers

Studio-quality product shots without a studio: the prompt structure that actually produces shadow geometry and surface materiality, the lens / aperture vocabulary that signals real photography, and the iterate-one-variable rule that stops you wasting credits on chaotic regenerations.

Who this is for

E-commerce sellers building listings, indie product founders shipping launches solo, Etsy / Shopify owners doing weekly fresh creative, and content teams generating placeholder imagery while real photography is scheduled.

When to reach for it

You need product imagery for a marketplace listing, social ad, landing page hero, or B2B catalog and you can’t afford or schedule a studio. Especially powerful for variants (same product, six color options) where reshooting each is overkill.

When this is NOT the right tool

The single “this is what you receive” hero image on a marketplace where accuracy is policy (Amazon, Etsy main image). Use real photography or photogrammetry for those. Anything where regulators inspect ad imagery for misleading depiction (supplements, medical devices, financial products).

Before you start

  • Pull 1-2 reference photos of your real product. Image-to-image always beats text-only for fidelity.
  • Decide finished platform sizes: Amazon needs 2000px+ square, Instagram 1080x1350, Shopify product 2048x2048. Set generation size accordingly.
  • Pick a brand visual mode in one sentence: “warm-lit kitchen lifestyle”, “cool studio minimal”, “moody product-noir”. This becomes your anchor for every subsequent prompt.
  • Have brand hex codes ready. “In brand colors” without hex drifts every time.

Step by step

  1. Use the 5-slot prompt scaffold: subject + material + finish + background + lighting. Example: “ceramic mug, glazed stoneware, matte finish with subtle texture, seamless paper backdrop in warm cream, soft key light from camera-left, gentle fill from a reflector camera-right”.
  2. Add camera language: lens + aperture + framing. “35mm, f/2.8, shallow depth of field, three-quarter angle, eye-level”. Camera vocabulary is the single biggest signal that AI separates “render” from “photograph”.
  3. Specify shadow direction and softness. “Soft shadow falling to lower-right at 30 degrees, edge softness about 4 inches”. Vague shadow language produces vague shadows.
  4. Iterate one variable per pass. Don’t simultaneously change lens, lighting, and background — you’ll never know what helped. Pick one, generate 4, evaluate.
  5. For reflective / glass / chrome products, add explicit reflection prompts: “subtle environment reflection on top surface, no hot-spot blowout, soft gradient highlight on side”. Reflective surfaces are AI’s weak spot.
  6. Generate 6-8 variants per prompt. The first output is rarely the keeper. Mark each as “usable / needs retouch / reject” so you build vocabulary.
  7. For final use, retouch lightly in Photoshop / Affinity: clean stray pixels, level brightness, sharpen the subject. AI does 90% of the work; retouching does the last 10% that separates “AI shot” from “studio shot”.

For the field-by-field prompt scaffold (key light, fill, lens, ratio, brand mood), see our walkthrough on writing a product image prompt before your first run.

First-run exercise

  1. Pick one real product you actually need to photograph this week. Real listings surface real decisions that practice runs miss.
  2. Run the 5-slot scaffold once end-to-end. Save the output even if rough.
  3. Place the image into the actual listing layout (Amazon thumbnail, Shopify product page). Most AI images look fine isolated and reveal problems in context.
  4. Second pass: change ONE variable. Most common improvement: shadow specification or aperture (depth of field).

Quality check

  • Does the shadow physics make sense? Direction, softness, and color should all match the stated light source. Floating products read as fake immediately.
  • Is the material believable? Ceramic should look like ceramic (slight surface variation), wood should show grain, metal should show micro-scratches and reflection.
  • At 100% zoom, are there edge artifacts (halos, weird AI fringe)? Common around fine details — handles, lids, transparent areas.
  • Does the lighting match a believable single setup? Two contradicting light sources (sun from left, hard shadow from right) is the most common giveaway.
  • Color accuracy: compare to your real product. Brand colors should match within reason; if not, your hex prompt was ignored.

How to reuse this workflow

  • Save winning prompts by product category. All mugs, all candles, all bottles get the same template — only the product description changes.
  • Build a small library of reject reasons: drifted logo, melted handle, contradictory shadow, plastic-looking ceramic. Naming failures speeds the next prompt.
  • Pair every saved prompt with a real product reference photo so the next person (or future you) can image-to-image and not lose fidelity.
  • Re-test the template every 4-6 weeks; models update and the workaround you needed last month may now be the default.

Structured 5-slot prompt → first pass 8 generations → mark keepers → iterate one variable (lighting, then aperture) → light retouch → export at platform sizes. If you are generating these shots inside ChatGPT specifically, the ChatGPT image generation workflow covers the iterate-one-variable loop in that UI.

Common mistakes

  • Vague “product on white” — produces generic floating product with no shadow logic. Always specify lighting direction and shadow geometry.
  • Generic “studio prompt” — produces the same default-lit AI ceramic regardless of what your product actually is. Specify material and finish.
  • Skipping camera language — without lens / aperture / framing, results read as renders not photographs.
  • Iterating multiple variables — you’ll never know which fix worked. One variable per pass.
  • Judging from a single output — generate at least 6 before evaluating.
  • Skipping the retouch step — AI does 90%, the final 10% Photoshop pass is what separates “AI shot” from “studio shot”.

FAQ

  • Can I use these for Amazon main images?: Generally no — Amazon policy requires the actual product. Use AI for secondary lifestyle and detail images.
  • What about reflective / glass products?: Hardest category. Expect 3-5x more rejects. Image-to-image with a real reference photo helps enormously.
  • Can I match my exact brand colors?: Include hex codes in the prompt and check the output with a color picker. Drift is common; nudge brightness / saturation in post.
  • How long should one product take?: First time end-to-end: 60-90 minutes. With saved templates: 15-20 minutes per SKU.
  • Best free retouching tool?: GIMP, Photopea (browser-based), or built-in Photos app for basic cleanup.

Tags: #Tutorial #Product photography