What about blameless culture?

Blameless is about not assigning fault to individuals. It is not about hiding what happened. "The deploy was unreviewed" is blameless and accurate. Blameful language from leadership is the most common way postmortem culture erodes, so the doc should model the opposite.

Should I just use Rootly or incident.io instead of pasting into a chat model?

If you already pay for one, yes — it auto-captures the timeline live during the incident and drafts from grounded events, cutting review to 10-15 minutes. The review discipline in this article still applies: the AI draft is a starting point, not the final word.

Can the AI run the postmortem meeting?

No. Use the doc as the meeting input. Humans facilitate.

What if Slack messages are in a different language?

GPT-5.5 and Claude handle multilingual Slack threads well, but verify timestamps and quoted strings line up. Mixed-language teams often have richer signal in Slack than in formal docs — preserve both.

Should action items go in a tracker automatically?

Yes, but after IC review. AI's "draft action items" are options, not commitments.

Can I include customer-facing wording in the same doc?

Better to keep them separate. The customer note is shorter and lawyer-reviewed; the internal postmortem is honest and detailed.

AI Tool Tutorials

AI for Incident Postmortems Without Sanitizing the Lessons

A step-by-step workflow to draft a blameless postmortem with AI in 60 minutes without letting the model round off the uncomfortable truths.

Published: May 24, 2026 Updated: Jun 04, 2026 Author: AI Productivity Guide Team 🌐 查看中文版本

The classic AI-drafted postmortem reads like a press release: balanced, well organized, and stripped of every uncomfortable truth that would have made it useful. The model rounds off “we deployed unreviewed code at 5pm Friday” into “a recent change interacted unexpectedly with our infrastructure.” That smoothing is not a bug you can prompt away once and forget. Large language models predict the most statistically likely next token, and when they lose the link to a specific source they merge distinct events into a tidy story — what researchers call confabulation, present in roughly 31% of real-world LLM responses and far higher in complex domains. A postmortem is exactly the complex domain where that failure mode is most expensive.

This workflow treats AI as a fast first-draft engine while keeping the incident commander (IC) in charge of every line where honesty matters more than comfort. Done right, it turns a 4-hour incident with 600 Slack messages into a defensible doc in 60-90 minutes instead of 3+ hours by hand.

TL;DR

Make the model build a sourced timeline first, before any narrative. Every line cites a Slack message, IC note, or dashboard. This is the single rule that blocks confabulation.
Draft Summary, Impact, and Root cause only from that timeline, with a hard “if you would speculate, write [needs IC input]” instruction.
Run 5 Whys with the AI as questioner, not writer — it converges too fast to a tidy answer otherwise, and software failures are rarely linear.
The IC reads the full draft against the raw artifacts and re-hardens any sentence that is more comfortable than the evidence. If you would not say it aloud to the team, do not ship it.
Platform-native tools (Rootly, incident.io) auto-capture the timeline during the incident; if you use one, start from its draft and apply the same review discipline.

What this covers

A workflow for turning raw incident artifacts — Slack thread, oncall doc, timeline notes, metrics described in text — into a structured postmortem that follows the standard Google SRE sections: Summary, Impact, Timeline, Root cause, Contributing factors, Action items. The goal is speed without sanitizing the lessons that make the doc worth writing.

Who this is for

ICs writing the doc after a 2am page, SREs running blameless postmortem culture, engineering managers who own the action items, and small teams without a dedicated incident process where everyone is the IC sometimes.

When to reach for it

Incidents big enough to warrant a written doc: customer impact, data loss, a multi-hour outage, or a repeated near-miss.
Teams that already have a postmortem template. AI is much faster filling a known structure than inventing one.
Cases where the Slack thread is the primary source of truth and re-reading 600 messages by hand will take 90 minutes.

When this is NOT the right tool

Security incidents. The writeup is legal-sensitive and AI cannot be trusted with phrasing on liability.
Postmortems where the root cause is interpersonal (a hand-off failure, a missed escalation). AI cannot read the room and will smooth over the part that matters.
Tiny incidents (5-minute blip, no customer impact). Skip the doc.

Three ways to draft a postmortem

How you draft depends on whether your incident tooling already captures the timeline. As of June 2026:

Approach	Timeline source	Draft time	Confabulation risk	Best for
Manual, no AI	IC reads Slack by hand	3+ hours	None (slow, error-prone from fatigue)	Tiny teams, sensitive incidents
General chat model (this workflow)	You paste artifacts, model builds sourced timeline	60-90 min	High if unsourced; low with the source rule	Teams on Claude/ChatGPT without a paid incident platform
Platform-native AI (Rootly, incident.io)	Auto-captured live: alerts, deploys, Slack, call transcripts	10-15 min review	Lower (grounded in captured events) but still needs IC review	Teams already paying for an incident tool

The chat-model path below uses Claude Opus 4.7 or Sonnet 4.6 (1M-token context, so a full Slack export usually fits) or GPT-5.5 Thinking. Any of these handles a multilingual Slack thread; the discipline matters more than the model.

Before you start

Collect the artifacts in one place: the Slack incident channel export, the oncall doc / runbook used, any timeline notes the IC took live, dashboard screenshots described in prose, and the actual fix (commit SHA or PR link).
Have a template ready. The Google SRE five-section pattern (Summary, Impact, Timeline, Root cause, Action items) is a fine default; whatever your org uses, give it to the model.
Decide who reviews before the doc goes wide — always at least the IC and one engineer who was hands-on during the incident.
Block 60-90 minutes within 48 hours of the incident. Memory degrades fast and is the most valuable input you have.

Step by step

Dump the artifacts. Export the Slack channel, paste the oncall doc, paste the timeline notes. Mark each block with a label so the model knows what it is reading.
Ask for a timeline FIRST, before any narrative. “Build a timeline with timestamps and one-line events. Source each line by quoting the Slack message or note. No interpretation yet.” This catches missing data early and is the foundation everything else is grounded in.
Review the timeline. Fill gaps — the IC always remembers details that did not make it into Slack. Add them with [IC note: ...] so the next pass treats them as authoritative.
Now ask for the Summary, Impact, Root cause sections. “Use only the timeline above. Do NOT invent any factor that is not on the timeline. If you would speculate, write [needs IC input] instead.” This is the single most important constraint, and it directly counters the model’s tendency to merge events into a plausible-but-wrong story.
Run a 5 Whys with the AI as a sparring partner, not a writer. “Here is the proximate cause. Ask me ‘why’ five times. I will answer each. Then summarize.” This keeps you doing the thinking. Note that 5 Whys assumes a linear chain; software failures usually have several contributing factors, so treat the output as one input, not the verdict.
Ask for draft action items — but only as a list of options grouped by category (prevent, detect, respond). The IC picks which ones ship. AI tends to overload action lists; trim to 3-5 actionable items with owners.
IC reads the whole draft against the original artifacts. Anywhere the doc is more comfortable than the artifacts justify, push back: “The Slack thread shows we ignored the alert for 22 minutes. The doc should say that.”

A prompt that produces honest output

Paste your real artifacts where the bracketed placeholders are. The fenced block is safe to copy verbatim:

You are helping draft an incident postmortem. I am the IC.

Inputs:

[SLACK CHANNEL EXPORT]
...paste...

[ONCALL DOC USED]
...paste...

[IC TIMELINE NOTES]
...paste...

[FIX]
...commit SHA + PR link + one-line description...

Produce:

1. Timeline — timestamps in UTC, one line per event, each line cites the
   source (Slack msg, IC note, dashboard). Do NOT include events not in
   the inputs.
2. Summary (3-4 sentences). Plain language. Do not soften the cause.
3. Impact (numbers — duration, customers affected, $ if known).
4. Root cause — one paragraph, only what the timeline supports. If you
   would speculate, write "[needs IC input]" and I will fill in.
5. Contributing factors — list of 2-5 items. Same speculation rule.
6. Draft action items — categorize as Prevent / Detect / Respond. List
   up to 8 options; I will trim. Each item needs a candidate owner role
   (not a person — "Platform team", "On-call rotation").

Rules:
- Blameless tone (no individual names in blame contexts) but NOT
  blame-free. "The deploy went out without review" is fair.
- Do NOT round off uncomfortable facts. "Alert was ignored for 22 min"
  stays as written.
- If a fact in the inputs contradicts itself, surface both and tag
  "[conflict — needs IC]".

Quality check

Every fact in the doc traces to a source — Slack message, IC note, dashboard, code link. Untraceable claims get cut or marked [needs IC input].
The Root cause section names the actual mechanism, not a euphemism. “We removed the canary check to ship faster” is fine. “Our deployment process did not catch the issue” is a sanitized version of the same fact.
Action items have a candidate owner role and a category. Lists of 12 unowned items are wishes, not work. A postmortem that produces no completed action items is just postmortem theater.
The 5 Whys, if done, is in the doc as a sub-section with the IC’s actual answers — not paraphrased.
The IC has read every sentence and would be comfortable defending it to the team. If you would not say it aloud, do not ship it.

How to reuse this workflow

Save the prompt as your team’s template. Each new incident starts from the same scaffold; only the inputs change.
Build a tiny “incident export kit” — a script that pulls the Slack channel, runs gh pr view on the fix PR, and assembles a single pastable document. Removes about 20 minutes of friction.
After each postmortem, review which sections the AI got close on and which needed heavy IC rewriting, then adjust the prompt.
Keep a running file of “sanitization patterns the model uses” — phrases it reaches for that hide truth (“interacted unexpectedly”, “process gap”). Tell future prompts to avoid them by name.

Common mistakes

Letting AI write the Root cause from the Slack thread directly, with no timeline pass. The doc ends up vague because the input was chronologically jumbled.
Skipping the source-citation rule. The model confabulates, the speculation reads plausibly, and a wrong “root cause” enters team folklore.
Accepting AI’s softened phrasing because it sounds professional. The whole point of a postmortem is to be uncomfortable to read.
Action items written by AI without an owner. They never get done.
Running the 5 Whys with AI as the writer. It converges too fast to a tidy answer. Use it as the questioner only.
Sharing the doc without IC review. This single rule prevents most postmortem-quality regressions.

FAQ

What about blameless culture? Blameless is about not assigning fault to individuals. It is not about hiding what happened. “The deploy was unreviewed” is blameless and accurate. Blameful language from leadership is the most common way postmortem culture erodes, so the doc should model the opposite.
Should I just use Rootly or incident.io instead of pasting into a chat model? If you already pay for one, yes — it auto-captures the timeline live during the incident and drafts from grounded events, cutting review to 10-15 minutes. The review discipline in this article still applies: the AI draft is a starting point, not the final word.
Can the AI run the postmortem meeting? No. Use the doc as the meeting input. Humans facilitate.
What if Slack messages are in a different language? GPT-5.5 and Claude handle multilingual Slack threads well, but verify timestamps and quoted strings line up. Mixed-language teams often have richer signal in Slack than in formal docs — preserve both.
Should action items go in a tracker automatically? Yes, but after IC review. AI’s “draft action items” are options, not commitments.
Can I include customer-facing wording in the same doc? Better to keep them separate. The customer note is shorter and lawyer-reviewed; the internal postmortem is honest and detailed.

External references: the Google SRE postmortem culture chapter for the blameless framework, and the Google SRE example postmortem for a worked template.

Tags: #AI coding #Workflow

TL;DR

What this covers

Who this is for

When to reach for it

When this is NOT the right tool

Three ways to draft a postmortem

Before you start

Step by step

A prompt that produces honest output

Quality check

How to reuse this workflow

Common mistakes

FAQ

Related

Related Articles

AI Changelog Generation: From Commits to a Release Note Humans Read

AI-Assisted Database Migrations — Reversible, Backfilled, Tested

AI Merge Conflict Resolution: When to Trust the Auto-Merge

AI On-Call Debugging: From Page to Fix Without Panic

AI PR Descriptions: From Diff to Reviewable

Aider Getting Started: Terminal AI Coding With Per-Edit Git Commits