Analysis With Claude — Data, Documents, Decisions

Long-context Claude is great at reading + structuring + deciding.

What this covers

Most “analyze this for me” prompts produce a polished restatement of the inputs with a confident-sounding conclusion attached. The conclusion is almost always wrong because the model skipped the messy middle step: organizing inputs into a structure before drawing inferences. Claude’s long context and tendency to follow detailed instructions makes it the strongest mass-market model for analysis — IF you walk it through a categorize-first workflow. This guide is for analysts, PMs, consultants, founders, and anyone preparing a recommendation that has to survive being questioned.

Who this is for

  • Analysts and consultants synthesizing 10+ inputs into a recommendation.
  • PMs evaluating user research, support tickets, or feature requests at scale.
  • Founders making a hire/fire/build/buy decision with mixed evidence.
  • Researchers reading multiple sources and writing a structured takeaway.

When to reach for it

  • You have inputs (documents, transcripts, survey responses, spreadsheet exports) and need a conclusion.
  • The inputs are too many to skim, too heterogeneous to spreadsheet, and the deadline is tomorrow.
  • You can articulate the decision being made but not yet how to frame the evidence.
  • Stakes are real but not bet-the-company. For the latter, bring a human team in.

When this is NOT the right tool

  • One small input where you can read everything in 10 minutes — just read it.
  • Highly numerical analysis better suited to Python/pandas. Claude can analyze data but not as cleanly as code.
  • Anything where missing one detail is catastrophic (legal contract review, medical decisions). Use Claude to surface candidates; a human verifies.

Before you start

  • Frame the decision in one sentence. “Should we hire X” — not “Help me think about X.”
  • Gather inputs into a single place. Markdown files, PDFs, pasted transcripts. Aim for under 200K tokens total — Claude handles long context but quality degrades past that.
  • List your initial hypothesis, even if rough. The model will help you stress-test it; it can’t help if you don’t have one.
  • Decide your output shape: a 1-page memo, a decision matrix, a list of trade-offs with a recommendation.

Step by step

  1. Load all inputs as files (Claude.ai Projects) or pasted text. Label each input clearly: “Document 1: Customer interview transcript, 2026-05-15. Document 2: PRD draft v3.”

  2. Force a categorization pass before any inference:

    Before drawing any conclusions, group the inputs above into 3-5
    categories that would help us decide \{the question\}. For each
    category, list which inputs belong and why.
  3. Inspect the categories. If they feel forced or miss obvious cuts, push back: “Why did you put Document 4 in category A? It feels closer to category B.”

  4. Ask for conclusions per category, NOT a single mega-conclusion:

    For each category, write 2-3 sentences on what these inputs taken
    together suggest, and one sentence on what's uncertain.
  5. Synthesize into a recommendation with explicit caveats:

    Based on the categorized analysis, give me:
    - A recommendation in one sentence
    - The 3 strongest pieces of evidence for it
    - The 2 strongest counter-arguments
    - What I'd need to learn to change my recommendation
  6. Stress-test: paste the recommendation back and ask What would have to be true for this to be wrong? This forces the model out of agreement mode.

A prompt template that produces honest analysis

You are helping me decide: {one-sentence decision}.
Constraints:
- Categorize evidence before drawing inferences.
- Label every claim as either "supported by Document X" or "model inference."
- Surface disagreements between inputs explicitly.
- End with what we'd need to know to be confident.

Quality check

  • Are the categories MECE-adjacent (Mutually Exclusive, Collectively Exhaustive — roughly)? Overlapping categories produce mushy conclusions.
  • Does each conclusion trace to specific inputs? If a claim has no source, it’s a model inference and should be labeled as such.
  • Is the recommendation actionable, or vague enough to please anyone? “Consider investigating further” is not a recommendation.
  • Did counter-arguments survive the synthesis, or did they vanish? They should be explicitly present.

How to reuse this workflow

  • For recurring decision types (hiring, vendor selection, feature prioritization), save a Project with the categorization prompt as a starter. Same framework, new inputs.
  • Build a decisions.md log of past analyses and their outcomes. Six months later you’ll see which categorization patterns held up and which didn’t.
  • Keep the “stress test” prompt as a separate snippet — it’s the highest-leverage one.

Load inputs with labels → categorize into 3-5 groups → conclude per category → synthesize with explicit counter-arguments → stress-test (“what would have to be true for this to be wrong”) → write your memo with Claude’s draft as scaffold, not final.

Common mistakes

  • Skipping the categorization step. The recommendation is structurally weak because the model didn’t have anchors.
  • Demanding a recommendation before structuring inputs. You’ll get a confident-sounding average of the inputs.
  • Treating Claude’s first draft as the final memo. The draft is scaffolding — the real memo needs your judgment in the synthesis section.
  • Loading too many inputs without labels. The model treats unlabeled inputs as equally important, which loses the natural weighting.
  • Not stress-testing the recommendation. Models default to agreement; explicit counter-prompts are how you get honest disagreement.
  • Mixing structured (spreadsheet) and unstructured (transcripts) data in one analysis. Split them; analyze separately; combine in the final synthesis.

FAQ

  • Why Claude over ChatGPT for analysis?: Claude’s longer context window and tendency to follow detailed instructions edge out ChatGPT on multi-document synthesis. ChatGPT is still strong; Gemini’s long context is competitive too.
  • What’s the practical input limit?: Claude’s context is huge but quality degrades past ~150K tokens of mixed material. For bigger inputs, summarize each input first, then analyze the summaries.
  • Should I use Claude Artifacts for the memo?: Yes — Artifacts work well for the final-memo step. Use chat for the analysis steps.
  • What about Claude Projects vs single chat?: Projects for recurring analysis types; single chat for one-off decisions.
  • Can I trust Claude with confidential data?: Anthropic’s enterprise plans have strong data controls. Default consumer plans store chats; check before uploading sensitive materials.
  • Why does my recommendation feel wishy-washy?: Probably because you didn’t stress-test it. Add the “what would have to be true for this to be wrong” prompt.

Tags: #Claude #Tutorial