Codex Audit Report Is Too Broad to Act On

Q: Is there a `codex audit` command?

No. As of June 2026 there is no `audit` subcommand. The built-in reviewer is the `/review` slash command, with four presets: review against a base branch, review uncommitted changes, review a commit, and **Custom review instructions**. A scoped audit lives in that last preset, or in a normal chat turn with the Step 3 template.

Q: Should I use a stronger model just for audits?

Often yes. Set `review_model` in `~/.codex/config.toml` to a top model (for example a GPT-5.5 reasoning tier) so `/review` always runs it, while your interactive session stays on a faster model. Security audits in particular benefit from the stronger reviewer.

Asked Codex to audit your project and got a 50-bullet report mixing typos with architecture? Re-scope to one dimension, cap output, force file:line, use /review's custom-instructions preset.

Published: May 17, 2026 Updated: Jun 15, 2026 Author: AI Productivity Guide Team 🌐 查看中文版本

You pasted “audit my project” (or picked Custom review instructions in /review) and got back 50+ bullets: “consider using const instead of let”, “API key should be in env var”, “this function is 200 lines long”, “consider adding tests for edge cases”. By the time you finish reading, you have no idea which 3 things to fix this afternoon. The report becomes a guilt artifact you never act on.

Fastest fix: one dimension, one scope, capped output. Run /review -> Custom review instructions and paste a single line like Audit only src/api/auth/ for security issues. Return at most 10 items, P0–P3, each with file:line. Skip cosmetic items. That alone turns the wall of bullets into a triagable list. The full template is in Step 3 below.

This is not a model problem, it’s an audit-prompt problem. A useful audit has one dimension, one scope, capped output, and file-anchored items. Below: how to spot why your audit went broad, and the prompt that produces an actionable 10-item list every time.

Quick terminology note (as of June 2026): there is no codex audit subcommand. The built-in reviewer is the /review slash command, which offers four presets — review against a base branch, review uncommitted changes, review a commit, and Custom review instructions. The last one prepends your freeform prompt to the reviewer, so it is the right home for a scoped audit. A plain “audit my project” chat turn works too, but /review scopes to a git diff and never edits your working tree.

Common causes

Ordered by hit rate, highest first.

1. The prompt was “audit the project”

Open-ended audit prompts produce open-ended reports. “Audit” with no dimension forces Codex to guess what you care about — and it hedges by listing everything.

You: audit my project
Codex: [returns 50 bullets across security, style, perf, types, tests, docs, accessibility]

The same thing happens if you open /review, choose Custom review instructions, and type only “review everything” — the preset prepends your text verbatim, so a vague instruction yields a vague review.

How to spot it: Look at your original prompt. If it doesn’t name one dimension (security OR perf OR types), the report will mix all of them.

2. No scope — Codex audited every directory

“Audit” without a path means Codex walks the whole tree. A monorepo audit returns items from apps/marketing/, packages/ui/, scripts/, docs/ all jumbled together — even though you only care about the API layer.

How to spot it: Group the bullets by file path. If they span 4+ top-level folders, scope was missing.

3. No severity rating — every item looks equal

Without “rate each item 1–5 for severity,” Codex returns items in discovery order, not impact order. A missing semicolon sits next to a SQL injection risk.

How to spot it: Read the first 5 items. If a critical-looking item appears alongside a cosmetic one with no flag, severity wasn’t requested.

4. No “already fixed” filter on repeat audits

Second-round audits re-report items you fixed in round 1, because Codex doesn’t know what changed. You wade through the same list, marking 40% as “done” by hand.

How to spot it: Diff this audit against the previous one. If 30%+ items are identical wording, you didn’t pass the previous list as a “skip” set.

5. No file:line anchor

Bullets like “consider improving error handling in the auth flow” can’t be acted on without 20 minutes of file-hunting. The report looks long because it’s spread across many vague paragraphs instead of pointing at concrete lines.

How to spot it: Count items with file.ts:42 style anchors. If under 50% of items have line numbers, you can’t triage without re-reading the codebase.

6. Mixed dimensions — style + perf + security in one pass

When you ask Codex to find “everything wrong,” it returns the union. Each dimension uses different judgment heuristics, so the report reads as inconsistent (a missing comment given the same weight as a missing CSRF check).

How to spot it: Tag each bullet with one of security, perf, style, types, tests, docs. If the histogram is flat across 4+ tags, dimensions were mixed.

Which bucket are you in?

Symptom in the report	Root cause	Jump to
Bullets span security + style + perf together	No single dimension	Step 1
Items come from 4+ top-level folders	No scope	Step 2
Critical and cosmetic items un-flagged, in discovery order	No severity request	Step 3
30%+ of items repeat a previous run	No “skip” set	Step 5
Under half the items have `file.ts:42` anchors	No file:line constraint	Step 3

Most over-broad audits are two or three of these at once, which is why the Step 3 template fixes dimension, severity, and file:line in a single prompt.

Shortest path to fix

Ordered by ROI. Steps 1–3 turn a broad audit into an actionable 10-item list.

Step 1: Pick exactly one dimension

Choose from this list, one at a time:

Dimension	When to run
security	Before a release; after touching auth, payments, or user input
perf	When p95 latency or bundle size regresses
types	After a TypeScript upgrade or large refactor
tests	Quarterly, or after a hotfix that lacked test coverage
style	Once per quarter; lowest priority
docs	Before onboarding a new contributor

Run security audits as their own pass — don’t bundle them with style.

Step 2: Pick one narrow scope

A useful scope is one directory or one feature, not “the project.” Examples:

src/api/auth/ — one feature module
src/components/billing/ — one user-facing flow
migrations/*.sql — one file class

If your codebase is large, run audits in slices and merge findings into your tracker, not into one mega-report.

When you run /review, the Review against a base branch and Review uncommitted changes presets already scope to a git diff (Codex finds the merge base against your upstream branch). That naturally limits the audit to what you just touched. For a directory-wide audit independent of git state, use Custom review instructions and name the path in the prompt itself.

Step 3: Use the constrained audit prompt

Paste this into the Custom review instructions preset (or into a chat turn), filling in the bracketed slots:

Audit only [SCOPE] for [DIMENSION] issues.

Constraints:
- Return at most 10 items, sorted by severity (P0 → P3).
- Each item must include:
  - severity (P0 = ship-blocker, P1 = before release, P2 = next sprint, P3 = nice-to-have)
  - file:line (or file range)
  - one-sentence problem statement
  - one-sentence fix
- Skip cosmetic items (formatting, naming) unless they hide a bug.
- Skip items already addressed in: [PASTE PREVIOUS AUDIT, or "none"].
- Do not propose architectural rewrites.

Output as a markdown table with columns: severity | file:line | problem | fix.

Example output you should expect:

| Sev | File:Line | Problem | Fix |
|---|---|---|---|
| P0 | src/api/auth/login.ts:42 | Password compared with `==` not constant-time | Use `crypto.timingSafeEqual` |
| P0 | src/api/auth/session.ts:118 | JWT signed with HS256 + secret in env, no rotation | Add `kid` header, rotate quarterly |
| P1 | src/api/auth/reset.ts:23 | Reset token TTL is 24h, RFC recommends 1h | Lower `TOKEN_TTL` to 3600 |
| P2 | src/api/auth/middleware.ts:67 | Rate limit per-IP, not per-account | Add `accountId` to the limit key |

Step 4: Triage in the issue tracker, not the audit file

Open each P0 and P1 as a real issue with the file:line in the title. P2/P3 go into a single “audit backlog” ticket. The markdown audit file is read-once and deleted.

# Bulk-create issues from a Codex audit (gh CLI)
gh issue create -t "P0: timing attack in login.ts:42" -b "Codex audit 2026-05-22"

Step 5: For repeat audits, pass the previous list as “skip”

Round 2 prompt:

Audit src/api/auth/ for security issues.
SKIP items already fixed:
- timing-safe password compare (login.ts:42)
- JWT rotation (session.ts:118)
- reset token TTL (reset.ts:23)

Same constraints as round 1.

You’ll get a shorter, fresher list instead of re-reading the same 50 bullets.

Step 6: Confirm it’s fixed

A scoped audit succeeded if all four hold for the returned list:

One dimension. Every item maps to the dimension you named (no stray style notes in a security pass).
Capped length. The list is at or under your item cap (<= 10), not 30+ bullets.
Anchored. Every item carries a file.ts:42 anchor you can open directly.
Ranked. Items are sorted P0 -> P3, not in discovery order.

If any check fails, the constraint for that check did not land — re-send the prompt with that line made explicit. A fast sanity check: the report should fit on one screen and you should be able to open the first P0 in your editor within ten seconds.

Prevention

Save one reusable audit prompt per dimension as a skill (Codex’s ~/.codex/prompts/ custom prompts are deprecated as of June 2026 in favor of skills, which can be shared via the repo and invoked from /review). Never compose ad-hoc audit prompts.
Set a heavier review_model in ~/.codex/config.toml so reviews run on your strongest model even when day-to-day work uses a faster one. /review uses the session model otherwise.
Cap audit output at 10 items per run; if more issues exist, schedule a follow-up run for the next slice.
Always require severity, file:line, and a one-line fix. Reject reports that lack any of the three.
Track audits in the issue tracker, not in markdown files. Markdown rots, tickets get closed.
Re-audit security per release, perf per regression, style per quarter. Match cadence to dimension.
Pass the previous round’s findings as a “skip” set so round N stays short.

FAQ

Is there a codex audit command? No. As of June 2026 there is no audit subcommand. The built-in reviewer is the /review slash command, with four presets: review against a base branch, review uncommitted changes, review a commit, and Custom review instructions. A scoped audit lives in that last preset, or in a normal chat turn with the Step 3 template.

Why does /review only report a few files when I wanted the whole project? Three of the four presets scope to a git diff (Codex finds the merge base against your upstream branch), so they only see changed files. That is by design. For a directory-wide audit regardless of git state, use Custom review instructions and name the path, for example Audit only src/api/ inside the prompt.

The cap is ignored and I still get 30 items. Why? The cap is a soft instruction, not a hard limit. Make it the first constraint, phrase it as “Return at most 10 items, drop the rest,” and pick one dimension. A capped count plus a single dimension almost always holds; a cap on a five-dimension prompt rarely does.

Should I use a stronger model just for audits? Often yes. Set review_model in ~/.codex/config.toml to a top model (for example a GPT-5.5 reasoning tier) so /review always runs it, while your interactive session stays on a faster model. Security audits in particular benefit from the stronger reviewer.

How do I stop repeat audits from re-listing fixed items? Paste the previous round’s findings as an explicit SKIP list (Step 5). Codex has no memory of what you fixed between sessions, so without the skip list it re-discovers the same issues every run.

Tags: #Codex #Coding agent #Troubleshooting #Debug #Broad audit

Common causes

1. The prompt was “audit the project”

2. No scope — Codex audited every directory

3. No severity rating — every item looks equal

4. No “already fixed” filter on repeat audits

5. No file:line anchor

6. Mixed dimensions — style + perf + security in one pass

Which bucket are you in?

Shortest path to fix

Step 1: Pick exactly one dimension

Step 2: Pick one narrow scope

Step 3: Use the constrained audit prompt

Step 4: Triage in the issue tracker, not the audit file

Step 5: For repeat audits, pass the previous list as “skip”

Step 6: Confirm it’s fixed

Prevention

FAQ

Related

Related Articles

Codex Committed to the Wrong Branch (or Straight to main)

Codex Stalls on a Merge Conflict or Resolves It the Wrong Way

Codex Added a Package but the Lockfile Did Not Change

Codex Fix Passes Every Test but Breaks at Runtime

Codex Creates a Duplicate TypeScript Interface for One That Already Exists

Codex Rewrote Git History You Did Not Want Touched (amend / rebase / force-push)