You publish 500+ pages. Search Console’s “Crawled — currently not indexed” report grows month over month. A content audit reveals many pages under 300 words. Traffic concentrates in your top 10% of articles; the remaining 90% generate near-zero impressions. Your site looks rich on the homepage but Google treats it as a low-quality property because the average page quality is thin.
Thin pages don’t just fail to rank individually — they actively reduce Google’s perception of the site’s overall quality. Cutting the weakest 20-40% can improve the remaining articles’ rankings because the site’s average quality bar rises. Below: how to audit honestly, decide per page (expand / merge / noindex / 410), and prevent thin pages from being published going forward.
Common causes
Ordered by hit rate, highest first.
1. Programmatic SEO pages with little unique substance
You generated 200 “Best X for Y” pages by combining lists. Each is templated + sparse. Google sees the template, not unique value.
How to spot it: Pages with similar URL patterns and similar word counts in the 200-400 word range. Programmatic shape.
2. AI-generated articles that all follow the same template
Same problem from a different source: AI-pipeline articles all hit the minimum-acceptable bar without exceeding it. Each one technically “answers” the query but adds nothing.
How to spot it: AI-generated articles cluster around the same word count (500-800), same structure, same example patterns.
3. Old short posts from a different content era
Pre-2020 short posts (300-500 words) that were standard then but are thin now. Standards rose; your old content didn’t.
How to spot it: publishedAt before 2020-ish + word count <600. Era mismatch.
4. Auto-generated tag / archive pages
(See Too many tags create thin archive pages.) These often slip into “thin pages” count.
How to spot it: Tag pages with <5 listed articles. Archive pages by date/author with auto-generated content only.
5. “List of links” pages with no editorial content
“Best tools for X” pages that are just 10 names with one-sentence descriptions. The page doesn’t help — the linked tools do.
How to spot it: Page is mostly outbound links + minimal explanation. No unique value above what the linked destinations provide.
6. Translated stubs without per-language depth
You auto-translated a stub. Now it’s a stub in two languages. Doubles the thin-page count.
How to spot it: ZH version mirrors EN word count, both are below depth threshold.
Shortest path to fix
Ordered by ROI. Step 1 audits; Step 2 decides per page; Steps 3-6 execute.
Step 1: Crawl and export URL + word count + last-modified + traffic
# Per article: word count + last-modified + 90-day impressions
# Export as CSV
# Sort by word count ascending
# Filter to <500 words = candidates
You’ll likely find 20-40% of your articles are below 500 words. That’s your thin pile.
Step 2: For each, tag as expand / merge / noindex / 410
| Tag | When to use |
|---|---|
| Expand | Topic has volume + you can win; 1-2 hours to deepen |
| Merge | 3-5 similar thin pages → one deep guide; 301 the old URLs |
| Noindex | Keep accessible internally, hide from search |
| 410 | No future use; remove permanently |
Don’t try to expand everything. Many thin pages should be merged or removed.
Step 3: For “expand” — do real depth work
- Add 3-5 concrete examples (real, not generic)
- Add a "common pitfalls" section
- Add an FAQ with 4-6 real questions
- Original data, screenshots, or expert quotes
- Re-screenshot if UI references exist
- Re-evaluate word count: 1200-2000+ for a real answer
Filler text to hit word count isn’t a fix. Google detects filler.
Step 4: For “merge” — combine 3-5 thin pages into one deep guide
1. Identify the merger target (the strongest of the group becomes the consolidated home)
2. Combine the best 1-2 sections from each thin page into the target
3. 301 redirect the other URLs to the target
4. Verify the target's word count clears the depth bar
You go from 5 thin pages to 1 deep page. Site quality average rises.
Step 5: For “noindex” or “410” — execute deliberately
<!-- noindex: keep accessible internally, hide from search -->
<meta name="robots" content="noindex" />
For 410 (permanent removal):
- Remove the MDX file
- Add a 410 entry in firebase.json / hosting config
OR redirect to a related successor (301)
- Don't 301 to homepage — Google treats that as soft-404
Step 6: Re-audit in 4-8 weeks
# Search Console:
# - "Crawled - not indexed" count should drop
# - Indexed page count drops (cuts) but quality rises
# - Average position of remaining articles improves
If the metrics moved in the right direction, the cut helped. If not, you cut wrong things or need to go further.
Prevention
- Set a minimum quality bar before publishing — word count alone isn’t enough; depth + uniqueness matter
- Build quarterly content audit into team rhythm; thin pages accumulate silently
- Programmatic SEO pages need unique substance per page, not just template-fill
- Cut the weakest 20-40% of old catalogs to lift site-wide quality average
- Don’t 301 to homepage; either 410 or 301 to topical successor
- Filler text to hit word count is detected; quality content has to actually exist
Related
Tags: #Content ops #Site quality #Site audit #Troubleshooting #Thin page