Deploy Preview URLs Got Indexed by Google
Preview URLs from Vercel or Netlify appear in Google's index, often outranking your canonical domain — usually a missing noindex header or robots block on preview hosts.
Articles tagged with #SEO
Preview URLs from Vercel or Netlify appear in Google's index, often outranking your canonical domain — usually a missing noindex header or robots block on preview hosts.
Your SPA updates `document.title` after the first render, but Googlebot indexes the original placeholder. The SERP shows "Loading..." or your home title on every URL.
Google deprecated HowTo rich results in 2023, yet your template still injects HowTo JSON-LD on every page. Rich Results Test passes, but the rich result never returns — and the markup may now be a liability.
JSON-LD reports 1,247 reviews and a 4.8 star rating, but the page only shows 32. Google flags it as a content mismatch and your star rating disappears from SERPs.
Search Console warns "Missing field author.name" on hundreds of Article pages. The byline string is set, but the JSON-LD shape is wrong — and the rich result quietly disappeared.
Every URL in your sitemap reports today's `lastmod`. Googlebot crawl rate falls instead of rising, because Google now ignores the field as unreliable.
The `<title>` and `<h1>` say different things, so Google picks neither — and rewrites your SERP title to a chopped fragment from body copy.
Your _redirects file is 3,000 lines. Builds are slower, crawlers chase chains, and nobody knows which rules are still needed. How to audit, collapse, and prune.
Your 'Related articles' module shows three near-identical posts on every page. Readers bounce, internal-link equity dilutes. How to detect, score, and de-duplicate.
Search Console reports "No return tags" on your hreflang cluster. Page A points to page B, but B doesn't point back to A. Google ignores the whole annotation.
Your SPA renders fine for users, but Search Console shows the page indexed with a blank `<body>`. Why Googlebot's rendering quirks lose your content — and how to fix it.
You got the Search Console email confirming mobile-first indexing. Two weeks later, pages and impressions drop. The desktop version had content the mobile version doesn't.
A common SEO myth says set `noindex,follow` on page 2+. Google treats long-term noindex,follow as noindex,nofollow — and your deep articles stop getting discovered.
Search Console reports thousands of duplicate URLs with `?utm_*`, `?sort=`, `?ref=` variants. Each parameter combination got indexed as a separate page.
You disallowed `/assets/` or `/_next/static/` in robots.txt to "save crawl budget." Googlebot can no longer render your page. Rankings and rich results decline.
Your sitemap.xml has 73,000 URLs in a single file. Search Console reports "Couldn't fetch" or processes only the first 50,000. How to split a sitemap correctly.
A practical 3-hour competitor analysis for content-site founders: who is ranking, where the gaps are, and whether the niche is winnable as a solo builder — without paid SEO tools.
A 4-hour quarterly review template that surfaces traffic drift, topical decay, and rank loss before they compound — with the exact questions, queries, and decision rules to use.
When indexed-page counts move and you do not know why, slicing Search Console by folder turns a vague problem into a debuggable one in 15 minutes.
Most internal search result pages should be kept out of the index. Here is why, and the two exceptions where letting them in actually wins traffic.
Three controls, three different jobs. Pick the wrong one and you either leak pages into the index, waste crawl budget, or hide content from yourself by accident.
WHOIS privacy hides your name, address, and email from public lookups. The persistent rumor is that Google treats hidden owners as suspicious. Here is what is actually true.
Use AI to identify stale articles, draft targeted refreshes, and bring decaying pages back to top-3 positions.
Use AI to map your internal link graph and surface the 20 highest-value bridge links to add this week.
Use AI to scope international SEO setup across hreflang, locale signals, and currency formatting in 90 minutes.
Use AI to detect cannibalizing article pairs and decide whether to merge or differentiate each one.
Three articles fight for "how to use claude API" — Google ranks none. Detect cannibalization in GSC, pick a pillar, merge the rest or differentiate intent.
First 20 items render, the next 100 load via JS — Googlebot never sees them.
Page updated but Google never re-crawls — server isn't returning `Last-Modified` / `ETag`.
Should `page/2` canonical to `page/1` or to itself? Google's actual current guidance.
The visible "Published / Updated" date does not match `datePublished` / `dateModified` in JSON-LD. Why this is more than a cosmetic bug.
Rich Results Test passes but Search Console flags breadcrumb mismatches, or the breadcrumb in SERP looks wrong. How to align JSON-LD with what users see.
You set canonical to a new URL. Weeks later, Google still ranks the old URL. Why canonical is a hint, not a command, and what actually changes the chosen canonical.
You just split content into categories, and the new category pages refuse to show up in Google. Why category pages need more than a list of links.
A previously well-ranking page disappears from Google. Sometimes this is a quality update, sometimes a manual action. How to tell which.
Search Console "HTML Improvements" or a third-party crawl shows the same meta description on hundreds of pages. Why this matters less than people think, but still matters.
Rich Results Test errors on FAQPage. Search Console removes FAQ rich snippets. Common causes and fixes.
Search Console shows the homepage gets crawled regularly but inner article pages stay at "Discovered — currently not indexed" or are never crawled.
Crawl Stats show Googlebot fetching hundreds of URLs per day, but the Performance report stays at the same flat impression line. Why crawling does not equal traffic.
You set hreflang for en and zh, but Search Console flags missing x-default. What x-default actually does and when you need it.
After redesigning the site, the indexed page count drops in Search Console. What is real damage vs. normal re-evaluation.
Site has thousands of pages but only a fraction get indexed. Why this happens and what actually moves the needle.
When `<meta name="robots">` and `X-Robots-Tag` HTTP header conflict, results are unpredictable. How to decide which to use and how to keep them in sync.
A new domain has been live for 4–8 weeks. Sitemap is submitted, URL Inspection shows pages as "Discovered". Nothing seems to be moving forward.
You added `<meta name="robots" content="noindex">` weeks ago but the page is still in Google. The most common reasons.
A template default or migration mistake left `<meta name="robots" content="noindex">` on pages that should be public. Detection and rollback.
A URL is in your sitemap but has zero internal links pointing to it. Google sees it as unimportant and either delays crawling or skips it entirely.
Search Console says the page is indexed, but the Performance report shows it has never gotten a single impression. Here is what that actually means.
You hit "Request Indexing" 10 times. The URL is still not in Google. Here is why URL Inspection is not what most people think.
Sitemap.xml lists URLs that have `<meta name="robots" content="noindex">`. Google reports them, but inconsistently. Why this happens and how to fix.
The "Indexed" line in Search Console drops by 50–200 URLs overnight. No errors, no changes. What is going on.
Sitemap.xml is submitted, the URLs are clearly listed, but Search Console's Pages report shows none of them. Causes and fixes.
A template refactor or framework upgrade broke JSON-LD blocks site-wide. How to detect and fix.
Google has indexed all your tag archive pages but article pages are still showing as "Discovered" or "Crawled — currently not indexed".
Google quietly stops crawling and indexing the thinnest pages on a site, even if there is no explicit penalty.
Sharing your URL on X/Twitter shows no image, even though og:image is set. Why Twitter cards have their own rules.
The Removals tool in Search Console blocks a URL from appearing in search for ~6 months. People misuse it as a permanent delete or a "fix indexing" button.
You added VideoObject JSON-LD for embedded videos but no video rich result, no video tab listing. Why.
The site-level WebSite JSON-LD declares name / URL / SearchAction that don't match reality. Why this confuses Google and how to align it.
Indexing delays cause panic. Here is the realistic baseline for new sites, new sections, and post-redesign sites.
Etsy SEO tag prompts — generate 13 tags that match Etsy search intent, balance broad with long-tail, and capture gift / occasion buyers without keyword stuffing.
How to build category and tag index pages in Astro that scale, rank, and avoid the thin-page trap, using Content Collections for the data layer.
The minimum-viable SEO setup for an Astro site — what every page must have, why, and the exact patterns that hold up at 500 articles.
How to generate, validate, and submit a sitemap.xml in Astro — including hreflang pairs, exclusion rules, and what Google actually does with it.
Decide whether to launch your first content site as English-only, Chinese-only, or bilingual, with real URL structure, hreflang tags, and sitemap config.
Practical taxonomy + URL design for content sites — section schemas, hub-page templates, and sitemap config that scales from 50 to 1000 articles.
A 10-item pre-launch checklist for sites going live on Firebase Hosting in 2026 — covering domain, SSL, cache, redirects, sitemap, Search Console, and rollback rehearsal.
Vercel is famous for Next.js, but it is also excellent for Astro content sites. Use this vercel.json setup, image config, and bandwidth math to decide.
Validate a content topic before writing — use this autocomplete + SERP scrape script, demand-signal scorecard, and free Keyword Planner workflow.
Depth-first vs breadth-first for a new content site, with a hub-stage decision rule, content plan template, and Search Console check.
A practical 2026 framework for picking a content-site niche with actual search demand, real intent, and low enough competition to rank as a solo builder.
Design a long-tail content site so structure, slugs, and internal links scale from 1 to 500 articles. Includes content sheet template and slug enforcement.
A pre-launch checklist for content sites going live on Vercel in 2026 — covers Astro / Next.js settings, custom domain, env vars, sitemap, canonical, analytics, and rollback.
Decide bilingual go/no-go with Search Console signals, hreflang layout template, and a 30-article re-evaluation gate.
Use AI to pre-review your site against AdSense common rejection causes before applying.
A repeatable AI audit workflow for Astro content sites — catches broken slugs, missing translations, dead internal links, draft leakage, and config drift.
A monthly workflow to surface broken internal + external links using AI + simple tooling.
Category pages are an under-audited SEO surface — this AI workflow catches duplicate intros, weak internal linking, missing descriptions, and bad pagination.
Use AI to identify pillars, cluster topics, and content gaps for your niche.
Use AI to audit hreflang implementation — slug mismatches, missing pairs, wrong codes.
Use AI to write meta descriptions at scale — without losing the quality that matters.
A weekly 30-min AI workflow to triage Search Console: index coverage, Core Web Vitals, mobile usability, manual actions.
Skip the generic 80-point SEO checklist. Generate one tailored to your stack (Astro / Next / Hugo / WordPress) with AI in 20 minutes.
A 3-pass AI workflow that scores every page for thin content, surfaces the worst offenders, and proposes one of: expand, merge, noindex, delete.
A reusable Claude Code prompt + workflow to run a real SEO audit on your content site.
Use Codex to spot-check sitemap correctness, freshness, and coverage.
Search Console says alternate — it's not an error. Here's how to read it.
Switched domains but old canonical still appears — config + cache.
Bilingual / multi-locale pages have canonicals pointing the wrong direction.
Wrong canonical leads to dropped pages or wrong URL ranking.
Description in SERP isn't yours — usually too generic, missing the keyword, or duplicated across pages.
Updated for 2026 — quick triage page for the Search Console error: what the bucket means, the first three things to check, and how to confirm the fix worked. For indie-site authority strategy, see the linked guide.
You set a canonical but Google picked another URL. Why and how to fix.
Google replaced your title tag with a shorter / different one in search results.
Decoding the hreflang errors shown specifically in Google Search Console's International Targeting report — return tag missing, language code invalid, and how to act on each.
What hreflang warnings mean, why mismatched clusters happen, and the minimal fixes that resolve return-tag and language-code errors. For the Search Console International Targeting workflow, see the linked page.
Brand-new domain takes weeks to start ranking. Here's what actually helps.
Pages exist but Google doesn't see your internal links pointing to them.
Short, thin pages get deprioritized. What "depth" really means.
Search results show a Google-generated snippet instead of your meta description.
Search Console flagged pages as not mobile-friendly. Typically viewport, tap targets, or content cut-off.
Sharing a page on Twitter / LinkedIn / Slack shows no preview image.
Share to LinkedIn / X / Slack and no image shows — meta tag, image size, or cache.
Indexed status, zero impressions. Usually content depth, intent mismatch, or topical authority.
Previously indexed pages now missing. Typically duplicate consolidation, quality drop, or technical regression.
You changed robots.txt but Google still crawls / indexes the same URLs.
The 7 most common "not indexed" reasons and the right fix for each.
/sitemap.xml or /sitemap-index.xml returns 404 — integration off or `site` missing.
Submitted a sitemap to Search Console weeks ago and still no indexing — or only a few pages indexed? Sitemap submission ≠ indexing. Here are the six most common real reasons and their fixes.
Google flagged your page as "Soft 404" — content too thin to be a real page.
Rich Results Test shows your JSON-LD has warnings or invalid fields.
JSON-LD warnings — typically missing required fields, wrong type, or out-of-spec fields.
Google replaces your title with H1 / sitename. Usually title quality, keyword stuffing, or relevance.
Search Console says URL unknown. The URL exists, but Google has never seen it. Next steps.
14 prompts that earn the first 80 words back from the bounce — search-intent payoffs, contrarian leads, problem-agitate, data hooks, pattern-break openers, by reader temperature.
17 copy-ready blog outline prompts for SEO posts, pillar pages, comparison articles, and cluster content — the outline does the heavy lifting so the draft writes itself.
14 prompts for meta descriptions that earn the click — search-intent payoff, listicle, comparison, without-X angle, CTR rewrite, and a diagnose-and-fix for the ones Google quietly rewrites.
12 prompts to AI-audit a content site — metadata, internal links, hreflang, thin content, canonical, schema, cannibalization, Core Web Vitals — with measurable thresholds and fixes.
15 prompts to write SEO titles that rank and earn the click — within 60 characters, with intent, brackets, year cues, and listicle hooks.
Duplication kills indexing once you cross a few hundred articles. Here's the script-driven workflow to catch it before Google does.
The specific signals that make AI-written articles read as low-quality, and the concrete edits that fix them — from a 2026 publisher's perspective.
A clear, practical guide to canonical tags — what they do, when they help, and the four mistakes that quietly break indexing.
Volume buys coverage, quality buys rankings. Use this Search Console-driven decision flow and quality-floor checklist to choose the right move.
Google crawled your page and decided not to index it. The reasons are almost always quality, duplication, or authority — not technical. The real fix list.
Why Google leaves new indie-site URLs in the Discovered bucket and the multi-week strategy that actually moves them out — authority, internal links, content depth, and realistic timelines.
A domain change always costs some short-term ranking. Use this 301 redirect playbook, Search Console move steps, and verification commands.
A 2026 workflow for using AI to spot the articles your site should have but doesn't — using your own sitemap, Search Console data, and a topic pillar map.
A real diagnostic checklist for when you publish content and Google ignores it. The order matters — start at the top.
A practical guide to `hreflang` for English/Chinese (or any two-language) sites — what to put in `<head>`, what most generators get wrong, and how to verify it works.
Sites past 1,000 articles need scripts, not willpower. Use this content-index generator, dupe scanner, and retire-shelf workflow to stay sane.
A realistic 30-day playbook for a brand-new content site: indexing, structure, the first 20 articles, and what to ignore.
Next.js does not break SEO, but it has footguns. Use this metadata API, sitemap.ts, and view-source checklist before you ask Google to crawl.
The Pages report shows indexed vs. not-indexed and why. Most of the scary categories are actually fine. Here is how to triage.
A correctly served sitemap and robots.txt are non-negotiable for SEO. Here is the App Router idiom for both in 2026.
Design pillar and cluster pages so Google understands your topical authority. Includes content schema, link-audit script, and pillar page template.
A surgical guide to robots.txt for indie sites — the two-line default that works, the rules that quietly deindex you, and the difference vs noindex.
A quarterly content audit with concrete scripts: URL inventory, Search Console join, dead-page flags, dupe scanner, broken-link checker, and decision log.
Root vs www is cosmetic — until both serve the same content. Use this DNS table, redirect config, and curl verification to lock it down once.
A practical 2026 framework for using AI to produce dozens of articles per month without triggering helpful-content penalties or losing the voice that makes the site work.
URL Inspection shows "User-declared canonical" and "Google-selected canonical". When they disagree, here is why and what to do.
Use Codex (or any code-aware AI agent) for a technical SEO review with copy-paste prompts, dist/ inspection commands, and a structured-data validator.
Pick a sustainable publishing cadence with a real 8-week measurement, batch workflow, and backlog-tracker script you can drop into your repo.
A decision framework for indexing category / hub pages — when they earn their keep, when they cannibalize, and how to turn empty categories into real landing pages.
A clear answer to the WordPress-era question — when tag archives help SEO, when they hurt, and the exact noindex pattern to ship.
A no-jargon intro to JSON-LD structured data — the three schema types every content site should ship, and the ones to ignore until you scale.
`blog.yoursite.com` or `yoursite.com/blog`? Use this Vercel/Next/Cloudflare rewrite config to keep SEO consolidated on one domain.
The shortest 2026 submission flow: verify, sitemap, request indexing on one URL, link from the homepage — plus how long each step actually takes for a brand-new domain.
The 5-minute sitemap submission flow, plus the 3 status messages that mean trouble and how to fix each.
A clear breakdown of what Google penalizes about ads — and the specific thresholds that turn "monetization" into "SEO problem".
URL Inspection tells you what Google sees, when it last crawled, and why a page is or is not indexed. Read it correctly with the API + UI guide here.
A plain-English explainer for sitemap.xml — what it does, what it does not do, and the four fields most generators put in that Google never reads.
Refresh only what's worth refreshing — with the Search Console query that finds position 8-20 articles, a refresh template, and re-indexing flow.
A 2026 guide to writing `<title>` and `<meta name="description">` that earn clicks — length limits, intent matching, and what Google rewrites anyway.
Search Console says 'Crawled — currently not indexed'? Google looked, but chose not to index. Here are the 5 most likely causes in order, and the shortest fix path.
noindex blocks indexing. robots.txt blocks crawling. They are not the same. A short table for when to use which, and the most common mistake that breaks both.
The full submission playbook — Search Console, sitemap, robots.txt configuration, manual indexing requests, and internal-link signals. For the shorter 2026 four-step order, see the linked guide.
Your site has been live for a week and still nowhere on Google? Work through the 10 most common causes: robots, noindex, wrong canonicals, JS rendering, sandbox period, thin content.