Crawling + indexability

The first two questions any technical SEO audit has to answer. Can Google find my pages? Can it index them? Until both answers are "yes," every other SEO effort is wasted. You can write the greatest page in the world. If Googlebot can't reach it, or reaches it but declines to add it to the index, it doesn't exist for search purposes. This page walks through what actually blocks crawling, what blocks indexing, and how to debug a page that should be ranking but isn't.

The mental model

Crawling and indexing are two different gates your page has to pass. Both have to succeed. Either can fail silently.

Crawling is reaching the page. Indexing is Google deciding to add it to its searchable database. A page can be crawled and then not indexed. It can be indexable in theory but never crawled. Separate the two in your head and you'll troubleshoot faster.

The full pipeline

Crawling: what blocks it

Indexability: what blocks it

Debugging: is my page indexed?

Step-by-step debug process

  1. Search site:yourdomain.com/your-url in Google. If it appears, it's indexed. If not, keep going.
  2. Open Search Console. Paste the URL into URL Inspection. Check coverage status.
  3. Match the status against the table above
  4. Fix the underlying cause (not the symptom)
  5. Click "Request Indexing" once the cause is fixed
  6. Wait a few days. Recheck.

Crawl budget, when it matters

Google allocates a finite crawl budget per site: roughly how many pages per day it will fetch. For sites under a few thousand pages, this is never a concern. Google has plenty of budget.

For sites over 10,000 pages, crawl budget matters. Poorly optimized crawl budget means important pages get crawled less often, take longer to update, and drift out of sync with what's actually on the site.

Optimizing crawl budget:

The tools

The quick-win list

What to do with this

Open Search Console right now. Go to the Pages report. Look at the "Why pages aren't indexed" breakdown. Every bucket there is an opportunity. Work through the largest bucket first. This is often the highest-leverage afternoon of SEO you'll have.

Next: XML sitemaps, the main way you tell Google which pages exist and matter.