Log file analysis shows what Googlebot actually crawls, how often, where it goes, where it fails. For large sites, it's the only way to see crawl budget in action. The tools do the heavy lifting. This page walks through the major tools, their tradeoffs, and when log analysis is worth the setup.
GSC shows you what Google reports it crawls. Server logs show what actually happened. The two don't always match. Log analysis catches:
Desktop app (separate from the Screaming Frog crawler). Imports log files, joins with crawl data, visualizes.
Cloud platform. Integrates log analysis with crawl data + Analytics + GSC.
Enterprise platform. Similar to OnCrawl but aimed at very large sites.
Bundled with Semrush subscriptions. Basic log analysis.
For teams with engineering resources, custom pipelines using Python, Elasticsearch + Kibana, or BigQuery provide ultimate flexibility.
For each section of site (blog, category, product pages, legal), how often does Googlebot visit? High-value pages should be visited often; low-value should be visited less.
Of URLs Googlebot crawls, how many end up indexed? Ratio <50% = quality issues.
Percent of Googlebot requests returning 404. Should be <1%. Higher = broken links to be fixed.
Should be near 0. Any consistent 5xx traffic indicates server reliability problems.
Should be <500ms. Slow response = crawl budget wasted, rankings can suffer.
Any URL where Googlebot follows 3+ redirects is a chain. Clean up.
URLs Googlebot crawls that aren't internally linked anymore. Often old URLs from a prior version of the site.
In your sitemap but Googlebot never visited. Crawl budget or discoverability issue.
Depends on hosting:
/var/log/nginx/access.log or similarNot all "Googlebot" in logs is real. Verify via reverse DNS:
googlebot.com or google.comAll the mentioned tools automate this.
After major changes (migrations, redesigns), daily for 2-4 weeks.
Real scenarios where log analysis surfaced issues invisible elsewhere:
If your site is over 10,000 URLs, run a one-off log analysis this quarter. Screaming Frog Log Analyser is the cheapest entry. You'll find at least one issue that GSC didn't show.
Next: SEO reporting, how to turn all this data into decisions.