Content pruning + refreshing
📖 4 min readUpdated 2026-04-18
Most sites publish content and never touch it again. Compounds that content into Google's "low quality" bucket. Pruning and refreshing, auditing what's working, updating what's close, removing what isn't, can lift site-wide rankings within months.
Why content pruning matters
- Google's Helpful Content signal is sitewide. Bad content drags good content down.
- Thin pages waste crawl budget and dilute internal link equity.
- Old, outdated content loses user trust, dated screenshots, old prices, dead references.
- Cannibalization: duplicate-ish pages compete with each other.
The audit workflow
Step 1: Inventory every URL
Crawl (Screaming Frog) + export rankings + traffic (GSC + Ahrefs/SEMrush).
Step 2: Pull data per URL
For each: impressions, clicks, avg position, word count, last-updated date, # of backlinks, number of internal links pointing to it.
Step 3: Categorize
Four buckets:
- Keep + leave alone, ranking well, getting traffic, still current. Don't touch.
- Refresh, ranks close to top 10 or gets impressions but not clicks, outdated content, worth keeping.
- Consolidate, thin page that overlaps another. Merge content + redirect URL.
- Remove, zero traffic, zero value, nothing worth salvaging. 410 (or noindex if it must remain accessible).
Refresh: what a real update looks like
- Re-check accuracy of every factual claim
- Update statistics, screenshots, prices, product names
- Add sections covering recent developments or PAA questions
- Update internal links to point to current cluster structure
- Improve structure (better H2s, lists, comparisons)
- Add original imagery or data
- Update the visible "last updated" date AFTER making substantive changes
Refresh: what doesn't count
- Changing "2025" to "2026" in the title only
- Minor wording tweaks
- Updating only dateModified in schema
- Adding a paragraph and calling it updated
Google detects these patterns. Fake refreshes don't earn freshness signals.
Consolidation
You have two pages: "Best CRM for Insurance" and "Top Insurance CRM Tools." Identical intent, split authority. Action:
- Merge the best content from both into one strong page
- Choose the better URL (higher traffic, more backlinks, or cleaner slug)
- 301 redirect the weaker URL to the winner
- Update internal links to point at the winner
Removal
For content that shouldn't exist:
- 410 Gone, removes it cleanly from index. Preferred.
- Noindex, keeps URL accessible but hides from search. Use when the URL still serves a purpose (legal pages, admin redirects).
- 301 redirect to a related page, use sparingly. Works when a relevant target exists; becomes soft-404 if you redirect everything to homepage.
- Delete + 404, fine if the URL has no external links. If backlinks exist, 301 instead.
Expected timeline
- Refreshes: ranking improvements within 2-6 weeks
- Consolidations: similar
- Removals (sitewide quality lift): 6-12 weeks as Google re-evaluates the site
Pruning-driven traffic recovery
Well-documented pattern: a site with thousands of thin pages prunes aggressively (drops 50-80% of URLs). Over the next 90 days, remaining content's rankings rise. Total traffic often exceeds the original level, from fewer pages.
Cadence
- Small sites (<500 URLs): annual audit is enough
- Medium (500-5k URLs): quarterly
- Large (5k+): rolling; 10-20% of content reviewed monthly