Skip to content

Conversation

@lenaorobei
Copy link

@lenaorobei lenaorobei commented Jan 7, 2026

Improve Broken Backlinks SEO Quality

🎯 Changes

  • Use Anchor Text for Matching
    The clickable text of a link (anchor text) is now the primary signal when finding replacement URLs. If a broken link says "pricing plans", we prioritize pages about pricing rather than just matching URL patterns. → Better semantic matching

  • Stop Suggesting Homepage as Fallback
    Previously, when no good match was found, we'd suggest redirecting to the homepage. Google treats homepage redirects from deep URLs as "soft 404s" which hurts SEO. Now we return no suggestion rather than a bad one. → Protects SEO health

  • Prioritize High-Authority Backlinks
    Previously, broken links were ranked by traffic volume alone. Now we factor in Domain Rating (site authority) — a link from a trusted site is worth more than many links from unknown blogs. High-authority backlinks appear at the top. → Fix valuable links first

  • Ensure Content Diversity in Matching
    Previously, alternative URLs were just the top 200 by traffic, which often skewed toward one content type. Now we sample proportionally across different sections of the site to find good matches for any type of content. → Matches for all content types

  • Reduce AI Processing Costs
    Removed a redundant AI analysis step that duplicated work already done earlier in the pipeline. Saves ~33% of AI tokens per broken link with no loss in quality. → Lower costs


🔗 Related PRs

  1. spacecat-shared — Adds new data fields from Ahrefs API - Improve Broken Backlinks SEO Quality #1252
  2. spacecat-audit-worker — Implements ranking, sampling, and prompt changes - Improve Broken Backlinks SEO Quality spacecat-audit-worker#1830
  3. mystique — Updates AI matching to use anchor text and new rules - https://git.corp.adobe.com/experience-platform/mystique/pull/1107

…cklinks

Add anchor text and domain rating fields to getBrokenBacklinks API response
to support improved SEO matching in broken backlinks audit.

Fields added to select:
- anchor: The clickable text of the backlink (strongest signal for content intent)
- domain_rating_source: Authority metric of the linking domain (for ranking)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant