A systematic internal linking audit template identifies orphaned pages, diluted link equity, and missed topical cluster opportunities. This framework walks through the columns, data-gathering steps, and prioritization logic needed to transform a crawl export into actionable internal linking decisions.
Start with a spreadsheet that includes URL, page title, inbound internal link count, outbound internal link count, and crawl depth. Crawl depth tells you how many clicks from the homepage a page sits—anything beyond three or four clicks risks marginalization. Add a column for page type: product, service, blog post, category, resource. This lets you filter and spot patterns, like blog posts linking heavily to other blog posts but ignoring service pages. Include a strategic importance tag—high, medium, low—based on conversion potential or organic traffic contribution. This subjective layer prevents you from treating a throwaway archive page the same as a flagship service offering. Finally, reserve a notes column for recording why a page is orphaned or what cluster it should join. These fields turn a crawl export into a decision tool rather than a data dump.
Run a full-site crawl in Screaming Frog or Sitebulb and export the internal link metrics. Most crawlers give you inbound and outbound counts by default. Cross-reference the URL list against your sitemap to catch orphans—pages the crawler found but that receive zero internal links. For larger sites, filter by HTTP status to exclude redirects and errors before analysis. Then layer in manual context: open your analytics and tag high-traffic pages or high-converting URLs in the strategic importance column. If you run topical clusters—pillar pages supported by subtopic posts—flag which URLs belong to which cluster. Canadian sites serving both English and French should run separate crawls per language subdirectory or use hreflang data to ensure each language silo's internal linking stays cohesive. The crawl gives structure; your business knowledge gives meaning.
Sort by inbound link count ascending. Pages with zero internal links are orphans—Google may not index them unless they appear in the sitemap or get external backlinks. Common culprits include new blog posts not yet added to category archives, one-off landing pages from past campaigns, or dynamically generated URLs outside the navigation. Next, sort descending to find pages with unusually high inbound counts. A blog post with two hundred internal links often signals boilerplate footer or sidebar links inflating the number without adding topical relevance. Compare outbound counts, too: pages linking to fifty other URLs scatter equity; pages linking to none are dead ends that break user flow. Flag these extremes. The goal is a distribution where strategic pages receive proportional attention and every page forwards users deeper into relevant content, not into voids or link farms.
Rank issues by revenue or lead-generation impact first. An orphaned product page or service landing page takes precedence over an orphaned blog post from two years ago. Within the high-value tier, prioritize pages already ranking on page two for competitive keywords—adding internal links can push them onto page one. Next, address pages at crawl depth four or deeper that should be shallower: move them into category hubs or link from pillar content. Low-priority fixes include cosmetic imbalances, like a blog post with three links instead of five, unless it belongs to a cluster you're actively building. Track fixes in a status column—planned, in progress, completed—and assign owners if you work with developers or content teams. Reassess quarterly as new content publishes and old pages drift. The template is a living document, not a one-time checklist.
If you organize content into topical clusters—one pillar page linking to related subtopic posts—use the template to verify completeness. Filter by cluster tag and check that the pillar links to all subtopics and that subtopics link back to the pillar and to sibling posts where contextually appropriate. Missing links weaken the cluster signal. For silo architectures, ensure category pages link down to child pages and that child pages link back up and laterally within the silo, but avoid cross-silo links unless the topics genuinely overlap. Bilingual Canadian sites should confirm that English clusters stay in the English path and French clusters in the French path, with hreflang tags handling language switching rather than internal cross-language links that dilute topical authority. The audit template surfaces these structural breaks before they fragment your site's semantic coherence.
Export a filtered view of high-priority rows—orphans, shallow-link strategic pages, broken cluster connections—and convert each into a discrete task. For orphans, the task is add contextual link from relevant existing page, specifying the source URL and anchor text. For over-linked pages, remove boilerplate or footer links and replace with editorial mentions. For dead ends, add two to four outbound links to related content. Assign each task a target completion date and responsible party. If your CMS is WordPress, these often become simple edits; for custom platforms, they may require developer tickets. Track completion in the status column and re-crawl monthly to confirm changes indexed. The template shifts from diagnostic to operational when every row maps to a concrete action, and your internal linking evolves from accidental to architected.
Screaming Frog and Sitebulb both provide inbound and outbound internal link counts, crawl depth, and URL lists in CSV or Excel format. Export the internal links report, filter by HTML pages, and merge with your analytics or CMS data for strategic tagging. For very large sites, enterprise tools like Botify or DeepCrawl offer similar exports with more granular crawl control.
Prioritize orphans that already have external backlinks or organic impressions in Search Console, because they have latent authority waiting to distribute. Next, fix orphaned product or service pages that align with current business goals. Leave orphaned test pages, outdated campaigns, or duplicate content at the bottom of the queue, or delete them entirely if they serve no purpose.
Quarterly audits catch new orphans from recent publishing and prevent crawl-depth drift as the site grows. After major site migrations, redesigns, or CMS changes, run an audit immediately to verify navigation and template links survived intact. High-velocity content sites—publishing daily—may benefit from monthly spot-checks on new URLs to ensure they integrate into existing clusters.
Yes. Schedule monthly Screaming Frog crawls via command-line mode and export link metrics to a shared folder. Use scripts or tools like Google Sheets integrations to flag new orphans or pages crossing crawl-depth thresholds automatically. Manual review still matters for strategic tagging and cluster assignments, but automation reduces the data-gathering overhead and ensures you catch issues faster.
The template reveals whether your cluster structure actually exists in code or just in your content calendar. By filtering on cluster tags, you see missing pillar-to-subtopic links, orphaned supporting posts, and dead-end pages that break the cluster's semantic web. Fixing these gaps consolidates topical signals and helps Google recognize your depth of coverage on a subject.
Identify three to five existing pages that contextually relate—service pages, related blog posts, category hubs—and add editorial links with descriptive anchor text. Avoid forcing links into unrelated content or footer boilerplate. If the page genuinely lacks natural link opportunities, consider whether it belongs in your navigation, a relevant category, or a pillar hub, and adjust your site architecture accordingly.