How to Crawl Authority Sites for External Links

Gyi Tsakalakis
February 12, 2014

Yesterday I wrote that I like to crawl authority sites for external links. Some of you wondered how to do that. There a bunch a of ways to do this. This is one way.

What is an authority site?

Since we are trying to identify high-quality, authoritative sites for SEO, we turn to Google:

Of course, we aren't disclosing the actual ranking signals used in our algorithms because we don't want folks to game our search results; but if you want to step into Google's mindset, the questions below provide some guidance on how we've been looking at the issue:

  • Would you trust the information presented in this article?
  • Is this article written by an expert or enthusiast who knows the topic well, or is it more shallow in nature?
  • Does the site have duplicate, overlapping, or redundant articles on the same or similar topics with slightly different keyword variations?
  • Would you be comfortable giving your credit card information to this site?
  • Does this article have spelling, stylistic, or factual errors?
  • Are the topics driven by genuine interests of readers of the site, or does the site generate content by attempting to guess what might rank well in search engines?
  • Does the article provide original content or information, original reporting, original research, or original analysis?
  • Does the page provide substantial value when compared to other pages in search results?
  • How much quality control is done on content?
  • Does the article describe both sides of a story?
  • Is the site a recognized authority on its topic?
  • Is the content mass-produced by or outsourced to a large number of creators, or spread across a large network of sites, so that individual pages or sites don’t get as much attention or care?
  • Was the article edited well, or does it appear sloppy or hastily produced?
  • For a health related query, would you trust information from this site?
  • Would you recognize this site as an authoritative source when mentioned by name?
  • Does this article provide a complete or comprehensive description of the topic?
  • Does this article contain insightful analysis or interesting information that is beyond obvious?
  • Is this the sort of page you’d want to bookmark, share with a friend, or recommend?
  • Does this article have an excessive amount of ads that distract from or interfere with the main content?
  • Would you expect to see this article in a printed magazine, encyclopedia or book?
  • Are the articles short, unsubstantial, or otherwise lacking in helpful specifics?
  • Are the pages produced with great care and attention to detail vs. less attention to detail?
  • Would users complain when they see pages from this site?

Obviously, not all of these will apply to every site, page, article, etc. That is why they are called guidelines.

You should also review Google's Search Quality Rating Guidelines.

You should also use the best law firm SEO tool ever created: your brain.

You probably already know what sites are authoritative on your subject matter. A few examples that I usually check out:

  • Government sites
  • Education sites (universities, law schools, etc)
  • Professional organization sites
  • State Bar sites
  • News sites
  • Real blogs

You can probably think of many more.

Once you have a good list of these sites (say 20 or so), you can crawl them for external links.

How to crawl your selected target sites

As previously mentioned, there are a bunch of ways to crawl sites. I prefer Screaming Frog.

Screaming Frog

Simply enter the domain you would like to crawl, click the "External" tab and click start. Screaming Frog will identify sites/pages that your target site is already linking to.

Export the list to a .csv and dive into new link opportunities.

What to do with the crawl data

You should parse your crawl data for:

  • Links to other sites that are similar to yours
  • Broken links
  • Links to images
  • Links to specific pages (as opposed to home page links)

The idea here is to identify places your target site is already linking as an indication that they might also be willing to link to you.

Hint: The fact that links to is not an indication that CNN is likely to link to you.

Generally speaking, the more relevant and local your site is to your law firm, the better opportunities you are likely to uncover.

Once you've identified some good candidates, it's time to start thinking about why the target site might link to you.

Does your site have a great page that could replace a broken link resource?

Has the other site published articles on subjects on which you have expertise?

Do you have something cool on your site that is pretty obviously interesting to the other site's audience?

If the answer is yes to any of these, it might be time to introduce yourself and your site to the people at your target site.

This doesn't mean mass emailing webmasters of a bunch of target sites. That's spam friend.

Instead, introduce yourself as you would in real life.

Do you go up to people at networking events and ask them for links? I hope not. So don't do that online.

Start a conversation. If and when it seems appropriate, suggest something that would:

  • Improve their site (fixing a broken link)
  • Help their audience
  • Help them

This is one of most frictionless ways to acquire links from high-quality sites.

But remember, high-quality sites don't maintain their quality by linking to every Jack and Jill that comes along.

If you're going to approach these folks, you better bring something worth linking to.

Gyi Tsakalakis
Co-Founder of AttorneySync
Notify of
Inline Feedbacks
View all comments

Let's Discuss Your Growth

Helping law firms grow profitably with digital marketing and can prove it.

Barry Conybeare

We are so impressed with AttorneySync and their digital marketing services. We started with an audit of our digital presence and hired them thereafter. Fixes to the website and other platforms quickly followed and we have seen a substantial increase in online traffic, new client calls, and new client contracts. Strongly recommend AttorneySync!

Let's Talk

How can we help?

More From Our Blog

June 8, 2023
On Local Law Firm SEO Rank Tracking and Reporting

Over the years, law firm prospects have sent us reports from just about all of our competitors. Unfortunately, even today, some law firm marketing agencies still mislead their clients via "reporting." One particularly egregious example comes in the form of ranking reports. Which prompted this LinkedIn post. To my surprise, I received a lot of […]

Read More
June 7, 2023
On Attribution

John Wanamaker supposedly said "Half the money I spend on advertising is wasted; the trouble is I don't know which half." In an an effort to figure out "what half is working," attribution was born. Coupled with a transition from traditional, offline ads to digital media, attribution became the holy grail for analyzing advertising spends. But […]

Read More
March 22, 2023
I asked ChatGPT to Recommend a Personal Injury Law Firm

I recently asked ChatGPT, "What are some of the top personal injury law firms in Chicago?? Actually, first I ask "who are some of the top personal injury lawyers in Chicago?" ChatGPT couldn't handle that one, so I modified the prompt. ChatGPT listed five very well-known firms downtown. Can you guess the other four? That's […]

Read More
February 17, 2023
My Take on Google Search's Guidance on AI-Generated Content & Law Firm SEO

If you're like me, you have some degree of AI, ChatGBT, Bard, exhaustion. Now don't get me wrong, this is stuff is remarkable and is changing, well, a lot. But before you hook up the ChatGPT API to your WordPress API and crank out 10,000 pages, here are a few things to think about. Let's […]

Read More
February 16, 2023
Google Adds New SEO Best Practices for Links

If you know me, you know my opinions about links and SEO advice from Google. If you don't, here's the TL;DR: Meh, links! Meaning, all things being equal, links still remain a competitive difference maker for ranking. Take Google's SEO advice with several grains of salt. Google has no economic incentive to help your site […]

Read More
February 1, 2023
Authenticity & Engagement

The best marketing advice I can give you is to be authentic. Of course, you don't find that very helpful in terms of meeting your growth goals. So, you might decide to game the system. As I'm writing this, one of the more popular ways to gain the system is to pay for engagement. This […]

Read More
January 19, 2023
ChatGPT for Legal Marketing

The following post was written by ChatGPT. ChatGPT, developed by OpenAI, is a state-of-the-art language model that can generate human-like text based on a given prompt or context. This technology has the potential to revolutionize the way that businesses, including law firms, market themselves to potential clients. One way that a law firm could use […]

Read More
May 26, 2022
SEO Expectations for Law Firms

How long does SEO take? When can I expect to see results? What results should I expect to see? These are all reasonable questions that we field from lawyers every day. And, like many legal answers, the answer is: It depends. Yes, I know that's not the answer you wanted. But it's the most honest […]

Read More
May 6, 2022
Who Should Do Marketing?

And how much time should they spend doing it? I recently had the privilege of chatting with Tyson, Jim, and Conrad for an upcoming episode of The Maximum Lawyer Podcast. If you're not familiar with The Maximum Lawyer community, you should definitely check it out. Jim asked a really great question about who should do […]

Read More

Let's Discuss

Questions or comments? Let's discuss on social!
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram