Inside the Google Algorithm Leak
Google Algorithm Leak

Inside the Google Algorithm Leak

Imagine waking up one morning to discover that the secret recipe behind Google's search rankings has been revealed.

That is exactly what happened with Rand Fishkin, CEO of SparkToro and founder of Moz.

He said, "In the last quarter century, no leak of this magnitude or detail has ever been reported from Google's search division."

The leak was extensive, revealing over 2,500 pages of Google algorithm factors and 14,000+ attributes influencing search rankings. It has revealed the mystery behind Google's algorithms and introduced some previously unknown ideas to the public.

Let's dive into the leak in detail.


What Exactly Happened?

On May 5th, 2024, Rand Fishkin obtained sensitive information about Google's algorithms, which he made public on May 27th. But the initial leak occurred much earlier. On March 13th, 2024, thousands of documents from Google's internal Content API Warehouse were released on GitHub by an automated bot known as yoshi-code-bot. The source of the leak was initially unknown but later revealed to be Erfan Azimi of EA Eagle Digital.

"The leak mostly confirmed a lot of things SEO testers have already known for years. For example, the leak suggests that triggering a peak of spammy anchors over a period of time can result in a penalty," says Ted Kubaitis, founder and CEO of SEO Tool Lab.

However, the leak has also upset SEO and digital marketers as it highlighted some factors that Google had previously denied, like using user data in search algorithms.

It's important to note that the leak is specifically related to an API for cloud-based document storage and retrieval, not organic search algorithms. And while the leaked documents highlight some ranking factors, they do not mention an important detail: how much these factors actually weigh in the overall ranking process. 

"The attributes aren't ranked in any way. So even getting confirmation that, yes, post-search click behaviour is a factor that impacts your site's ability to rank is only marginally helpful—because we don't know how significant of a factor it actually is," explains marketing expert Phil Stott.


Key Findings from the Google Algorithm Leak

Here are the standout takeaways from the Google algorithm leak:

  • NavBoost

This feature assesses search demand by tracking how many times a keyword is searched and how often search results are clicked on. It also tells apart long and short clicks and rates queries based on what users are really looking for. For example, if users spend a lot of time on videos or images related to a query, NavBoost will trigger video or image features for that query and similar ones.


  • Twiddlers

Twiddlers are specialised re-ranking functions that adjust how a document's information retrieval score is calculated or alter its ranking. They help fine-tune search results by tweaking how documents are ranked.


  • Demotions

It's true that your content can be demoted for various reasons. These include mismatched links, negative SERP signals that show user dissatisfaction, poor product reviews, irrelevant locations, exact-match domains, or adult content.


  • Change History

Google keeps track of the last 20 changes made to your webpage. So, if you want to make sure that any old or outdated content doesn't mess with your current rankings or search results, you'll have to update the page more than 20 times.


  • Successful Clicks Matter

According to the leaked algorithm, you must create high-quality content and user experiences to rank well. This is because Google measures user engagement with metrics like badClicks, goodClicks, lastLongestClicks, and unsquashedClicks to evaluate how users interact with your content.


  • SiteAuthority

Google uses a concept called "siteAuthority," which impacts the overall ranking of your site based on the quality of its content. What's interesting to note is that despite publicly acknowledging this in 2011 after the Panda update, Google has since denied having a specific website authority score.


  • Whitelists

Some modules hint that Google keeps whitelists for certain domains, especially for things like elections and COVID-19. For example, there's isElectionAuthority and isCovidLocalAuthority, which show that these domains get extra attention.


  • Brand Matters

Branding significantly influences how Google identifies, sorts, ranks, and filters entities. "If there was one universal piece of advice I had for marketers seeking to improve their organic search rankings and traffic broadly, it would be: "Build a notable, popular, well-recognised brand in your space, outside of Google search," says Rand Fishkin.


So, what does the Google API information mean for SEO and digital marketers? Continue reading here to know.

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics