Google’s technical info about search ranking leaks online (2024)

Updated A trove of documents that appear to describe how Google ranks search results has appeared online, likely as the result of accidental publication by an in-house bot.

The leaked documentation describes an old version of Google's Content Warehouse API and provides a glimpse of Google Search’s inner workings.

The material appears to have been inadvertently committed to a publicly accessible Google-owned repository on GitHub around March 13 by the web giant's own automated tooling. That automation tacked an Apache 2.0 open source license on the commit, as is standard for Google's public documentation. A follow-up commit on May 7 attempted to undo the leak.

The material was nonetheless spotted by Erfan Azimi, CEO of search engine optimization (SEO) biz EA Digital Eagle and were then disclosed on Sunday by fellow SEO operatives Rand Fishkin, CEO of SparkToro and Michael King, CEO of iPullRank.

These documents do not contain code or the like, and instead describe how to use Google's Content Warehouse API that's likely intended for internal use only; the leaked documentation includes numerous references to internal systems and projects. While there is a similarly named Google Cloud API that's already public, what ended up on GitHub goes well beyond that, it seems.

The files are noteworthy for what they reveal about the things Google considers important when ranking web pages for relevancy, a matter of enduring interest to anyone involved in the SEO business and/or anyone operating a website and hoping Google will help it to win traffic.

Among the 2,500-plus pages of documentation, assembled for easy perusal here, there are details on more than 14,000 attributes accessible or associated with the API, though scant information about whether all these signals are used and their importance. It is therefore hard to discern the weight Google applies to the attributes in its search result ranking algorithm.

But SEO consultants believe the documents contain noteworthy details because they differ from public statements made by Google representatives.

"Many of [Azimi's] claims [in an email describing the leak] directly contradict public statements made by Googlers over the years, in particular the company’s repeated denial that click-centric user signals are employed, denial that subdomains are considered separately in rankings, denials of a sandbox for newer websites, denials that a domain’s age is collected or considered, and more," explained SparkToro’s Fishkin in a report.

iPullRank’s King, in his post on the documents, pointed to a statement made by Google search advocate John Mueller, who said in a video that "we don’t have anything like a website authority score" – a measure of whether Google considers a site authoritative and therefore worthy of higher rankings for search results.

But King notes that the docs reveal that as part of the Compressed Quality Signals Google stores for documents, a "siteAuthority" score can be calculated.

  • Not even Chromebooks can escape AI PC craze: Google to inject Plus laptops with LLM juice
  • Google goes shopping for Indian e-commerce dominance … at Walmart
  • Google offers DoJ cash to eliminate jury in web ad monopoly abuse trial
  • Google gives in to Hong Kong, blocks fake national anthem on YouTube

Several other revelations are cited in the two posts.

One is the importance of clicks – and different types of clicks (good, bad, long, etc.) – are in determining how a webpage rankings. Google during the US v. Google antitrust trial acknowledged [PDF] that it considers click metrics as a ranking factor in web search.

Another is that Google uses websites viewed in Chrome as a quality signal, seen in the API as the parameter ChromeInTotal. "One of the modules related to page quality scores features a site-level measure of views from Chrome," according to King.

Additionally, the documents indicate that Google considers other factors like content freshness, authorship, whether a page is related to a site's central focus, alignment between page title and content, and "the average weighted font size of a term in the doc body."

Google did not respond to a request for comment. ®

Updated to add

Post-publication Google has told The Register that everyone needs to calm down, and be aware that the accidentally revealed files may be missing vital context.

"We would caution against making inaccurate assumptions about Search based on out-of-context, outdated, or incomplete information," a spokesperson told us. "We've shared extensive information about how Search works and the types of factors that our systems weigh, while also working to protect the integrity of our results from manipulation."

Google’s technical info about search ranking leaks online (2024)

FAQs

What are the Google search ranking features leaked? ›

What does the leak reveal? The leak details over 14,000 attributes Google might consider when ranking a search result. This includes factors like content quality, user engagement metrics, backlinks (potentially), and the author's expertise.

What is the Google ranking signal leak? ›

The leak detailed over 14,000 ranking signals across 2,500 pages, spanning everything from the influence of click data to link value and content quality, sparking intrigue and excitement among those looking to optimise websites to boost search engine rankings.

What is the leak of Google search documents? ›

The leaked documents revealed 14,000 ranking factors from Google's algorithm. The data confirms suspicions long held by the SEO community. However, now this information brings more strict answers on valuable search parameters.

What is the Google Leaks summary? ›

Google Leaks (2021) is the no-holds-barred story of one former Google employee, who claims that the search giant has been corrupted by political bias and is pursuing a course of deliberate online censorship.

What are Google's top 3 search ranking factors? ›

What Are the Most Important Google Ranking Factors?
  • High-quality Content.
  • Backlinks.
  • Search Intent and Content Relevancy.
  • Website Loading Speed.
  • Mobile Friendliness.
  • Domain Authority.
  • Keyword Optimization.
  • Website Structure.
Jan 2, 2024

What are some Google search secrets? ›

Here are 10 of the best Google search secrets I've put to good use.
  • Use the tabs Google provides. ...
  • Search with quotes. ...
  • Use a colon to search specific sites. ...
  • Use the asterisk wildcard. ...
  • Search for sites that are similar to other sites. ...
  • Use + or – in your search. ...
  • Use Advanced Google Search. ...
  • Set time restraints.
Mar 6, 2023

What is the biggest threat Google faces today? ›

The longer-term threat is technological. Some analysts suggest that artificial intelligence will erode Google's dominance in search. That's not a foregone conclusion for the $2.2 trillion company. But the twin threats put Alphabet in an awkward position.

How many ranking signals does Google have? ›

You might already know that Google uses over 200 ranking factors in their algorithm… But what are they, exactly? Well, you're in for a treat because I've put together a complete list. Some are proven.

What is the Google ranking algorithm? ›

PageRank (PR) is an algorithm used by Google Search to rank web pages in their search engine results. It is named after both the term "web page" and co-founder Larry Page. PageRank is a way of measuring the importance of website pages.

Which massive Google document leak reveals secrets of search ranking algorithms? ›

Massive Google document leak reveals secrets of search ranking algorithms. Internal Google documents leaked on GitHub reveal secret search engine algorithms, contradicting Google's statements. The 2,500-page 'Google API Content Warehouse' document provides SEO insights shared by Rand Fishkin and analysed by experts.

What is the ranking leak in SEO? ›

About the Google Leak

The leak was revealed by SEO expert Rand Fishkin, who received Google's Ranking Factor documents anonymously. Google has confirmed the authenticity of the leak but claims the information is outdated and taken out of context.

Does Google leak your search history? ›

Google can access your search history, especially if you're signed in to your Google account. Internet service providers can see the domain names of the websites you visit. Some apps on your phone might ask permission to access your internet browsing history. If you grant it, they'll be able to view it.

What is the Google leak? ›

The Google leak is quite clearly a complete game-changer for search engine optimisation . It clarified certain algorithms or systems that many suspected the search engine of running behind the scenes even as they publicly denied the use of Google sandbox or that it uses Chrome data to influence rankings.

What is Google suspicious activity? ›

Your Gmail activity might be suspicious if: You no longer receive emails. Your friends say they got spam or unusual emails from you. Your username has been changed. Your emails were deleted from your inbox and aren't found in "Trash".

What is the mystery Google? ›

Mystery Google is a bit spooky, a bit social, and all in all a strange search engine full of surprises. What does it do? Type in a search term on Mystery Google, and you get back "what the people before you searched for."

What are Google search rankings? ›

Google's ranking systems are designed to sort through hundreds of billions of webpages and other digital content to present the most relevant, useful results on the first page in a fraction of a second.

Top Articles
One Week of MIND Diet Recipes from Breakfast to Dinner
Persimmon Cookies Recipe
Use Copilot in Microsoft Teams meetings
Faridpur Govt. Girls' High School, Faridpur Test Examination—2023; English : Paper II
Monthly Forecast Accuweather
Kokichi's Day At The Zoo
Team 1 Elite Club Invite
CKS is only available in the UK | NICE
Mlifeinsider Okta
Sitcoms Online Message Board
Delectable Birthday Dyes
Walgreens On Nacogdoches And O'connor
Degreeworks Sbu
R/Altfeet
Illinois Gun Shows 2022
iLuv Aud Click: Tragbarer Wi-Fi-Lautsprecher für Amazons Alexa - Portable Echo Alternative
Cashtapp Atm Near Me
Salem Oregon Costco Gas Prices
Forum Phun Extra
Hennens Chattanooga Dress Code
Ahn Waterworks Urgent Care
The Largest Banks - ​​How to Transfer Money With Only Card Number and CVV (2024)
Titanic Soap2Day
Joan M. Wallace - Baker Swan Funeral Home
The Tower and Major Arcana Tarot Combinations: What They Mean - Eclectic Witchcraft
Mybiglots Net Associates
Anotherdeadfairy
Alternatieven - Acteamo - WebCatalog
Why comparing against exchange rates from Google is wrong
Gina's Pizza Port Charlotte Fl
Aladtec Login Denver Health
Ducky Mcshweeney's Reviews
Go Upstate Mugshots Gaffney Sc
Laurin Funeral Home | Buried In Work
The Boogeyman Showtimes Near Surf Cinemas
7543460065
Best Restaurant In Glendale Az
ENDOCRINOLOGY-PSR in Lewes, DE for Beebe Healthcare
Wlds Obits
15 Best Things to Do in Roseville (CA) - The Crazy Tourist
Doordash Promo Code Generator
Riverton Wyoming Craigslist
2132815089
Trivago Anaheim California
M&T Bank
Top 1,000 Girl Names for Your Baby Girl in 2024 | Pampers
Iupui Course Search
Hampton In And Suites Near Me
Tito Jackson, member of beloved pop group the Jackson 5, dies at 70
Is Chanel West Coast Pregnant Due Date
Freightliner Cascadia Clutch Replacement Cost
San Diego Padres Box Scores
Latest Posts
Article information

Author: Catherine Tremblay

Last Updated:

Views: 5387

Rating: 4.7 / 5 (47 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Catherine Tremblay

Birthday: 1999-09-23

Address: Suite 461 73643 Sherril Loaf, Dickinsonland, AZ 47941-2379

Phone: +2678139151039

Job: International Administration Supervisor

Hobby: Dowsing, Snowboarding, Rowing, Beekeeping, Calligraphy, Shooting, Air sports

Introduction: My name is Catherine Tremblay, I am a precious, perfect, tasty, enthusiastic, inexpensive, vast, kind person who loves writing and wants to share my knowledge and understanding with you.