Author: Olaf Kopp
Reading time: 7 Minutes

Predicting site quality

Topics: , , , , ,

5/5 - (2 votes)

The patent describes methods for predicting site quality scores for new websites using phrase models. It involves generating a phrase model from previously scored sites, where the model maps phrase frequency measures to average site quality scores.

For new sites, relative frequencies of phrases are determined and used with the model to calculate an aggregate site quality score. This predicted score can then be used by search engines to help rank resources from that site in search results.

The complete access to analysis of the patents, research paper, other SEO related documents and use of AI research tools are only for SEO Thought Leader (yearly), SEO Thought Leader (monthly), and SEO Thought Leader basic (yearly) members.

Your advantages:

+ Full analysis of hundreds of well researched active Microsoft and Google patents and research paper.
+ Save a lot of time and get insights in just a few minutes, without having to spend hours analyzing the documents.
+ Get quick exclusive insights about how search engines and Google could work  with easy to understand summaries and analysis.
+ All patents classified by topic for targeted research.
+ New patent summaries and analysis every week. Weekly notification via E-Mail
+ Use all 4 AI Research Tools to gain insights from all documents in the database, the Google API Leak, Antitrust trial documents, the whole google support documents, fundamental knowledge from worlds best SEO experts, Bill Slawskis fundamental research work for the industry, books about semantic search and more in seconds.
+ Gain fundamental insights for your SEO work and become a real thought leader.

Get access to the SEO Research Suite and become a SEO thought leader now!
Already a member? Log in here

COMMENT ARTICLE



  • Behzad Hussain

    10.12.2024, 18:06 Uhr

    Thanks for this nice explanation.
    The Phrase Model generation and weighting is very complex in the Patent but you have explained these factors very nicely and comprehensively.

  • Behzad Hussain

    19.01.2025, 12:58 Uhr

    You mentioned this: “The process starts by identifying n-grams (phrases consisting of 2 to 5 words) within the text of the sites being analyzed. ”

    In the Patent it’s also mentioned that the one gram is also considered. “In other implementations, n-grams of only one length are used.” (It’s mentioned under the exaplanation of Fig. 2, explanation of first step 200.

    I think “only one length” refers here as unigram or 1-gram.

Content from the blog

Overview: Brand Monitoring Tools for LLMO / Generative Engine Optimization

Generative AI assistants like ChatGPT or Claude and AI search engines like Perplexity or Google read more

What is the Google Shopping Graph and how does it work?

The Google Shopping Graph is an advanced, dynamic data structure developed by Google to enhance read more

How Google can personalize search results?

The personalization of search results is one of the last steps in the ranking process read more

LLMO: How do you optimize for the answers of generative AI systems?

As more and more people prefer to ask ChatGPT rather than Google when searching for read more

The dimensions of the Google ranking

The ranking factors at Google have become more and more multidimensional and diverse over the read more

How Google evaluates E-E-A-T? 80+ signals for E-E-A-T

In 2022 I published an overview of E-E-A-T signals for the first time, which Google read more