Author: Olaf Kopp
Reading time: 5 Minutes

Anchor tag indexing in a web crawler system

Topics: , ,

5/5 - (1 vote)

The Google patent describes a method and system for indexing documents in a collection of linked documents, such as web pages on the internet. It focuses on creating and using anchor maps to index information from other documents that link to a target document, not just the content of the target document itself. Key aspects include:

  • Creating a link log of source documents and their outbound links
  • Generating a sorted anchor map with target documents and lists of source documents linking to them
  • Including annotations like anchor text from the source documents in the anchor map
  • Using the anchor map information when indexing documents to improve search relevance
  • Merging and updating anchor maps over time as new information is crawled

This approach allows indexing useful information about a document from other linking documents, even if the target document itself is unavailable or has little text content. It can improve search results by incorporating more contextual information about each document.

The complete analysis of the patents, research paper and other SEO related documents and use of AI research tools are only for SEO Thought Leader (yearly), SEO Thought Leader (monthly), and SEO Thought Leader basic (yearly) members.

Your advantages:

+ Full analysis of hundreds of well researched active Microsoft and Google patents and research paper.
+ Save a lot of time and get insights in just a few minutes, without having to spend hours analyzing the documents.
+ Get quick exclusive insights about how search engines and Google could work  with easy to understand summaries and analysis.
+ All patents classified by topic for targeted research.
+ New patent summaries every month. Notification via E-Mail
+ Use the AI Research Tools to gain insights from all documents in the database, the Google API Leak, Antitrust trial documents, the whole google support documents and more in seconds
+ Gain fundamental insights for your SEO work and become a real thought leader.

Get access to the SEO Research Suite and become a SEO thought leader now!
Already a member? Log in here

COMMENT ARTICLE



Content from the blog

Case Study: 1400% visibility increase in 6 months through E-E-A-T of the source entity

In this article, I would like to show the background, implementation and results of a read more

The most important ranking methods for modern search engines

Modern search engines can rank search results in different ways. Vector Ranking, BM25, and Semantic read more

Digital brand building: The interplay of (online) branding & customer experience

Digital brand building or branding is one of the central topics in online marketing. Read read more

How to become a really good SEO

I’ve been doing SEO for 15+ years now and it’s been a long road of read more

Helpful content: What Google really evaluates?

Since the first Helpful Content Update in 2022, the SEO world has been thinking about read more

Interesting Google patents & research papers for search and SEO in 2024

In this article I would like to contribute to archiving well-founded knowledge from Google patents read more