EntityMap: the Open Standard That Gives AI Systems a Structured View of Your Business

It is a frustrating experience to realize that an AI system is confidently lying about your business. You might find it hallucinating a product name, inventing a member of your executive team, or completely misquoting what your company is actually capable of doing. This happens because of how we have built the web. We designed it for humans to read, using pages, links, and prose.

When an AI model tries to answer a question about your organization, it does not read your site like a person does. It retrieves fragments from various pages and stitches them together using probability. Because the information is scattered across dozens of different URLs, the AI often fills in the gaps with guesses. This is not a flaw in the AI models themselves, but rather a failure of the medium. We are trying to feed structured institutional knowledge into systems through an unstructured web of pages. A useful companion note is Product Feeds Now Belong in SEO Strategy, because it looks at a nearby part of the same system.

From my perspective, this represents a fundamental shift in how we think about digital presence. For years, we focused on making content discoverable for humans. Now, we have to make it understandable for machines that do not browse, but rather retrieve. The tradeoff here is between the ease of publishing a blog post and the effort required to maintain a structured knowledge base. The decision for a business owner is whether they are comfortable leaving their brand reputation to a probabilistic guess or if they want to provide a definitive source of truth.

The EntityMap Proposal

To solve this gap, a new open standard called EntityMap has been proposed. The goal is to allow organizations to publish a single structured file that acts as a map for AI systems. Instead of forcing an AI to guess how your business works by scraping a hundred different pages, this file explicitly declares what the organization knows, how key entities relate to one another, and where the evidence for those claims lives. The same pattern also shows up in Google Lost Its Scraping Case, where the practical question is how the signal becomes visible.

The project is currently in a public consultation phase that runs until June 30, 2026, with a formal launch planned for July 1. During this window, the creators are looking for technical critique and real world testing. They want to hear from developers, SEO specialists, and anyone who builds or manages AI retrieval systems to ensure the standard works in practice.

This is a critical moment because standards are rarely rewritten once they are adopted. If you are involved in the technical side of AI or search, the decision to participate now is about influence. The tradeoff is spending time on a pre launch specification versus trying to fix a broken standard three years from now. It is much easier to shape the architecture of a tool while it is still in consultation than to request a feature update after it has been baked into the ecosystem.

Fitting Into the Existing Standards Landscape

It is important to understand that EntityMap is not intended to replace the tools we already use. It is designed to fill a specific void that sitemap.xml and schema.org were not built to handle. A sitemap tells a crawler which pages exist, and schema.org provides a way to describe what is on a specific page. Neither of these provides a holistic view of an organization's knowledge across an entire domain.

Consider a healthcare provider. Using schema.org, they can mark up a single page about a specific treatment. However, EntityMap allows that provider to declare their core treatment areas, the relationships between those treatments, and the peer reviewed evidence supporting each claim, all while linking to where that evidence resides on the site. Similarly, a SaaS company can use it to explicitly state how their feature X differs from a competitor's version, providing direct links to documentation and case studies as proof.

The expert interpretation here is that we are moving from local metadata to global institutional mapping. Schema.org is a description of a page, but EntityMap is a description of a business. The decision a company needs to make is whether their value proposition is simple enough to be inferred from individual pages or if it is complex enough to require a structured map. For most high stakes industries, like finance or medicine, the risk of AI misinterpretation is too high to rely on page level metadata alone.

The Mechanics of How EntityMap Works

Technically, EntityMap is a JSON file placed at a predictable location on a domain. It is built around three primary components. First are the Entities, which are the named things the business covers, such as products, people, locations, or specific areas of expertise. Second are the Relations, which define how those entities connect. For example, a relation might specify that a certain product improves a specific outcome or that a particular person leads a specific team. This connects with What AI Says About Your Locations when the same signal needs a clearer operating decision.

The third and perhaps most vital component is the Evidence chunks. These are actual passages from the website that support the claims made in the map, linked directly to their source URLs. These chunks include attribution metadata, such as the publisher name and the timestamp of retrieval. This ensures that when an AI extracts this data into a vector database, the chain of evidence remains intact and the original source is not lost.

The specification is kept minimal to encourage adoption, requiring only about 12 fields across three objects. Everything else, such as custom predicates or changelog tracking, is optional. The strategic value here is the evidence chunk. By linking a claim directly to a snippet of text and a URL, you are essentially providing a pre digested set of facts for the AI. The tradeoff is the maintenance burden. You cannot simply set this and forget it, as your evidence chunks must remain accurate as your website content evolves.

Who Should Prioritize This Standard

Several different roles stand to benefit from this approach. For those building Retrieval Augmented Generation (RAG) systems, EntityMap provides cleaner source data, which leads to better reasoning and fewer hallucinations. For SEO professionals, it offers a new way to influence AI visibility that complements traditional content and link building rather than replacing it.

Publishers can use it to protect their attribution. As AI platforms disaggregate content, having a structured file that declares ownership and knowledge helps preserve the link between the AI's answer and the original creator. Finally, any organization concerned about how they are represented in AI generated answers can use this as a tool to assert control over their own narrative.

The standard is published under CC BY 4.0, meaning there is no vendor lock in or subscription fee. This is a significant point because it prevents a single AI company from owning the map of your business. The decision for a leader is whether to remain a passive subject of AI scraping or to become an active provider of structured truth. The tradeoff is a small amount of technical overhead in exchange for a significant increase in brand safety within AI ecosystems.

Contributing to the Project

The current consultation period is intended to be a practical exercise rather than a formality. The project team is looking for specific types of feedback. They want to know if developers who have tried building an EntityMap found the process awkward or if the implementation broke in certain environments. They are also looking for use case validation to see if the standard misses critical needs for specific industries.

One of the most important areas for feedback is the predicate critique. The standard currently defines 24 core predicates, such as IMPROVES, MEASURES, and DEPENDS_ON. These are the semantic abstractions that tell the AI how entities relate. If these predicates do not align with how a specific industry actually works, the standard will fail to be useful for that sector.

From a professional standpoint, the predicates are the most dangerous part of any open standard. If they are too generic, they are useless, but if they are too specific, they become brittle. Anyone contributing should look at these 24 predicates and ask if they accurately describe the logic of their business. The decision to suggest a new predicate or remove an existing one is where the real work of standardization happens.

The Importance of an Open Approach

It is worth noting that this is a proposal coming from within the search and AI community, not a corporate product. It has already been reviewed and endorsed by R.V. Guha, one of the founders of schema.org. This endorsement suggests that the project is aligned with the long term goals of the open web.

The process is divided into phases, starting with this technical review and early implementation. Once the consultation closes, the focus will shift toward wider adoption and research into how the standard impacts AI retrieval across different sectors. Because it is genuinely open, it avoids the pitfalls of proprietary formats that force businesses to pay a toll just to tell an AI who they are.

The value of an open standard is that it creates a common language. If every AI company creates its own way of mapping businesses, we end up with a fragmented web again. The tradeoff is that open standards move slower than proprietary ones because they require consensus. However, the result is a more stable foundation for the entire internet.

Why the Timing is Critical

If you have spent the last few years watching AI systems misrepresent your expertise or the work of your clients, this is the moment to intervene. The barrier to entry is low. You do not need to be a software engineer to review the specification and test it against a real world problem you are facing.

The window for this feedback is short, lasting only 33 days before the adoption phase begins. Once the standard is finalized, the opportunity to shape the core logic of how AI systems view your business will be much smaller. The risk of inaction is that you continue to rely on the hope that an LLM will correctly infer your business model from scattered pages.

Ultimately, the decision is about moving from a defensive posture to an offensive one. Instead of trying to correct AI hallucinations after they happen, you can provide the map that prevents them from occurring in the first place. This is a practical step toward taking ownership of your digital identity in an era where the primary interface for information is no longer a search results page, but a generated answer.

The EntityMap Proposal

Fitting Into the Existing Standards Landscape

The Mechanics of How EntityMap Works

Who Should Prioritize This Standard

Contributing to the Project

The Importance of an Open Approach

Why the Timing is Critical

Related posts

A Practical Way to Use MCP Servers to Speed Up SEO Research

AI Search Can’t Verify Your Business, Here’s How to Fix It

AI Overviews Now Answer Most Local Searches, How to Get Your Business Cited

How To: Optimize Your Small Business for AI Powered Search

Comments