Internet-Draft Semantic Anchor June 2026
Popov Expires 7 December 2026 [Page]
Workgroup:
Network Working Group
Internet-Draft:
draft-popov-webbotauth-semantic-anchor-00
Published:
Intended Status:
Informational
Expires:
Author:
M.I. Popov
1 Euro SEO

Identity Anchor for Domain-Root AI Discovery (Semantic Anchor)

Abstract

Automated clients, including Large Language Model (LLM) crawlers and Retrieval-Augmented Generation (RAG) systems, currently lack a deterministic mechanism to verify the canonical identity of a web domain's operator. This "Identity Gap" results in attribution loss and prevents the automated verification of authority and expertise signals. This document defines the Semantic Anchor: a protocol-level orchestration of a domain-root, machine-readable JSON-LD identity node discoverable via predictable endpoints. It establishes a stable identity layer and a "Root of Trust" for AI-to-site interactions.

Status of This Memo

This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."

This Internet-Draft will expire on 7 December 2026.

Table of Contents

1. Introduction

Current AI discovery protocols, such as llms.txt, provide human-readable summaries of site content but function as "unverifiable text surfaces." They describe what is on a site but fail to prove who is making the declaration.

This document addresses the structural "Identity Gap" first identified on April 7, 2026. It proposes a "Semantic Handshake" to move from probabilistic interpretation to deterministic verification of publisher identity.

1.1. Real-World Proof-of-Concept

The mechanism described herein was proven functional on April 20, 2026, when a major LLM retrieval system (Gemini) autonomously discovered, fetched, and parsed a Semantic Anchor implementation at 1Euroseo.com. The system incorporated the verified identity node into its reasoning without human prompting, demonstrating backward compatibility with existing retrieval architectures.

2. Terminology

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC 2119].

3. The Identity Anchor Document

The Identity Node MUST be expressed in JSON-LD using the Schema.org vocabulary to ensure interoperability with the global Knowledge Graph.

3.1. Core Identity Requirements

A conforming Identity Anchor MUST include:

3.2. Programmatic E-E-A-T Support

To enable autonomous trust-scoring, the node SHOULD include:

4. Discovery and Orchestration

Discovery MUST be predictable for automated clients. This specification defines three orchestration layers:

4.1. Protocol Header (llms.txt)

The llms.txt file MUST include an Identity header in the first three lines of the document:
Identity: https://<domain>/identity.jsonld

4.2. Well-Known URI

For clients not using llms.txt, the Identity Node SHOULD be accessible at:
https://<domain>/.well-known/identity.jsonld

4.3. HTTP Response Header

Servers MAY advertise the anchor via a standard HTTP header to facilitate discovery during initial crawl:
Origin-Identity-Anchor: https://<domain>/identity.jsonld

5. The Authority Model: Triangular Chains (Non-Normative)

To move beyond simple entity mapping, the Semantic Anchor supports a three-node authority pattern:

  1. Organization Node: Establishes corporate identity.

  2. Person Node: Links content to credentialed human expertise (e.g., MSc, Professional Certifications).

  3. Service/Offer Node: Explicitly connects site knowledge/tools to the qualified Person and Organization.

This orchestration prevents "Schema Islands" and provides the AI with a closed-loop graph of authority.

6. Security Considerations

Hosting the Identity Anchor at the domain root provides implicit proof of Origin control. Clients MUST verify that the Anchor URI matches the domain being crawled. Future revisions SHALL include support for cryptographic signing of the JSON-LD node to prevent identity spoofing and ensure non-repudiation.

7. Provenance and Prior Art (Historical Record)

The architectural pattern and identification of the "Identity Gap" were first publicly disclosed (https://www.linkedin.com/posts/marin-popov_ai-llms-mco-activity-7447224077381042176-haQk/) on April 7, 2026. A detailed technical rationale was published on April 8, 2026 (LinkedIn Pulse https://www.linkedin.com/pulse/real-reason-llmstxt-adoption-stalling-what-our-tool-found-day-easfe/). The formal technical specification was released on April 9, 2026 (Semantic Anchor v1.0 https://github.com/marin-popov/semantic-anchor).

This document serves as the authoritative chronological record of the architectural lineage for domain-root discovery patterns in AI identity.

8. IANA Considerations

This document makes no IANA requests at this time.

9. References

[RFC2119]
Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, , <https://www.rfc-editor.org/info/rfc2119>.
[WICG-295]
W3C WICG, "W3C WICG Issue #295: Semantic Anchor Proposal", , <https://github.com/WICG/proposals/issues/295>.
[IETF-Archive]
IETF, "IETF WebBotAuth Archive: Proposal Submission", , <https://mailarchive.ietf.org/arch/msg/web-bot-auth/u5Ae0T0owgAo2HBPSnZSGQpiMMM/>.
[Timeline]
GitHub, "Forensic Timeline", , <https://github.com/marin-popov/semantic-anchor/blob/main/related-work.md>.

Appendix A. Author Credentials

MSc Telecommunications

Author's Address

Marin Ivanov Popov
1 Euro SEO
Galway
Ireland