Open, vendor-neutral authoritative domain data consumed by AI systems, search, and other automated agents
Source: Dev.to
AI systems don’t know who your domain represents
AI assistants are increasingly the first layer between users and websites. People ask chatbots what a site is, who runs it, how to contact it, or whether it’s the “official” source for something.
Today, AI systems infer that information indirectly, from:
- partial crawls
- inconsistent metadata
- third‑party aggregators
- heuristics that usually work, until they don’t
This leads to common failure modes:
- misattribution (the wrong org, product, or contact)
- conflating similarly named domains
- inferring identity from whatever page happened to be crawled
These aren’t ranking problems, rather domain assertion problems.
What’s missing
There is no simple, first‑party, domain‑level place where a domain can say:
- “This domain represents X.”
- “This is the official site.”
- “This is how to contact us.”
Today we have:
schema.org(page‑level semantics)robots.txt(crawler policy)security.txt(security contact)ai.txt(usage policy)
But nothing that is:
- domain‑level
- identity‑focused
- machine‑readable
- self‑hosted
- boring and predictable
Introducing the AI Domain Data Standard (AIDD)
AIDD is a small, open specification for publishing domain‑level identity assertions for AI systems and automated agents. It’s a single JSON document hosted by the domain itself:
https://example.com/.well-known/domain-profile.json
Minimal example
{
"spec": "https://ai-domain-data.org/spec/v0.1",
"name": "Example Corp",
"description": "Open-source infrastructure for X",
"website": "https://example.com",
"contact": "https://example.com/contact"
}
Optional fields
entity_type(aligned with schema.org types)logo- embedded JSON‑LD for interoperability
There’s also an optional DNS TXT fallback for resolvers that can’t fetch HTTPS.
What this is
- first‑party and domain‑controlled
- self‑hosted
- vendor‑neutral
- versioned and schema‑validated
- composable with existing identity and trust systems
What this is not
- an identity provider
- a verification or trust system
- a ranking signal
- a replacement for crawling
Think of it like security.txt, but for domain identity instead of security reporting.
Why this matters
Entity‑resolution literature is clear: identity inference degrades when signals are partial, indirect, or noisy. AIDD doesn’t “fix AI,” but it gives AI systems a clean anchor signal for who a domain claims to represent. Consumers can:
- weigh it
- corroborate it
- or ignore it
The key point is attribution, not truth enforcement.
Tooling (so it’s not just a spec)
- CLI to init / validate / emit records
- Schema validation tests
- Resolver SDK
Integrations
- Next.js:
- WordPress:
- Jekyll:
Online utilities
- Generator:
- Checker:
Everything is open source and MIT licensed.
- Repository:
- Specification:
Who this is for
If you:
- run a site, project, or organization
- build crawlers, agents, or AI ingestion pipelines
- maintain CMS or hosting tooling
- care about clean web metadata
this might be useful. If not, you can ignore it—it’s intentionally small.
What’s next
The current version is deliberately minimal. Future work may explore optional layers like:
- cryptographic signing
- registrar or registry signals
- higher‑assurance identity assertions
The core goal remains the same: a simple, universal, domain‑hosted declaration surface that anyone can publish.
Feedback is extremely welcome; adoption even more so.