Agent Surface Auditor checks to ensure your site is ready for Agents Scans visit (must be)
Source: Dev.to

The Problem
Most discussions about optimizing for agents sound abstract, but in practice agent‑facing failures are plain engineering issues:
- One machine‑readable document links to one canonical path, while another links to an older or duplicated public path.
- Sitemaps forget important execution docs.
- OpenAPI files are reachable but buried behind inconsistent references.
- Instructions are written for humans, not for machine execution.
Introducing the Agent Surface Auditor
We built a standalone Agent Surface Auditor to detect these issues automatically.
What It Checks
- Discovery files such as
robots.txt,sitemap.xml,llms.txt, andllms-full.txt. - Project‑specific docs defined by config (e.g., execution docs or OpenAPI files).
- Execution surfaces for MCP, API docs, OpenAPI, or CLI.
- Content‑quality signals: heading structure, readability, and semantic HTML.
- Canonical drift between repository state and public output.
Common Issues Detected
- Stale public links.
- Multiple canonical entry points.
- Machine‑readable docs missing from sitemap/index layers.
- Content that is technically accessible but too noisy for an agent to use efficiently.
Provider Choices
The auditor lets you select the fetching provider explicitly:
direct-fetchbrowserbasecloudflareauto(quality‑aware fallback)
Choosing the right provider matters because not every site needs browser rendering, and some providers succeed where others do not.
Installation & Usage
Quick Install Path
npx agent-surface-auditor run --mode url --url https://example.com --provider autoAlternative Installation
npx openskills install Citedy/agent-surface-auditorKey Takeaways
Agent‑readiness goes beyond publishing llms.txt. It requires a coherent execution surface:
- One canonical path.
- One clean machine‑readable layer.
- One obvious next action.
This is simultaneously a docs problem, an infrastructure problem, and a product‑reliability problem.
Fact Anchors
- Agents benefit from a stable discovery layer and canonical machine‑readable links.
- Markdown and plain text reduce presentation noise for high‑value technical content.
- MCP, APIs, and CLI workflows are stronger action targets than generic landing pages.
- Public skills should be installable through a direct command or registry flow, not hidden behind manual setup.
- Cross‑file link drift between discovery files, sitemap entries, and execution docs is a concrete failure mode.
- Teams should audit agent‑facing surfaces continuously, not treat them as one‑off marketing assets.