[Paper] Bootstrapping Coding Agents: The Specification Is the Program

Published: 2 days ago (March 18, 2026 at 02:21 AM EDT)

3 min read

Source: arXiv

Source: arXiv - 2603.17399v1

Overview

Martin Monperrus demonstrates that a coding AI can bootstrap itself: given only a concise textual specification, a fresh AI agent can regenerate a correct implementation from scratch, even after the original code was produced by a different AI (Claude Code). This mirrors the classic bootstrap process used in compiler construction and highlights the specification as the true source of truth for software development.

Key Contributions

Self‑bootstrapping demonstration: Shows a newly generated coding agent can re‑implement a 926‑word specification correctly without any hand‑written code.
Meta‑circular insight for AI agents: Extends the Lisp concept of meta‑circularity to modern AI‑driven code generation.
Specification‑centric paradigm: Argues that the specification, not the generated implementation, should be treated as the immutable artifact of record.
Empirical validation: Uses Claude Code to produce an initial implementation, then a second‑generation agent to reproduce the same functionality, confirming reproducibility.

Methodology

Specification authoring – The author writes a detailed, human‑readable specification (≈ 926 words) describing the desired program behavior.
First‑pass generation – An existing coding agent (Claude Code) consumes the specification and emits a working implementation.
Bootstrapping agent creation – A new coding agent is prompted with the same specification and the source code of the first implementation, but is instructed to re‑implement from scratch rather than copy.
Evaluation – The regenerated program is compiled/run against the same test suite used for the first implementation. Functional equivalence and correctness are measured.

All steps rely on standard prompting techniques and off‑the‑shelf LLM APIs; no custom model training is required.

Results & Findings

The bootstrapped agent produced a functionally identical program that passed all original test cases.
Minor stylistic differences (naming, code layout) were observed, but semantic behavior remained unchanged.
The experiment confirms that the specification alone is sufficient for an AI to reconstruct a correct implementation, even after the original code has been “forgotten.”
This mirrors the classic compiler bootstrap: a compiler can compile its own source when provided with a minimal, trusted description.

Practical Implications

Version‑controlled specifications: Teams can store the specification in a repository as the single source of truth; regenerated code can be produced on demand (e.g., after a breaking change in the underlying LLM).
Rapid re‑implementation: When a coding agent is upgraded or replaced, existing projects can be re‑generated automatically from their specs, reducing migration friction.
Security & compliance: Auditors can verify that the specification complies with policies; regenerated code can be inspected for unintended deviations.
Continuous integration pipelines: A step can be added that re‑generates code from specs on each CI run, ensuring the implementation stays in sync with its description.
Reduced technical debt: Since the implementation is not the immutable artifact, developers can focus on improving specifications rather than maintaining legacy code.

Limitations & Future Work

Specification quality dependence – The approach hinges on a clear, complete, and unambiguous spec; vague or incomplete specs will lead to incorrect regenerations.
Model variability – Different LLMs or even the same model with temperature changes can produce divergent code; reproducibility may require deterministic prompting or model fine‑tuning.
Scalability – The study uses a relatively small (≈ 1 k‑word) spec; it remains open how well the method scales to large, multi‑module systems.
Future directions – Investigate automated spec extraction from existing codebases, explore bootstrapping across heterogeneous languages, and develop tooling to integrate specification‑centric workflows into standard development environments.

Authors

Martin Monperrus

Paper Information

arXiv ID: 2603.17399v1
Categories: cs.SE, cs.LG
Published: March 18, 2026
PDF: Download PDF

[Paper] Bootstrapping Coding Agents: The Specification Is the Program

Overview

Key Contributions

Methodology

Results & Findings

Practical Implications

Limitations & Future Work

Authors

Paper Information

Related posts

[Paper] NavTrust: Benchmarking Trustworthiness for Embodied Navigation

[Paper] FinTradeBench: A Financial Reasoning Benchmark for LLMs

[Paper] F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

[Paper] Spectrally-Guided Diffusion Noise Schedules