[Paper] ATLAS: Active Theory Learning for Automated Science

Published: 3 days ago (June 10, 2026 at 01:52 PM EDT)

2 min read

Source: arXiv

Source: arXiv - 2606.12386v1

Overview

Advancing scientific understanding through mechanistic modeling requires posing the right experimental questions to yield maximally informative data. To automate this pursuit within cognitive science, we introduce ATLAS (Active Theory Learning for Automated Science), an active learning framework for the data-driven discovery of interpretable behavioral models. ATLAS iterates between generating mechanistic hypotheses—instantiated as a diverse ensemble of sparse neural networks (Disentangled RNNs)—and designing experiments that optimally distinguish between them. We test this approach on the problem of recovering reinforcement learning agents from their behavior in bandit tasks. ATLAS designs varied sequences of qualitatively novel experiments with temporal structure tailored to underlying agent characteristics. The models trained on these experiments are evaluated against a comprehensive set of metrics for mechanistic modeling that capture behavioral, structural, and computational similarity. ATLAS achieves a 5-10x improvement in sample efficiency across all metrics compared to random experimentation, and its performance is further validated against expert-designed experiments derived from literature. These in silico results showcase ATLAS’s potential to accelerate human-interpretable insights in cognitive science and other domains where scientific inquiry relies on discovering mechanistic models.

Key Contributions

This paper presents research in the following areas:

cs.LG
cs.AI

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.LG.

Authors

Noémi Éltető
Nathaniel D. Daw
Kimberly L. Stachenfeld
Kevin J. Miller

Paper Information

arXiv ID: 2606.12386v1
Categories: cs.LG, cs.AI
Published: June 10, 2026
PDF: Download PDF

[Paper] ATLAS: Active Theory Learning for Automated Science

Overview

Key Contributions

Methodology

Practical Implications

Authors

Paper Information

Related posts

[Paper] Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

[Paper] Mana: Dexterous Manipulation of Articulated Tools

[Paper] SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

[Paper] Understanding Truncated Positional Encodings for Graph Neural Networks