[Paper] My Chemical Harness: Evolutionary Molecular Design over Synthetic Pathways with Large Language Model Agents

Published: (June 8, 2026 at 07:52 PM EDT)
2 min read
Source: arXiv

Source: arXiv - 2606.11256v1

Overview

Designing molecules with target properties is most useful when candidate structures are accompanied by feasible synthetic routes. We introduce My Chemical Harness, a route-native evolutionary framework for goal-directed molecular design in which the search population consists of executable synthetic pathways rather than isolated molecular graphs. Each route is built from purchasable building blocks and reaction templates, executed by deterministic chemistry tools, and scored through task-specific molecular oracles. Large language models (LLMs) are used only as strategy controllers that select high-level preferences over route length, move type, reaction families, motifs, and exploration pressure, while local code performs route construction, validation, deduplication, scoring, selection, and memory updates. This separation lets the LLM guide exploration without allowing it to introduce hallucinated products or unsupported reaction steps. On a soluble epoxide hydrolase proxy task, our LLM agent improves over single pass LLM and deterministic controllers, reaching state-of-the-art performance across the sEH score, synthetic accessibility score, and AiZynthFinder success rate metrics. These results suggest that constrained LLM agents can play a significant role in molecular discovery without requiring training, fine-tuning, or dedicated generative models.

Key Contributions

This paper presents research in the following areas:

  • physics.chem-ph
  • cs.LG
  • cs.NE

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of physics.chem-ph.

Authors

  • César Ojeda
  • Darius A. Faroughy
  • Maryam Karimi
  • Payam Zarrintaj
  • Mir Mehdi Seyedebrahimi
  • Martín Carballo-Pacheco

Paper Information

  • arXiv ID: 2606.11256v1
  • Categories: physics.chem-ph, cs.LG, cs.NE
  • Published: June 8, 2026
  • PDF: Download PDF
0 views
Back to Blog

Related posts

Read more »