[Paper] QBugLM: An Agentic Benchmarking Framework for LLM-based Quantum Software Debugging

Published: 5 days ago (June 5, 2026 at 10:34 AM EDT)

2 min read

Source: arXiv

Source: arXiv - 2606.07314v1

Overview

Quantum software bugs often yield silent, incorrect outputs rather than explicit errors, making them particularly difficult to detect and repair with conventional techniques. Although large language models (LLMs) have shown strong performance on classical software engineering tasks, their ability to debug quantum code remains largely unexplored. To bridge this gap, we propose QBugLM, a multi-agent framework that automates the quantum software debugging pipeline, from taxonomy-driven bug injection to LLM-based detection and repair, and finally to simulation-based validation, for framework-agnostic OpenQASM 3.0 programs. We further conduct a comprehensive case study using QBugLM to benchmark two LLMs, Claude 4.6 Sonnet and Qwen3 Coder Next, across different prompting strategies, bug categories, and quantum programs. Our results show that iterative feedback is critical, as a single retry raises Pass@1 from below 25% to above 80%. Moreover, simpler structured prompting can even outperform Chain-of-Thought and ReAct for reasoning-capable models under fixed-resource constraints. Our work takes initial steps toward benchmarking LLM capabilities for debugging quantum programs and offers practical insights to support future efforts in automated quantum software repair.

Key Contributions

This paper presents research in the following areas:

cs.SE
cs.ET
quant-ph

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.SE.

Authors

An B. B. Pham
Hoa T. Nguyen
Muhammad Usman

Paper Information

arXiv ID: 2606.07314v1
Categories: cs.SE, cs.ET, quant-ph
Published: June 5, 2026
PDF: Download PDF

[Paper] QBugLM: An Agentic Benchmarking Framework for LLM-based Quantum Software Debugging

Overview

Key Contributions

Methodology

Practical Implications

Authors

Paper Information

Related posts

[Paper] Agentic Very Much! Adoption of Coding Agent in New GitHub Projects

[Paper] Is US Defense Acquisition Ready to Acquire AI-Enabled Capabilities? Assessing the DoD Software Acquisition Pathway Through a Scenario-Based Policy Analysis

[Paper] On the Shoulders of Giants: Empowering Automated Smart Contract Auditing via the GiAnt Corpus

[Paper] A Causal Probabilistic Framework for Perception-Informed Closed-Loop Simulation of Autonomous Driving