[Paper] Enhancing Software Engineering Through Closed-Loop Memory Optimization
Source: arXiv - 2606.05646v1
Overview
Large language models (LLMs) have enabled powerful software engineering (SE) agents capable of navigating complex codebases and resolving real-world issues. However, these agents remain fundamentally episodic: they fail to retain, refine, and reuse experiences across tasks, repeatedly reconstructing context from scratch and reproducing similar mistakes. Even with memory support, they offer no remedy for the absence of a principled, task-agnostic \textit{memory utility}, making them difficult to evaluate rigorously or generalize across agents and settings. To tackle these limitations, we introduce \ours, a closed-loop framework for memory augmentation in SE agents. \ours grounds memory utility in \textit{validated downstream impact}, establishing utility as both a task-agnostic \textbf{evaluation benchmark} and an annotation-free \textbf{optimization signal}. Through complementary evaluation on \textit{single-episode} and \textit{cross-episode} memory augmentation, results demonstrate that \ours consistently improves SE agents across settings, achieving absolute gains of up to $\uparrow5.25%$ in success rate and $\uparrow4.63%$ in resolve efficiency, while substantially reducing computational cost by $\geq9.79%$. Our project page: \href{https://xhguo7.github.io/MemOp/}{https://xhguo7.github.io/MemOp/}.
Key Contributions
This paper presents research in the following areas:
- cs.SE
- cs.AI
Methodology
Please refer to the full paper for detailed methodology.
Practical Implications
This research contributes to the advancement of cs.SE.
Authors
- Xuehang Guo
- Zora Zhiruo Wang
- Qingyun Wang
- Graham Neubig
- Xingyao Wang
Paper Information
- arXiv ID: 2606.05646v1
- Categories: cs.SE, cs.AI
- Published: June 4, 2026
- PDF: Download PDF