[Paper] RepoMod-Bench: A Benchmark for Code Repository Modernization via Implementation-Agnostic Testing
The evolution of AI coding agents has shifted the frontier from simple snippet completion to autonomous repository-level engineering. However, evaluating these ...