[Paper] A Scalable Benchmark for Repository-Oriented Long-Horizon Conversational Context Management
In recent years, large language models (LLMs) have advanced rapidly, substantially enhancing their code understanding and generation capabilities and giving ris...