[Paper] How Requirements Quality Makes (or Breaks) Traceability Link Recovery

Published: (June 10, 2026 at 05:16 AM EDT)
2 min read
Source: arXiv

Source: arXiv - 2606.11834v1

Overview

Traceability information between requirements and source code greatly benefits the maintenance of a software system. Since manually establishing trace links is cumbersome and error-prone, previous research explored automated traceability link recovery (TLR) approaches to support this task. However, quality defects in requirements impact subsequent activities such as TLR, yet evidence about this remains scarce. Our objective is to contribute empirical evidence on this impact. At the same time, we aim to understand how the performance of TLR approaches varies given these quality defects. To this end, we annotated 28 types of quality defect in 189 use case descriptions from two datasets. Then, we executed five distinct TLR approaches on the dataset and measured their performance in recovering trace links. Finally, we performed statistical tests to quantify the defects’ effect strength on this performance. Our results show that some quality defects harm TLR performance, e.g., sentences that do not start with noun phrases, while others actually benefit performance, e.g., use cases that include implementation details. Moreover, different types of approaches respond differently to these defects. As a consequence, the performance-optimizing choice of a TLR approach depends on the quality of the dataset.

Key Contributions

This paper presents research in the following areas:

  • cs.SE

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.SE.

Authors

  • Tobias Hey
  • Julian Frattini

Paper Information

  • arXiv ID: 2606.11834v1
  • Categories: cs.SE
  • Published: June 10, 2026
  • PDF: Download PDF
0 views
Back to Blog

Related posts

Read more »