[Paper] IN2P3 Computing Center 2024 Workload Dataset

Published: (June 4, 2026 at 05:20 AM EDT)
2 min read
Source: arXiv

Source: arXiv - 2606.05914v1

Overview

This paper provides and analyzes a dataset detailing the characteristics and execution data of all jobs submitted to the IN2P3 Computing Center (Villeurbanne, France), a national research and support unit of the CNRS, in 2024. The main additional value of this contribution compared to previously available datasets consists in the combination of an extended time interval considered, the inclusion of memory usage data and its recency, on top on improving the diversity of datasets provenance. This allows researchers to simulate and evaluate scheduling algorithms on a real workload over a large time window. Thus, specificities due to seasonal, monthly, and weekly user behaviors can be taken into account, which is not possible with smaller or synthetic datasets. It is composed of 44M jobs submitted by 1k users running on a cluster of a maximum of 312 machines supporting 46k concurrent threads and providing 105To of RAM.

Key Contributions

This paper presents research in the following areas:

  • cs.DC

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.DC.

Authors

  • Guillaume Cochard
  • Bertrand Simon

Paper Information

  • arXiv ID: 2606.05914v1
  • Categories: cs.DC
  • Published: June 4, 2026
  • PDF: Download PDF
0 views
Back to Blog

Related posts

Read more »