[Paper] ROIDS: Robust Outlier-Aware Informed Down-Sampling

Published: 3 months ago (January 27, 2026 at 06:07 AM EST)

2 min read

Source: arXiv

Source: arXiv - 2601.19477v1

Overview

Informed down-sampling (IDS) is known to improve performance in symbolic regression when combined with various selection strategies, especially tournament selection. However, recent work found that IDS’s gains are not consistent across all problems. Our analysis reveals that IDS performance is worse for problems containing outliers. IDS systematically favors including outliers in subsets, which pushes GP towards finding solutions that overfit to outliers.

To address this, we introduce ROIDS (Robust Outlier-Aware Informed Down-Sampling), which excludes potential outliers from the sampling process of IDS. With ROIDS it is possible to keep the advantages of IDS without overfitting to outliers and to compete on a wide range of benchmark problems. This is reflected in our experiments, where ROIDS shows the desired behavior on all studied benchmark problems. ROIDS consistently outperforms IDS on synthetic problems with added outliers as well as on a wide range of complex real‑world problems, surpassing IDS on over 80 % of the real‑world benchmark problems. Moreover, compared to all studied baseline approaches, ROIDS achieves the best average rank across all tested benchmark problems. This robust behavior makes ROIDS a reliable down‑sampling method for selection in symbolic regression, especially when outliers may be included in the data set.

Key Contributions

cs.NE

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.NE.

Authors

Alina Geiger
Martin Briesch
Dominik Sobania
Franz Rothlauf

Paper Information

arXiv ID: 2601.19477v1
Categories: cs.NE
Published: January 27, 2026
PDF: Download PDF

[Paper] ROIDS: Robust Outlier-Aware Informed Down-Sampling

Overview

Key Contributions

Methodology

Practical Implications

Authors

Paper Information

Related posts

[Paper] VideoGPA: Distilling Geometry Priors for 3D-Consistent Video Generation

[Paper] End-to-end Optimization of Belief and Policy Learning in Shared Autonomy Paradigms

[Paper] User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments

[Paper] Decoupled Diffusion Sampling for Inverse Problems on Function Spaces