[Paper] Deep ensemble graph neural networks for probabilistic cosmic-ray direction and energy reconstruction in autonomous radio arrays

Published: 3 days ago (February 26, 2026 at 01:29 PM EST)

5 min read

Source: arXiv

Source: arXiv - 2602.23321v1

Overview

A team of astrophysicists and machine‑learning researchers has built a deep‑ensemble graph neural network (GNN) that can infer the arrival direction and energy of ultra‑high‑energy cosmic rays from the raw voltage traces recorded by autonomous radio antenna arrays. By treating each triggered antenna as a node in a graph and injecting physical priors into the network, they achieve sub‑degree angular precision and a ∼ 16 % energy resolution—far better than previous data‑driven methods—while also providing calibrated uncertainty estimates.

Key Contributions

Graph‑based representation of radio arrays – each antenna becomes a node, with edges encoding spatial relationships, enabling the GNN to naturally exploit the irregular geometry of real‑world detector layouts.
Physics‑aware GNN architecture – custom layers embed known propagation and antenna response physics, reducing the amount of simulated training data needed.
Deep ensemble for probabilistic output – multiple independently trained GNNs are combined to produce both point estimates and well‑calibrated confidence intervals for direction and energy.
State‑of‑the‑art reconstruction performance – angular resolution of 0.092° and electromagnetic energy resolution of 16.4 % on realistic simulated data with noise and background.
Robustness & domain‑shift analysis – systematic tests (noise level variation, antenna failures, simulation‑real gaps) demonstrate how to detect when the model’s predictions may become unreliable.

Methodology

Data preparation – Simulated extensive‑air‑shower events generate voltage waveforms at each antenna. After a trigger, only the antennas that recorded a signal are kept.
Graph construction –
- Nodes: per‑antenna feature vectors (e.g., peak voltage, pulse width, time‑of‑arrival, local noise level).
- Edges: distances and relative orientations between antenna pairs, optionally weighted by expected signal correlation.
Physics‑informed GNN – The network stacks message‑passing layers that respect causal signal propagation (e.g., time‑delay constraints) and includes learnable kernels that mimic antenna gain patterns.
Deep ensemble – Five GNNs are trained from different random seeds and data shuffles. Their outputs are aggregated to produce a mean prediction and a variance that serves as an uncertainty estimate.
Training & loss – A multi‑task loss combines a directional loss (angular distance) and an energy loss (relative error), plus a regularization term that penalizes over‑confident uncertainty predictions (negative log‑likelihood).
Evaluation & robustness checks – Performance is measured on a held‑out simulated set with realistic noise, and stress‑tests are run by altering noise levels, removing antennas, or injecting systematic biases to mimic real‑world domain shifts.

Results & Findings

Metric	Value (simulated, realistic noise)
Angular resolution (68 % containment)	0.092°
Energy resolution (RMS of (Ê‑E)/E)	16.4 %
Uncertainty calibration (coverage of 68 % CI)	≈ 68 % (well‑calibrated)
Training data required (relative to vanilla GNN)	~ 30 % fewer events

Physics priors matter: Adding propagation constraints reduced the required training set size by a factor of three while preserving accuracy.
Ensembling improves reliability: The variance across ensemble members correlates strongly with actual prediction error, enabling automatic flagging of low‑confidence events.
Robustness: When up to 20 % of antennas are randomly disabled, angular error degrades gracefully (< 0.15°), and the ensemble’s uncertainty grows accordingly, signaling the loss of confidence.

Practical Implications

Next‑generation radio cosmic‑ray observatories (e.g., GRAND, IceCube‑Gen2 radio extensions) can deploy fewer antennas or operate with higher noise environments while still achieving precise reconstruction, lowering hardware and maintenance costs.
Real‑time event filtering – The calibrated uncertainties allow an autonomous array to decide on‑the‑fly whether to trigger a deeper read‑out or discard an event, saving bandwidth and storage.
Transferable workflow – The graph‑based pipeline is agnostic to the specific detector geometry, making it applicable to other sparse sensor networks (e.g., seismic arrays, distributed IoT monitoring) where the physical layout is irregular.
Open‑source tooling – The authors release the GNN modules and training scripts, giving developers a ready‑made template for building physics‑aware graph models on custom datasets.

Limitations & Future Work

Simulation‑only validation – Results are demonstrated on high‑fidelity Monte‑Carlo data; real‑world deployment will need extensive domain‑adaptation studies.
Computational load – While inference is fast on a modern GPU, the ensemble (five models) still requires ~ 10 ms per event, which may be a bottleneck for ultra‑high‑throughput arrays.
Energy range – The study focuses on ultra‑high‑energy (> 10¹⁸ eV) showers; extending to lower energies may demand richer feature engineering or larger ensembles.
Future directions proposed by the authors include:
- Incorporating self‑supervised pre‑training on unlabeled real data to bridge the simulation‑real gap.
- Exploring Bayesian GNNs for a more principled uncertainty treatment.
- Scaling the approach to multi‑modal data (e.g., combining radio with surface‑detector or fluorescence measurements).

Bottom line: By marrying graph neural networks with domain knowledge and deep ensembles, the authors deliver a practical, high‑precision solution for reconstructing cosmic‑ray properties from noisy, sparsely sampled radio data—opening the door for smarter, cheaper, and more autonomous astrophysical observatories.*

Authors

Arsène Ferrière
Aurélien Benoit-Lévy
Olivier Martineau-Huynh
Matías Tueros

Paper Information

arXiv ID: 2602.23321v1
Categories: astro-ph.IM, cs.LG
Published: February 26, 2026
PDF: Download PDF

[Paper] Deep ensemble graph neural networks for probabilistic cosmic-ray direction and energy reconstruction in autonomous radio arrays

Overview

Key Contributions

Methodology

Results & Findings

Practical Implications

Limitations & Future Work

Authors

Paper Information

Related posts

[Paper] Model Agreement via Anchoring

[Paper] SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation

[Paper] A Dataset is Worth 1 MB

[Paper] SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport