[Paper] Algorithmic and Minimax Complexities in Kernel Bandits

Published: (June 9, 2026 at 01:49 PM EDT)
2 min read
Source: arXiv

Source: arXiv - 2606.11171v1

Overview

Gaussian-process upper confidence bound (GP-UCB) and decision-estimation-coefficient (DEC) methods may appear, at first sight, to belong to different theories. This paper places the two viewpoints in a common algorithmic-information language for frequentist RKHS bandits. GP-UCB fixes an algorithmic, rather than true, Gaussian-process prior and exploits realized-trajectory complexity together with computational tractability, whereas MAMS optimizes a robust class-wide MAIR/DEC envelope. Through the unified MAIR framework and heterogeneous positive-semidefinite algorithmic priors, we generalize both the GP-UCB analysis and the MAMS algorithm, propose a safeguarded master that combines their advantages, and provide a kernel-bandit construction showing that algorithmic complexity can be more informative than class-wide minimax or DEC certificates in overparameterized models. The resulting message is that algorithmic information and class-wide minimax coefficients answer different questions and can lead to different gaps; kernel bandits provide a clean setting in which this distinction becomes mathematically visible.

Key Contributions

This paper presents research in the following areas:

  • cs.LG
  • cond-mat.stat-mech
  • cs.IT
  • math.OC
  • math.ST

Methodology

Please refer to the full paper for detailed methodology.

Practical Implications

This research contributes to the advancement of cs.LG.

Authors

  • Yunbei Xu

Paper Information

  • arXiv ID: 2606.11171v1
  • Categories: cs.LG, cond-mat.stat-mech, cs.IT, math.OC, math.ST
  • Published: June 9, 2026
  • PDF: Download PDF
0 views
Back to Blog

Related posts

Read more »