First Proof

Published: (February 7, 2026 at 10:25 AM EST)
1 min read

Source: Hacker News

Abstract

To assess the ability of current AI systems to correctly answer research-level mathematics questions, we share a set of ten math questions which have arisen naturally in the research process of the authors. The questions had not been shared publicly until now; the answers are known to the authors of the questions but will remain encrypted for a short time.

0 views
Back to Blog

Related posts

Read more »

RLHF from Scratch

rlhf-from-scratch Hands‑on RLHF tutorial and minimal code examples. This repo is focused on teaching the main steps of RLHF with compact, readable code rather...