Evaluating chain-of-thought monitorability

Published: (December 18, 2025 at 07:00 AM EST)
1 min read

Source: OpenAI Blog

Overview

OpenAI introduces a new framework and evaluation suite for chain-of-thought monitorability, covering 13 evaluations across 24 environments. Our findings show that monitoring a model’s internal reasoning is far more effective than monitoring outputs alone, offering a promising path toward scalable co…

Back to Blog

Related posts

Read more »

GPT-5.2-Codex

Article URL: https://openai.com/index/introducing-gpt-5-2-codex/ Comments URL: https://news.ycombinator.com/item?id=46316367 Points: 79 Comments: 50...