monitorability Articles | DocsDigest

Matched posts: 1

Reasoning models struggle to control their chains of thought, and that’s good

OpenAI News / Mar 5, 2026

CoT controllability is very low (≤15.4%) across tested frontier models
Larger models slightly better, but longer reasoning and post-training reduce control
CoT-Control released: 13k+ tasks for measuring CoT controllability

chain-of-thought monitorability cot-control evaluation safety model-controllability

Previous1 / 1Next