Reasoning models struggle to control their chains of thought, and that’s good
OpenAI News / Mar 5, 2026
- CoT controllability is very low (≤15.4%) across tested frontier models
- Larger models slightly better, but longer reasoning and post-training reduce control
- CoT-Control released: 13k+ tasks for measuring CoT controllability