Reasoning models struggle to control their chains of thought, and that’s goodOpenAI News / Mar 5, 2026CoT制御率は極めて低い長い推論や追加訓練で制御性低下現状はCoT監視が有効な防護層chain-of-thoughtcotmonitoringevaluationsafetyrlprompting