Where the goblins came fromOpenAI News / Apr 29, 2026報酬が語彙を強化Nerdyから挙動が転移報酬とデータを修正して鎮静化reinforcement-learningreward-modelingfine-tuningdata-filteringmodel-auditalignment
Improving instruction hierarchy in frontier LLMsOpenAI News / Mar 10, 2026IH‑Challenge dataset teaches instruction priorityImproves safety steerability and prompt‑injection robustnessMaintains usefulness without overrefusalinstruction-hierarchyreinforcement-learningprompt-injectionsafetydatasetevaluationoverrefusal