Anthropic co-founder Chris Olah's remarks on Pope Leo XIV's encyclical "Magnifica humanitas"Anthropic News / May 25, 2026外部批判の必要性モデルの不可解な内部状態利益の公平な分配aisafetyinterpretabilitygovernanceethicspolicy
Inside our approach to the Model SpecOpenAI News / Mar 25, 2026行動規範を公開チェーン・オブ・コマンドルーブリックで解釈model-specchain-of-commandsafetydefaultsevaluationinterpretability