Introducing new capabilities to GPT-RosalindOpenAI News / Jun 3, 2026生命科学向けに最適化LifeSciBenchで評価研究プレビュー提供gpt-rosalindlifesciencemedicinal-chemistrygenomicsbenchmarksagentic-codingtool-use
A shared playbook for trustworthy third party evaluationsOpenAI News / May 29, 2026ハーネスを明示主張と証拠を一致脆弱性を予算で検証evaluationharnesssafeguardstool-usecontaminationbudgetablation