OpenAI Blog · Oct 29, 2025
gpt-oss-safeguard technical report
Reviewed by Errol Vogt, Site support technician & online learning analyst · original summary · editorial policy
gpt-oss-safeguard technical report. gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from a provided policy in order to label content under that policy. In this report, we describe gpt-oss-safeguard’s capabilities and provide our baseline safety evaluations on the gpt-oss-safeguard models, using the underlying gpt-oss models as a baseline. For more information about the development and architecture of the underlying gpt-oss models, see… This update is relevant for small-office operators tracking changes in their tools.
Operator takeaway: For operators: review whether 'gpt-oss-safeguard technical report' affects your current setup before relying on it in production.
ai