OpenAI Blog · Oct 29, 2025

gpt-oss-safeguard technical report

Reviewed by Errol Vogt, Site support technician & online learning analyst · original summary · editorial policy

gpt-oss-safeguard technical report. gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from a provided policy in order to label content under that policy. In this report, we describe gpt-oss-safeguard’s capabilities and provide our baseline safety evaluations on the gpt-oss-safeguard models, using the underlying gpt-oss models as a baseline. For more information about the development and architecture of the underlying gpt-oss models, see… This update is relevant for small-office operators tracking changes in their tools.

Operator takeaway: For operators: review whether 'gpt-oss-safeguard technical report' affects your current setup before relying on it in production.

Read the original at OpenAI Blog →

gpt-oss-safeguard technical report

Zapier SDK: Connect your code files to thousands of actions

How agents are transforming work

The 8 best AI presentation makers in 2026