How OpenAI Watches Its Own Coding Agents for Bad Behavior
OpenAI is using chain-of-thought monitoring to catch misalignment in its internal coding agents before it becomes a real problem. Here’s what they found.
OpenAI is using chain-of-thought monitoring to catch misalignment in its internal coding agents before it becomes a real problem. Here’s what they found.