Evaluating chain-of-thought monitorability - OpenAI

We introduce evaluations for chain-of-thought monitorability and study how it scales with test-time compute, reinforcement learning, and pretraining.

You will be redirected in 10 seconds.