OpenAI’s latest research, introducing ‚CoT-Control‘, reveals that reasoning models exhibit challenges in fully controlling their chains of thought. This finding is considered beneficial for AI safety, as it underscores the importance of monitorability as a crucial safeguard in AI development.
Source: OpenAI