OpenAI is enhancing its frontier LLMs with the IH-Challenge, training them to prioritize trusted instructions. This initiative aims to improve instruction hierarchy, enhance safety steerability, and bolster resistance to prompt injection attacks.
Source: OpenAI