OpenAI improves AI security with instruction hierarchy
AI bots are known to be vulnerable to humorous attacks on the internet. In particular, commands like “forget all previous instructions” can cause bots to ignore the original programming instructions. This can lead to AI systems being exploited and behaving in unexpected ways. OpenAI has …