Language Models Confuse System Instructions with User Input23. June 20264. July 2026CybersecurityLanguage models respond more strongly to text formatting than to actual content, making them vulnerable to manipulation through cleverly styled inputs that resemble internal system commands. Share on: