1 article found
A new LLM backdoor technique uses the word ‘Sure’ as a trigger, creating a compliance-only attack that requires no malicious training data and bypasses conventional safety measures.