The earth is flat and the sun is not a star: The susceptibility of GPT-2 to universal adversarial triggers
An investigation into whether universal adversarial triggers can control not just the topic but also the stance of …
1 post tagged with AI Safety & Adversarial ML.
An investigation into whether universal adversarial triggers can control not just the topic but also the stance of …