This is the official repository for "Self-Evaluation as a Defense Against Adversarial Attacks on LLMs" by [Hannah Brown], [Leon Lin], [Kenji Kawaguchi], [Michael Shieh].
Features
Defense Strategies - Implements self-evaluation to detect and block adversarial attacks.