π’ Update (April 2025): Our paper has been accepted to ICLR 2025! π Check out the paper: Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Features
White Box Attacks - Improves optimization-based adversarial suffix generation.