The official repository containing the introduction and code for our NAACL 2025 paper: SeqAR: Jailbreak LLMs with Sequential Auto-Generated Characters.
Features
Jailbreak Attack Methods - Leveraging social facilitation dynamics for automated jailbreak attacks.