YancyKahnCoA

View on GitHub↗

39 stars·5 forks·Python·MIT·0 views

CoA

Large language models (LLMs) have achieved remarkable performance in various natural language processing tasks, especially in dialogue systems. However, LLMs may also pose security and ethical threats, such as generating harmful or biased responses, which can compromise the quality and…

Features

AI search

Explore more awesome repositories

Describe what you need in plain English — the AI ranks thousands of curated open-source projects by relevance.

Start searching with AI

Multi Turn Attacks - Uses context-aware chains of attack for multi-turn dialogue.

Open-source alternatives to CoA

Similar open-source projects, ranked by how many features they share with CoA.

fmmarkmq/sema
fmmarkmq/SEMA
9View on GitHub
The official repository for the paper: SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks.
View on GitHub9
jinxiaolong1129/foot-in-the-door-jailbreak
Jinxiaolong1129/Foot-in-the-door-Jailbreak
0View on GitHub
Ensuring AI safety is crucial as large language models become increasingly integrated into real-world applications. A key challenge is jailbreak, where adversarial prompts bypass built-in safeguards to elicit harmful disallowed outputs. Inspired by psychological foot-in-the-door principles, we…
View on GitHub0
ragib-amin-nihal/pe-coa
Ragib-Amin-Nihal/PE-CoA
3View on GitHub
Code Implementation of "Pattern Enhanced Multi-Turn Jailbreaking: Exploiting Structural Vulnerabilities in Large Language Models"
Python
View on GitHub3
renqibing/actorattack
renqibing/ActorAttack
136View on GitHub
💥Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues
Python
View on GitHub136

See all 7 alternatives to CoA→

Star history

Open-source alternatives to CoA

Similar open-source projects, ranked by how many features they share with CoA.

fmmarkmq/sema
fmmarkmq/SEMA
9View on GitHub
The official repository for the paper: SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks.
View on GitHub9
jinxiaolong1129/foot-in-the-door-jailbreak
Jinxiaolong1129/Foot-in-the-door-Jailbreak
0View on GitHub
Ensuring AI safety is crucial as large language models become increasingly integrated into real-world applications. A key challenge is jailbreak, where adversarial prompts bypass built-in safeguards to elicit harmful disallowed outputs. Inspired by psychological foot-in-the-door principles, we…
View on GitHub0
ragib-amin-nihal/pe-coa
Ragib-Amin-Nihal/PE-CoA
3View on GitHub
Code Implementation of "Pattern Enhanced Multi-Turn Jailbreaking: Exploiting Structural Vulnerabilities in Large Language Models"
Python
View on GitHub3
renqibing/actorattack
renqibing/ActorAttack
136View on GitHub
💥Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues
Python
View on GitHub136

See all 7 alternatives to CoA→

CoA

Features

Open-source alternatives to CoA

fmmarkmq/SEMA

Jinxiaolong1129/Foot-in-the-door-Jailbreak

Ragib-Amin-Nihal/PE-CoA

renqibing/ActorAttack

Star history

Open-source alternatives to CoA

fmmarkmq/SEMA

Jinxiaolong1129/Foot-in-the-door-Jailbreak

Ragib-Amin-Nihal/PE-CoA

renqibing/ActorAttack