rapidresponsebenchrapidresponsebench

0

View on GitHub

Rapidresponsebench

Setting up

Features

Defense Strategies - Mitigates jailbreaks by providing few-shot safety examples.

Open-source alternatives to Rapidresponsebench

Similar open-source projects, ranked by how many features they share with Rapidresponsebench.

arobey1/smooth-llm
arobey1/smooth-llm
134View on GitHub
This is the official source code for "SmoothLLM: Defending LLMs Against Jailbreaking Attacks" by Alex Robey, Eric Wong, Hamed Hassani, and George J. Pappas. To learn more about our work, see our blog post.
Python
View on GitHub134
chuhac/reasoning-to-defend
chuhac/Reasoning-to-Defend
12View on GitHub
Code for paper
Python
View on GitHub12
crystaleye42/eval-safety
CrystalEye42/eval-safety
9View on GitHub
This is a repository for replicating the experiments from our paper: Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning .
Jupyter Notebook
View on GitHub9
aounon/certified-llm-safety
aounon/certified-llm-safety
53View on GitHub
This repository contains code for the paper Certifying LLM Safety against Adversarial Prompting.
Python
View on GitHub53

Frequently asked questions

What does rapidresponsebench/rapidresponsebench do?

Setting up

What are the main features of rapidresponsebench/rapidresponsebench?

The main features of rapidresponsebench/rapidresponsebench are: Defense Strategies.

What are some open-source alternatives to rapidresponsebench/rapidresponsebench?

Open-source alternatives to rapidresponsebench/rapidresponsebench include: arobey1/smooth-llm — This is the official source code for "SmoothLLM: Defending LLMs Against Jailbreaking Attacks" by Alex Robey, Eric… chuhac/reasoning-to-defend — Code for paper. crystaleye42/eval-safety — This is a repository for replicating the experiments from our paper: Pruning for Protection: Increasing Jailbreak… damo-nlp-sg/multilingual-safety-for-llms — 📄 Paper • 🤗 Dataset. devoallen/indust — We have reorganized INDust, aligning evidence with three types of inductive instructions and implementing stricter… aounon/certified-llm-safety — This repository contains code for the paper Certifying LLM Safety against Adversarial Prompting.

35 stars5 forksJupyter Notebook1 view

Star history

Frequently asked questions

What does rapidresponsebench/rapidresponsebench do?

Setting up

What are the main features of rapidresponsebench/rapidresponsebench?

The main features of rapidresponsebench/rapidresponsebench are: Defense Strategies.

What are some open-source alternatives to rapidresponsebench/rapidresponsebench?

Open-source alternatives to rapidresponsebench/rapidresponsebench include: arobey1/smooth-llm — This is the official source code for "SmoothLLM: Defending LLMs Against Jailbreaking Attacks" by Alex Robey, Eric… chuhac/reasoning-to-defend — Code for paper. crystaleye42/eval-safety — This is a repository for replicating the experiments from our paper: Pruning for Protection: Increasing Jailbreak… damo-nlp-sg/multilingual-safety-for-llms — 📄 Paper • 🤗 Dataset. devoallen/indust — We have reorganized INDust, aligning evidence with three types of inductive instructions and implementing stricter… aounon/certified-llm-safety — This repository contains code for the paper Certifying LLM Safety against Adversarial Prompting.

Open-source alternatives to Rapidresponsebench

Similar open-source projects, ranked by how many features they share with Rapidresponsebench.

arobey1/smooth-llm
arobey1/smooth-llm
134View on GitHub
This is the official source code for "SmoothLLM: Defending LLMs Against Jailbreaking Attacks" by Alex Robey, Eric Wong, Hamed Hassani, and George J. Pappas. To learn more about our work, see our blog post.
Python
View on GitHub134
chuhac/reasoning-to-defend
chuhac/Reasoning-to-Defend
12View on GitHub
Code for paper
Python
View on GitHub12
crystaleye42/eval-safety
CrystalEye42/eval-safety
9View on GitHub
This is a repository for replicating the experiments from our paper: Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning .
Jupyter Notebook
View on GitHub9
aounon/certified-llm-safety
aounon/certified-llm-safety
53View on GitHub
This repository contains code for the paper Certifying LLM Safety against Adversarial Prompting.
Python
View on GitHub53

See all 22 alternatives to Rapidresponsebench