Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples

📢 News

Updated 2024-10-11: The SAU method has now been integrated into BackdoorBench. Users can directly obtain and utilize our method within the platform for testing defenses against backdoor attacks.

📝 Introduction

Welcome to the official repository for the NeurIPS 2023 paper titled "Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples". This project introduces a novel defense mechanism against malicious backdoor attacks in machine learning models.

📊 Overview

In this paper, we address the challenge of purifying a backdoored model using a small clean dataset. By linking backdoor risk with adversarial risk, we derive a new upper bound for backdoor risk, primarily focusing on the risk posed by shared adversarial examples (SAEs) between the backdoored and purified models. This discovery leads us to formulate a novel bi-level optimization problem for backdoor mitigation through adversarial training techniques. To tackle this problem, we propose Shared Adversarial Unlearning (SAU). SAU first generates SAEs and subsequently unlearns them so that they are correctly classified by the purified model and/or differently classified by both models, thereby reducing the backdoor effect. Our experiments across various benchmarks and network architectures demonstrate that SAU achieves state-of-the-art performance in defending against backdoors.

🛠️ Setup Instructions

To start working with this project:

Clone Repository:

git clone https://github.com/shawkui/Shared_Adversarial_Unlearning.git
cd Shared_Adversarial_Unlearning

Install Dependencies:
```
bash sh/install.sh
```

⚙️ Usage

🧪 Performing an Attack

Simulate an attack scenario using the command:

python attack/badnet.py --save_folder_name badnet_demo

🛡️ Applying SAU Defense Mechanism

After setting up the attack scenario, apply the SAU defense with:

python defense/sau.py --result_file badnet_demo

📄 Citation

If you find our work valuable and use it in your research, please cite our paper using the following BibTeX entry:

@inproceedings{wei2023shared,
title={Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples},
author={Wei, Shaokui and Zhang, Mingda and Zha, Hongyuan and Wu, Baoyuan},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023},
url={https://openreview.net/forum?id=zqOcW3R9rd}
}

🎖️ Acknowledgment

Our code is built upon BackdoorBench, "BackdoorBench: A Comprehensive Benchmark of Backdoor Learning". If you find their work useful, consider giving them a star.

📞 Contact

For any inquiries or feedback, feel free to open an issue or reach out via email at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
analysis		analysis
attack		attack
backdoorbench_nlp		backdoorbench_nlp
config		config
dataset		dataset
defense		defense
detection_infer		detection_infer
detection_pretrain		detection_pretrain
for_imagenet		for_imagenet
models		models
resource		resource
sh		sh
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_cn.md		README_cn.md
framework.png		framework.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples

📢 News

📝 Introduction

📊 Overview

🛠️ Setup Instructions

⚙️ Usage

🧪 Performing an Attack

🛡️ Applying SAU Defense Mechanism

📄 Citation

🎖️ Acknowledgment

📞 Contact

About

Releases

Packages

Languages

License

shawkui/Shared_Adversarial_Unlearning

Folders and files

Latest commit

History

Repository files navigation

Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples

📢 News

📝 Introduction

📊 Overview

🛠️ Setup Instructions

⚙️ Usage

🧪 Performing an Attack

🛡️ Applying SAU Defense Mechanism

📄 Citation

🎖️ Acknowledgment

📞 Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages