BackdoorBox: A Python Toolbox for Backdoor Learning
Abstract
Third-party resources (, samples, backbones, and pre-trained models) are usually involved in the training of deep neural networks (DNNs), which brings backdoor attacks as a new training-phase threat. In general, backdoor attackers intend to implant hidden backdoor in DNNs, so that the attacked DNNs behave normally on benign samples whereas their predictions will be maliciously changed to a pre-defined target label if hidden backdoors are activated by attacker-specified trigger patterns. To facilitate the research and development of more secure training schemes and defenses, we design an open-sourced Python toolbox that implements representative and advanced backdoor attacks and defenses under a unified and flexible framework. Our toolbox has four important and promising characteristics, including consistency, simplicity, flexibility, and co-development. It allows researchers and developers to easily implement and compare different methods on benchmark or their local datasets. This Python toolbox, namely \texttt{BackdoorBox}, is available at \url{https://github.com/THUYimingLi/BackdoorBox}.
Cite
@article{arxiv.2302.01762,
title = {BackdoorBox: A Python Toolbox for Backdoor Learning},
author = {Yiming Li and Mengxi Ya and Yang Bai and Yong Jiang and Shu-Tao Xia},
journal= {arXiv preprint arXiv:2302.01762},
year = {2023}
}
Comments
BackdoorBox V0.1. The first two authors contributed equally to this toolbox. 13 pages