AttackBenchmark

AttackBench

The AttackBench framework wants to fairly compare gradient-based attacks based on their robustness evaluation curves. To this end, we derive a process involving five distinct stages, as depicted below.

In stage (1), we construct a list of diverse non-robust and robust models to assess the attacks' impact on various settings, thus testing their adaptability to diverse defensive strategies.
In stage (2), we define an environment for testing gradient-based attacks under a systematic and reproducible protocol. Specifically, AttackBench limits the number of forward and backward queries to the model, such that all attacks are compared within a given maximum query budget. This step provides common ground with shared assumptions, advantages, and limitations. We then run the attacks against the selected models individually and collect the performance metrics of interest in our analysis, which are perturbation size, execution time, and query usage.
In stage (3), we gather all the previously-obtained results, comparing attacks with the novel local optimality metric that quantifies how close an attack is to the optimal solution.
Finally, in stage (4), we aggregate the optimality results from all considered models, and in stage (5) we rank the attacks based on their average optimality, namely global optimality.

A comprehensive overview of the five stages of AttackBench. Each attack is tested in fair conditions, and then it is ranked through the optimality metric. The best attack is the one that produces the higher numbers of minimally-perturbed adversarial examples with fewer queries and less time.

Experimental coverage

Datasets

Models

Libraries

Distinct Attacks

102

Implementations

815

Comparisons

We perform an extensive experimental analysis that compares 20 attacks (listed below), retrieving their original implementation and collecting the other implementations available among popular adversarial attack libraries. We empirically test a total of 102 techniques, re-evaluating them in terms of their runtime, success rate and perturbation distance, as well as with our newly introduced optimality metrics. While implementing AttackBench, we collected additional insights, including sub-optimal implementations, attacks returning incorrect results, and errors in the source code that prevent attacks from concluding their runs correctly. These additional insights could lead to a complete re-evaluation of the State of the Art, as incorrect evaluations might have impacted and inflated results in published work.

AttackBench CIFAR-10

AttackBench ImageNet

Robustness Evaluation Curves ›

Authors

Antonio Emanuele Cinà*

University of Genoa

Jérôme Rony*

ÉTS Montréal

Maura Pintor

University of Cagliari

Luca Demetrio

University of Genoa

Ambra Demontis

University of Cagliari

Battista Biggio

University of Cagliari

Ismail Ben Ayed

ÉTS Montréal

Fabio Roli

University of Genoa

* Equal contribution

Citation


                    @inproceedings{cina2024attackbench,

                          author = {Cin{\`a}, A. E. and Rony, J. and Pintor, M. and Demetrio, L. and Demontis, A. and Biggio, B. and Ayed, I. B. and Roli, F.},

                          title = {AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples},

                          booktitle={AAAI Conference on Artificial Intelligence},

                          year = {2025},

                    }

Acknowledgments

AttackBench has been partially supported by the EU—NGEU National Sustainable Mobility Center (CN00000023), Italian Ministry of University and Research Decree n. 1033—17/06/2022 (Spoke 10); the project Sec4AI4Sec, under the EU’s Horizon Europe Research and Innovation Programme (grant agreement no. 101120393); the project ELSA, under the EU’s Horizon Europe Research and Innovation Programme (grant agreement no. 101070617); and projects SERICS (PE00000014) and FAIR (PE0000013) under the MUR NRRP funded by the EU—NGEU.

AttackBench Evaluating Gradient-based Attacks for Adversarial Examples

AttackBench

Experimental coverage

Authors

Citation

Acknowledgments