CISPA
Browse

File(s) not publicly available

"What's in the box?!": Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models

conference contribution
posted on 2023-11-29, 18:17 authored by Sahar Abdelnabi, Mario FritzMario Fritz
Machine learning models are now widely deployed in real-world applications. However, the existence of adversarial examples has been long considered a real threat to such models. While numerous defenses aiming to improve the robustness have been proposed, many have been shown ineffective. As these vulnerabilities are still nowhere near being eliminated, we propose an alternative deployment-based defense paradigm that goes beyond the traditional white-box and black-box threat models. Instead of training and deploying a single partially-robust model, one could train a set of same-functionality, yet, adversarially-disjoint models with minimal in-between attack transferability. These models could then be randomly and individually deployed, such that accessing one of them minimally affects the others. Our experiments on CIFAR-10 and a wide range of attacks show that we achieve a significantly lower attack transferability across our disjoint models compared to a baseline of ensemble diversity. In addition, compared to an adversarially trained set, we achieve a higher average robust accuracy while maintaining the accuracy of clean examples.

History

Preferred Citation

Sahar Abdelnabi and Mario Fritz. "What's in the box?!": Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models. In: Moving Target Defense Workshop (MTD). 2021.

Primary Research Area

  • Trustworthy Information Processing

Name of Conference

Moving Target Defense Workshop (MTD)

Legacy Posted Date

2021-12-07

Open Access Type

  • Gold

BibTeX

@inproceedings{cispa_all_3526, title = ""What's in the box?!": Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models", author = "Abdelnabi, Sahar and Fritz, Mario", booktitle="{Moving Target Defense Workshop (MTD)}", year="2021", }

Usage metrics

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC