CISPA
Browse

Provably Cost-Sensitive Adversarial Defense via Randomized Smoothing

Download (1.83 MB)
conference contribution
posted on 2025-10-07, 09:11 authored by Yuan Xin, Dingfan Chen, Michael Backes, Xiao ZhangXiao Zhang
As ML models are increasingly deployed in critical applications, robustness against adversarial perturbations is crucial. While numerous defenses have been proposed to counter such attacks, they typically assume that all adversarial transformations are equally important, an assumption that rarely aligns with real-world applications. To address this, we study the problem of robust learning against adversarial perturbations under cost-sensitive scenarios, where the potential harm of different types of misclassifications is encoded in a cost matrix. Our solution introduces a provably robust learning algorithm to certify and optimize for cost-sensitive robustness, building on the scalable certification framework of randomized smoothing. Specifically, we formalize the definition of cost-sensitive certified radius and propose our novel adaptation of the standard certification algorithm to generate tight robustness certificates tailored to any cost matrix. In addition, we design a robust training method that improves certified cost-sensitive robustness without compromising model accuracy. Extensive experiments on benchmark datasets, including challenging ones unsolvable by existing methods, demonstrate the effectiveness of our certification algorithm and training method across various cost-sensitive scenarios.

History

Primary Research Area

  • Trustworthy Information Processing

Name of Conference

International Conference on Machine Learning (ICML)

CISPA Affiliation

  • Yes

BibTeX

@conference{Xin:Chen:Backes:Zhang:2025, title = "Provably Cost-Sensitive Adversarial Defense via Randomized Smoothing", author = "Xin, Yuan" AND "Chen, Dingfan" AND "Backes, Michael" AND "Zhang, Xiao", year = 2025, month = 7 }

Usage metrics

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC