Benign Overfitting in Adversarially Robust Linear Classification

Chen, Jinghui; Cao, Yuan; Gu, Quanquan

File Download

content.pdf

Supplementary

Citations:
Appears in Collections:
- Statistics & Actuarial Science: Conference papers

Conference Paper: Benign Overfitting in Adversarially Robust Linear Classification

Title	Benign Overfitting in Adversarially Robust Linear Classification
Authors	Chen, Jinghui Cao, Yuan Gu, Quanquan
Issue Date	1-Aug-2023
Abstract	"Benign overfitting", where classifiers memorize noisy training data yet still achieve a good generalization performance, has drawn great attention in the machine learning community. To explain this surprising phenomenon, a series of works have provided theoretical justification in over-parameterized linear regression, classification, and kernel methods. However, it is not clear if benign overfitting still occurs in the presence of adversarial examples, i.e., examples with tiny and intentional perturbations to fool the classifiers. In this paper, we show that benign overfitting indeed occurs in adversarial training, a principled approach to defend against adversarial examples. In detail, we prove the risk bounds of the adversarially trained linear classifier on the mixture of sub-Gaussian data under ℓp adversarial perturbations. Our result suggests that under moderate perturbations, adversarially trained linear classifiers can achieve the near-optimal standard and adversarial risks, despite overfitting the noisy training data. Numerical experiments validate our theoretical findings.
Persistent Identifier	http://hdl.handle.net/10722/338361

DC Field	Value	Language
dc.contributor.author	Chen, Jinghui	-
dc.contributor.author	Cao, Yuan	-
dc.contributor.author	Gu, Quanquan	-
dc.date.accessioned	2024-03-11T10:28:17Z	-
dc.date.available	2024-03-11T10:28:17Z	-
dc.date.issued	2023-08-01	-
dc.identifier.uri	http://hdl.handle.net/10722/338361	-
dc.description.abstract	<p>"Benign overfitting", where classifiers memorize noisy training data yet still achieve a good generalization performance, has drawn great attention in the machine learning community. To explain this surprising phenomenon, a series of works have provided theoretical justification in over-parameterized linear regression, classification, and kernel methods. However, it is not clear if benign overfitting still occurs in the presence of adversarial examples, i.e., examples with tiny and intentional perturbations to fool the classifiers. In this paper, we show that benign overfitting indeed occurs in adversarial training, a principled approach to defend against adversarial examples. In detail, we prove the risk bounds of the adversarially trained linear classifier on the mixture of sub-Gaussian data under ℓp adversarial perturbations. Our result suggests that under moderate perturbations, adversarially trained linear classifiers can achieve the near-optimal standard and adversarial risks, despite overfitting the noisy training data. Numerical experiments validate our theoretical findings.</p>	-
dc.language	eng	-
dc.relation.ispartof	39th Conference on Uncertainty in Artificial Intelligence (UAI 2023) (01/08/2023-03/08/2023, Pittsburgh, Pennsylvania)	-
dc.title	Benign Overfitting in Adversarially Robust Linear Classification	-
dc.type	Conference_Paper	-
dc.description.nature	preprint	-

File Download

Supplementary

Conference Paper: Benign Overfitting in Adversarially Robust Linear Classification

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats