AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation

Li, Lin; Qiu, Jianing; Spratling, Michael

doi:10.1007/s11263-024-02206-4

AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation

Open access
Published: 24 August 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Computer Vision Aims and scope Submit manuscript

AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation

Download PDF

290 Accesses
Explore all metrics

Abstract

Deep neural networks are vulnerable to adversarial examples. Adversarial training (AT) is an effective defense against adversarial examples. However, AT is prone to overfitting which degrades robustness substantially. Recently, data augmentation (DA) was shown to be effective in mitigating robust overfitting if appropriately designed and optimized for AT. This work proposes a new method to automatically learn online, instance-wise, DA policies to improve robust generalization for AT. This is the first automated DA method specific for robustness. A novel policy learning objective, consisting of Vulnerability, Affinity and Diversity, is proposed and shown to be sufficiently effective and efficient to be practical for automatic DA generation during AT. Importantly, our method dramatically reduces the cost of policy search from the 5000 h of AutoAugment and the 412 h of IDBH to 9 h, making automated DA more practical to use for adversarial robustness. This allows our method to efficiently explore a large search space for a more effective DA policy and evolve the policy as training progresses. Empirically, our method is shown to outperform all competitive DA methods across various model architectures and datasets. Our DA policy reinforced vanilla AT to surpass several state-of-the-art AT methods regarding both accuracy and robustness. It can also be combined with those advanced AT methods to further boost robustness. Code and pre-trained models are available at: https://github.com/TreeLLi/AROID.

Using the Strongest Adversarial Example to Alleviate Robust Overfitting

One Size Does NOT Fit All: Data-Adaptive Adversarial Training

One radish, One hole: Specific adversarial training for enhancing neural network’s robustness

Article 07 June 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Deep neural networks (DNNs) are well known to be vulnerable to infinitesimal yet highly malicious artificial perturbations in their input, i.e., adversarial examples (Szegedy et al., 2014). The lack of robustness cause a crisis of security and trustworthiness for applications built on DNNs and thus hinders their further deployment in real world applications especially in the critical domains like healthcare (Qiu et al., 2023). Thus far, adversarial training (AT) has been the most effective defense against adversarial attacks (Athalye et al., 2018). AT is typically formulated as a min-max optimization problem:

$$\begin{aligned} \arg \min _{\varvec{\theta }} \mathbb {E}[\arg \max _{\varvec{\delta }} \mathcal {L}(\varvec{x}+\varvec{\delta }; \varvec{\theta })] \end{aligned}$$

(1)

where the inner maximization searches for the perturbation $\varvec{\delta }$ to maximize the loss, while the outer minimization searches for the model parameters $\varvec{\theta }$ to minimize the loss on the perturbed examples.

One major issue of AT is that it is prone to overfitting (Rice et al., 2020; Wong et al., 2020). Unlike in standard training (ST), overfitting in AT, a.k.a. robust overfitting (Rice et al., 2020), significantly impairs adversarial robustness. Many efforts (Li & Spratling, 2023b; Wu et al., 2020; Dong et al., 2022; Liu et al., 2023; Liu & Satoh, 2023) have been made to understand robust overfitting and mitigate its effect. One promising solution is data augmentation (DA), which is a common technique to prevent ST from overfitting. However, many studies (Rice et al., 2020; Wu et al., 2020; Gowal et al., 2021; Rebuffi et al., 2021) have revealed that advanced DA methods, originally proposed for ST, often fail to improve adversarial robustness. Therefore, DA is usually combined with other regularization techniques such as Stochastic Weight Averaging (SWA) (Rebuffi et al., 2021), Consistency regularization (Tack et al., 2022) and Separate Batch Normalization (Addepalli, Jain, and Radhakrishnan, 2022) to improve its effectiveness. However, recent work (Li & Spratling, 2023c) demonstrated that DA alone can significantly improve AT if it has strong diversity and well-balanced hardness. This suggests that ST and AT may require different DA strategies, especially in terms of hardness. It is thus necessary to design DA schemes dedicated to AT.

IDBH (Li & Spratling, 2023c) is the latest DA scheme specifically designed for AT. Despite its impressive robust performance, IDBH employs a heuristic search method to manually optimize the DA. This search process requires a complete AT for every sampled policy, which induces prohibitive computational cost and scales poorly to large datasets and models. Hence, when the computational budget is limited, the hyperparameters for IDBH might be found using a reduced search space^{Footnote 1} and by employing a smaller model, leading to compromised performance.

Another issue is that IDBH, in common with other conventional DA methods such as AutoAugment (Cubuk et al., 2019) and TrivialAugment (Müller & Hutter, 2021), applies the same strategy to all samples in the dataset throughout training. The distinctions between different training samples, and between the model checkpoints at different stages of training, are neglected. We hypothesize that different data samples at the same stage of training, as well as the same sample at the different stages of training, demand different DAs. Hence, we conjecture that an improvement in robustness could be realized by customizing DA for data samples and training stages.

To address the above issues, this work proposes a bi-level optimization framework (see Fig. 1) to automatically learn Adversarial Robustness by Online Instance-wise Data-augmentation (AROID). To the best of our knowledge, AROID is the first automated DA method specific to adversarial robustness. AROID employs a multi-head DNN-based policy model to map a data sample to a DA policy (see Fig. 2). This DA policy is defined as a sequence of pre-defined transformations applied with strength determined by the output of the policy model. This policy model is optimized, alongside the training of the target model, towards three novel objectives to achieve a target level of hardness and diversity. DA policies, therefore, are customized for each data instance and evolve with the target network as training progresses. This in practice produces a more globally optimal DA policy and thus benefits robustness. Importantly, the proposed policy learning objectives, in contrast to the conventional ones like validation accuracy (Cubuk et al., 2019), do not reserve a subset of the training data for validation and do not rely on prohibitively expensive inner loops for training the target model to evaluate the rewards of the sampled policies. The former ensures the entire training set is available for training to avoid potential data scarcity. The latter enables policy optimization to be much more efficient and scalable so that it is more practical for AT. Compared to IDBH in particular, this allows our approach to explore a larger space of DAs. Taking an example of optimizing the DA for CIFAR10 and PRN18, AROID took 9 h using an A100 GPU, IDBH took 412 h using an A100 GPU, and AutoAugment took 5000 h using a P100 GPU (Hataya et al., 2020).

Extensive experiments show that AROID outperforms all competitive DA methods across various datasets and model architectures while being more efficient than the previous best method (IDBH). AROID achieves state-of-the-art robustness for DA methods on the standard benchmarks. Besides, AROID outperforms, regarding accuracy and robustness, state-of-the-art AT methods. It also complements such robust training methods and can be combined with them to improve robustness further.

2 Related Work

Robust training. To mitigate overfitting in AT, many methods other than DA, have been previously proposed. One line of works, IGR (Ross & Doshi-Velez, 2018), CURE (Moosavi-Dezfooli et al., 2019), AdvLC (Li & Spratling, 2023b), discovered a connection between adversarial vulnerability and the smoothness of input loss landscape, and promoted robustness by smoothing the input loss landscape. Meanwhile, Wu et al. (2020) and Chen et al. (2021) found that robust generalization can be improved by a flat weight loss landscape and proposed AWP and SWA, respectively, to smooth the weight loss landscape during AT. RWP (Yu et al., 2022) and SEAT (Wang & Wang, 2022) were later proposed to further refine AWP and SWA, respectively, to increase robustness. SCARL (Kuang et al., 2023) incorporated semantic information into adversarial training. IBD (Kuang et al., 2023) distilled prior knowledge from a robust pre-trained model to enhance adversarial robustness. Many works, including MART (Wang et al., 2020), LAS-AT (Jia et al., 2022), ISEAT (Li & Spratling, 2023a), considered the difference between individual training instances and improved AT through regularizing in an instance-wise manner. Our proposed approach is also instance-wise, but contrary to existing methods tackles robust overfitting via DA instead of robust regularization. As shown in Sect. 4.5, it works well alone and, more importantly, complements the above techniques.

Data augmentation for ST. Although DA has been a common practice in many fields, we only review vision-based DA in this section as it is most related to our work. In computer vision, DA can be generally categorized as: basic, composite and mixup. Basic augmentations refer to a series of image transformations that can be applied independently. They mainly include crop-based (Random Crop (He et al., 2016a), Cropshift (Li & Spratling, 2023c), etc.), color-based (Brightness, Contrast, etc.), geometric-based (Rotation, Shear, etc.) and dropout-based (Cutout (DeVries & Taylor, 2017), Random Erasing (Zhong et al., 2020), etc.) transformations. Composite augmentations denote the composition of basic augmentations. Augmentations are composed into a single policy/schedule usually through two ways: interpolation (Hendrycks et al., 2020; Wang et al., 2021) and sequencing (Cubuk et al., 2019, 2020; Müller & Hutter, 2021). MixUp (Zhang et al., 2017), and analogous works like CutMix (Yun et al., 2019), can be considered as a special case of interpolation-based composition, which combines a pair of different images, instead of augmentations, as well as their labels to create a new image and its label.

Composite augmentations by design have many hyperparameters to optimize. Most previous works, as well as the pioneering AutoAugment (Cubuk et al., 2019), tackled this issue using automated machine learning (AutoML). DA policies were optimized towards maximizing validation accuracy (Cubuk et al., 2019; Lin et al., 2019; Li et al., 2020; Liu et al., 2021), maximizing training loss (Zhang et al., 2020) or matching the distribution density between the original and augmented data (Lim et al., 2019; Hataya et al., 2020). Optimization here is particularly challenging since DA operations are usually non-differentiable. Major solutions seek to estimate the gradient of DA learning objective w.r.t. the policy generator or DA operations using, e.g., policy gradient methods (Cubuk et al., 2019; Zhang et al., 2020; Lin et al., 2019) or reparameterization trick (Li et al., 2020; Hataya et al., 2020). Alternative optimization techniques include Bayesian optimization (Lim et al., 2019) and population-based training (Ho et al., 2019). Noticeably, several works like RandAugment (Cubuk et al., 2020) and TrivialAugment (Müller & Hutter, 2021) found that if the augmentation space and schedule were appropriately designed, competitive results could be achieved using a simple hyperparameter grid search or fixed hyperparameters. This implies that in ST these advanced yet complicated methods may not be necessary. However, it remains an open question if simple search can still match these advanced optimization methods in AT. Besides, instance-wise DA strategy was also explored in Cheung and Yeung (2022); Miao et al. (2023) for ST. Our method is the first automated DA approach specific for AT. We follow the line of policy gradient methods to enable learning DA policies. A key distinction here is that our policy learning objective is designed to guide the learning of DA policies towards improved robustness for AT, while the objective of the above methods is to increase accuracy for ST.

3 Method

We propose a method to automatically learn DA alongside AT to improve robust generalization. An instance-wise DA policy is produced by a policy model and learned by optimizing the policy model towards three novel objectives. Updating of the policy model and the target model (the one being adversarially trained for the target task) alternates throughout training (the policy model is updated every K updates of the target model), yielding an online DA strategy. This online, instance-adaptive, strategy produces different augmentations for different data instances at different stages of training.

The following notation is used. $\varvec{x} \in \mathbb {R}^d$ is a d-dimensional sample whose ground truth label is y. $\varvec{x}_i$ refers to i-th sample in a dataset. The model is parameterized by $\varvec{\theta }$. $\mathcal {L}(\varvec{x}, y; \varvec{\theta })$ or $\mathcal {L}(\varvec{x}; \varvec{\theta })$ for short denotes the predictive loss evaluated with $\varvec{x}$ w.r.t. the model $\varvec{\theta }$ (Cross-Entropy loss was used in all experiments). $\rho (\varvec{x}; \varvec{\theta })$ computes the adversarial example of $\varvec{x}$ w.r.t. the model $\varvec{\theta }$. $p_i(\varvec{x}; \varvec{\theta })$ or $p_i$ for short refers to the output of the Softmax function applied to the final layer of the model, i.e., the probability at i-th logit given the input $\varvec{x}$.

3.1 Modeling the DA Policy

Following the design of IDBH (Li & Spratling, 2023c) and TrivialAugment (Müller & Hutter, 2021), DA is implemented using four types of transformations: flip, crop, color/shape and dropout applied in order. We implement flip using HorizontalFlip, crop using Cropshift (Li & Spratling, 2023c), dropout using Erasing^{Footnote 2} (Zhong et al., 2020), and color/shape using a set of operations including Color, Sharpness, Brightness, Contrast, Autocontrast, Equalize, Shear (X and Y), Rotate, Translate (X and Y), Solarize and Posterize. A dummy operation, Identity, is included in each augmentation group to allow data to pass through unchanged. More details including the complete augmentation space are described in Section A.

To customize the DA applied to each data instance individually, a policy model parameterized by $\varvec{\theta }_{plc}$, is used to produce a DA policy conditioned on the input data (see Fig. 2). The policy model employs a DNN backbone to extract features from the data, and multiple, parallel, linear prediction heads on the top of the extracted features to predict the policy. The policy model used in this work has four heads corresponding to the four types of DA described above. The output of a head is converted into a multinomial distribution where each logit represents a pre-defined sub-policy, i.e., an augmentation operation associated with a strength/magnitude (e.g. ShearX, 0.1). Different magnitudes of the same operation are represented by different logits, so that each has its own chance of being sampled. A particular sequence of sub-policies to apply to the input image are selected based on the probabilities encoded in the four heads of the policy network.

3.2 Objectives for Learning the Data Augmentation Policy

The policy model is trained using three novel objectives: (adversarial) Vulnerability, Affinity and Diversity. These objectives are designed to learn data augmentations with strong diversity and appropriate hardness: requirements that have been shown to be effective for adversarial training (Li & Spratling, 2023c).

3.2.1 Motivation

Intuitively, enhancing the diversity and hardness of data augmentation should help mitigate robust overfitting by increasing the complexity of the training data. Specifically, enhanced diversity increases the number of distinct data augmentations applied during training and expands the effective training set size (Gontijo-Lopes et al., 2021). Increasing hardness raises the difficulty level of the augmented data for the model to learn (adversarially), thereby reducing (robust) overfitting. However, if the hardness exceeds the level that the training model can fit, accuracy and even robustness will decline, despite the reduction in robust overfitting. Therefore, to maximize performance, hardness should be carefully adjusted to balance between reducing robust overfitting and improving overall performance. The optimal level of hardness should therefore be tailored to different models and training settings.

Understanding what kind of data augmentation is effective for adversarial training is not the focus of the current work so we refer the reader to (Li & Spratling, 2023c) for a formal quantitative definition of diversity and hardness, along with extensive experimental evidence supporting the above reasoning.

3.2.2 Objectives

Vulnerability measures the loss variation caused by adversarial perturbation on the augmented data w.r.t. the target model:

$$\begin{aligned} \mathcal {L}_{vul}(\varvec{x}; \varvec{\theta }_{plc})&= \mathcal {L}(\rho (\varvec{\hat{x}}; \varvec{\theta }_{tgt}); \varvec{\theta }_{tgt}) - \mathcal {L}(\varvec{\hat{x}}; \varvec{\theta }_{tgt}) \nonumber \\ \text {where}\ \varvec{\hat{x}}&= \Phi (\varvec{x}; S(\varvec{\theta }_{plc}(\varvec{x}))) \end{aligned}$$

(2)

$\Phi (\varvec{x}; S(\varvec{\theta }_{plc}(\varvec{x})))$ augments $\varvec{x}$ by $S(\varvec{\theta }_{plc}(\varvec{x}))$, the augmentations sampled from the output distribution of policy model conditioned on $\varvec{x}$, so $\varvec{\hat{x}}$ is the augmented data. A larger Vulnerability indicates that $\varvec{x}$ becomes more vulnerable to adversarial attack after DA. A common belief about the relationship between training data and robustness is that AT benefits from adversarially hard samples.^{Footnote 3} (Madry et al., 2018; Li & Spratling, 2023c). From a geometric perspective, maximizing Vulnerability encourages the policy model to project data into the previously less-robustified space.

Nevertheless, the maximization of Vulnerability, if not constrained, would likely favor those augmentations producing samples far away from the original distribution. Training with such augmentations was observed to degrade accuracy and even robustness when accuracy is overly reduced (Li & Spratling, 2023c). Therefore, Vulnerability should be maximized while the distribution shift caused by augmentation is constrained:

$$\begin{aligned} arg\max _{\varvec{\theta }_{plc}}\ \mathcal {L}_{vul}(\varvec{x}; \varvec{\theta }_{plc})\ \ \text {s.t.}\ ds(\varvec{x}, \varvec{\hat{x}}) \le D \end{aligned}$$

(3)

where $ds(\cdot )$ measures the distribution shift between two samples and D is a constant. Directly solving Eq. (3) is intractable, so we convert it into an unconstrained optimization problem by adding a penalty on the distribution shift as:

$$\begin{aligned} arg\max _{\varvec{\theta }_{plc}}\ \mathcal {L}_{vul}(\varvec{x}; \varvec{\theta }_{plc}) - \lambda \cdot ds(\varvec{x}, \varvec{\hat{x}}) \end{aligned}$$

(4)

where $\lambda $ is a hyperparameter and a larger $\lambda $ corresponds to a tighter constraint on distribution shift, i.e., smaller D. Distribution shift is measured using a variant of the Affinity metric (Gontijo-Lopes et al., 2021):

$$\begin{aligned} ds(\varvec{x}, \varvec{\hat{x}}) = \mathcal {L}_{aft}(\varvec{x}; \varvec{\theta }_{plc}) = \mathcal {L}(\varvec{\hat{x}}; \varvec{\theta }_{aft}) - \mathcal {L}(\varvec{x}; \varvec{\theta }_{aft}) \end{aligned}$$

(5)

Affinity captures the loss variation caused by DA w.r.t. a model $\varvec{\theta }_{aft}$ (called the affinity model): a model pre-trained on the original data (i.e., without any data augmentation). Affinity increases as the augmentation proposed by the policy network makes data harder for the affinity model to correctly classify. By substituting Eq. (5) into Eq. (4), we obtain an adjustable Hardness objective:

$$\begin{aligned} \mathcal {L}_{hrd}(\varvec{x}; \varvec{\theta }_{plc}) = \mathcal {L}_{vul}(\varvec{x}; \varvec{\theta }_{plc}) - \lambda \cdot \mathcal {L}_{aft}(\varvec{x}; \varvec{\theta }_{plc}) \end{aligned}$$

(6)

This encourages the DA produced by the policy model to be at a level of hardness defined by $\lambda $ (larger values of $\lambda $ corresponding to lower hardness). Ideally, $\lambda $ should be tuned to ensure the distribution shift caused by DA is sufficient to benefit robustness while not being so severe as to harm accuracy.

Last, we introduce a Diversity objective to promote diverse DA. Diversity enforces a relaxed uniform distribution prior over the logits of the policy model, i.e., the output augmentation distribution:

$$\begin{aligned} \mathcal {L}_{div}^h (\varvec{x}) = \frac{1}{C} \left[ - \sum _i^{p_i^h < l} \log (p_i^h) + \sum _j^{p_j^h > u} \log (p_j^h)\right] \end{aligned}$$

(7)

C is the total count of logits violating either lower (l), or upper (u) limits and h is the index of the prediction head. Intuitively speaking, the Diversity loss penalizes overly small and large probabilities, helping to constrain the distribution to lie in a pre-defined range (l, u). As l and u approach the mean probability, the enforced prior becomes closer to a uniform distribution, which corresponds to a highly diverse DA policy. Diversity encourages the policy model to avoid the over-exploitation of certain augmentations and to explore other candidate augmentations. Note that Diversity is applied to the color/shape head in a hierarchical way: type-wise and strength-wise inside each type of augmentation.

Combining the above three objectives together, the policy model is trained to optimize:

$$\begin{aligned} arg \min _{\varvec{\theta }_{plc}} \ -\mathbb {E}_{i \in B} \mathcal {L}_{hrd}(\varvec{x}_i) + \beta \cdot \mathbb {E}_{h \in H} \mathcal {L}_{div}^h (\varvec{x}; \varvec{\theta }_{plc}) \end{aligned}$$

(8)

where B is the batch size and $\beta $ trades-off hardness against diversity. $\mathcal {L}_{div}^h$ is calculated across instances in a batch, so no need for averaging over B like $\mathcal {L}_{hrd}$.

3.2.3 Mechanism

The Vulnerability objective is computed using feedback on adversarial vulnerability, measured by the variation in loss caused by adversarial perturbations, from the target model. The policy model learns from this feedback to determine which types and magnitudes of data augmentation (DA) elevates the adversarial vulnerability of augmented data. This learning raises the likelihood of applying such augmentations to the training data, thereby resulting in increased hardness. Meanwhile, the Affinity objective is employed to limit DA’s hardness to a level that does not compromise performance. Additionally, the Diversity objective prevents the over-reliance on specific DA methods, promoting exploration across a diverse spectrum of augmentation techniques. Together, these three objectives dictate the appropriate DA for each training sample.

3.3 Optimization

The entire training is a bi-level optimization process (Algorithm 1): the target and policy models are updated alternately. This online training strategy adapts the policy model to the varying demands for DA from the target model at the different stages of training. The target model is optimized using AT with the augmentation sampled from the policy model:

$$\begin{aligned} arg \min _{\varvec{\theta }_{tgt}} \mathcal {L}(\rho (\Phi (\varvec{x}; S(\varvec{\theta }_{plc}(\varvec{x})));\varvec{\theta }_{tgt}); \varvec{\theta }_{tgt}) \end{aligned}$$

(9)

After every K updates of the target model, the policy model is updated using the gradients of the policy learning loss as follows:

$$\begin{aligned} \frac{(8)}{\partial \varvec{\theta }_{plc}} = - \frac{\partial \mathbb {E}_{i \in B} \mathcal {L}_{hrd}(\varvec{x}_i)}{\partial \varvec{\theta }_{plc}} + \beta \frac{\mathbb {E}_{h \in H} \mathcal {L}_{div}^h (\varvec{x})}{\partial \varvec{\theta }_{plc}} \end{aligned}$$

(10)

The latter can be derived directly, while the former $\frac{\partial \mathcal {L}_{hrd}}{\partial \varvec{\theta }_{plc}}$ cannot because the involved augmentation operations are non-differentiable. To estimate these gradients, we apply the REINFORCE algorithm (Williams, 1992) with baseline trick to reduce the variance of gradient estimation. It first samples T augmentations, named trajectories, in parallel from the policy model and then computes the real Hardness value, $\mathcal {L}_{hrd}^{(t)}$, using Eq. (6) independently on each trajectory t. The gradients are estimated (see Section B for derivation) as follows:

$$\begin{aligned} \frac{1}{B\cdot T}\sum _{i=1}^B\sum _{t=1}^T \sum _{h=1}^H \frac{\partial \log (p_{(t)}^h(\varvec{x}_i))}{\partial \varvec{\theta }_{plc}} [\mathcal {L}_{hrd}^{(t)}(\varvec{x}_i) - \tilde{\mathcal {L}_{hrd}}] \end{aligned}$$

(11)

$p_{(t)}^h$ is the probability of the sampled sub-policy at the h-th head and $\tilde{\mathcal {L}_{hrd}}=\frac{1}{T}\sum _{t=1}^T \mathcal {L}_{hrd}^{(t)}(\varvec{x}_i)$ is the mean $\mathcal {L}_{hrd}$ (the baseline used in the baseline trick) averaged over the trajectories. Algorithm 2 illustrates one iteration of updating the policy model. Note that, when one model is being updated, backpropagation is blocked through the other. The affinity model, used in calculating the Affinity metric, is fixed throughout training.

3.4 Modes of Application

AROID can be used in two modes: online and offline. In the online mode, the policy and target models are jointly trained so that the policy model has to be retrained every time a new target model is trained. This adapts the DA policy to the target model on-the-fly which improves effectiveness but adds the extra cost of policy learning to that of adversarial training. In the offline mode, the training of policy and target models are separate phases. A policy model is trained in advance (using online AROID), a step that is analogous to the hyperparameter optimization of other DA methods. This pre-trained policy model is then subsequently used to train a new target model. Specifically, at each epoch of training the target network a policy network checkpoint, saved at the corresponding epoch when using online AROID, is used to sample DA policies for training the target model. When AROID is deployed in this offline mode, we refer to it as AROID-T, as it involves the transfer of the policy model. The standard mode of application is online, which we refer to simply as AROID.

3.5 Efficiency

The efficiency of AROID is dependent on the mode. The cost of AROID is composed of two parts: policy learning and DA sampling. Policy learning can be one-time expense if AROID is used in offline mode. DA sampling requires only one forward pass of the policy model, which can be negligible because the policy model can be much smaller than the target model without hurting the performance. Therefore, AROID in offline mode is roughly as efficient as other regular DA methods.

In online mode, in the worst case, AROID adds about 43.6% extra computation to baseline AT (see calculation in Section C) when $T=8$ and $K=5$. This is less than the overhead 52.5% of the state-of-the-art AT method LAS-AT (Jia et al., 2022) and substantially less than the search cost of IDBH and AutoAugment (compared in Sect. 4.4). Furthermore, we observed that AROID can still achieve robustness higher than other competitors with a much smaller policy model (Sect. 4.13.3), reduced T and increased K (Sect. 4.4) for improved efficiency. For example, setting $T=4$ and $K=20$, the overhead is only about 10% compared to baseline AT.

Another efficiency concern, as for all other deep learning methods, is hyperparameter optimization. We discuss below how this can be done efficiently so that AROID can be easily adapted to a new setting. First, as shown in Sect. 4.13.1, most of our hyperparameters can transfer well among different training settings, so that only a light tuning is needed to achieve reasonably good performance for new setting. In most cases, only $\lambda $ needs to be tuned. Second, hyperparameter optimization can be accelerated by first searching with a cheap setting, such as $K=20$ and $T=4$, and then transferring the found values to the final setting, i.e., $K=5$ and $T=8$. Note that our hyperparameter tuning process is not different from others.

Table 1 The performance of various DA methods

AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation

Abstract

Similar content being viewed by others

Using the Strongest Adversarial Example to Alleviate Robust Overfitting

One Size Does NOT Fit All: Data-Adaptive Adversarial Training

One radish, One hole: Specific adversarial training for enhancing neural network’s robustness

Explore related subjects

1 Introduction

2 Related Work

3 Method

3.1 Modeling the DA Policy

3.2 Objectives for Learning the Data Augmentation Policy

3.2.1 Motivation

3.2.2 Objectives

3.2.3 Mechanism

3.3 Optimization

3.4 Modes of Application

3.5 Efficiency

4 Experiments

4.1 Benchmarking DA on Adversarial Robustness

4.2 Offline Versus Online AROID

4.3 Mitigating Robust Overfitting

4.4 Comparison of Policy Search Costs

4.5 Comparison with State-of-the-Art Robust Training Methods

4.6 Generalization to Alternative AT Methods

4.7 Combining with Extra Data

4.8 Generalization to ImageNet

4.9 Performance on Common Corruption Datasets

4.10 Robustness Evaluation with More Attacks

4.11 Data Scaling Versus Model Scaling

4.12 Enlarging Policy Search Space

4.13 Ablation Study

4.13.1 Hyperparameters

4.13.2 Policy Learning Objectives

4.13.3 Policy Model Architecture

4.13.4 Uniform Sampling

4.14 Analysis of Learned DA Policies

4.14.1 Progression of Policy Learning Objectives

4.14.2 Visualization of Learned DA Policies

4.14.3 Visualization of Augmented Data Samples

5 Conclusions

Availability of data and material

Code availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Appendices

Appendix A DA Search Space

Appendix B Derivation

Appendix C Efficiency Analysis

Appendix D Experimental Set-ups

1.1 D.1 Configuration of AROID

1.2 D.2 Configuration of Compared DA Methods

1.3 D.3 Configuration of Compared State-of-the-art Robust Training Methods

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation