Executing non-targeted attacks using the Fast Gradient Sign Method (FGSM) and Iterative Fast Gradient Sign Method (I-FGSM) attack algorithms on proxy networks resulted in the attacked model achieving an accuracy of 0.00%.
FGSM stands out for its simplicity and efficiency, showcasing the susceptibility of machine learning models to subtle, meticulously crafted perturbations in input data. This underscores the importance of enhancing the robustness and security of machine learning systems, motivating researchers to develop more resilient models and defenses against adversarial attacks like FGSM.
-
Given an input sample
$x$ and a trained machine learning model$f$ , FGSM computes the gradient of the model's loss function$J(f(x), y)$ with respect to the input$x$ , where$y$ represents the true label of$x$ . -
Rather than adjusting the model's parameters, FGSM directly alters the input data
$x$ in the direction of the gradient of the loss function with respect to$x$ , while restricting the perturbation to a small step (scaled by a small epsilon value) to prevent excessive distortion. -
The perturbed input, denoted as
$x'$ , is calculated using the formula:
Here,
- Finally, the adversarial example
$x'$ is fed into the model, which is likely to misclassify it due to the minor perturbation aimed at maximizing the loss.
- Initialize
$x'$ with the original benign image$x$ . - Construct a loop corresponding to the number of iterations for iterative processing.
- In each iteration, apply FGSM with
$\epsilon = \alpha$ to obtain a new$x'$ , then constrain the new$x'$ within the range$[x-\epsilon, x+\epsilon]$ , where$\alpha$ represents the step size.
- Simultaneously target multiple proxy models.
- Aggregate the variance and bias of models (Delving into Transferable Adversarial Examples and Black-box Attacks).
- Employ Ensemble Attack (Query-Free Adversarial Transfer via Undertrained Surrogates).
The CIFAR-10 dataset serves as the primary data source for this project, accessible here.
For detailed implementation and usage instructions, please refer to the provided code.