Adversarial Attack

Executing non-targeted attacks using the Fast Gradient Sign Method (FGSM) and Iterative Fast Gradient Sign Method (I-FGSM) attack algorithms on proxy networks resulted in the attacked model achieving an accuracy of 0.00%.

Attack Algorithm

FGSM

FGSM stands out for its simplicity and efficiency, showcasing the susceptibility of machine learning models to subtle, meticulously crafted perturbations in input data. This underscores the importance of enhancing the robustness and security of machine learning systems, motivating researchers to develop more resilient models and defenses against adversarial attacks like FGSM.

Given an input sample $x$ and a trained machine learning model $f$, FGSM computes the gradient of the model's loss function $J(f(x), y)$ with respect to the input $x$, where $y$ represents the true label of $x$.
Rather than adjusting the model's parameters, FGSM directly alters the input data $x$ in the direction of the gradient of the loss function with respect to $x$, while restricting the perturbation to a small step (scaled by a small epsilon value) to prevent excessive distortion.
The perturbed input, denoted as $x'$, is calculated using the formula:

$$x' = x + \epsilon \cdot \text{sign}(\nabla_x J(f(x), y))$$

Here, $\epsilon$ denotes a small constant indicating the magnitude of the perturbation, and $\text{sign}(\cdot)$ returns the sign of its argument.

Finally, the adversarial example $x'$ is fed into the model, which is likely to misclassify it due to the minor perturbation aimed at maximizing the loss.

I-FGSM

Initialize $x'$ with the original benign image $x$.
Construct a loop corresponding to the number of iterations for iterative processing.
In each iteration, apply FGSM with $\epsilon = \alpha$ to obtain a new $x'$, then constrain the new $x'$ within the range $[x-\epsilon, x+\epsilon]$, where $\alpha$ represents the step size.

Attack Technique

Simultaneously target multiple proxy models.
Aggregate the variance and bias of models (Delving into Transferable Adversarial Examples and Black-box Attacks).
Employ Ensemble Attack (Query-Free Adversarial Transfer via Undertrained Surrogates).

Dataset

The CIFAR-10 dataset serves as the primary data source for this project, accessible here.

For detailed implementation and usage instructions, please refer to the provided code.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Adversarial_Attack.ipynb		Adversarial_Attack.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Attack

Attack Algorithm

FGSM

I-FGSM

Attack Technique

Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Adversarial Attack

Attack Algorithm

FGSM

I-FGSM

Attack Technique

Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages