A package implementing the NEURAL FINGERPRINTS FOR ADVERSARIAL ATTACK DETECTION

These details have not been verified by PyPI

Project links

Project description

Neural Fingerprints for Adversarial Attack Detection

Neural Fingerprints is a method for detecting adversarial attacks on deep neural networks. This project implements the key components of the Neural Fingerprints approach as described in the paper link to paper.

The module consists of two main components:

Data base creator - Generates a database of activations from the last layers of a neural network model, for both original images and adversarial examples.
Fingerprints armor - Implements the Neural Fingerprints technique to detect potential adversarial attacks by analyzing the activations in the database.

Data base creator

The Data Base Creator module contains scripts and modules for creating adversarial examples activations database from the last layers of a neural network model. The goal is to build a database to test our adversarial fingerprint protection mechanism.

Input Structure

The input should be structured as an ImageNet dataset:

images/
├── category_1/
│   ├── image_1.JPEG
│   ├── image_2.JPEG
│   └── ...
├── category_2/
│   ├── image_1.JPEG
│   ├── image_2.JPEG
│   └── ...
└── ...

Output Format

The output will be pickle files containing the activations of the last X layers of the model for the original images and the adversarial examples generated from them.

Example directory structure:

database/
├── category_1/
│   ├── orig
│   │   ├── image_1.pkl
│   │   ├── image_2.pkl
│   │   └── ...
│   ├── attack
│   │   ├── image_1_0_to_1_ifgsm_3_01.pkl
│   │   ├── image_2_0_to_31_ifgsm_1_01.pkl
│   │   └── ...
├── category_2/
│   ├── orig
│   │   ├── image_1.pkl
│   │   ├── image_2.pkl
│   │   └── ...
│   ├── attack
│   │   ├── image_1_0_to_1_ifgsm_96_01.pkl
│   │   ├── image_2_0_to_78_ifgsm_9_01.pkl
│   │   └── ...
└── ...

Supported Attacks: The module supports generating adversarial examples using Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks.

Files

attack.py

This module contains the Attack class, which provides methods for performing adversarial attacks on input tensors. It includes methods for Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks.

get_last_layers_activations.py

This module contains the ModelLastLayers class, which is used to extract activations from the last X FC layers of a model. It registers hooks to capture the activations during the forward pass.

db_creator.py

The main script to create the DB. It uses the ModelLastLayers class to extract activations and the Attack class to generate adversarial examples.

Exsample

see db_creator_test for an exsample how to run the db_creator file.

Fingerprints Armor

The folder fingerprints_armor contains scripts and modules that implement the Neural Fingerprints method to detect adversarial attacks. The goal is to create fingerprints such that they can be used by the following methods: vote, anomaly, and ll_ratio to detect patterns that match an adversarial attack.

The fingerprint method is implemented via SingleClassFingerprintsArmor to find the fingerprint for every class to detect the adversarial attack on it.

The input should be the output from Data Base Creator and the output will supply functions:

vote: Predicted class labels based on the voting mechanism.
anomaly (likelihood): Anomaly score for each data point based on likelihood anomaly detection.
likelihood_ratio: Likelihood ratio value used to determine if a data point is an adversarial attack or not.

These functions will take a DataFrame containing the activations from the last x layers of the model after a suspicious image has been run through it as input and will return detection results indicating whether the image is attacked or not.

Files

fingerprints_io.py

This module handles the input and output operations for processing the activations database.

fingerprints.py

The main script that implements the activation fingerprint detection algorithm to identify potential adversarial attacks on neural networks. It uses statistical methods for fingerprint analysis.

Example

See fingerprints_armor_test for an example of how to run the fingerprints file and get the results.

In this test, each class tested will result in a CSV file with the following structure:

File Naming Convention: Each CSV file is named after the class being analyzed (e.g., class_name.csv).
File Structure: The CSV file contains the following columns:

y vote anomaly ll_ratio cls

0 0 0.2 1.5 class_1

1 1 0.3 2.1 class_1

0 0 0.1 1.2 class_1

1 1 0.4 2.5 class_1

... ... ... ... ...

Each row represents a data point, with the corresponding values for the actual label, predicted class label (vote), anomaly score, and likelihood ratio.

y	vote	anomaly	ll_ratio	cls
0	0	0.2	1.5	class_1
1	1	0.3	2.1	class_1
0	0	0.1	1.2	class_1
1	1	0.4	2.5	class_1
...	...	...	...	...

Aggregated Results:

After running the tests for all classes, the individual CSV files are concatenated into a single DataFrame (data). The aggregated results provide a comparison across multiple classes, showing how the defense mechanism performed in terms of voting, anomaly detection, and likelihood ratio.

Metric Columns:

vote: Based on the voting mechanism (e.g., majority vote from multiple fingerprint models).
anomaly: Anomaly scores that help identify unusual data points.
ll_ratio: Likelihood ratio for detecting adversarial examples.
Ground Truth (y): The ground truth label (0 for original data and 1 for attacked data) to compare against the predicted values.

Confusion Matrix:

For each metric (vote, anomaly, and ll_ratio), the confusion matrix is calculated. The confusion matrix compares the predicted values (from vote, anomaly, or ll_ratio) against the actual ground truth (y). This helps evaluate the performance of the adversarial defense mechanism.

Confusion Matrix Structure:

	Predicted: 0	Predicted: 1
True: 0	True Negative (TN)	False Positive (FP)
True: 1	False Negative (FN)	True Positive (TP)

Where:

True Negative (TN): The number of original data points classified correctly as original.
False Positive (FP): The number of attacked data points classified incorrectly as original.
False Negative (FN): The number of original data points classified incorrectly as attacked.
True Positive (TP): The number of attacked data points classified correctly as attacked.

Quantile-Based Decision:

For each of the metrics (vote, anomaly, ll_ratio), the output is analyzed using a threshold based on the 1% quantile of the attack data. If a prediction for a particular metric exceeds this threshold, it is classified as an adversarial attack. This helps in assessing the robustness of the defense mechanism at different thresholds.

Configuration and Settings

The project uses a settings approach to manage configurations, allowing you to set paths, thresholds, and parameters directly in the code. This means you don’t have to specify configurations through command-line arguments each time you run the script, which simplifies setup and execution.

Open the settings script and adjust the values as needed.

Setup

Docker

To ensure a consistent and reproducible environment, a Dockerfile is provided. The Docker setup installs all necessary dependencies and sets up the environment for running the scripts.

Building the Docker Image

To build the Docker image, run the following command in the root directory of the repository:

docker build -t neural-fingerprints .

Running the Docker Container

To run the Docker container, use the following command:

docker run -v /path/to/imagenet:/image_net_data/ -v /path/to/output:/neural_fingerprints_db/ -it neural-fingerprints

Replace /path/to/imagenet with the path to your ImageNet dataset and /path/to/output with the path where you want to save the output DB.

Running Without Docker

If you prefer not to use Docker, you can run the project directly on your local machine by setting up a Python environment and installing the required dependencies. Follow the steps below to get started.

Clone the Repository Start by cloning the repository to your local machine:

git clone https://github.com/yourusername/neural-fingerprints.git
cd neural-fingerprints

Set Up a Virtual Environment It's recommended to create a virtual environment to isolate the project dependencies. You can do this with venv (or virtualenv if preferred).

For Linux/macOS:

python3 -m venv venv
source venv/bin/activate

For Windows:

python -m venv venv
.\venv\Scripts\activate

Install Dependencies Once the virtual environment is activated, install the required dependencies using the requirements.txt file:

pip install -r requirements.txt

This will install all the necessary Python libraries, including any dependencies needed for adversarial attack detection.

Running the Project After the dependencies are installed, you can run the project scripts directly.

To create the activations database (original and adversarial examples), use:

python src/db_creator/db_creator.py

To run the Neural Fingerprints defense on the database and detect adversarial attacks:

python src/fingerprints_armor/fingerprints.py

This will process the activations from the database and output detection results to the specified directory.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.1

Nov 6, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fingerprints_armor-1.0.1.tar.gz (22.2 kB view details)

Uploaded Nov 6, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fingerprints_armor-1.0.1-py3-none-any.whl (17.5 kB view details)

Uploaded Nov 6, 2024 Python 3

File details

Details for the file fingerprints_armor-1.0.1.tar.gz.

File metadata

Download URL: fingerprints_armor-1.0.1.tar.gz
Upload date: Nov 6, 2024
Size: 22.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for fingerprints_armor-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`2ac0f2733801baee1cf90392597c245d65527ae7bbb8212170e46760aa987698`
MD5	`a9038a7c3d232a9131c3420f36bc3c5f`
BLAKE2b-256	`670c2e9492437fcd6b6d702b2a2dc9661980c9ee366463aeca6b9f835759ac98`

See more details on using hashes here.

File details

Details for the file fingerprints_armor-1.0.1-py3-none-any.whl.

File metadata

Download URL: fingerprints_armor-1.0.1-py3-none-any.whl
Upload date: Nov 6, 2024
Size: 17.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for fingerprints_armor-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9de4a24a92b3f52dd8d1b4e5d88703e646aad3ca86af4bd60fd34d97accd37a8`
MD5	`8a1244a02766ae7e3e451faa268ec925`
BLAKE2b-256	`3412102713ef6c12bfb759e7b4a0af676957b3e9442dc84d5ad2cb6dcc006903`

See more details on using hashes here.

fingerprints-armor 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Neural Fingerprints for Adversarial Attack Detection

Data base creator

Input Structure

Output Format

Files

attack.py

get_last_layers_activations.py

db_creator.py

Exsample

Fingerprints Armor

Files

fingerprints_io.py

fingerprints.py

Example

Aggregated Results:

Metric Columns:

Confusion Matrix:

Quantile-Based Decision:

Configuration and Settings

Setup

Docker

Building the Docker Image

Running the Docker Container

Running Without Docker

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes