Static Features Extraction Engine
Project description
Static Features Extraction Engine
This project allows the user to extract static features from Windows PE files, which have been proven effective for malware family classification.
Specifically, the list of the chosen features and the extraction process itself adhere to the work proposed in the paper: Decoding the Secrets of Machine Learning in Malware Classification: A Deep Dive into Datasets, Feature Extraction, and Model Performance.
The project was carried out as part of my Master's thesis: Clustering Windows Malware using Static Features and Concept Drift Detection.
Prerequisites
Make sure you have a running and active version of Docker.
Usage
- Configure the Docker Compose file by providing the following information:
MALWARE_DIR_PATH: the path where all the PE files are stored. The directory should group malwares based on their family, so it should contain $n$ subdirectories where $n$ is the number of families;VT_REPORTS_PATH: the path of the VirusTotal reports. Each line of this file should be a separate json containing a report of a single PE file;MERGE_DATASET_PATH: the path of the dataset that will be produced containing[SHA256, family, submission-date]of each file, starting from the VT reports file;FINAL_DATASET_DIR: directory path where the final dataset with the extracted features will be stored.
- Deploy the engine to start the extraction process:
docker compose up -d
Authors
- Luca Fabri
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dts_cdd_wdis-1.2.0.tar.gz.
File metadata
- Download URL: dts_cdd_wdis-1.2.0.tar.gz
- Upload date:
- Size: 27.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
da280c69b23a02defafb7c41ce36f0eeab3c4a0b94c7d4501aa36e18e44addb2
|
|
| MD5 |
5632ec0a310eacfc12a8d98106eaebd2
|
|
| BLAKE2b-256 |
4f5c5de673a470ea36c1642db419018429f8955895815b43107f9c11dcbe23e8
|
File details
Details for the file dts_cdd_wdis-1.2.0-py3-none-any.whl.
File metadata
- Download URL: dts_cdd_wdis-1.2.0-py3-none-any.whl
- Upload date:
- Size: 36.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
95633bbf222f21717c76d027966cbc2d93bdde855abdb5ec5dfe02a8dc7fe8a1
|
|
| MD5 |
380b1ce577c18302e6a6d84090d62ee0
|
|
| BLAKE2b-256 |
f454c35a0d10a4478c84dfe6d7beec3232ec6f921ab4c457804bc771828f1f7c
|