Generate datasets amd models based on vulnerabilities data from Vulnerability-Lookup.
Project description
VulnTrain
VulnTrain offers a suite of commands to generate diverse AI datasets and train models using comprehensive vulnerability data from Vulnerability-Lookup. It harnesses over one million JSON records from all supported advisory sources to build high-quality, domain-specific models.
Additionally, data from the vulnerability-lookup:meta container, including enrichment sources such as vulnrichment and Fraunhofer FKIE,
is incorporated to enhance model quality.
Check out the datasets and models on Hugging Face:
For more information about the use of AI in Vulnerability-Lookup, please refer to the user manual.
Usage
Install VulnTrain:
$ pipx install VulnTrain
Three types of commands are available:
- Dataset generation: Create and prepare datasets.
- Model training: Train models using the prepared datasets.
- Model validation: Assess the performance of trained models (validations, benchmarks, etc.).
Check out the documentation for more information.
How to cite
Bonhomme, C., & Dulaunoy, A. (2025). VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification (Version 1.4.0) [Computer software]. https://doi.org/10.48550/arXiv.2507.03607
@misc{bonhomme2025vlai,
title={VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification},
author={Cédric Bonhomme and Alexandre Dulaunoy},
year={2025},
eprint={2507.03607},
archivePrefix={arXiv},
primaryClass={cs.CR}
}
License
VulnTrain is licensed under GNU General Public License version 3
Copyright (c) 2025 Computer Incident Response Center Luxembourg (CIRCL)
Copyright (C) 2025 Cédric Bonhomme - https://github.com/cedricbonhomme
Copyright (C) 2025 Léa Ulusan - https://github.com/3LS3-1F
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vulntrain-2.1.0.tar.gz.
File metadata
- Download URL: vulntrain-2.1.0.tar.gz
- Upload date:
- Size: 258.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c9ee533f8c196e3c7b4082949a664384c0b686940e15cdfbb8dbb1d87e9ffadd
|
|
| MD5 |
c1c13761aa9eb9706e3e496fc972ee37
|
|
| BLAKE2b-256 |
63e44a609cbb8af793fa111215576459ad4c62017ed244cc372e1158a5e9fe06
|
Provenance
The following attestation bundles were made for vulntrain-2.1.0.tar.gz:
Publisher:
release.yml on vulnerability-lookup/VulnTrain
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vulntrain-2.1.0.tar.gz -
Subject digest:
c9ee533f8c196e3c7b4082949a664384c0b686940e15cdfbb8dbb1d87e9ffadd - Sigstore transparency entry: 707158035
- Sigstore integration time:
-
Permalink:
vulnerability-lookup/VulnTrain@d4cbd7c1e8318002b791ee8fd8b56598ebda7ce1 -
Branch / Tag:
refs/tags/v2.1.0 - Owner: https://github.com/vulnerability-lookup
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@d4cbd7c1e8318002b791ee8fd8b56598ebda7ce1 -
Trigger Event:
release
-
Statement type:
File details
Details for the file vulntrain-2.1.0-py3-none-any.whl.
File metadata
- Download URL: vulntrain-2.1.0-py3-none-any.whl
- Upload date:
- Size: 267.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9c6cf24204e15254234d137b50911d23367b0b482dda014bac781aadf10cec67
|
|
| MD5 |
cfca3760c01277e6e694c171d2ccd5e7
|
|
| BLAKE2b-256 |
8a9ebb1929c4c3055384ac9b2aac75db2a48f438e30faad51375c5ef5d36d0b2
|
Provenance
The following attestation bundles were made for vulntrain-2.1.0-py3-none-any.whl:
Publisher:
release.yml on vulnerability-lookup/VulnTrain
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vulntrain-2.1.0-py3-none-any.whl -
Subject digest:
9c6cf24204e15254234d137b50911d23367b0b482dda014bac781aadf10cec67 - Sigstore transparency entry: 707158036
- Sigstore integration time:
-
Permalink:
vulnerability-lookup/VulnTrain@d4cbd7c1e8318002b791ee8fd8b56598ebda7ce1 -
Branch / Tag:
refs/tags/v2.1.0 - Owner: https://github.com/vulnerability-lookup
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@d4cbd7c1e8318002b791ee8fd8b56598ebda7ce1 -
Trigger Event:
release
-
Statement type: