Skip to main content

A small python script that enriches Vulnerability STIX Objects with other intel

Project description

Arango CVE Processor

codecov

Before you get started

Arango CVE Processor is built into Vulmatch which also handles the download of CVE objects (what you need for ACVEP to work). As such, Vulmatch is probably better suited to what you're looking for.

tl;dr

A small script that enriches CVEs to other sources with all data stored as STIX 2.1 objects.

Overview

Here at DOGESEC we work with a lot of CVE data across our products. cve2stix generates core STIX 2.1 Vulnerability objects from CVE data.

However, we have lots of other sources (EPSS, KEV, ATT&CK...) that we want to enrich this data with.

We built Arango CVE Processor to handle the generation and maintenance of these enrichments.

In short, Arango CVE Processor is a script that;

  1. reads the ingested CVE STIX data in ArangoDB
  2. creates STIX objects to represent the relationships between CVE and other datasets

Source

Usage

Install the script

# clone the latest code
git clone https://github.com/muchdogesec/arango_cve_processor
# create a venv
cd arango_cve_processor
python3 -m venv arango_cve_processor-venv
source arango_cve_processor-venv/bin/activate
# install requirements
pip3 install -r requirements.txt

Configuration options

Arango CVE Processor has various settings that are defined in an .env file.

To create a template for the file:

cp .env.example .env

To see more information about how to set the variables, and what they do, read the .env.markdown file.

Run

python3 arango_cve_processor.py \
    MODE \
    --
    MODE OPTIONS

The following modes are available;

  • cve-cwe
    • links vulnerability objects to CWE objects
  • cve-capec (relies on cve-cwe run first)
    • links vulnerability objects to CAPEC objects
  • cve-attack (relies on cve-capec run first)
    • links vulnerability objects to ATT&CK objects
  • cve-epss
    • creates/updates report objects linked to CVE representing one of more EPSS score for the time range run
  • cve-kev (relies on cve-cwe run first)
    • creates/updates report objects linked to CVE representing CISA KEV data
  • cve-vulncheck-kev (relies on cve-cwe run first)
    • creates/updates report objects linked to CVE representing Vulncheck KEV data
  • cpematch
    • creates/updates grouping objects (and linked software objects) representing CPE Matches tied to CPEs.

All modes have varying options, however, the following are available in all modes

  • --database (required): the arangoDB database name where the objects you want to link are found. It must contain the collections nvd_cve_vertex_collection and nvd_cve_edge_collection
  • --ignore_embedded_relationships (optional, boolean). Default is false. if true passed, this will stop any embedded relationships from being generated. This is a stix2arango feature where STIX SROs will also be created for _ref and _refs properties inside each object (e.g. if _ref property = identity--1234 and SRO between the object with the _ref property and identity--1234 will be created). See stix2arango docs for more detail if required, essentially this a wrapper for the same --ignore_embedded_relationships setting implemented by stix2arango
  • --ignore_embedded_relationships_sro (optional): boolean, if true passed, will stop any embedded relationships from being generated from SRO objects (type = relationship). Default is false
  • --ignore_embedded_relationships_smo (optional): boolean, if true passed, will stop any embedded relationships from being generated from SMO objects (type = marking-definition, extension-definition, language-content). Default is false

To see the options available for each mode you can run with the help flag (-h), e.g.,

python3 arango_cve_processor.py \
  cve-epss -h
python3 arango_cve_processor.py \
  cve-cwe -h

Examples

Process CVE -> CWE relationships for all CVEs modified after 2024-02-01

python3 arango_cve_processor.py \
  cve-cwe \
  --database vulmatch_database \
  --modified_min 2024-02-01 \
  --ignore_embedded_relationships true \
  --ignore_embedded_relationships_sro true \
  --ignore_embedded_relationships_smo true

Get all EPSS scores for CVEs for each day in 2024

python3 arango_cve_processor.py \
  cve-epss \
  --database vulmatch_database \
  --start_date 2024-01-01 \
  --end_date 2024-12-31 \
  --ignore_embedded_relationships true \
  --ignore_embedded_relationships_sro true \
  --ignore_embedded_relationships_smo true

Update all CPE Matches modified after 2024-02-01

python3 arango_cve_processor.py \
  cpematch \
  --database vulmatch_database \
  --updated_after 2024-02-01 \
  --ignore_embedded_relationships true \
  --ignore_embedded_relationships_sro true \
  --ignore_embedded_relationships_smo true

Backfilling data

stix2arango contains a set of utility scripts that can be used to backfill all the datasources required for this test.

How it works

If you would like to know how the logic of this script works in detail, please consult the /docs directory.

Useful supporting tools

Support

Minimal support provided via the DOGESEC community.

License

Apache 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arango_cve_processor-1.4.7.tar.gz (889.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arango_cve_processor-1.4.7-py3-none-any.whl (34.9 kB view details)

Uploaded Python 3

File details

Details for the file arango_cve_processor-1.4.7.tar.gz.

File metadata

  • Download URL: arango_cve_processor-1.4.7.tar.gz
  • Upload date:
  • Size: 889.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for arango_cve_processor-1.4.7.tar.gz
Algorithm Hash digest
SHA256 a58f66d5029d93fc0f7334f290acaf4798955f0bcc986cd2de321f44ca1e4f3b
MD5 323530e75c6a42242ff51ea0029d9a8b
BLAKE2b-256 a343a0fa96f7a29b4b0f08b9fd59104bd3d9bc56ed67097bd8c20c6a9ff3b4d8

See more details on using hashes here.

Provenance

The following attestation bundles were made for arango_cve_processor-1.4.7.tar.gz:

Publisher: create-release.yml on muchdogesec/arango_cve_processor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file arango_cve_processor-1.4.7-py3-none-any.whl.

File metadata

File hashes

Hashes for arango_cve_processor-1.4.7-py3-none-any.whl
Algorithm Hash digest
SHA256 92e278277f4b12cfa32530d00ffc1f65264e40e970528a06e5846ad9448ad7a4
MD5 cedbf47853e829599e2c06f1c1a0e85d
BLAKE2b-256 ddd8ac621be78b8a5680f5b035ca0aad59be564696a5b1e4ee9b4380729f6b60

See more details on using hashes here.

Provenance

The following attestation bundles were made for arango_cve_processor-1.4.7-py3-none-any.whl:

Publisher: create-release.yml on muchdogesec/arango_cve_processor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page