Skip to main content

stix2arango is a command line tool that takes a group of STIX 2.1 objects in a bundle and inserts them into ArangoDB. It can also handle updates to existing objects in ArangoDB imported in a bundle.

Project description

stix2arango

codecov

Overview

stix2arango is a command line tool that takes a group of STIX 2.1 objects in a bundle and inserts them into ArangoDB. It can also handle updates to existing objects in ArangoDB imported in a bundle.

  1. STIX 2.1 bundle entered
  2. User chooses database/collection names (stix2arango creates as needed)
  3. stix2arango inserts objects (or updates them) and then generates any relationships between them

tl;dr

stix2arango

Watch the demo.

Usage

Install the script

# clone the latest code
git clone https://github.com/muchdogesec/stix2arango
# create a venv
cd stix2arango
python3 -m venv stix2arango-venv
source stix2arango-venv/bin/activate
# install requirements
pip3 install .

Note, the installation assumes ArangoDB is already installed locally.

You can install ArangoDB here. stix2arango is compatible with both the Enterprise and Community versions.

Configuration options

stix2arango has various settings that are defined in an .env file.

To create a template for the file:

cp .env.example .env

To see more information about how to set the variables, and what they do, read the .env.markdown file.

Run

python3 stix2arango.py \
	--file PATH/TO/STIX.json \
	--database NAME \
	--collection NAME \
	--stix2arango_note SOMETHING \
	--ignore_embedded_relationships BOOLEAN

Where;

  • --file (required): is the path to the valid STIX 2.1 bundle .json file
  • --database (required): is the name of the Arango database the objects should be stored in.
  • --create_db (default true): If database does not exist, stix2arango will create it. You can set to false to stop this behaviour (and avoid the risk of incorrect DBs being created). Generally setting to false is a good idea if you know the databases exist. This setting will only work if the Arango user being used to authenticate has permissions to create new databases.
  • --collection (required): is the name of the Arango collection in the database specified the objects should be stored in. If the collection does not exist, stix2arango will create it
  • --stix2arango_note (optional): Will be stored under the _stix2arango_note custom attribute in ArangoDB. Useful as can be used in AQL. a-z characters only. Max 24 chars.
  • --ignore_embedded_relationships (optional, boolean): if true passed, this will stop ANY embedded relationships from being generated. This applies for all object types (SDO, SCO, SRO, SMO). If you want to target certain object types see ignore_embedded_relationships_sro and ignore_embedded_relationships_sro flags. Default isfalse`
  • --ignore_embedded_relationships_sro (optional, boolean): if true passed, will stop any embedded relationships from being generated from SRO objects (type = relationship). Default is false
  • --ignore_embedded_relationships_smo (optional, boolean): if true passed, will stop any embedded relationships from being generated from SMO objects (type = marking-defirnition, extension-definition, language-content). Default is false
  • --include_embedded_relationships_attributes (optional, stix _ref or _refs attribute): if you only want to create embedded relationships from certain keys (attributes) in a STIX object you can pass a list of attributes here. e.g. object_refs created_by_ref . In this example, embedded relationships to all objects listed in object_refs and objects in created_by_ref will be created between source (the objects that house these attibutes) and destinations (the objects listed as values for these attributes)
  • --is_large_file (pass flag): Use this mode when the bundle is very large (>100mb), this will chunk the input into multiple files before loading into memory.

For example, using the MITRE ATT&CK Enterprise bundle;

python3 stix2arango.py \
	--file cti_knowledge_base_store/mitre-attack-enterprise/enterprise-attack-15_1.json \
	--database stix2arango_demo \
	--collection demo_1 \
	--stix2arango_note v15.1 \
	--ignore_embedded_relationships true \
	--is_large_file

If you want to include embedded relationships as edges in the ArangoDB collection, you would run;

python3 stix2arango.py \
	--file cti_knowledge_base_store/mitre-attack-enterprise/enterprise-attack-15_1.json \
	--database stix2arango_demo \
	--collection demo_2 \
	--stix2arango_note v15.1 \
	--ignore_embedded_relationships false \
	--is_large_file

If you want to include embedded relationships for created_by_ref and object_marking_refs attibutes collection, you would run;

python3 stix2arango.py \
	--file cti_knowledge_base_store/mitre-attack-enterprise/enterprise-attack-15_1.json \
	--database stix2arango_demo \
	--collection demo_2 \
	--stix2arango_note v15.1 \
	--include_embedded_relationships_attributes object_refs created_by_ref \
	--is_large_file

A note on embedded relationships

stix2arango can handle all embedded references to other STIX objects under _ref and _refs properties in a STIX object when --ignore_embedded_relationships is set to false.

e.g.

        {
            "type": "x-mitre-tactic",
            "spec_version": "2.1",
            "id": "x-mitre-tactic--b977ad29-eb0c-5f09-bb2f-6d3f23e2a175",
            "created_by_ref": "identity--8700e156-6ce9-5090-8589-f9d0aef7bdb7",

For the above object a STIX SRO would be generated by stix2arango with source_ref=x-mitre-tactic--b977ad29-eb0c-5f09-bb2f-6d3f23e2a175, target_ref=identity--8700e156-6ce9-5090-8589-f9d0aef7bdb7 and relationship_type=created-by.

Quickstart

We store a lot of STIX data from popular knowledgebases on Cloudflare R2.

This is a useful resource to quickly populate STIX data using stix2arango if you want to see what it can do.

To populate this data for yourself, check out utilities/README.md for import scripts.

Useful supporting tools

Support

Minimal support provided via the DOGESEC community.

License

Apache 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stix2arango-1.1.9.tar.gz (5.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

stix2arango-1.1.9-py3-none-any.whl (23.7 kB view details)

Uploaded Python 3

File details

Details for the file stix2arango-1.1.9.tar.gz.

File metadata

  • Download URL: stix2arango-1.1.9.tar.gz
  • Upload date:
  • Size: 5.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for stix2arango-1.1.9.tar.gz
Algorithm Hash digest
SHA256 91161797d929aa9e19d08b2e9cd6e81eea005ae676c137c2d6b6fc783b1ed7d3
MD5 79f219bc146d91067eaf7094866729a3
BLAKE2b-256 0e372817e4d5417f5eb4b608311f5e1d3ce6d1f2c50511b92eb2ca7b3b39b5d2

See more details on using hashes here.

Provenance

The following attestation bundles were made for stix2arango-1.1.9.tar.gz:

Publisher: create-release.yml on muchdogesec/stix2arango

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file stix2arango-1.1.9-py3-none-any.whl.

File metadata

  • Download URL: stix2arango-1.1.9-py3-none-any.whl
  • Upload date:
  • Size: 23.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for stix2arango-1.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 eb074aad0a93fe61dd7bbae1af8a48e26015a688d82727e6af0cbbd3f71960d7
MD5 034b32151afb0d963e9e1d6430af78eb
BLAKE2b-256 77c22d8797469b1a5285ad0519dea098538985f74f0f122222b462ac831a171b

See more details on using hashes here.

Provenance

The following attestation bundles were made for stix2arango-1.1.9-py3-none-any.whl:

Publisher: create-release.yml on muchdogesec/stix2arango

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page