Skip to main content

This python tools helps managing DBMS benchmarking experiments in a Kubernetes-based HPC cluster environment. It enables users to configure hardware / software setups for easily repeating tests over varying configurations.

Project description

Maintenance GitHub release PyPI version .github/workflows/draft-pdf.yml Documentation Status

Benchmark Experiment Host Manager (Bexhoma)

Orchestrating cloud-native DBMS benchmarking experiments on Kubernetes.

Bexhoma manages benchmark experiments of Database Management Systems (DBMS) in a Kubernetes-based cluster environment. It lets you configure hardware and software setups and repeat tests across varying configurations — all with a single command.

Benchmarks: YCSB · TPC-H · TPC-DS · TPC-C (HammerDB and Benchbase)

Tested DBMSs: Citus (Hyperscale), Clickhouse, CockroachDB, Dragonfly, Exasol, IBM DB2, MariaDB, MariaDB Columnstore, MemSQL (SingleStore), MonetDB, MySQL, OmniSci (HEAVY.AI), Oracle DB, PostgreSQL, SQL Server, SAP HANA, TiDB, TimescaleDB, Vertica, YugabyteDB

Tested cloud platforms: Amazon Web Services · Google Cloud · Microsoft Azure · IBM Cloud · Oracle Cloud · Minikube

Bexhoma acts as the orchestrator [2] for distributed parallel benchmarking experiments. The basic workflow is: start a containerised DBMS, deploy monitoring, import data, run benchmarks, and shut everything down — with a single command. A more advanced workflow plans a sequence of such experiments, runs them as a batch, and joins results for comparison.

Bexhoma also supports scaling out data-loading and benchmarking drivers to simulate cloud-native environments, as described in [4]. See example results from A Cloud-Native Adoption of Classical DBMS Performance Benchmarks and Tools.

Documentation · Issue tracker

Installation

  1. Install the package:

    pip install bexhoma
    

    This requires Java and build tools.

  2. Ensure kubectl is installed and configured for a running Kubernetes cluster (e.g. Minikube) with dynamic PVC provisioning.

  3. Copy and adjust the cluster configuration:

    cp k8s-cluster.config cluster.config
    

    Set the context name, namespace, and cluster name. Ensure resultfolder points to an existing directory on your local filesystem. See the full configuration guide.

  4. Optionally adjust the shared volume manifests before the first run:

    All other components (monitoring, message queue, evaluator) are deployed automatically on first use.

Optional: Install a conda environment: conda create -n "bexhoma" python=3.14.2 ipython

Optional: Install Jupyter for notebooks: conda install -c conda-forge jupyter

Quickstart

Each command below installs PostgreSQL, loads data, runs the benchmark, and prints a summary. Monitor progress with bexperiments status while running.

YCSB

bexhoma ycsb run -ms 1 -tr -sf 1 --workload a -dbms PostgreSQL \
  -tb 16384 -nlp 1 -nlt 64 -nlf 4 -nbp 1,8 -nbt 64 -nbf 2,3

Loads 1 million rows, then runs workload A (50% reads / 50% updates, 1 million operations) targeting 32,768 ops/s (16,384 × 2) and 49,152 ops/s (16,384 × 3). Each target is tested with 64 driver threads as a single pod and again split across 8 pods.

See more details at https://bexhoma.readthedocs.io/en/latest/Example-YCSB.html

TPC-C (HammerDB)

bexhoma hammerdb run -ms 1 -tr -sf 16 -sd 5 -dbms PostgreSQL \
  -nlt 16 -nbp 1,2 -nbt 16

Loads a TPC-C schema with 16 warehouses using 16 loader threads, then benchmarks for 5 minutes with 16 virtual users — first in a single pod, then split across 2 pods.

See more details at https://bexhoma.readthedocs.io/en/latest/Example-HammerDB.html

TPC-C (Benchbase)

bexhoma benchbase run -ms 1 -tr -sf 16 -sd 5 -dbms PostgreSQL \
  -nbp 1,2 -nbt 16 -nbf 16 -tb 1024

Loads a TPC-C schema with 16 warehouses, then benchmarks for 5 minutes targeting 16,384 ops/s (1,024 × 16) with 16 threads — first in a single pod, then split across 2 pods.

See more details at https://bexhoma.readthedocs.io/en/latest/Example-Benchbase.html

TPC-H

bexhoma tpch run -ms 1 -dbms PostgreSQL

Loads 1 GB of TPC-H data (scale factor 1) and runs all 22 decision-support queries with a single client.

See more details at https://bexhoma.readthedocs.io/en/latest/Example-TPC-H.html

TPC-DS

bexhoma tpcds run -ms 1 -dbms PostgreSQL

Loads 1 GB of TPC-DS data (scale factor 1) and runs all 99 decision-support queries with a single client.

See more details at https://bexhoma.readthedocs.io/en/latest/Example-TPC-DS.html

Architecture

Bexhoma deploys Prometheus for monitoring and cAdvisor for resource metrics alongside every DBMS container. For analytical benchmarks the dbmsbenchmarker package [3] is used as query executor and evaluator [1,2]. For transactional workloads, HammerDB, Benchbase, and YCSB act as drivers [4].

See the images/ folder for more details.

Contributing

We welcome contributions of all kinds — code, documentation, bug reports, and more. Please first read the contribution guide.

If you find a bug, please open an issue on the issue tracker and include:

  • Operating system and architecture
  • Python version
  • Bexhoma version (or git commit / date)
  • Full traceback

References

If you use Bexhoma in work contributing to a scientific publication, please cite [2] or [1]:

[1] Erdelt P.K. (2021). A Framework for Supporting Repetition and Evaluation in the Process of Cloud-Based DBMS Performance Benchmarking. TPCTC 2020, LNCS 12752, Springer.
https://doi.org/10.1007/978-3-030-84924-5_6

[2] Erdelt P.K. (2022). Orchestrating DBMS Benchmarking in the Cloud with Kubernetes. TPCTC 2021, LNCS 13169, Springer.
https://doi.org/10.1007/978-3-030-94437-7_6

[3] Erdelt P.K., Jestel J. (2022). DBMS-Benchmarker: Benchmark and Evaluate DBMS in Python. JOSS 7(79), 4628.
https://doi.org/10.21105/joss.04628

[4] Erdelt P.K. (2024). A Cloud-Native Adoption of Classical DBMS Performance Benchmarks and Tools. TPCTC 2023, LNCS 14247, Springer.
https://doi.org/10.1007/978-3-031-68031-1_9

[5] Erdelt P.K., Rabl T. (2026). Benchmarking Multi-Tenant Architectures in PostgreSQL. EDBT 2026.
https://doi.org/10.48786/edbt.2026.46

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bexhoma-0.9.11.tar.gz (178.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bexhoma-0.9.11-py3-none-any.whl (210.7 kB view details)

Uploaded Python 3

File details

Details for the file bexhoma-0.9.11.tar.gz.

File metadata

  • Download URL: bexhoma-0.9.11.tar.gz
  • Upload date:
  • Size: 178.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bexhoma-0.9.11.tar.gz
Algorithm Hash digest
SHA256 0951d88701c759639d574d90092db7a6e55f29415e9ba57f21851d49f52f4b63
MD5 3d3eefbb4ef0dcf338fd901cf416acbe
BLAKE2b-256 6f3a6224f782ac78a8f82a561f6f3f03926a6ac76544d823cf19fd69f58e59ae

See more details on using hashes here.

File details

Details for the file bexhoma-0.9.11-py3-none-any.whl.

File metadata

  • Download URL: bexhoma-0.9.11-py3-none-any.whl
  • Upload date:
  • Size: 210.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bexhoma-0.9.11-py3-none-any.whl
Algorithm Hash digest
SHA256 99e4b339461fbb968651198c3bdfb3a8d6cdc4730c0fc21d9630113744fbff73
MD5 72dafc248b8b5b9b6ac4dddba6f2b816
BLAKE2b-256 a27bc3fda61801a888392bae4a7465b3d8f322b96557996e199c1765f9547e55

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page