Skip to main content

An experiment framework for Root Cause Analysis

Project description

rcabench-platform

An experiment framework for Root Cause Analysis (RCA), supporting fast development of RCA algorithms and their evaluation on various datasets.

Specifications

Development Guide

Requirements

Operating System

This project is primarily developed and tested on Ubuntu 24.04 LTS or later versions. Other Linux distributions and macOS environments should be compatible with minimal configuration adjustments.

Windows is not officially supported. While some functionality may work in Windows environments (especially through WSL), we cannot guarantee full compatibility or provide dedicated support.

Toolchain

Toolchain Version
uv ^0.7.13
just ^1.21.0
Docker Engine *
Docker Compose *

IDE

Recommended setup

Git

Commit Message

We follow the Conventional Commits specification.

Branching Strategy

When you are developing a new feature, create a new branch from main and name it according to the following convention:

<your github id>/feat/<feature-name>

This branch prefixed with your github id, is your own working branch. You can force-push to it freely. Do anything you want in this branch.

When you are done with the feature, create a pull request to main and invite other developers to review your code. If the code is approved, it will be merged into main. Then you can start a new branch from main and continue your work.

The main branch is the default branch for this repository. main is protected and should not be used for development. Before merging any changes into main, ensure that the following conditions are met:

  • The branch is up to date with main.
  • The branch is free of merge conflicts.
  • The basic checks passed successfully.
  • The changes will not break other developers' workflow.

We requires a linear commit history. Please use git rebase to keep your branch up to date with main. When merging your branch into main, use git merge --ff-only to ensure a fast-forward merge. This will keep the commit history clean and linear.

Workflow

Download source code

git clone git@github.com:LGU-SE-Internal/rcabench-platform.git
cd rcabench-platform

Run basic checks

just dev

If the basic checks pass, then your python environment is ready for development.

Local development services

docker compose up -d
docker compose down

We have the following localhost services running in the background:

  • neo4j: for graph visualization

Link datasets

Mount JuiceFS to your machine:

sudo juicefs mount redis://10.10.10.38:6379/1 /mnt/jfs -d --cache-size=1024

See infra/README.md for more details.

Link the datasets to the project directory:

mkdir -p data
cd data
ln -s /mnt/jfs/rcabench-platform-v2 ./

Docker image

rcabench-platform:

./scripts/docker.sh build
./scripts/docker.sh run
./scripts/docker.sh push

clickhouse_dataset:

cd docker/clickhouse_dataset
./cli.sh build
./cli.sh push

Commands

Self test

Test if the environment is set up correctly:

./main.py self test

Prepare inputs

./cli/prepare_inputs.py --help

Make rcaeval

./cli/make_rcaeval.py --help

Make rcabench

./cli/make_rcabench.py --help

Example calls:

mkdir -p /dev/shm/make
TMP=/dev/shm/make LOGURU_COLORIZE=0 POLARS_MAX_THREADS=16 ./cli/make_rcabench.py run --parallel=8 >temp/a.log 2>&1

The example call runs 8 parallel processes with 16 polars threads each, using memory storage as the temporary directory. It is tested on a VM with 128 cores and 192 GiB of RAM.

./cli/make_rcabench.py make-filtered
./cli/make_rcabench.py make-with-issues
./cli/make_rcabench.py merge-conclusion
./cli/make_rcabench.py query-fault-types rcabench
./cli/make_rcabench.py query-fault-types rcabench_filtered
./cli/make_rcabench.py query-fault-types rcabench_with_issues

Evaluation

./main.py eval --help
./main.py eval show-algorithms
./main.py eval show-datasets

Example calls:

./main.py eval single traceback-A7 rcabench_filtered ts3-ts-route-plan-service-request-delay-59s2q4
LOGURU_LEVEL=INFO ./main.py eval batch -d rcaeval_re2_tt -a random -a baro -a nsigma -a traceback-A7 --use-cpus=112 --clear >temp/a.log 2>&1
LOGURU_LEVEL=INFO ./main.py eval batch -d rcabench_filtered -a random -a baro -a nsigma -a traceback-A7 --use-cpus=112 --clear >temp/a.log 2>&1
./main.py eval perf-report rcaeval_re2_tt
./main.py eval perf-report rcabench_filtered

Notebooks

Edit the SDG Visualization notebook:

./notebooks/sdg.py

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rcabench_platform-0.2.3.tar.gz (43.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rcabench_platform-0.2.3-py3-none-any.whl (65.8 kB view details)

Uploaded Python 3

File details

Details for the file rcabench_platform-0.2.3.tar.gz.

File metadata

  • Download URL: rcabench_platform-0.2.3.tar.gz
  • Upload date:
  • Size: 43.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.7.13

File hashes

Hashes for rcabench_platform-0.2.3.tar.gz
Algorithm Hash digest
SHA256 ec7459ee6893d0cbb42ef333f77fded91cd5091f1a23a1a862cc7b2650e56862
MD5 0d42f55dd2055b2b82dac58f58f159ac
BLAKE2b-256 62c0e563c01ac3000b3fbf38b27e083012454c0fdcad520f85fb203ce0159771

See more details on using hashes here.

File details

Details for the file rcabench_platform-0.2.3-py3-none-any.whl.

File metadata

File hashes

Hashes for rcabench_platform-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 98968c5afb5de7c5036dd0aada6d828d49797958707a8d213fcc33992d5086b3
MD5 d20c13e5ef7c9993a60f3344728efa78
BLAKE2b-256 0d420e62f065f56a72387b058a0b7b947e5c9c882603bb9841490455cbd8f51c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page