A package to extract the causal graph from continuous tabular data.

These details have not been verified by PyPI

Project links

Homepage

Project description

logo

CausalExplain - A library to infer causal-effect relationships from tabular data

'CausalExplain' is a library that implements methods to extract the causal graph, from tabular data, specifically the ReX method, and other compared methods like GES, PC, FCI, LiNGAM, CAM, and NOTEARS.

This repository contains the implementation of ReX and all necessary tools to reproduce the results presented in our accompanying paper. ReX supports diverse data generation processes, including non-linear and additive noise models, and has demonstrated robust performance on synthetic and real-world datasets.

About ReX

ReX is a causal discovery method that leverages machine learning (ML) models coupled with explainability techniques, specifically Shapley values, to identify and interpret significant causal relationships among variables. Comparative evaluations on synthetic datasets comprising tabular data reveal that ReX outperforms state-of-the-art causal discovery methods across diverse data generation processes, including non-linear and additive noise models. Moreover, ReX was tested on the Sachs single-cell protein-signaling dataset, achieving a precision of 0.952 and recovering key causal relationships with no incorrect edges. Taking together, these results showcase ReX's effectiveness in accurately recovering true causal structures while minimizing false positive predictions, its robustness across diverse datasets, and its applicability to real-world problems. By combining ML and explainability techniques with causal discovery, ReX bridges the gap between predictive modeling and causal inference, offering an effective tool for understanding complex causal structures.

ReX Schema

Our experimental results, conducted on five families of synthetic datasets with varying complexity, demonstrate that REX consistently recovers true causal relationships with high precision while minimizing false positives and orientation errors, comparing favorably to existing methods. Additionally, REX was tested on the Sachs single-cell protein-signaling dataset (Sachs et al., 2005), achieving a competitive performance with no false positives and recovering important causal relationships. This further validates the applicability of REX to real-world datasets, highlighting its robustness across different types of data.

Prerequisites without Docker

Operating System: Linux or macOS
Environment Manager: PyEnv or Conda
Programming Language: Python 3.10.12 or higher
Hardware: CPU

Installation

The project can be installed using pip:

$ pip install causalexplain

Data

The datasets used in the paper and the examples can be generated using the generators module, which is also part of this library. In case you want to reproduce results from the articles that we used as reference, you can find the datasets in the data folder.

Executing `causalexplain`

Option 1: Command Line

To run causalexplain on your data, you can use the causalexplain command:

$ python -m causalexplain
   ___                      _                 _       _
  / __\__ _ _   _ ___  __ _| | _____  ___ __ | | __ _(_)_ __
 / /  / _` | | | / __|/ _` | |/ _ \ \/ / '_ \| |/ _` | | '_ \
/ /__| (_| | |_| \__ \ (_| | |  __/>  <| |_) | | (_| | | | | |
\____/\__,_|\__,_|___/\__,_|_|\___/_/\_\ .__/|_|\__,_|_|_| |_|
                                       |_|
usage: causalexplain [-h] -d DATASET [-m {rex,pc,fci,ges,lingam,cam,notears}]
                   [-t TRUE_DAG] [-l LOAD_MODEL] [-T THRESHOLD] [-u UNION]
                   [-i ITERATIONS] [-b BOOTSTRAP] [-r REGRESSOR] [-S SEED]
                   [-s [SAVE_MODEL]] [-n] [-v] [-q] [-o OUTPUT]

that will present you with a menu to choose the dataset you want to use, the method you want to use to infer the causal graph, and the hyperparameters you want to use.

The minimum required to run causalexplain is a dataset file in CSV format, with the first row containing the names of the variables, and the rest of the rows containing the values of the variables. The method selected by default is ReX, but you can also choose between PC, FCI, GES, LiNGAM, CAM, NOTEARS. At the end of the execution, the edges of the plausible causal graph will be displayed along with the metrics obtained, if the true dag is provided (argument -t).

Option 2: Notebook

In case you want to run causalexplain from your code in a notebook, you can use the GraphDiscovery class. The following example shows how to use the GraphDiscovery class to train a model on a dataset using ReX method:

from causalexplain import GraphDiscovery

experiment = GraphDiscovery(
   experiment_name='my_experiment',
   model_type='rex',
   csv_filename='data.csv',
   dot_filename='true_graph.dot')

# Run the experiments
experiment.run()

To load a model from a file, you can use the load method of the GraphDiscovery class:

from causalexplain import GraphDiscovery

experiment = GraphDiscovery()
experiment.load("/path/to/model.pkl")

This can be useful if you want to train a model on a dataset and then use it to predict causal graphs on other datasets, or train a model on different batches.

Once a model has been trained or loaded, you can plot the resulting DAG, save the trained model to a file, or export the predicted causal graph to a DOT file.

# Plot the resulting DAG
experiment.plot()

# Save the trained model to a file
experiment.save("/path/to/model.pkl")

To export the predicted causal graph to a DOT file, you can use the export method of the GraphDiscovery class:

experiment.export("/path/to/my_predicted_graph.dot")

Output

The output of causalexplain is typically a graph with the edges of the plausible causal graph and the metrics obtained from the evaluation of the causal graph against the true DAG. These results are printed to the console, unless the '-o' option is specified, in which case the DAG is saved to a file in DOT format. Metrics are printed only if the true DAG is provided.

Example CLI commands

The following command illustrates how to run causalexplain on the toy dataset using the ReX method:

$ python -m causalexplain -d /path/to/toy_dataset.csv -t /path/to/toy_dataset.dot

The same command can be used to run causalexplain on the toy dataset using the CAM method:

$ python -m causalexplain -d /path/to/toy_dataset.csv -m cam -t /path/to/toy_dataset.dot

For more information on command line options, run causalexplain -h or go to the Quickstart section in the documentation.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.9.4

Apr 25, 2026

0.9.1

Jan 19, 2026

0.9.0

Jan 19, 2026

0.8.0

Jan 4, 2026

0.7.0

Dec 29, 2025

0.6.2

Nov 22, 2025

0.6.2b0 pre-release

Nov 22, 2025

0.6.1

May 12, 2025

0.6.1b0 pre-release

May 12, 2025

0.6.0

May 12, 2025

This version

0.5.4b0 pre-release

Dec 26, 2024

0.5.4a0 pre-release

Dec 26, 2024

0.5.3

Dec 18, 2024

0.5.2

Dec 18, 2024

0.5.1

Dec 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

causalexplain-0.5.4b0.tar.gz (216.5 kB view details)

Uploaded Dec 26, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

causalexplain-0.5.4b0-py3-none-any.whl (259.0 kB view details)

Uploaded Dec 26, 2024 Python 3

File details

Details for the file causalexplain-0.5.4b0.tar.gz.

File metadata

Download URL: causalexplain-0.5.4b0.tar.gz
Upload date: Dec 26, 2024
Size: 216.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for causalexplain-0.5.4b0.tar.gz
Algorithm	Hash digest
SHA256	`8bbeecef982720638486603313ba40d409b50157dbd1fc73d0b7c9a1cd00a0ff`
MD5	`ba706049f01c117bf78b5215e927db2b`
BLAKE2b-256	`1501bfaef0b14be03d53c5f4f376a88d00ed727decb96efa65be72a09b7036a3`

See more details on using hashes here.

File details

Details for the file causalexplain-0.5.4b0-py3-none-any.whl.

File metadata

Download URL: causalexplain-0.5.4b0-py3-none-any.whl
Upload date: Dec 26, 2024
Size: 259.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for causalexplain-0.5.4b0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc298ab2157149c26572ef10c4da48b2f9511cb4e0d914559a90f558c5284722`
MD5	`d894c415a57db734d938a8927bb6768b`
BLAKE2b-256	`3a1ae78df0cacb58dac036efd47449c7b9b27f55d2fe828c66f2b938c08e2621`

See more details on using hashes here.

causalexplain 0.5.4b0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

CausalExplain - A library to infer causal-effect relationships from tabular data

About ReX

Prerequisites without Docker

Installation

Data

Executing `causalexplain`

Option 1: Command Line

Option 2: Notebook

Output

Example CLI commands

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

causalexplain 0.5.4b0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

CausalExplain - A library to infer causal-effect relationships from tabular data

About ReX

Prerequisites without Docker

Installation

Data

Executing causalexplain

Option 1: Command Line

Option 2: Notebook

Output

Example CLI commands

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Executing `causalexplain`