The BASSA algorithm as presented in the paper Sparse Linear Bayesian Models for Organic Chemistry

These details have not been verified by PyPI

Project description

BASSA: Bayesian Analysis with Spike-and-Slab Arrangements

Overview

Most chemical datasets are small and high-dimensional, making deep learning impractical. Linear regression remains interpretable and effective, but feature selection is critical. Traditional methods pick a single “best” model, overlooking the fact that multiple plausible models may exist.

BASSA combines Bayesian spike-and-slab regression with a filtering method to efficiently discover and organize many valid regression models. This reveals diverse interpretations hidden in chemical data without overcommitting to a single solution.

Installation

pip install bassa-reg

Example Use

priors = SpikeAndSlabPriors()
config = SpikeAndSlabConfigurations(sampler_iterations=5000)
abs_dir = os.path.dirname(os.path.abspath(__file__))
regression = SpikeAndSlabRegression(x_train, y_train, priors, config, project_path=abs_dir, experiment_name="demo")
 
regression.run()

bassa = Bassa(model=regression)
bassa.run()

Results

After running both the spike-and-slab regression and BASSA, results are saved in the specified project directory.
The main output is the bassa_plot.png file, which represents the models chosen by BASSA.

This chart visualizes the different models found by BASSA, with their feature combinations and performance metrics.
Key additional outputs include:

Markov Chain Visualization

The markov chain visualization shows the exploration of different models over iterations.
It is sorted by feature inclusion frequency, highlighting the most commonly selected features.
Precise feature inclusion frequencies are also provided in a separate file named feature_stats.csv.

Survival Process Plot

The survival plot, accompanied by the survival_table.csv file, illustrates the survival process of models over iterations.

This is an auxiliary output that helps understand how models persist or change and is used to generate the upset chart.

Additional Data

The meta_data.csv file contains information about the Spike-and-Slab regression run, including the number of iterations, and other configuration details. It also includes some metrics about the regression performance on the training data.

Prediction on New Data

In order to make predictions on new data, create a new TestSet object.

priors = SpikeAndSlabPriors()
config = SpikeAndSlabConfigurations(sampler_iterations=5000)
abs_dir = os.path.dirname(os.path.abspath(__file__))

test_set = TestSet(x_test=x_test,
                   samples_per_y=100,
                   iterations=200,
                   y_test=y_test)

regression = SpikeAndSlabRegression(x_train,
                                    y_train,
                                    priors,
                                    config,
                                    test_set=test_set,
                                    project_path=abs_dir,
                                    experiment_name="demo")
 
regression.run()

The sampler will run an extra numbers of iterations set by the iterations parameter in the TestSet object.
In every iteration, the sampler will sample samples_per_y values of y for each sample in the test set.
The average of these samples will be the predicted value for each sample in the test set.

Continuing a Previous Run

In order to continue a previous run, you first need to set save_samples=True on the SpikeAndSlabConfigurations object.
Then, you can load the previous run using the SpikeAndSlabLoader object and pass it to the SpikeAndSlabRegression object.

priors = SpikeAndSlabPriors()

config = SpikeAndSlabConfigurations(sampler_iterations=5000,
                                    save_meta_data=True,
                                    save_samples=True)

abs_dir = os.path.dirname(os.path.abspath(__file__))
regression = SpikeAndSlabRegression(x=x_train,
                            y=y_train,
                            priors=priors,
                            config=config,
                            project_path=abs_dir,
                            experiment_name="example_run",)

regression.run()

# load experiment, add test set and continue for another 1000 iterations
loader = SpikeAndSlabLoader(path = f"{abs_dir}/{regression.full_experiment_name}")

regression = SpikeAndSlabRegression(x=x_train,
                                    y=y_train,
                                    priors=priors,
                                    config=config,
                                    project_path=abs_dir,
                                    experiment_name="example_run",
                                    load_experiment=loader)

regression.run()

Choosing Priors

TBD

BASSA Thresholds

TBD

Project details

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.1.9

Feb 4, 2026

0.1.8

Oct 4, 2025

0.1.7

Sep 28, 2025

0.1.6

Aug 25, 2025

0.1.5

Aug 25, 2025

0.1.4

Aug 25, 2025

0.1.3

Aug 25, 2025

This version

0.1.2

Aug 24, 2025

0.1.1

Aug 24, 2025

0.1.0

Aug 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bassa_reg-0.1.2.tar.gz (32.6 kB view details)

Uploaded Aug 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bassa_reg-0.1.2-py3-none-any.whl (38.0 kB view details)

Uploaded Aug 24, 2025 Python 3

File details

Details for the file bassa_reg-0.1.2.tar.gz.

File metadata

Download URL: bassa_reg-0.1.2.tar.gz
Upload date: Aug 24, 2025
Size: 32.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for bassa_reg-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`7d40745dd9711cc6c27cbb14f630257b5c46c90cd7d6040c856b0d9b2e01ea57`
MD5	`e516a37b83b2e4e4317e8da152c1c0aa`
BLAKE2b-256	`bb93b7c79c4d8eb24c25178789a616e66059688d50778609b93dd6a16e0151d5`

See more details on using hashes here.

File details

Details for the file bassa_reg-0.1.2-py3-none-any.whl.

File metadata

Download URL: bassa_reg-0.1.2-py3-none-any.whl
Upload date: Aug 24, 2025
Size: 38.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for bassa_reg-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9da76ea60c3a5398577aaf32fc298b0560b15ec9854b293784dcb19c7b72e68d`
MD5	`3d510fde87fbdcda98ce983d2d22dd04`
BLAKE2b-256	`745997af3a6d4413f207d0b1ce0b4ccc09eb77c60330d3ac59f9d69fc02e9db4`

See more details on using hashes here.

bassa-reg 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

BASSA: Bayesian Analysis with Spike-and-Slab Arrangements

Overview

Installation

Example Use

Results

Markov Chain Visualization

Survival Process Plot

Additional Data

Prediction on New Data

Continuing a Previous Run

Choosing Priors

BASSA Thresholds

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes