Data Analysis and Visualization using Bootstrap-Coupled Estimation.
Project description
DABEST-Python
Recent Version Update
On 22 March 2024, we officially released DABEST Version Ondeh (v2024.03.29). This new version provides several new features and includes performance improvements.
-
New Paired Proportion Plot: This feature builds upon the existing proportional analysis capabilities by introducing advanced aesthetics and clearer visualization of changes in proportions between different groups, inspired by the informative nature of Sankey Diagrams. It’s particularly useful for studies that require detailed examination of how proportions shift in paired observations.
-
Customizable Swarm Plot: Enhancements allow for tailored swarm plot aesthetics, notably the adjustment of swarm sides to produce asymmetric swarm plots. This customization enhances data representation, making visual distinctions more pronounced and interpretations clearer.
-
Standardized Delta-delta Effect Size: We added a new metric akin to a Hedges’ g for delta-delta effect size, which allows comparisons between delta-delta effects generated from metrics with different units.
-
Miscellaneous Improvements: This version also encompasses a broad range of miscellaneous enhancements, including bug fixes, Bootstrapping speed improvements, new templates for raising issues, and updated unit tests. These improvements are designed to streamline the user experience, increase the software’s stability, and expand its versatility. By addressing user feedback and identified issues, DABEST continues to refine its functionality and reliability.
Contents
- About
- Installation
- Usage
- How to cite
- Bugs
- Contributing
- Acknowledgements
- Testing
- DABEST in other languages
About
DABEST is a package for Data Analysis using Bootstrap-Coupled ESTimation.
Estimation statistics are a simple framework that avoids the pitfalls of significance testing. It employs familiar statistical concepts such as means, mean differences, and error bars. More importantly, it focuses on the effect size of one’s experiment or intervention, rather than succumbing to a false dichotomy engendered by P values.
An estimation plot comprises two key features.
-
It presents all data points as a swarm plot, ordering each point to display the underlying distribution.
-
It illustrates the effect size as a bootstrap 95% confidence interval on a separate but aligned axis.
DABEST powers estimationstats.com, allowing everyone access to high-quality estimation plots.
Installation
This package is tested on Python 3.8 and onwards. It is highly recommended to download the Anaconda distribution of Python in order to obtain the dependencies easily.
You can install this package via pip
.
To install, at the command line run
pip install dabest
You can also clone this repo locally.
Then, navigate to the cloned repo in the command line and run
pip install .
Usage
import pandas as pd
import dabest
# Load the iris dataset. This step requires internet access.
iris = pd.read_csv("https://github.com/mwaskom/seaborn-data/raw/master/iris.csv")
# Load the above data into `dabest`.
iris_dabest = dabest.load(data=iris, x="species", y="petal_width",
idx=("setosa", "versicolor", "virginica"))
# Produce a Cumming estimation plot.
iris_dabest.mean_diff.plot();
Please refer to the official tutorial for more useful code snippets.
How to cite
Moving beyond P values: Everyday data analysis with estimation plots
Joses Ho, Tayfun Tumkaya, Sameer Aryal, Hyungwon Choi, Adam Claridge-Chang
Nature Methods 2019, 1548-7105. 10.1038/s41592-019-0470-3
Paywalled publisher site; Free-to-view PDF
Bugs
Please report any bugs on the issue page.
Contributing
All contributions are welcome; please read the Guidelines for contributing first.
We also have a Code of Conduct to foster an inclusive and productive space.
A wish list for new features
If you have any specific comments and ideas for new features that you would like to share with us, please read the Guidelines for contributing, create a new issue using Feature request template or create a new post in our Google Group.
Acknowledgements
We would like to thank alpha testers from the Claridge-Chang lab: Sangyu Xu, Xianyuan Zhang, Farhan Mohammad, Jurga Mituzaitė, and Stanislav Ott.
Testing
To test DABEST, you need to install pytest and nbdev.
- Run
pytest
in the root directory of the source distribution. This runs the test suite in the folderdabest/tests/mpl_image_tests
. - Run
nbdev_test
in the root directory of the source distribution. This runs the value assertion tests in the folderdabest/tests
The test suite ensures that the bootstrapping functions and the plotting functions perform as expected.
For detailed information, please refer to the test folder
DABEST in other languages
DABEST is also available in R (dabestr) and Matlab (DABEST-Matlab).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dabest-2024.3.29.tar.gz
.
File metadata
- Download URL: dabest-2024.3.29.tar.gz
- Upload date:
- Size: 75.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9fb51da4f5c8670bdd078a2b8b7094908508223b77764a9ba337cb5e75549666 |
|
MD5 | 12400cfa03f0222e55dfd48ee7329c21 |
|
BLAKE2b-256 | 70b202930f2ceb21351a7e608510cc598a03d5f78eb36aad48a0f731097b8949 |
File details
Details for the file dabest-2024.3.29-py3-none-any.whl
.
File metadata
- Download URL: dabest-2024.3.29-py3-none-any.whl
- Upload date:
- Size: 77.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aab609b8d1b7ec79ded9ddc3f83b439cb017db678a32fce40c9c5da2630c7eda |
|
MD5 | e70e6d461ef06f2679d5bba706d3d20b |
|
BLAKE2b-256 | 286e958b01b3bb9789c7b6dc47fbbee34e4884a62135a39dc5c36ef82a1e69e8 |