A Python library for EDA, including visualizations, directory management, data preprocessing, reporting, and more.
Project description
Welcome to EDA Toolkit, a collection of utility functions designed to streamline your exploratory data analysis (EDA) tasks. This repository offers tools for directory management, some data preprocessing, reporting, visualizations, and more, helping you efficiently handle various aspects of data manipulation and analysis.
Prerequisites
Before you install eda_toolkit
, ensure your system meets the following requirements:
Python
: Version3.7.4
or higher is required to runeda_toolkit
.
Additionally, eda_toolkit
depends on the following packages, which will be automatically installed when you install eda_toolkit
:
jinja2
: version3.1.4
(Exact version required)matplotlib
: version3.5.3
or higher, but capped at3.9.2
nbformat
: version4.2.0
or higher, but capped at5.10.4
numpy
: version1.21.6
or higher, but capped at2.1.0
pandas
: version1.3.5
or higher, but capped at2.2.2
plotly
: version5.18.0
or higher, but capped at5.24.0
scikit-learn
: version1.0.2
or higher, but capped at1.5.2
seaborn
: version0.12.2
or higher, but capped below0.13.0
xlsxwriter
: version3.2.0
(Exact version required)scipy
: version1.7.3
or higher, but capped at1.11.1
💾 Installation
To install eda_toolkit
, simply run the following command in your terminal:
pip install eda_toolkit
📄 Official Documentation
https://lshpaner.github.io/eda_toolkit
🌐 Authors' Websites
🙏 Acknowledgements
We would like to express our deepest gratitude to Dr. Ebrahim Tarshizi, our mentor during our time in the University of San Diego M.S. Applied Data Science Program. His unwavering dedication and mentorship played a pivotal role in our academic journey, guiding us to successfully graduate from the program and pursue successful careers as data scientists.
We also extend our thanks to the Shiley-Marcos School of Engineering at the University of San Diego for providing an exceptional learning environment and supporting our educational endeavors.
⚖️ License
eda_toolkit
is distributed under the MIT License. See LICENSE for more information.
🛟 Support
If you have any questions or issues with eda_toolkit
, please open an issue on this GitHub repository.
📚 Citing eda_toolkit
If you use eda_toolkit
in your research or projects, please consider citing it.
@software{shpaner_2024_13162633,
author = {Shpaner, Leonid and
Gil, Oscar},
title = {EDA Toolkit},
month = aug,
year = 2024,
publisher = {Zenodo},
version = {0.0.11a2},
doi = {10.5281/zenodo.13162633},
url = {https://doi.org/10.5281/zenodo.13162633}
}
🔖 References
-
Hunter, J. D. (2007). Matplotlib: A 2D Graphics Environment. Computing in Science & Engineering, 9(3), 90-95. https://doi.org/10.1109/MCSE.2007.55
-
Kohavi, R. (1996). Census Income. UCI Machine Learning Repository. https://doi.org/10.24432/C5GP7S.
-
Pace, R. Kelley, & Barry, R. (1997). Sparse Spatial Autoregressions. Statistics & Probability Letters, 33(3), 291-297. https://doi.org/10.1016/S0167-7152(96)00140-X.
-
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, E. (2011). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825-2830. http://jmlr.org/papers/v12/pedregosa11a.html.
-
Waskom, M. (2021). Seaborn: Statistical Data Visualization. Journal of Open Source Software, 6(60), 3021. https://doi.org/10.21105/joss.03021.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file eda_toolkit-0.0.11a2.tar.gz
.
File metadata
- Download URL: eda_toolkit-0.0.11a2.tar.gz
- Upload date:
- Size: 40.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 78a7ed6ef0740354619bee901fb5be3b73e9f825e70de0c0cd9ca5b294b51ea7 |
|
MD5 | 9b25aab892bf25efef10ec7df22964d1 |
|
BLAKE2b-256 | dc86161410afd91873692ad1423d0d53e9685986abdf30e0afbac16b182b8af0 |
File details
Details for the file eda_toolkit-0.0.11a2-py3-none-any.whl
.
File metadata
- Download URL: eda_toolkit-0.0.11a2-py3-none-any.whl
- Upload date:
- Size: 35.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f7dceb14ee0d846894db4f72d023a4fba224cb25b12deffd14f96d11246b31f0 |
|
MD5 | d4d5fad47a34f1ecfe20c24897b9f42b |
|
BLAKE2b-256 | 80ffaf87ba0b88e208c5b1827ec7077c9d6185987f05b53495c4f8270e6bf382 |