Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model validation, etc.
Project description
data-science-toolbox
Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model training and validation etc.
Project Organization
├── README.md
├── data_science_toolbox <- Project source code
│ │
│ ├── gists <- Code gists with commonly used code (change to root
│ │ directory, connect to database, profile data, etc)
│ ├── io <- Code for input/output utilities
│ ├── etl <- For building reproducible ETL pipelines, including data
│ │ checks and transformers
│ ├── ml <- Machine Learning utility code (feature engineering, etc)
│ ├── pandas <- Pandas related utility code
│ │ ├── analysis
│ │ ├── cleaning
│ │ ├── engineering
│ │ ├── text
│ │ ├── datetime
│ │ ├── optimization
│ │ └── profiling
│ ├── project_utils.py <- For project specific utilities
│ │
│ ├── text <- Code for dealing with text. Includes distributed loading of text corpus,
│ │ entity statement extraction, sentiment analysis, pii removal etc.
│ └── __init__.py <- Makes data_science_toolbox a Python module
├── tests
├── LICENSE
├── poetry.lock
└── pyproject.toml
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
data_science_toolbox-0.1.2.tar.gz
(66.3 kB
view hashes)
Built Distribution
Close
Hashes for data_science_toolbox-0.1.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 979aa16c49cd72b1c78cda85b4e6895f3b64d7f120ffe98f235087194d7795ca |
|
MD5 | 8827636a108d493ab826437831aeece9 |
|
BLAKE2b-256 | 7f68826a96f4aa14e143b48aeb77d3cfae8fa777b3b663767cc5acbc85f0e459 |
Close
Hashes for data_science_toolbox-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 323f08d9ff60b97500523a4ba17583ac77e1a14e3c141604e429f6315b47a02a |
|
MD5 | 08b54dd6f5ae333689a546e79e9a09ae |
|
BLAKE2b-256 | 064740f2d8c01c17f50b64e4669dd5a39106f6a792a780a42cfcd4335a8a3965 |