Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model validation, etc.
Project description
data-science-utils
Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, etc.
Project Organization
├── README.md <- The top-level README for developers using this project.
├── gists <- Code gists with commonly used code (change to root
│ directory, connect to database, profile data, etc)
├── io <- Code for input/output utilities
├── etl <- For building reproducible ETL pipelines, including data
│ checks and transformers
├── ml <- Machine Learning utility code (feature engineering, etc)
├── pandas <- Pandas related utility code
│ ├── analysis
│ ├── cleaning
│ ├── engineering
│ ├── text
│ ├── datetime
│ ├── optimization
│ └── profiling
├── text <- Code for dealing with text. Includes distributed loading of text corpus,
│ entity statement extraction, sentiment analysis, etc.
├── __init__.py <- Makes data_science_utils a Python module
├── project_utils.py <- For project specific utilities
└── LICENSE
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
data_science_toolbox-0.1.1.tar.gz
(77.1 kB
view hashes)
Built Distribution
Close
Hashes for data_science_toolbox-0.1.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 05046abfad0f057014a5a1b18b0e8c77839039c556c1eace0a43efc129a51396 |
|
MD5 | f12d24cd6c3b117255a86db2b7ee3147 |
|
BLAKE2b-256 | f42e638b43c7662c72b7b7e7349c3e59e547c8988eb9c22aa20348a33add26dc |
Close
Hashes for data_science_toolbox-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 97f9695d84fccac82853c0d5830eaa354e2448c3fc43f1fd039f88b5a26c0219 |
|
MD5 | 097e02aa247b4d6995945e63f151bfbe |
|
BLAKE2b-256 | 9d77bece679cb6165f8ed2e73ba4ae646c9c03410aa56d4f644135de8b5553b3 |