Auxiliary functions to clean pandas data frames
Project description
Pywrangle
Library for Python data wrangling to streamline string cleaning, identifying missing data, and tracking dataframe changes. Available on PyPI here
Install
- Python 3.6+
- numpy
- pandas
To install pywrangle, use pip:
pip install pywrangle
Import
Per convention with Python Analysis modules, import pywrangle as follows:
>>> import pywrangle as pw
String cleaning
def clean_str_columns(df: object, col_strcase_tuple: tuple) -> df:
Master function to clean string columns using col_strcase_tuple key.
Missing Data
print_nulls_per_col(df) -> None:
Calculates number of null values in each column and prints result.
Dataframe changes
History
version = "0.2.40"
- refactored code for clarity
- added display info to print_df_changes
version = "0.2.1"
- Created init file for function imports
- Documentation on importing pywrangle
- Added numpy as required package.
- Changed package requirements to greater than or equal to.
version = "0.0.1"
- Init
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pywrangle-0.2.49.tar.gz
(7.7 kB
view details)
File details
Details for the file pywrangle-0.2.49.tar.gz
.
File metadata
- Download URL: pywrangle-0.2.49.tar.gz
- Upload date:
- Size: 7.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/49.5.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 442294f1d1aaccfd9ad422b73e83057de3ca33d01a3a7100b44f0c91fcf890a7 |
|
MD5 | fb25a763235d2f8282ec812ccd2c4096 |
|
BLAKE2b-256 | 334b7ab4969ef11919c9cc4cbd32de97fbac86ee5ed31c11216a6c32501d8a4a |