module designed to make your data preprocessing experience easier
Project description
irdatacleaning
This python package is designed to make Artificial Intelligence accessible by starting with the data cleaning stage.
DataCorrelation:
this module allows you to be able to view the correlation values of your dataset allowing you the ability to prevent simple errors DataCorrelation(df = pandas dataframe) df: is where you will input the dataset you would like to evaluate Correlationmatrix(): is the method you call uppon to view which columns have correlation relationships. LookingAtCorr() is the method is where you will actually make the changes to your dataset this method returns a pandas dataframe. Check(): this method will call uppon both LookingAtCorr, and Correlationmatrix for you this method also will return a pandas dataframe.
DataDiscovery:
This class is designed to allow you the ability to evaluate your data so that you may get an idea of what you need to change in the dataset the best way to use this class is by actaully creating an instance of this class where it will automate everything. DataDiscovery(df) df will be any pandas dataframe you wish to evaluate.
Encoder
this class is dessigned to help you make encoding your data simple the input variables for this class are df: a pandas dataframe type: by defalult this variable will br set to ONEHOTENCODER if you with to use OrdinalEncoder you would set type to ordinalencoder then you can call the check method to make the corretions this method will return a pandas data frame. if you wish to compare the returned value to the original dataset you may call copy.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for irdatacleaning-2021.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 94ffd15c714e0b95fbbd018e170a36bd3656f53022524bae859398b356c453ab |
|
MD5 | 6dfdea5d674a4f529d04670b90d23c7b |
|
BLAKE2b-256 | 638053864d080d1a7989cd9a1463d6a1d9b029430153d3c2e2fbe710aceacd5e |