No project description provided
Project description
how to use this package:
#after you download the package from PyPi
#to import this package import YasinDataPrepKit as dtk
#your can read a csv, json and Excel files and in order to do #that your need to do the next:
#first you need to make an object and call the ReadingData class obj = dtk.ReadingData(r”Your file absolute path”)
#make sure to always use (r””)when reading your data for #correctly read your file path
#after this your call the read() function like this obj.read()
#after you have done these steps correctly then the rest is easy #you can do many function for data summary for instance:
#to find mean of int and float columns you use print(obj.calculate_mean())
#to find the maximum print(obj.max_value())
#for handling missing values you can use either the remove or #impute methods print(obj.handle_missing_values(‘remove’)) print(obj.handle_missing_values(‘impute’))
#the ‘remove’ and ‘impute’ is you specifying the strategy you #want to use to handle missing values
#there is also a function for encoding using one hot encoding print(obj.encode_categorical_data())
#here is a list for all functions other than the ones above that you can use from this package
#to calculate sum print(obj.calculate_sum())
#to calculate minimum print(obj.min_value())
#to calculate median print(obj.median_value())
#to calculate variance print(obj.var_value())
#to calculate standard deviation print(obj.std_deviation())
#to calculate correlation coefficient print(obj.cor_coefficient())
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for yasDataPrepKit-0.4.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0cfd8dc3d30f42cc6af1de60499a5e36375810fcfb334f4aef223e78301d55b6 |
|
MD5 | cb9195d7bef96f252151bcadef2fc5ad |
|
BLAKE2b-256 | 2d6fcc98b84c851121a6c016b207b04910480c637ae46da1ca2e45ad70945751 |