Python library for reproducible data analytics of dataframes, text and images
Project description
makedalytics
Currrent version: A python library for data analytics on data including text and/or images with an emphasis on scientific reproducibility. There is only a dependancy on pip (although some versions of pip may have dependancies). Please note the library is currently evolving, and I am testing new functions. At present the tested functional functions of makedalytics are the following:
general functions
remind_ds_libraries():
This function takes no arguments and reminds what libraries you may want to import by printing them.
packages_to():
This function takes no arguments and displays what your system and packages are, and writes them to a file called "requirementscontext.txt"
check_requirements(test2list):
Function takes arguments of a list e.g. ['absl-py==0.7.0','aiofiles==0.4.0','altair==4.1.1'] of packages you will have as requirements as test2list. Returns which you are missing in your machine environment. Missing may be a difference of version number.
biopsy_df(start_row,end_row,column, df):
Function takes arguments of starting row (start row), end row and column or [columns] listed in []. Returns only specified area of dataframe.
understand_df(df):
This function takes the dataframe as an argument, and produces information on number of columns, rows, data types, existance of nulls etc.
pristine(df,axis_to_zap, modify_index):
Takes arguments of dataframe(df), 'columns' or 'rows' (axis to zap) and modify_index which can be set to true or false. Returns a cleaned dataframe without columns or rows that have nulls.
text and NLP related functions
show_duplicates(df):
If there are duplicates it will show and tell.
tag_text(text, column):
Shows you where (what row) certain text is in a columns. Works like nuclear medicine tagging.
wordish_count(str):
Counts split up elements of a string. In the case ofan English sentance that is a count for each word with some counting of certain punctuation..
image related functions
create_matrix(width, height, default_element):
Helper function to create matrices width' x
height' y
fills with default element which can be set to a number or a boolean (True or False)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for makedalytics-0.0.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 780c155500a8a273cc533c33ae171b750cb73e26107269c8390380f76b4e21b7 |
|
MD5 | cfde8b56b9c578ce74da85c818b01f39 |
|
BLAKE2b-256 | 901bf7765ecde99e7929785375046102228f9408650ce4a66198ed0fda54716a |