Skip to main content

Python library for reproducible data analytics of dataframes, text and images

Project description

makedalytics

Currrent version: DOI A python library for data analytics on data including text and/or images with an emphasis on scientific reproducibility. There is only a dependancy on pip (although some versions of pip may have dependancies). Please note the library is currently evolving, and I am testing new functions. At present the tested functional functions of makedalytics are the following:

general functions

remind_ds_libraries():

This function takes no arguments and reminds what libraries you may want to import by printing them.

packages_to():

This function takes no arguments and displays what your system and packages are, and writes them to a file called "requirementscontext.txt"

check_requirements(test2list):

Function takes arguments of a list e.g. ['absl-py==0.7.0','aiofiles==0.4.0','altair==4.1.1'] of packages you will have as requirements as test2list. Returns which you are missing in your machine environment. Missing may be a difference of version number.

biopsy_df(start_row,end_row,column, df):

Function takes arguments of starting row (start row), end row and column or [columns] listed in []. Returns only specified area of dataframe.

understand_df(df):

This function takes the dataframe as an argument, and produces information on number of columns, rows, data types, existance of nulls etc.

pristine(df,axis_to_zap, modify_index):

Takes arguments of dataframe(df), 'columns' or 'rows' (axis to zap) and modify_index which can be set to true or false. Returns a cleaned dataframe without columns or rows that have nulls.

text and NLP related functions

show_duplicates(df):

If there are duplicates it will show and tell.

tag_text(text, column):

Shows you where (what row) certain text is in a columns. Works like nuclear medicine tagging.

wordish_count(str):

Counts split up elements of a string. In the case ofan English sentance that is a count for each word with some counting of certain punctuation..

image related functions

create_matrix(width, height, default_element):

Helper function to create matrices width' x height' y fills with default element which can be set to a number or a boolean (True or False)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

makedalytics-0.0.9-py3-none-any.whl (16.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page