Skip to main content

Python library for reproducible data analytics of dataframes, text and images

Project description

makedalytics

Currrent version: DOI A python library for data analytics on data including text and/or images with an emphasis on scientific reproducibility. There is only a dependancy on pip (although some versions of pip may have dependancies). Please note the library is currently evolving, and I am testing new functions. At present the tested functional functions of makedalytics are the following:

general functions

remind_ds_libraries():

This function takes no arguments and reminds what libraries you may want to import by printing them.

packages_to():

This function takes no arguments and displays what your system and packages are, and writes them to a file called "requirementscontext.txt"

check_requirements(test2list):

Function takes arguments of a list e.g. ['absl-py==0.7.0','aiofiles==0.4.0','altair==4.1.1'] of packages you will have as requirements as test2list. Returns which you are missing in your machine environment. Missing may be a difference of version number.

biopsy_df(start_row,end_row,column, df):

Function takes arguments of starting row (start row), end row and column or [columns] listed in []. Returns only specified area of dataframe.

understand_df(df):

This function takes the dataframe as an argument, and produces information on number of columns, rows, data types, existance of nulls etc.

pristine(df,axis_to_zap, modify_index):

Takes arguments of dataframe(df), 'columns' or 'rows' (axis to zap) and modify_index which can be set to true or false. Returns a cleaned dataframe without columns or rows that have nulls.

text and NLP related functions

show_duplicates(df):

If there are duplicates it will show and tell.

tag_text(text, column):

Shows you where (what row) certain text is in a columns. Works like nuclear medicine tagging.

wordish_count(str):

Counts split up elements of a string. In the case ofan English sentance that is a count for each word with some counting of certain punctuation..

image related functions

create_matrix(width, height, default_element):

Helper function to create matrices width' x height' y fills with default element which can be set to a number or a boolean (True or False)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

makedalytics-0.0.9-py3-none-any.whl (16.4 kB view details)

Uploaded Python 3

File details

Details for the file makedalytics-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: makedalytics-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 16.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for makedalytics-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 780c155500a8a273cc533c33ae171b750cb73e26107269c8390380f76b4e21b7
MD5 cfde8b56b9c578ce74da85c818b01f39
BLAKE2b-256 901bf7765ecde99e7929785375046102228f9408650ce4a66198ed0fda54716a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page