Python library for reproducible data analytics of dataframes, text and images
Project description
makedalytics
Currrent version: A python library for data analytics on data including text and/or images with an emphasis on scientific reproducibility. There is only a dependancy on pip (although some versions of pip may have dependancies). Please note the library is currently evolving, and I am testing new functions. At present the tested functional functions of makedalytics are the following:
general functions
remind_ds_libraries():
This function takes no arguments and reminds what libraries you may want to import by printing them.
packages_to():
This function takes no arguments and displays what your system and packages are, and writes them to a file called "requirementscontext.txt"
check_requirements(test2list):
Function takes arguments of a list e.g. ['absl-py==0.7.0','aiofiles==0.4.0','altair==4.1.1'] of packages you will have as requirements as test2list. Returns which you are missing in your machine environment. Missing may be a difference of version number.
biopsy_df(start_row,end_row,column, df):
Function takes arguments of starting row (start row), end row and column or [columns] listed in []. Returns only specified area of dataframe.
understand_df(df):
This function takes the dataframe as an argument, and produces information on number of columns, rows, data types, existance of nulls etc.
pristine(df,axis_to_zap, modify_index):
Takes arguments of dataframe(df), 'columns' or 'rows' (axis to zap) and modify_index which can be set to true or false. Returns a cleaned dataframe without columns or rows that have nulls.
text and NLP related functions
show_duplicates(df):
If there are duplicates it will show and tell.
tag_text(text, column):
Shows you where (what row) certain text is in a columns. Works like nuclear medicine tagging.
wordish_count(str):
Counts split up elements of a string. In the case ofan English sentance that is a count for each word with some counting of certain punctuation..
image related functions
create_matrix(width, height, default_element):
Helper function to create matrices width' x
height' y
fills with default element which can be set to a number or a boolean (True or False)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file makedalytics-0.0.9-py3-none-any.whl
.
File metadata
- Download URL: makedalytics-0.0.9-py3-none-any.whl
- Upload date:
- Size: 16.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 780c155500a8a273cc533c33ae171b750cb73e26107269c8390380f76b4e21b7 |
|
MD5 | cfde8b56b9c578ce74da85c818b01f39 |
|
BLAKE2b-256 | 901bf7765ecde99e7929785375046102228f9408650ce4a66198ed0fda54716a |