Skip to main content

An extensible package to automate and enhance the manipulation of tabular/matrix data without giving up performance.

Project description

License

matrixb

matrixb is a Python library to provide a single interface to handle common data input, cleaning, and analysis features for data science datasets. The name MatrixB arise from a play on several interpretations - Matrix, Beta (Experimental tools to use with matrices) and Matrix, Version B (Enhancements on 2-D matrix processing).

In general, you might consider matrixb as occupying a middlespace between the speed and efficiency of using native 2-D array matrices in Python and sophisticated statistical datastructures such as dataframes in Pandas and R. It provides a singular interface to load in csv, xls, xlsx, and ods files, automatically cleans common mistakes (accidentally adding a space to the end of a text string, or changing the capitalization structure of column names between two different data files) and allows tremendous flexibility in programmer-assisted cleaning and analysis, in part by leveraging the tools in the pydatacleaner package. If you need more functionality than matrixb, you likely should be using Pandas. There isn't a lot of overhead in matrixb (at least that can't be eliminated with a switch passed into the constructor), and so if it is too heavy for you, you probably need to custom write your own processing.

Distribution

Project Status

Currently, matrixb is functional but shallowly vetted condition and should be considered alpha software. Some features that were implemented in the past may have been broken with more recent refactoring, and test coverage is still limited. Your mileage may vary.

Code comments of NOTE and TODO indicate known shortcomings that may be useful to you. The interface will likely change in future versions.

If you wish to rely on features of this package, I am likely more than willing to accommodate and to incorporate sensible design improvements or, in some cases, changes.

Installation

Use the package manager pip to install matrixb.

pip install matrixb

Usage

Many examples of usage are available in the main test files included in the t/ subdirectory.

import matrixb

**EXAMPLES coming soon**

Contributing

Contributions are collaboration is welcome. For major changes, please contact me in advance to discuss.

Please make sure to update tests for any contribution, as appropriate.

Author

Kevin Crouse. Copyright, 2019.

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

matrixb-0.0.1.1.tar.gz (25.7 kB view details)

Uploaded Source

Built Distribution

matrixb-0.0.1.1-py3-none-any.whl (35.6 kB view details)

Uploaded Python 3

File details

Details for the file matrixb-0.0.1.1.tar.gz.

File metadata

  • Download URL: matrixb-0.0.1.1.tar.gz
  • Upload date:
  • Size: 25.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.20.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.5

File hashes

Hashes for matrixb-0.0.1.1.tar.gz
Algorithm Hash digest
SHA256 7e4576b034ddc6938a5f6059896c2c8179cac0d12623a57a2fd991eae2e0924a
MD5 fa2e4cf9bcadb027b0b66335a7ddbdb0
BLAKE2b-256 f17ab0a48d7df98dd3a04acf967413e340035fdd1fdd698b0ed72f1baabfbe8d

See more details on using hashes here.

File details

Details for the file matrixb-0.0.1.1-py3-none-any.whl.

File metadata

  • Download URL: matrixb-0.0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 35.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.20.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.5

File hashes

Hashes for matrixb-0.0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 710b3773e6d0478c84dfc9023f9bb97764d034648b2bd929bca435a72d0d39bd
MD5 cabc0eeef4c286e67b97f24a17c57628
BLAKE2b-256 d8651bda7b77082e7a03b42117ce64707246dcd50d30571ba0c80e44c1de56f5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page