Data dictionary functionality for pandas data frames
Project description
Data Dictionary for pandas
The data dictionary consists at least of the following columns:
Data Set
: Used when mapping in combination withField
to rename to the column toName
.Field
: Column name of the data frame to map toName
.Name
: Column name that is unique throughout the data dictionary.Description
: Description of the column name. This can be used to provide additional information when displaying the data frame.Type
: Type the column should be cast to.Format
: Format to use when values need to be converted to a string representation. The format string has to be a Python format string such as{:.0f}%
The data dictionary can either be loaded from a CSV file (example data dictionary) or from a data frame.
Installation
Using pip
You can install using the pip package manager by running:
pip install pandas-datadict
Alternatively, you could install directly from Github:
pip install https://github.com/177arc/pandas-datadict/archive/master.zip
From source
Download the source code by cloning the repository or by pressing Download ZIP on this page. Install by navigating to the proper directory and running
python setup.py install
Usage
For usage guidance and testing the package interactively, hit the Usage Jupyter Notebook.
Documentation
For the code documentation, please visit the documentation Github Pages.
Contributing
- Fork the repository on GitHub.
- Run the tests with
python -m pytest tests/
to confirm they all pass on your system. If the tests fail, then try and find out why this is happening. If you aren't able to do this yourself, then don't hesitate to either create an issue on GitHub, contact me on Discord or send an email to py@177arc.net. - Either create your feature and then write tests for it, or do this the other way around.
- Run all tests again with with
python -m pytest tests/
to confirm that everything still passes, including your newly added test(s). - Create a pull request for the main repository's
master
branch.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pandas-datadict-0.2.3.tar.gz
.
File metadata
- Download URL: pandas-datadict-0.2.3.tar.gz
- Upload date:
- Size: 8.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19ecfceb99a3b9d755976d8529058ad844c22c5579f4bfa3c2192fba8e3d5251 |
|
MD5 | 1d943bf3d41e512e271a68e833a7946b |
|
BLAKE2b-256 | fb0cf1408e16eab85501f58f2ea4f43fb6d6f32ed10376659a1a471455b3163e |
File details
Details for the file pandas_datadict-0.2.3-py3-none-any.whl
.
File metadata
- Download URL: pandas_datadict-0.2.3-py3-none-any.whl
- Upload date:
- Size: 10.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9d2d972bad77a65cec5f600f4f1fee542c858622a656d7f7af49a5cbb0d4a77 |
|
MD5 | c9cc0ace0c1fdab31ec560f4dbd91a2b |
|
BLAKE2b-256 | 63da068cf457892cb2f09dccf9de9c862cb98782ef0664d65883720cfc17fd5d |