Skip to main content

Data dictionary functionality for Pandas data frames

Project description

# Data Dictionary for Pandas

The DataDict class in this package provides functionality for mapping the columns of different pandas data frames into a consistent namespace, ensuring the columns to comply with the data type specified in the data dictionary and describing the data.

The data dictionary consists at least of the following columns: * Data Set: Used when mapping in combination with Field to rename to the column to Name. * Field: Column name of the data frame to map to Name. * Name: Column name that is unique throughout the data dictionary. * Description: Description of the column name. This can be used to provide additional information when displaying the data frame. * Type: Type the column should be cast to. * Format: Format to use when values need to be converted to a string representation. The format string has to be a Python format string such as {:.0f}%

The data dictionary can either be loaded from a CSV file or from a data frame.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-datadict-0.0.1.tar.gz (4.5 kB view hashes)

Uploaded Source

Built Distribution

pandas_datadict-0.0.1-py3-none-any.whl (6.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page