Skip to main content

Attribute Name Validator

Project description

attribute-name-validator

test distribute

This pacakge provides a CLI for column name analysis. It also acts as library for catalog of approved abbreviations and codified column naming rules.

The CLI, when provided a text file with column names used in the physical data layer for a given entity, generates a report, based on Data Modeling and Naming Standards defined within this repository.

See ATTRIBUTE NAMING GUIDELINES AND ANALYSIS REPORT USAGE for further information pertaining to the issues faced during the naming of columns and how this package can be used to get feed back on column names, and help standardize column naming.

See CONTRIBUTING.md for information pertaining to development of this package.

Install

Install the package using pip from github

pip3 install git+https://github.com/siddartham/attribute-name-validator.git

Usage

Report of each execution has the catalog of class words, acronyms included. This way, we can see the source data set, based on which reports were produced and lookup approved acronyms.

Work Flow

Screenshot

$ anv -h
usage: attribute-name-validator [-h] [-l] [--write-to-text-files] target_path

This command generates report on naming analysis from either a list of column name files in a folder,
or a given column names file, by looking up the words used to the column names in a local Abbreviation catalog
 that comes with the installation of package. On the execution of this command, you will also have a reports
folder created with CATALOG.xlsx and COLUMN_NAMING_GUIDELINES_AND_ANALYSIS_REPORT_USAGE.html files,
which have more information related to the working of the tool

To add a set of Class Words Abbreviations and Acronyms, as exceptions beyond current enterprise guidelines
create ncna.ini file and add exceptions under respective sections, as shown below

[additional-catalog]
acronyms = LOB
class-word-abbreviations = IN, IN3, CM3, LB

positional arguments:
  target_path           Path to file with column names or folder with column name files to analyze

optional arguments:
  -h, --help            show this help message and exit
  -l, --log             This flag, if present, shows logs of execution in sys.stdout.
  --write-to-text-files, -wttf
                        This flag, if present, also creates text files of reports under their respective entity specific folder.

Provide a relative/absolute path to file with the list of column names to be used for the entity, with each column in a new line.

$ anv <PATH_TO_<ENTITY_NAME>>
$ ls reports/
ATTRIBUTE_NAMING_GUIDELINES_AND_REPORT_USAGE.html        CATALOG.xlsx                            TABLE_NEW_REPORT.xlsx
$ anv <PATH_TO_ENTITY_FOLDER>
$ ls reports/
ATTRIBUTE_NAMING_GUIDELINES_AND_REPORT_USAGE.html       ENTITY_1_REPORT.xlsx                                    ENTITY_3
CATALOG.xlsx                                            ENTITY_2                                                ENTITY_3_REPORT.xlsx
ENTITY_1                                                ENTITY_2_REPORT.xlsx

TASK LOG

See TASKLOG.md for information pertaining to the development history of the package and the next set of tasks.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

attribute-name-validator-0.0.0.tar.gz (47.4 kB view hashes)

Uploaded Source

Built Distribution

attribute_name_validator-0.0.0-py3-none-any.whl (46.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page