No project description provided
Project description
Library to add a normalized companies names column in Excel file
About the problem:
Once we find misspelling and not normalized companies names in an Excel file that contains the at least the column named organization we will struggle against the possibility of save and process data without integrity. This library aims to process the Excel file and add a new column named canonical_name.
Technologies used in this library
How to use the library
This library has two ways of use: The first one is installing locally through and the second is installing from PyPI repository. Let's see how to use it by the both ways below.
Observations: This document assumes that you're familiar to Poetry, python virtual environment and has it already installed in your machine.
First Way - Installing locally using Poetry:
1 - Run the command to access the bash using the virtualenv created by poetry
poetry shell
2 - Run the command below to Poetry installs the library locally
poetry install
Second Way - Installing using pip:
- Run the command below to see install via pip
pip install normalize-companies-names
Executing the library
1 - Run the command below to see the information about the library
normalize --help
Result: Usage: normalize [OPTIONS]
Options:
-c, --canonicals TEXT Canonicals companies names separated by comma. (e.g 'MICROSOFT TECHNOLOGY LICENSING,MICRON TECHNOLOGY,DELTA TECHNOLOGY,ELTA TECHNOLOGY') [required]
-i, --input_filepath TEXT Path to the Excel file that need to be processed. [required]
-o, --output_filepath TEXT Path to save the processed Excel file. [required]
--help Show this message and exit.
2 - Run the command below to process a file and receive the processed one
normalize -c MICROSOFT,MICRON,ELTA,DELTA -i ./data/patent-records.xlsx -o ./data/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for normalize_companies_names-1.0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 671473dbcea6fbee32217509800f1a131bb41a6e2d0a91bec835f0ba77aa3f7c |
|
MD5 | c3d22c8bce1dddea8a642bc664cb1c59 |
|
BLAKE2b-256 | b1f3f21c91c51e48f1c111392c36aaa57c54fe687431e33835d863ac9cf09305 |
Hashes for normalize_companies_names-1.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 07c541b7fc6fc71b7194b5ad9f92a10f76985a8e2d31b3b7bc09bd80272d2e56 |
|
MD5 | 08181bbe0a4606eb99dcb3a047497a43 |
|
BLAKE2b-256 | 70ea340b6c8cf398f7fbdd7b5798c481c67246cc940e8bfb9e494e45b1b14203 |