Python library and webapp for searching standard industry and product classifiers
Python library and webapp for matching against standard industry and product classifiers. Comes with NAICS, GS1, and USEEIO built-in.
Install using pip or conda:
conda -c conda-forge -c cmutel perdu
pip install perdu
As a webapp:
As a library:
import perdu perdu.search_useeio("plastic toy")
Perdu uses whoosh as the search engine. When you first import it, Perdu will import the three built-in catalogues in around one minute.
Currently, the only possibility to upload data to the web interface is via CSV, with the first column being the item name or title, and the second (optional) column being the item description. See
perdu.test.fixtures for examples.
Adding other catalogues
See the files in
perdu.extraction for examples on how to extract data from PDFs (NAICS), XML (GS1), and JSON (USEEIO). Each search catalogue will have its own schema, but Perdu expects these schemas to have at least the columns
code (see examples in
perdu.searching). New catalogues will need to have suitable functions provided in
In addition to the default search method used in the web interface, Perdu also offers search corrections (
search_corrector_useeio) and disjunction maximization (
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size perdu-0.2.1.tar.gz (624.8 kB)||File type Source||Python version None||Upload date||Hashes View|