Skip to main content

Financial crime domain data validation and normalization library.

Project description

rigour

Data cleaning and validation functions for processing various types of text emanating and describing the business world. This applies to human and company names, language, territory and country codes, corporate and tax identifiers, etc.

The underlying idea is that handling these sorts of descriptors is easy on first glance, but reveals a dizzying set of complexity when carried into production. This is why rigour consolidates implementations that have already met some edge cases and are well-tested.

Installing rigour

You can grab the latest release from PyPI:

pip install -U rigour

Usage & documentation

See: https://rigour.followthemoney.tech/

Acknowledgements

The address formatting database contained in rigour/data/addresses/formats.yml is derived from worldwide.yml in the OpenCageData address-formatting repository. It is used to format addresses according to customs in the country that is been encoded.

rigour consolidates and includes a set of older Python libraries into a single codebase: languagecodes, pantomime, fingerprints. The development of these libraries was funded by OCCRP as part of the Aleph software project.

License

MIT. See LICENSE.

Project details


Release history Release notifications | RSS feed

This version

1.8.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rigour-1.8.0-py3-none-any.whl (4.1 MB view details)

Uploaded Python 3

File details

Details for the file rigour-1.8.0-py3-none-any.whl.

File metadata

  • Download URL: rigour-1.8.0-py3-none-any.whl
  • Upload date:
  • Size: 4.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rigour-1.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4511466e46e83a67c54caafc70bdfef4e38e2f5456e112f75110e1a601e2b5d5
MD5 7c893142e95e81844cd4ed59a4458cbe
BLAKE2b-256 1085b428c15f71196a17b6441fe5c467bc8ba4fcdfa14c534a603048fc9eec11

See more details on using hashes here.

Provenance

The following attestation bundles were made for rigour-1.8.0-py3-none-any.whl:

Publisher: build.yml on opensanctions/rigour

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page