Skip to main content

Financial crime domain data validation and normalization library.

Project description

rigour

Data cleaning and validation functions for processing various types of text emanating and describing the business world. This applies to human and company names, language, territory and country codes, corporate and tax identifiers, etc.

The underlying idea is that handling these sorts of descriptors is easy on first glance, but reveals a dizzying set of complexity when carried into production. This is why rigour consolidates implementations that have already met some edge cases and are well-tested.

Installing rigour

You can grab the latest release from PyPI:

pip install -U rigour

Usage & documentation

See: https://rigour.followthemoney.tech/

Acknowledgements

The address formatting database contained in rigour/data/addresses/formats.yml is derived from worldwide.yml in the OpenCageData address-formatting repository. It is used to format addresses according to customs in the country that is been encoded.

rigour consolidates and includes a set of older Python libraries into a single codebase: languagecodes, pantomime, fingerprints. The development of these libraries was funded by OCCRP as part of the Aleph software project.

License

MIT. See LICENSE.

Project details


Release history Release notifications | RSS feed

This version

1.7.4

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rigour-1.7.4-py3-none-any.whl (4.1 MB view details)

Uploaded Python 3

File details

Details for the file rigour-1.7.4-py3-none-any.whl.

File metadata

  • Download URL: rigour-1.7.4-py3-none-any.whl
  • Upload date:
  • Size: 4.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rigour-1.7.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e3f6e9f27b98ee37bf45134b70c6bfa36211613117a0eb8713e6acc1632bb533
MD5 729c3c9e6653c34ad67365f2aab2b467
BLAKE2b-256 3032acbc2d7dfdd799b3a9d6100277e69b220d7ef6e34be4d93bf6ad7c16f157

See more details on using hashes here.

Provenance

The following attestation bundles were made for rigour-1.7.4-py3-none-any.whl:

Publisher: build.yml on opensanctions/rigour

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page