Skip to main content

Financial crime domain data validation and normalization library.

Project description

rigour

Data cleaning and validation functions for processing various types of text emanating and describing the business world. This applies to human and company names, language, territory and country codes, corporate and tax identifiers, etc.

The underlying idea is that handling these sorts of descriptors is easy on first glance, but reveals a dizzying set of complexity when carried into production. This is why rigour consolidates implementations that have already met some edge cases and are well-tested.

Installing rigour

You can grab the latest release from PyPI:

pip install -U rigour

Usage & documentation

See: https://rigour.followthemoney.tech/

Acknowledgements

The address formatting database contained in rigour/data/addresses/formats.yml is derived from worldwide.yml in the OpenCageData address-formatting repository. It is used to format addresses according to customs in the country that is been encoded.

rigour consolidates and includes a set of older Python libraries into a single codebase: languagecodes, pantomime, fingerprints. The development of these libraries was funded by OCCRP as part of the Aleph software project.

License

MIT. See LICENSE.

Project details


Release history Release notifications | RSS feed

This version

1.7.3

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rigour-1.7.3-py3-none-any.whl (4.1 MB view details)

Uploaded Python 3

File details

Details for the file rigour-1.7.3-py3-none-any.whl.

File metadata

  • Download URL: rigour-1.7.3-py3-none-any.whl
  • Upload date:
  • Size: 4.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rigour-1.7.3-py3-none-any.whl
Algorithm Hash digest
SHA256 549972a30c1f7fba40bf5f99d9c4aa0190b71b605e263c34b4eb54655aa846aa
MD5 8680f6cf7e4d8ea972d5aefb216f2342
BLAKE2b-256 3d162c4d153be2a90ee0d8296756884229c43f26b3c36a3a26d15171439a89ff

See more details on using hashes here.

Provenance

The following attestation bundles were made for rigour-1.7.3-py3-none-any.whl:

Publisher: build.yml on opensanctions/rigour

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page