Skip to main content

Data standard, storage and retrieval for structured and unstructured FollowTheMoney data

Project description

Docs ftm-lakehouse on pypi PyPI Downloads PyPI - Python Version Python test and package pre-commit Coverage Status AGPLv3+ License Pydantic v2

ftm-lakehouse

ftm-lakehouse provides a data standard and archive storage for leaked data, private and public document collections. The concepts and implementations are originally inspired by mmmeta and Aleph's servicelayer archive.

ftm-lakehouse acts as a multi-tenant storage and retrieval mechanism for structured entity data, documents and their metadata. It provides a high-level interface for generating and sharing document collections and importing them into various search and analysis platforms, such as OpenALeph, ICIJ Datashare or Liquid Investigations

Installation

Requires python 3.11 or later.

pip install ftm-lakehouse

Documentation

openaleph.org/lib/ftm-lakehouse

Development

This package is using poetry for packaging and dependencies management, so first install it.

Clone this repository to a local destination.

Within the repo directory, run

poetry install --with dev

This installs a few development dependencies, including pre-commit which needs to be registered:

poetry run pre-commit install

Before creating a commit, this checks for correct code formatting (isort, black) and some other useful stuff (see: .pre-commit-config.yaml)

Testing

ftm-lakehouse uses pytest as the testing framework.

make test

License and Copyright

ftm-lakehouse, (c) 2024 investigativedata.io

ftm-lakehouse, (c) 2025 Data and Research Center – DARC

ftm-lakehouse is licensed under the AGPLv3 or later license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ftm_lakehouse-0.1.0.tar.gz (44.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ftm_lakehouse-0.1.0-py3-none-any.whl (57.0 kB view details)

Uploaded Python 3

File details

Details for the file ftm_lakehouse-0.1.0.tar.gz.

File metadata

  • Download URL: ftm_lakehouse-0.1.0.tar.gz
  • Upload date:
  • Size: 44.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.12.57+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ffdb71929ee9f91df0c14c47d5b9e2e110324adad142947782b862bfd7d93f54
MD5 1e8d619f46d35dce506592dd6c09ae09
BLAKE2b-256 cc384ea7eb4b3d87ed3560deb5a9555b196fe3e870f82083e2bfdeca96afe835

See more details on using hashes here.

File details

Details for the file ftm_lakehouse-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ftm_lakehouse-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 57.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.12.57+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f4a97c4a0f9cf9d75f23ba049fba031fbe58d1b80d3e4ae88ece276ae5660637
MD5 eb9b1eb1b8547913e41bc9ffb3c20845
BLAKE2b-256 3c892a74db8067fe9240701eb9cfba2f8cf6818e5e8185e0a9aa5e5d49f6fa05

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page