Skip to main content

Data standard, storage and retrieval for structured and unstructured FollowTheMoney data

Project description

Docs ftm-lakehouse on pypi PyPI Downloads PyPI - Python Version Python test and package pre-commit Coverage Status AGPLv3+ License Pydantic v2

ftm-lakehouse

ftm-lakehouse provides a data standard and archive storage for leaked data, private and public document collections. The concepts and implementations are originally inspired by mmmeta and Aleph's servicelayer archive.

ftm-lakehouse acts as a multi-tenant storage and retrieval mechanism for structured entity data, documents and their metadata. It provides a high-level interface for generating and sharing document collections and importing them into various search and analysis platforms, such as OpenALeph, ICIJ Datashare or Liquid Investigations

Installation

Requires python 3.11 or later.

pip install ftm-lakehouse

Documentation

openaleph.org/docs/lib/ftm-lakehouse

Development

This package is using poetry for packaging and dependencies management, so first install it.

Clone this repository to a local destination.

Within the repo directory, run

poetry install --with dev

This installs a few development dependencies, including pre-commit which needs to be registered:

poetry run pre-commit install

Before creating a commit, this checks for correct code formatting (isort, black) and some other useful stuff (see: .pre-commit-config.yaml)

Testing

ftm-lakehouse uses pytest as the testing framework.

make test

License and Copyright

ftm-lakehouse, (c) 2024 investigativedata.io

ftm-lakehouse, (c) 2025 Data and Research Center – DARC

ftm-lakehouse is licensed under the AGPLv3 or later license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ftm_lakehouse-0.2.0.tar.gz (57.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ftm_lakehouse-0.2.0-py3-none-any.whl (77.6 kB view details)

Uploaded Python 3

File details

Details for the file ftm_lakehouse-0.2.0.tar.gz.

File metadata

  • Download URL: ftm_lakehouse-0.2.0.tar.gz
  • Upload date:
  • Size: 57.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.3.2 CPython/3.13.5 Linux/6.12.63+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.2.0.tar.gz
Algorithm Hash digest
SHA256 34c32c97e0a0226893c626b1c3d2a103d68a13b2de071158f5e19b70a41c716b
MD5 fc62810fa5e73b6cf980df1961b4edaa
BLAKE2b-256 8c798fbc52e2856663b724b97430edd54061c5f0ab08fd05d02d96099c82f525

See more details on using hashes here.

File details

Details for the file ftm_lakehouse-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: ftm_lakehouse-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 77.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.3.2 CPython/3.13.5 Linux/6.12.63+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ad3fe9ca6cc2d3d3f8f0376184da7ed143c4dfd6ff2e5c784778c0ce9473b338
MD5 5ef139f0c8138fefbf3c5c340bae1cbd
BLAKE2b-256 3ee625d4f277011ebde898b8ae61858a7638b2cee0d0355cd89febc4379cb28e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page