Skip to main content

Data standard, storage and retrieval for structured and unstructured FollowTheMoney data

Project description

Docs ftm-lakehouse on pypi PyPI Downloads PyPI - Python Version Python test and package pre-commit Coverage Status AGPLv3+ License Pydantic v2

ftm-lakehouse

ftm-lakehouse provides a data standard and archive storage for leaked data, private and public document collections. The concepts and implementations are originally inspired by mmmeta and Aleph's servicelayer archive.

ftm-lakehouse acts as a multi-tenant storage and retrieval mechanism for structured entity data, documents and their metadata. It provides a high-level interface for generating and sharing document collections and importing them into various search and analysis platforms, such as OpenALeph, ICIJ Datashare or Liquid Investigations

Installation

Requires python 3.11 or later.

pip install ftm-lakehouse

Documentation

openaleph.org/docs/lib/ftm-lakehouse

Development

This package is using poetry for packaging and dependencies management, so first install it.

Clone this repository to a local destination.

Within the repo directory, run

poetry install --with dev

This installs a few development dependencies, including pre-commit which needs to be registered:

poetry run pre-commit install

Before creating a commit, this checks for correct code formatting (isort, black) and some other useful stuff (see: .pre-commit-config.yaml)

Testing

ftm-lakehouse uses pytest as the testing framework.

make test

License and Copyright

ftm-lakehouse, (c) 2024 investigativedata.io

ftm-lakehouse, (c) 2025 Data and Research Center – DARC

ftm-lakehouse is licensed under the AGPLv3 or later license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ftm_lakehouse-0.3.0.tar.gz (80.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ftm_lakehouse-0.3.0-py3-none-any.whl (111.3 kB view details)

Uploaded Python 3

File details

Details for the file ftm_lakehouse-0.3.0.tar.gz.

File metadata

  • Download URL: ftm_lakehouse-0.3.0.tar.gz
  • Upload date:
  • Size: 80.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.3.2 CPython/3.13.5 Linux/6.12.73+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.3.0.tar.gz
Algorithm Hash digest
SHA256 6f3b05c6099937f2469b19ad439fb2c5bc1349d4e7793c29c596ccd74b275094
MD5 5f1ac04d83a6e7b6c04adfc6711a3c8f
BLAKE2b-256 c53e6742a2e6c6533b27629c0fd1e7cbfc6a49b932de5cdfe56cf1b139f248aa

See more details on using hashes here.

File details

Details for the file ftm_lakehouse-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: ftm_lakehouse-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 111.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.3.2 CPython/3.13.5 Linux/6.12.73+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f6cc75c14e75eacf2d0c7e8552e852af01618eb3f5c51fe2af7481c8d9d685d9
MD5 5455c61dbfb0bce7360f42d310260062
BLAKE2b-256 6464351e466a8aae741d150fa105e7483011b7e78e63723793c83b6c98d8dad5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page