Skip to main content

Data standard, storage and retrieval for structured and unstructured FollowTheMoney data

Project description

Docs ftm-lakehouse on pypi PyPI Downloads PyPI - Python Version Python test and package pre-commit Coverage Status AGPLv3+ License Pydantic v2

ftm-lakehouse

ftm-lakehouse provides a data standard and archive storage for leaked data, private and public document collections. The concepts and implementations are originally inspired by mmmeta and Aleph's servicelayer archive.

ftm-lakehouse acts as a multi-tenant storage and retrieval mechanism for structured entity data, documents and their metadata. It provides a high-level interface for generating and sharing document collections and importing them into various search and analysis platforms, such as OpenALeph, ICIJ Datashare or Liquid Investigations

Installation

Requires python 3.11 or later.

pip install ftm-lakehouse

Documentation

openaleph.org/lib/ftm-lakehouse

Development

This package is using poetry for packaging and dependencies management, so first install it.

Clone this repository to a local destination.

Within the repo directory, run

poetry install --with dev

This installs a few development dependencies, including pre-commit which needs to be registered:

poetry run pre-commit install

Before creating a commit, this checks for correct code formatting (isort, black) and some other useful stuff (see: .pre-commit-config.yaml)

Testing

ftm-lakehouse uses pytest as the testing framework.

make test

License and Copyright

ftm-lakehouse, (c) 2024 investigativedata.io

ftm-lakehouse, (c) 2025 Data and Research Center – DARC

ftm-lakehouse is licensed under the AGPLv3 or later license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ftm_lakehouse-0.1.1.tar.gz (44.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ftm_lakehouse-0.1.1-py3-none-any.whl (57.0 kB view details)

Uploaded Python 3

File details

Details for the file ftm_lakehouse-0.1.1.tar.gz.

File metadata

  • Download URL: ftm_lakehouse-0.1.1.tar.gz
  • Upload date:
  • Size: 44.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.12.57+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.1.1.tar.gz
Algorithm Hash digest
SHA256 94e7321c6f031826a4787f5fd067ef94bbd1ef949f0a9be072f7241c59b767cb
MD5 0ef94e5ffcdd300047d14bc4ae4cba97
BLAKE2b-256 7cf9401c24405aef5a7a216fb7c166691deb17563b19b8da76966baef035a7a7

See more details on using hashes here.

File details

Details for the file ftm_lakehouse-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: ftm_lakehouse-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 57.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.12.57+deb13-amd64

File hashes

Hashes for ftm_lakehouse-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7306c558dd8a88616b1ceb0c4147bbf04a1a431e7bb1c1980a413c8b6c403bff
MD5 abdbd2e803431c0db7cbcf648c6bbeec
BLAKE2b-256 cace41dd39942c8d792152cd6bb9276270a3748aec8e4217686a8722b657ea5f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page