Skip to main content
Avatar for Friedrich Lindenberg from gravatar.com

Friedrich Lindenberg

Username   pudo

45 projects

followthemoney-integrate

Last released on

followthemoney-enrich

Last released on

followthemoney

Last released on

balkhash

Last released on

Cloud storage library to store raw and structured data from different datasets in a data lake

memorious

Last released on

A minimalistic, recursive web crawling library for Python.

urlnormalizer

Last released on

Normalize URLs. Mostly useful for deduplicating HTTP URLs.

servicelayer

Last released on

Basic remote service functions for alephdata components

alephclient

Last released on

Command-line client for Aleph API

normality

Last released on

Micro-library to normalize text strings

fingerprints

Last released on

A library to generate entity fingerprints.

languagecodes

Last released on

A library that normalises language codes

pdflib

Last released on

python bindings for poppler

followthemoney-util

Last released on

ingestors

Last released on

Ingestors extract useful information in a structured standard format.

dataset

Last released on

Toolkit for Python-based database access.

banal

Last released on

Commons of banal micro-functions for Python.

pantomime

Last released on

MIME type normalisation and labels.

storagelayer

Last released on

Content-addressable storage for aleph and memorious

pgcsv

Last released on

CSV to Postgres data puncher.

apikit

Last released on

A set of utility functions for RESTful Flask applications.

countrynames

Last released on

A library to map country names to ISO codes.

celestial

Last released on

MIME type processing tools.

babbage

Last released on

A light-weight analytical engine for OLAP processing

exactitude

Last released on

A library with real-world data parsers.

morphium

Last released on

Tools for scrapers on morph.io

datafreeze

Last released on

Export data from a SQL database to a set of file formats.

messytables

Last released on

Parse messy tabular data in various formats

cronosparser

Last released on

Parser for CronosPro / CronosPlus database files.

krauler

Last released on

A minimalistic, recursive web crawling library for Python.

typecast

Last released on

Convert types in source data.

jsonmapping

Last released on

Map flat data to structured JSON via a mapping.

sparqlquery

Last released on

SPARQL query builder, fork of sparqlquery

metafolder

Last released on

Store a bunch of documents alongside basic metadata

extractors

Last released on

Wrapper script for data extractors.

jsongraph

Last released on

Library for data integration using a JSON/RDF object graph.

mqlparser

Last released on

Parser for MQL queries

jtssql

Last released on

Generate database tables based on JSON Table Schema

thready

Last released on

Simple wrapper for threaded execution.

fiscalmodel

Last released on

Reference data for fiscal data classification

spendb

Last released on

SpenDB

googlesheets

Last released on

Simply read and write Google Spreadsheets from Python

osvalidate

Last released on

OpenSpending Model/Data Validation

restpager

Last released on

A RESTful pager class for Flask

scrapekit

Last released on

Light-weight tools for web scraping

pynomenklatura

Last released on

Client library for nomenklatura, make record linkages on the web.

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page