17 projects
banal
Commons of banal micro-functions for Python.
ingestors
Ingestors extract useful information in a structured standard format.
alephclient
Command-line client for Aleph API
memorious
A minimalistic, recursive web crawling library for Python.
pdflib
python bindings for poppler
normality
Micro-library to normalize text strings
storagelayer
Content-addressable storage for aleph and memorious
exactitude
A library with real-world data parsers.
qt5reactor
Twisted Qt Integration
fingerprints
A library to generate entity fingerprints.
urlnormalizer
Normalize URLs. Mostly useful for deduplicating HTTP URLs.
countrynames
A library to map country names to ISO codes.
cronosparser
Parser for CronosPro / CronosPlus database files.
typecast
Convert types in source data.
localshare
localshare: A commandline utility to share files over local network.
qt5reactor-fork
Twisted Qt Integration