Skip to main content

Split International Aid Transparency Initiative (IATI) XML activity files.

Project description

Parsing very-large XML files into an in-memory DOM can cause a memory-usage explosion that shuts down even a large server. This streaming library is optimised to handle very large IATI activity reports and split them into smaller XML documents that a system can import individually. It can load the IATI activity reports either from a local file or direct download from a (public) URL. Memory usage is relatively constant regardless of the source-file size, so it is entirely reasonable to process 100+ MB XML files, even on a resource-constrained system. Output goes into a user-specified directory.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
iatisplit-0.2.tar.gz (7.9 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page