Skip to main content

Big dada: Download data from lots of open data platforms.

Project description

This goes through all the datasets I know about.

for dataset in pluplusch():
    print(dataset)

You can tune it a bit with the parameters.

pluplusch(catalogs = ['http://data.enseignementsup-recherche.gouv.fr'], proxies = {},
          cache_dir = '/lockers/tlevine_vol/dadawarehouse.thomaslevine.com/big/pluplusch')

Add these test cases:

https://data.cityofnewyork.us, niuh-hrin

If you want to save the data catalog metadata every so often, you might write a crontab that looks like this.

@weekly pluplusch

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pluplusch-0.0.4.tar.gz (5.9 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page