Skip to main content

Access to National Library of the Netherlands datasets

Project description

=================================================================
KB python API: Access to National Library of the Netherlands data
=================================================================

KB-Python-API is a simple API for Python, the API provides easy access to free and CC-BY-NC-ND_ datasets provided by the National Library of the Netherlands (KB).

It relies on the back-end infrastructure of the KB which consists of an SRU and OAI-MPH service. The KB Python API makes it easy to interact with historical data,
for more information on the provided datasets and data-rights take a look at the DataServices page of the KB (http://www.kb.nl/bronnen-zoekwijzers/dataservices-en-apis).

Currently only the OAI part is implemented.

OAI example
===========
>>> from kb.nl.api import oai
>>> from kb.nl.helpers import alto_to_text
>>> oai.list_sets()
['ANP', 'BYVANCK', 'DPO', 'SGD']
>>> records = oai.list_records("ANP")
>>> records.identifiers[:3]
['anp:1937:10:01:1', 'anp:1937:10:01:2', 'anp:1937:10:01:3']
>>> len(oai.resumptiontoken)
42
>>> record = oai.get(records.identifiers[0])
>>> alto_record = record.alto
>>> alto_to_text(alto_records[0]).split("\\n")[1][:27]
u' RADIO 1 van 1 Ootober 1937'
>>> image_record = record.image
>>> len(image_record)
721035

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kb-0.1.tar.gz (4.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page