Skip to main content

Common interface to multiple library catalogues and bibliographical databases

Project description

PyPI Version Tests Read the Docs

edpop-explorer

Common Python interface to multiple library catalogues and heritage databases, including a commandline tool for exploration.

About

There are many library catalogues and heritage databases nowadays that have a public API or a downloadable dataset, which makes it possible to use their data in other applications. However, despite the great and longstanding efforts to harmonize library data with standards such as Marc21, BIBFRAME and authority files such as FAST, there is still too much variety in standards to combine data from different APIs and downloadable datasets right away. edpop-explorer aims to provide a common interface to multiple catalogues to both querying and results, while keeping the original data available at all times.

edpop-explorer was created as part of the EDPOP project, an international network that stimulates innovative research on European popular print culture. In this project, a virtual research environment (VRE) is being created to gather and annotate metadata of items in popular print culture. The VRE will rely on edpop-explorer.

[1] https://edpop.wp.hum.uu.nl/

Install

Please note that edpop-explorer is still under active development and that while it might be useful, the public API is not yet stable.

edpop-explorer can easily be installed from PyPI:

# pip install edpop-explorer

The Python API will then be available for import via the edpop_explorer package. The commandline tool can be run using the edpopx command. (On Windows it may be that the edpopx command does not become available on the path. In that case you can also run it using the command python -m edpop_explorer.)

EDPOP Explorer comes with a number of pre-installed readers. Most of these readers connect to external APIs. Please take into account that there is always a chance that some readers are (temporarily) not available or that the public interfaces have changed. In the latter case, you are welcome to file an issue or create a fix.

A limited number of pre-installed readers do not work with external APIs but with pre-downloaded databases. Where possible, these databases are automatically downloaded the first time. In case of the USTC reader, an automatic download is not provided but the database file may be obtained from the project team. If this database is not available, an exception will be raised with an indication as to where to put the database file.

Basic usage

Python API

A basic search in the Heritage of the Printed Book database of CERL (HPB) looks like this:

>>> from edpop_explorer.readers import HPBReader
>>> reader = HPBReader()
>>> reader.set_query("gruninger")
>>> reader.fetch()  # Start fetching, fetch 10 at a time
>>> reader.number_of_results  # Total number of results for query
2134
>>> reader.number_fetched  # Number of results that have been fetched so far
10
>>> record = reader.records[0]  # A Record object
>>> title = record.title  # A Field object 
>>> print(title)
The book of the Mainyo-i-khard
>>> graph = record.to_graph()  # Get an rdflib graph for this record
>>> print(graph.serialize())  # Get turtle serialization
[] a edpoprec:BibliographicalRecord ;
    edpoprec:dating [ a edpoprec:Field ;
            edpoprec:originalText "1871" ] ;
    edpoprec:fromCatalog <https://dhstatic.hum.uu.nl/edpop-explorer/catalogs/hpb> ;
    edpoprec:identifier "UkWE.01.B25967" ;
    edpoprec:publicURL "http://hpb.cerl.org/record/UkWE.01.B25967" ;
    edpoprec:publisherOrPrinter [ a edpoprec:Field ;
            edpoprec:originalText "Carl Grüninger, Augustenstrasse 7 ; Messrs. Trübner and Co., 60 Paternoster Row" ] ;
    edpoprec:title [ a edpoprec:Field ;
            edpoprec:originalText "The book of the Mainyo-i-khard" ] .

Commandline tool

Start the programme from the command line using the edpopx command:

$ edpopx

To perform a search on a database, give the name of the database followed by the query you want to perform, such as:

# hpb gruninger

Before executing the query, EDPOP Explorer will show the way the query is transformed before calling the external API. In many cases, including HPB, the transformed query is exactly the same as the user-inputted query. After performing the query, you will see the number of results and a summary of the first ten results. To load more results, use the next command:

# next

The results are numbered. Use the show command to see the contents of a particular record (use showrdf to see RDF and showraw to see the original record data converted to YAML):

# show 8

To exit, type Ctrl+D or use the quit command:

# quit

Development

For development purposes, clone the repository and use the --editable option, and install the optional development dependencies too:

# pip install --editable '.[dev]'

This way, the source code will be read directly from the original directory when running the application and changes will have immediate effect.

Run unit test using pytest:

# pytest

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edpop_explorer-0.10.1.tar.gz (46.0 kB view details)

Uploaded Source

Built Distribution

edpop_explorer-0.10.1-py3-none-any.whl (58.9 kB view details)

Uploaded Python 3

File details

Details for the file edpop_explorer-0.10.1.tar.gz.

File metadata

  • Download URL: edpop_explorer-0.10.1.tar.gz
  • Upload date:
  • Size: 46.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.0

File hashes

Hashes for edpop_explorer-0.10.1.tar.gz
Algorithm Hash digest
SHA256 861f8b105570b8720aad4ee7218fbe785deb91d39fd3935fed1f6155bc9dc18c
MD5 8d9ff179d3326e88a43beb26323ede2b
BLAKE2b-256 9bb1f5a65b29fc86e92659e5fd253af85ad8fd04cd057dbea11e65ad6e40b805

See more details on using hashes here.

File details

Details for the file edpop_explorer-0.10.1-py3-none-any.whl.

File metadata

File hashes

Hashes for edpop_explorer-0.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a8e4d21c71d45e64b6d5fa1d792d6ea9ba1d8353dc0a19d09e475fa2aa2dd337
MD5 9c12f487665b55ee3ff7fcb0e7b7c5a5
BLAKE2b-256 a8a729a7a893d56fcd9556a3dac1ad95d70e8b90f380a2f500f41fcc8873a51f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page