Skip to main content

A Scythe for harvesting OAI-PMH repositories.

Project description

oaipmh-scythe: A Scythe for harvesting OAI-PMH repositories.

Welcome to oaipmh-scythe, an updated and modernized version of the original sickle, now with additional features and ongoing maintenance.

CI pre-commit.ci status ci coverage codeql
Docs docs
Package pypi-version pypi-python-versions all-downloads monthly-downloads
Meta doi OpenSSF Scorecard hatch ruff mypy License

oaipmh-scythe is a lightweight OAI-PMH client library written in Python. It has been designed for conveniently retrieving data from OAI interfaces the Pythonic way:

from oaipmh_scythe import Scythe

with Scythe("https://zenodo.org/oai2d") as scythe:
  records = scythe.list_records()
  next(records)
# <Record oai:zenodo.org:4574771>

Features

  • Easy harvesting of OAI-compliant interfaces
  • Support for all six OAI verbs
  • Convenient object representations of OAI items (records, headers, sets, ...)
  • Automatic de-serialization of Dublin Core-encoded metadata payloads to Python dictionaries
  • Option for ignoring deleted items

Requirements

Python >= 3.10

oaipmh-scythe is built with:

  • httpx for issuing HTTP requests
  • lxml for parsing XML responses

Installation

You can install oaipmh-scythe via pip from PyPI:

python -m pip install oaipmh-scythe

Documentation

The documentation is made with Material for MkDocs and is hosted by GitHub Pages.

Similar Projects

There are a couple of similar projects available on PyPI and GitHub, e.g. via the topics oai-pmh and oai-pmh-client. Among them are these implementations in Python:

Project Description Last commit
sickle oaipmh-scythe is a fork of sickle last-commit
pyoai sickle was inspired by pyoai last-commit
pyoaiharvester oai-pmh harvester CLI last-commit
ddblabs-ometha oai-pmh harvester with CLI and TUI last-commit
oai-harvest uses pyoai internally last-commit
oai-pmh-harvester uses sickle internally last-commit

There are also similar projects available in Java and PHP.

Acknowledgments

This is a fork of sickle which was originally written by Mathias Loesch.

License

oaipmh-scythe is distributed under the terms of the BSD license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

invenio_oaipmh_scythe-0.13.0.tar.gz (29.1 kB view details)

Uploaded Source

Built Distribution

invenio_oaipmh_scythe-0.13.0-py3-none-any.whl (21.8 kB view details)

Uploaded Python 3

File details

Details for the file invenio_oaipmh_scythe-0.13.0.tar.gz.

File metadata

File hashes

Hashes for invenio_oaipmh_scythe-0.13.0.tar.gz
Algorithm Hash digest
SHA256 5944eb6f7d23e4011ae9fa80f0050b39e9b9596e96e9d47b07c0e8205a6804a9
MD5 5d8f12671acac9464ae9260eff959a67
BLAKE2b-256 35e31b433c9fcdd699ffda4ce01d469b1322b9effdb882864e619ce2d486325b

See more details on using hashes here.

File details

Details for the file invenio_oaipmh_scythe-0.13.0-py3-none-any.whl.

File metadata

File hashes

Hashes for invenio_oaipmh_scythe-0.13.0-py3-none-any.whl
Algorithm Hash digest
SHA256 271e8221be63aee083e89d0bdc22abffa7212cf372881b016a58b457ee21f4e3
MD5 f91f92cb69115c1677ed54a57934cfce
BLAKE2b-256 bc72d1e66dad1e94a8d97de5393ede094f00c14201e078b083744889f65a7c0b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page