Skip to main content

Modular mapping of METS/MODS data to EDM (Europeana Data Model), providing comprehensive transformation of METS/MODS XML structures into Pydantic-based EDM classes and properties.

Project description

mets_to_edm

mets_to_edm is a Python library for converting METS/MODS XML records into Europeana Data Model (EDM) records. It can be used both as a library and from the command line and has a basic Mapping that should work well for most cases. But the library also provides a flexible mapping layer to override certain parts.

Features

  • Converts METS/MODS XML to EDM using edmlib
  • Easily extensible: override mapping methods to customize output
  • CLI and Python API

Installation

Install via Poetry (recommended):

poetry install

Or with pip (if you have all dependencies):

pip install .

Usage

As a Python Library

from mets_to_edm import MetsToEdmMapper
from lxml import etree

# Parse your METS/MODS XML file
xml_tree = etree.parse("example.xml")

# Convert to an EDM record
edmlib_record = MetsToEdmMapper.process_record(xml_tree)

# Serialize to EDM XML
edm_xml = edmlib_record.serialize()
print(edm_xml)

From the Command Line

python -m mets_to_edm example.xml "Provider Name" [--data-provider "Data Provider"]
  • "Provider Name": the institution name to be filled in as edm:provider (the aggregator providing the data to europeana)
  • "Data Provider": the institution name to be filled in as edm:dataProvider (the Organisation where the data originates from). Optional as it will otherwise be extracted from the amdSec using XPath "mets:rightsMD/mets:mdWrap/mets:xmlData/dv:rights/dv:owner"

Customizing the Mapping

To change how specific fields are mapped, subclass MetsToEdmMapper and override the relevant class methods. For example, to change how titles are extracted: You can override any method such as:

  • get_titles
  • get_descriptions
  • get_publishers
  • get_types
  • get_languages
  • ...and more (see mets_to_edm/mapper.py for all available hooks)

Example: Overriding the Data Provider

class MyMapper(MetsToEdmMapper):
	@classmethod
	def get_data_provider(cls, dmd_sec, amd_sec, default=None):
		return "My Custom Data Provider"

# Usage:
# edmlib_record = MyCustomMapper.process_record(tree)

For more examples have a look at the examples directory.

Further Information

  • See the source code in mets_to_edm/mapper.py for all overridable methods and mapping logic.
  • For questions or contributions, open an issue or pull request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mets_to_edm-0.2.2.tar.gz (11.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mets_to_edm-0.2.2-py3-none-any.whl (11.7 kB view details)

Uploaded Python 3

File details

Details for the file mets_to_edm-0.2.2.tar.gz.

File metadata

  • Download URL: mets_to_edm-0.2.2.tar.gz
  • Upload date:
  • Size: 11.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.0 CPython/3.12.9 Linux/6.14.0-1017-azure

File hashes

Hashes for mets_to_edm-0.2.2.tar.gz
Algorithm Hash digest
SHA256 c9d358583f384d387381c7c6ec072ca074e0628ae5cccd5af8529ccafc0ac403
MD5 debbc5e8ce3ac4c9d364b2a6d3cd15ce
BLAKE2b-256 0917bb294ea73b1b4c7f8570f1e8b1fb74db186808e063f06b7eed981d79ce79

See more details on using hashes here.

File details

Details for the file mets_to_edm-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: mets_to_edm-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 11.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.0 CPython/3.12.9 Linux/6.14.0-1017-azure

File hashes

Hashes for mets_to_edm-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 c82a5c8a5b76edec81498b97656e123216c36685f346b641f85751ecdc17d5a0
MD5 060ab563d00e24b8ad9b039c2a9daa96
BLAKE2b-256 49e1374a5d9a494e8cedb2babc2a6dd62f043c3f39cbf19ea54ef555807d5b2d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page