Modular mapping of METS/MODS data to EDM (Europeana Data Model), providing comprehensive transformation of METS/MODS XML structures into Pydantic-based EDM classes and properties.
Project description
mets_to_edm
mets_to_edm is a Python library for converting METS/MODS XML records into Europeana Data Model (EDM) records.
It can be used both as a library and from the command line and has a basic Mapping that should work well for most cases.
But the library also provides a flexible mapping layer to override certain parts.
Features
- Converts METS/MODS XML to EDM using edmlib
- Easily extensible: override mapping methods to customize output
- CLI and Python API
Installation
Install via Poetry (recommended):
poetry install
Or with pip (if you have all dependencies):
pip install .
Usage
As a Python Library
from mets_to_edm import MetsToEdmMapper
from lxml import etree
# Parse your METS/MODS XML file
xml_tree = etree.parse("example.xml")
# Convert to an EDM record
edmlib_record = MetsToEdmMapper.process_record(xml_tree)
# Serialize to EDM XML
edm_xml = edmlib_record.serialize()
print(edm_xml)
From the Command Line
python -m mets_to_edm example.xml "Provider Name" [--data-provider "Data Provider"]
"Provider Name": the institution name to be filled in as edm:provider (the aggregator providing the data to europeana)"Data Provider": the institution name to be filled in as edm:dataProvider (the Organisation where the data originates from). Optional as it will otherwise be extracted from the amdSec using XPath "mets:rightsMD/mets:mdWrap/mets:xmlData/dv:rights/dv:owner"
Customizing the Mapping
To change how specific fields are mapped, subclass MetsToEdmMapper and override the relevant class methods. For example, to change how titles are extracted:
You can override any method such as:
get_titlesget_descriptionsget_publishersget_typesget_languages- ...and more (see
mets_to_edm/mapper.pyfor all available hooks)
Example: Overriding the Data Provider
class MyMapper(MetsToEdmMapper):
@classmethod
def get_data_provider(cls, dmd_sec, amd_sec, default=None):
return "My Custom Data Provider"
# Usage:
# edmlib_record = MyCustomMapper.process_record(tree)
For more examples have a look at the examples directory.
Further Information
- See the source code in
mets_to_edm/mapper.pyfor all overridable methods and mapping logic. - For questions or contributions, open an issue or pull request.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mets_to_edm-0.2.3.tar.gz.
File metadata
- Download URL: mets_to_edm-0.2.3.tar.gz
- Upload date:
- Size: 11.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.0 CPython/3.12.9 Linux/6.17.0-1008-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fa416318ceb0cbc7782533168deb2c12991aebb95dfa1a412a6382d19f74c31c
|
|
| MD5 |
f0580fab70759975876a8a679dcd932b
|
|
| BLAKE2b-256 |
f005a36ba3fa93e6861b1a07e7b698e85117340f89702a879c3e4c82327ed0f7
|
File details
Details for the file mets_to_edm-0.2.3-py3-none-any.whl.
File metadata
- Download URL: mets_to_edm-0.2.3-py3-none-any.whl
- Upload date:
- Size: 11.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.0 CPython/3.12.9 Linux/6.17.0-1008-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d80633ff3914edc75f3ea066b5d3e447f493e1145aa990599243f46dda2d0483
|
|
| MD5 |
9df1cf18b0864a3ef501fa431640d52a
|
|
| BLAKE2b-256 |
b549334785d9f99e09a9d8a70b08a33ec8724f5a8956b3c498d516ee48a8fd02
|