Skip to main content

RDF to Python object mapper

Project description

plastron-rdf

RDF-to-Python mapping framework

How it works

An RDF resource class encapsulates a subject URI, an RDF graph, a set of inserts and deletes made to that graph, and a mapping from particular RDF predicates to Python attributes, for easy access and manipulation of the RDF graph.

The mapping is created by declaring class attributes on the RDF resource class using one of two descriptor classes: DataProperty or ObjectProperty. Use a DataProperty when all the values of its predicate will be RDF literals. Use an ObjectProperty when the values will be URIs.

Each descriptor takes one required argument, and several optional ones. The required argument predicate is the URI of the RDF predicate for this mapping. The optional arguments are:

  • required: boolean, defaults to False
  • repeatable: boolean, defaults to False. If it is False, generally it means that a resource is not valid if it has more than a single triple using the same predicate. The exception to this is for DataProperty attributes. A resource may have multiple triples using the same predicate and still be considered valid as long as each Literal value has a different language code.
  • validate: callable returning a boolean, used for additional custom validation beyond the required and repeatable built-ins.

For DataProperty attributes, you can also specify:

  • datatype: URI of the data type of this field. This has two functions:

    1. Used in conjunction with the predicate URI to select the triples to return the values of for this attribute. This allows you to reuse the same predicate URI for different attributes. Note that a DataProperty with datatype=None (the default) will only return values without a datatype. Note that because of the way RDF literals are defined, an attribute cannot have a non-None datatype and a language code.

    2. When adding values to this attribute, if they have no datatype, this datatype is added to them before they are added to the resource's graph.

from rdflib import URIRef

from plastron.rdfmapping.descriptors import DataProperty
from plastron.rdfmapping.resources import RDFResourceBase

# an RDF resource class inherits from the RDFResourceBase abstract class,
# or another subclass thereof
class Book(RDFResourceBase):
    title = DataProperty(
        predicate=URIRef('http://purl.org/dc/terms/title'),
        required=True,
    )
    author = DataProperty(
        predicate=URIRef('http://purl.org/dc/elements/1.1/creator'),
        required=True,
        repeatable=True,
    )
    publication_date = DataProperty(
        predicate=URIRef('http://purl.org/dc/terms/published'),
        datatype=URIRef('http://id.loc.gov/datatypes/edtf/EDTF'),
    )

When one of these attributes is accessed from an instance, it returns either an RDFDataProperty or an RDFObjectProperty object.

>>> book = Book()
>>> book.title
<plastron.rdfmapping.descriptors.RDFDataProperty object at ...>

Using that object, you can manipulate and query its values:

>>> book.title = 'Good Omens'  # set the title
>>> str(book.title)            # get it back as a string
'Good Omens'
>>> len(book.title)            # get the number of values for this attribute
1
>>> book.title.add(            # add a second value
...     Literal('The Nice and Accurate Prophecies of Agnes Nutter')
... )
>>> len(book.title)            # see the length change
2
>>> book.title.remove(         # remove the second value
...     Literal('The Nice and Accurate Prophecies of Agnes Nutter')
... )
>>> len(book.title)            # and the length changes back again
1
>>> book.title.clear()         # clear all the values
>>> len(book.title)
0
>>> book.title = 'Good Omens'  # back where we started

You can check the validity of the resource as a whole, or of each of its individual properties. Checking the resource as a whole returns a ValidationResultsDict object of attribute names mapped to ValidationResult objects, while checking each property returns a ValidationResult, either ValidationSuccess (which evaluates to True in a boolean context) or ValidationFailure (which evaluates to False).

>>> book.is_valid       # this is False, because we have not added any authors
False
>>> book.label.is_valid
<plastron.rdfmapping.properties.ValidationSuccess object at ...>
>>> book.author.is_valid
<plastron.rdfmapping.properties.ValidationFailure object at ...>
>>> bool(book.author.is_valid)  # bool-ifies to False
False
>>> str(book.author.is_valid)   # stringifies to a validation message
'is required'

The ValidationResultsDict is a subclass of a regular dictionary, with added methods to get the items that are successes and the items that are failures.

>>> results = book.validate()
>>> results.keys()
dict_keys(['title', 'author', 'date'])
>>> results.values()
dict_values([<plastron.rdfmapping.properties.ValidationSuccess object at ...>,
 <plastron.rdfmapping.properties.ValidationFailure object at ...>,
 <plastron.rdfmapping.properties.ValidationSuccess object at ...>])
>>> {name: str(result) for name, result in results if isinstance(result, ValidationFailure)}
{'author': 'is required'}
>>> [name for name, result in results if isinstance(result, ValidationSuccess)]
['title', 'date']

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

plastron_rdf-4.6.2.tar.gz (14.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

plastron_rdf-4.6.2-py3-none-any.whl (12.8 kB view details)

Uploaded Python 3

File details

Details for the file plastron_rdf-4.6.2.tar.gz.

File metadata

  • Download URL: plastron_rdf-4.6.2.tar.gz
  • Upload date:
  • Size: 14.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for plastron_rdf-4.6.2.tar.gz
Algorithm Hash digest
SHA256 2544055d5cbe559d51e45a14f727fe32bd94844ebf75b9536f3b6268b3067354
MD5 83406e61c5ddc19b0f88772562b40d73
BLAKE2b-256 86ce30bde6755b25d8010c4beda8a5e3a179fbd38bb5d11d3ea207fed7369c70

See more details on using hashes here.

File details

Details for the file plastron_rdf-4.6.2-py3-none-any.whl.

File metadata

  • Download URL: plastron_rdf-4.6.2-py3-none-any.whl
  • Upload date:
  • Size: 12.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for plastron_rdf-4.6.2-py3-none-any.whl
Algorithm Hash digest
SHA256 64a6be2bc52052342e0fce8086271e8839692d6fecefeea30070c89b70e5072d
MD5 c93301f4600d19715bd2d723c9f799d2
BLAKE2b-256 7744698959e250023093b3e0eed34cb842092d3ffefa4d3aceb86b1bb22ade8f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page