An XML Schema validator and decoder
Project description
This is an implementation of XML Schema for Python (supports versions 2.7 and Python 3.3+).
This library arises from the needs of a solid Python layer for processing XML Schema based files for MaX (Materials design at the Exascale) European project. A significant problem is the encoding and the decoding of the XML data files produced by different simulation software. Another important requirement is the XML data validation, in order to put the produced data under control. The lack of a suitable alternative for Python in the schema-based decoding of XML data has led to build this library. Obviously this library can be useful for other cases related to XML Schema based processing, not only for the original scope.
For more information read the xmlschema documentation on readthedocs.
Features
The xmlschema library includes the following features:
Full XSD 1.0 support
Building of XML schema objects from XSD files
Validation of XML instances against XSD schemas
Decoding of XML data into Python data structures
An XPath based API for finding schema’s elements and attributes
Support of XSD validation modes
Installation
You can install the library with pip in a Python 2.7 or Python 3.3+ environment:
pip install xmlschema
The library uses the Python’s ElementTree XML library and doesn’t require additional packages. The library includes also the schemas of the XML Schema standards for working offline and to speed-up the building of schema instances.
Usage
Import the library and then create a schema instance using the path of the file containing the schema as argument:
>>> import xmlschema
>>> my_schema = xmlschema.XMLSchema('xmlschema/tests/cases/examples/vehicles/vehicles.xsd')
The schema can be used to validate XML documents:
>>> my_schema.is_valid('xmlschema/tests/cases/examples/vehicles/vehicles.xml')
True
>>> my_schema.is_valid('xmlschema/tests/cases/examples/vehicles/vehicles-1_error.xml')
False
>>> my_schema.validate('xmlschema/tests/cases/examples/vehicles/vehicles-1_error.xml')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/home/brunato/Development/projects/xmlschema/xmlschema/validators/xsdbase.py", line 393, in validate
raise error
xmlschema.validators.exceptions.XMLSchemaValidationError: failed validating <Element '{http://example.com/vehicles}cars' at 0x7f8032768458> with XsdGroup(model='sequence').
Reason: character data between child elements not allowed!
Schema:
<xs:sequence xmlns:xs="http://www.w3.org/2001/XMLSchema">
<xs:element maxOccurs="unbounded" minOccurs="0" name="car" type="vh:vehicleType" />
</xs:sequence>
Instance:
<vh:cars xmlns:vh="http://example.com/vehicles">
NOT ALLOWED CHARACTER DATA
<vh:car make="Porsche" model="911" />
<vh:car make="Porsche" model="911" />
</vh:cars>
Using a schema you can also decode the XML documents to nested dictionaries, with values that match to the data types declared by the schema:
>>> import xmlschema
>>> from pprint import pprint
>>> xs = xmlschema.XMLSchema('xmlschema/tests/cases/examples/collection/collection.xsd')
>>> pprint(xs.to_dict('xmlschema/tests/cases/examples/collection/collection.xml'))
{'@xsi:schemaLocation': 'http://example.com/ns/collection collection.xsd',
'object': [{'@available': True,
'@id': 'b0836217462',
'author': {'@id': 'PAR',
'born': '1841-02-25',
'dead': '1919-12-03',
'name': 'Pierre-Auguste Renoir',
'qualification': 'painter'},
'estimation': Decimal('10000.00'),
'position': 1,
'title': 'The Umbrellas',
'year': '1886'},
{'@available': True,
'@id': 'b0836217463',
'author': {'@id': 'JM',
'born': '1893-04-20',
'dead': '1983-12-25',
'name': 'Joan Miró',
'qualification': 'painter, sculptor and ceramicist'},
'position': 2,
'title': None,
'year': '1925'}]}
License
This software is distributed under the terms of the MIT License. See the file ‘LICENSE’ in the root directory of the present distribution, or http://opensource.org/licenses/MIT.
Roadmap
Validated XML data encoding
XSD 1.1
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for xmlschema-0.9.23-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5966b5acf38aeed30bcd26d3965585154a6ad22d3d10141c3dd6a32906a2e385 |
|
MD5 | ab4457ba796d07eacee4b9ab0f7e12c7 |
|
BLAKE2b-256 | 7bca87aa08b2b4afce34cb397635a87d5b2475a41e317796a1856f7209a09f52 |