Skip to main content

Wrapper for ``lxml`` trees which serializes to string upon iteration.

Project description

This package provides a wrapper for lxml trees which serializes to string on iteration, but otherwise makes the tree available in an attribute.

The primary for this is WSGI middleware which may avoid needless XML parsing and serialization.

Usage

It’s recommend to use the lazy decorator on your application method. This allows you to return an lxml tree object, which is then automatically turned into an XMLSerializer.

>>> from repoze.xmliter import lazy
>>> @lazy
... def application(environ, start_response)
...     return some_lxml_tree

You may provide a serializer function, which will be used when the XMLSerializer is eventually iterated over (i.e. when the response is rendered):

>>> @lazy(serializer=lxml.html.tostring)
... def application(environ, start_response)
...     return some_lxml_tree

Middleware can use isinstance to test if the result is an XML iterable:

>>> from repoze.xmliter.serializer import XMLSerializer
>>> isinstance(result, XMLSerializer)

In this case, the middleware can simply access the tree attribute of the result.

There are two convenience methods which can be used to parse a WSGI iterable of strings and build an XMLSerializer object, but avoids re-building the serializer if the input iterable is already an instance of XMLSerializer:

>>> from repoze.xmliter.utils import getXMLSerializer
>>> result = getXMLSerializer(result)

Or, if you are parsing HTML:

>>> from repoze.xmliter.utils import getHTMLSerializer
>>> result = getHTMLSerializer(result)

If result is not an XMLSerializer, it will be parsed using a feed parser, turned into an lxml tree, and wrapped up in an XMLSerializer, which is returned.

Changelog

0.6.1 (2022-01-14)

  • Fixed tests with lxml 4.7.1 or higher. Fixes issue 8. [maurits]

0.6 - 2014-09-21

  • Python 3 compatibility [Lennart Regebro]

0.5 - 2012-01-25

  • Add __len__ to serializer to help WSGI servers. [Laurence]

  • Serializer should iter the entire string in one go. [Laurence]

0.4 - 2011-06-16

  • Ensure trailing space is removed when replacing doctype with empty string. [Laurence]

0.3 - 2011-06-03

  • Add doctype option to replace doctype on serialization. [Laurence]

0.2 - 2010-09-11

  • Use document encoding by default. (This fixes test failure on Ubuntu 10.04.) [Laurence]

  • Defer to xsl:output settings when serializing an XSLResultTree. [Laurence]

  • Turn off pretty printing by default for HTML to avoid affecting rendering on the browser. [Laurence]

0.1 - 2010-04-21

  • Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

repoze.xmliter-0.6.1.tar.gz (12.5 kB view details)

Uploaded Source

File details

Details for the file repoze.xmliter-0.6.1.tar.gz.

File metadata

  • Download URL: repoze.xmliter-0.6.1.tar.gz
  • Upload date:
  • Size: 12.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/42.0.2 requests-toolbelt/0.8.0 tqdm/4.19.6 CPython/2.7.17

File hashes

Hashes for repoze.xmliter-0.6.1.tar.gz
Algorithm Hash digest
SHA256 3682ac26dc38ea21b73eb877d2e612b6c671275ffe3fcf6ed15317b209f97cb2
MD5 b1ba94347c7c4ecdd0c3092ee25a5458
BLAKE2b-256 60703a0e82929bfe771248b628986202d983f372ad6819d05ea71755b228a145

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page