Skip to main content

Powerful and Pythonic XML processing library combining libxml2/libxslt with the ElementTree API.

Project description

lxml is a Pythonic, mature binding for the libxml2 and libxslt libraries. It provides safe and convenient access to these libraries using the ElementTree API.

It extends the ElementTree API significantly to offer support for XPath, RelaxNG, XML Schema, XSLT, C14N and much more.

To contact the project, go to the project home page or see our bug tracker at https://launchpad.net/lxml

In case you want to use the current in-development version of lxml, you can get it from the github repository at https://github.com/lxml/lxml . Note that this requires Cython to build the sources, see the build instructions on the project home page. To the same end, running easy_install lxml==dev will install lxml from https://github.com/lxml/lxml/tarball/master#egg=lxml-dev if you have an appropriate version of Cython installed.

After an official release of a new stable series, bug fixes may become available at https://github.com/lxml/lxml/tree/lxml-3.2 . Running easy_install lxml==3.2bugfix will install the unreleased branch state from https://github.com/lxml/lxml/tarball/lxml-3.2#egg=lxml-3.2bugfix as soon as a maintenance branch has been established. Note that this requires Cython to be installed at an appropriate version for the build.

3.2.0 (2013-04-28)

Features added

Bugs fixed

  • LP#690319: Leading whitespace could change the behaviour of the string parsing functions in lxml.html.
  • LP#599318: The string parsing functions in lxml.html are more robust in the face of uncommon HTML content like framesets or missing body tags. Patch by Stefan Seelmann.
  • LP#712941: I/O errors while trying to access files with paths that contain non-ASCII characters could raise UnicodeDecodeError instead of properly reporting the IOError.
  • LP#673205: Parsing from in-memory strings disabled network access in the default parser and made subsequent attempts to parse from a URL fail.
  • LP#971754: lxml.html.clean appends ‘nofollow’ to ‘rel’ attributes instead of overwriting the current value.
  • LP#715687: lxml.html.clean no longer discards scripts that are explicitly allowed by the user provided whitelist. Patch by Christine Koppelt.

Other changes

Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
lxml-3.2.0.tar.gz (3.3 MB) Copy SHA256 hash SHA256 Source None
lxml-3.2.0.win32-py2.6.exe (3.2 MB) Copy SHA256 hash SHA256 Windows Installer 2.6
lxml-3.2.0.win32-py2.7.exe (3.2 MB) Copy SHA256 hash SHA256 Windows Installer 2.7
lxml-3.2.0.win32-py3.2.exe (3.2 MB) Copy SHA256 hash SHA256 Windows Installer 3.2
lxml-3.2.0.win32-py3.3.exe (3.2 MB) Copy SHA256 hash SHA256 Windows Installer 3.3
lxml-3.2.0.win32-py3.4.exe (3.2 MB) Copy SHA256 hash SHA256 Windows Installer 3.4
lxml-3.2.0.win-amd64-py2.6.exe (3.4 MB) Copy SHA256 hash SHA256 Windows Installer 2.6
lxml-3.2.0.win-amd64-py2.7.exe (3.4 MB) Copy SHA256 hash SHA256 Windows Installer 2.7
lxml-3.2.0.win-amd64-py3.2.exe (3.4 MB) Copy SHA256 hash SHA256 Windows Installer 3.2
lxml-3.2.0.win-amd64-py3.3.exe (3.4 MB) Copy SHA256 hash SHA256 Windows Installer 3.3
lxml-3.2.0.win-amd64-py3.4.exe (3.4 MB) Copy SHA256 hash SHA256 Windows Installer 3.4

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page