Common Python library which contains reusable components, developed at Infrae.
This is a common Python library which contains reusable components, developed at Infrae.
Sprout, while mainly XML related, does not have a functional goal as such. Its goals instead are organizational:
- Enable code reuse between projects, such as Silva and Infrae’s topicmaps implementation
- Allow us to write modern, pure Python code without external dependencies – it only depends on the Python standard library.
- Allow us to write solid code, covered by a large unit test suite. The lack of external dependencies and focus on modern code makes this easier.
While Sprout’s aim is mainly for use within Infrae at present, the code inside should be general enough for use in your own projects as well.
Sprout’s focus is mainly currently XML related. It features:
- sprout.saxext, a library to make writing SAX-based code more easy.
- htmlsubset, a system to easily create HTML-ish subsets that are secure and robust to wrong user input.
- Add two misisng methods for prefix mapping on the handler, in order to follow the SAX handler.
- The xmlexport handler takes now arguments and keyword arguments that it passes along to child handlers.
- Add a missing method characters to xmlexport handler, in order to follow the SAX protocol.
- Remove deprecated silvasubset,
- Add support to register producers with adapters,
- saxify as now a validate option to verify the validity of the incoming XML to work on before doing anything,
- Update and clean tests.
- Fix thread safety in xml importer.
- Use builtin set instead of the deprecated sets module (when available).
- Package extension as an egg.
- When title attributes are added to <a> tags, they are now recognized but nothing is done with them. Previously, the presence of a ‘title’ attribute corrupted the link.
- no changes logged.
- Added support for hex entities (e.g.   for a non-breaking space) to html2sax.
- Added support for limiting HTML, taking safe, well-formed snippets out of larger HTML strings. Can deal with HTML that is not well-formed.
- set sprout up as a Python package in a much simpler, and less wrong way.
- sax2html handles empty attributes in HTML now, so that <option selected> gets turned into <option selected="selected">.
- in Silva subset, accept nested <b>, <i> and <a href=""> in those.
- Allow optional filtering handler that events get passed through before import happens.
- In silva subset, treat \n as <br>.
- In html2sax, we’re now making sure that any tags that cannot be singletons in HTML are kept open, by adding an extra space character event.
- Initial public release.