Python HTML/XML parser for simple web scraping.
Project description
What is it?
DHTMLParser is a lightweight HTML/XML parser created for one purpose - quick and easy picking selected tags from DOM.
It can be very useful when you are in need to write own “guerilla” API for some webpage, or a scrapper.
If you want, you can also create HTML/XML documents more easily than by joining strings.
Documentation
Full module documentation can be found here: http://pydhtmlparser.rtfd.org
Changelog
2.0.2
Fixed bugs in .isAlmostEqual().
2.0.1
Fixed bugs in .match().
Fixed broken links in documentation.
2.0.0
Rewritten, refactored, splitted to multiple files.
Added unittest coverage of almost 100% of the code.
Added better selector methods (.wfind(), .match)
Added Sphinx documentation.
Fixed a lot of bugs.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.