Skip to main content

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Project description

https://img.shields.io/travis/scrapy/parsel.svg https://img.shields.io/pypi/v/parsel.svg

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Features

  • Extract text using CSS or XPath selectors

  • Regular expression helper methods

Example:

>>> from parsel import Selector
>>> sel = Selector(text=u"""<html>
        <body>
            <h1>Hello, Parsel!</h1>
            <ul>
                <li><a href="http://example.com">Link 1</a></li>
                <li><a href="http://scrapy.org">Link 2</a></li>
            </ul
        </body>
        </html>""")
>>>
>>> sel.css('h1::text').extract_first()
u'Hello, Parsel!'
>>>
>>> sel.css('h1::text').re('\w+')
[u'Hello', u'Parsel']
>>>
>>> for e in sel.css('ul > li'):
        print(e.xpath('.//a/@href')).extract_first()
http://example.com
http://scrapy.org

History

0.9.1 (2015-07-30)

  • Setup Sphinx build and docs structure

  • Build universal wheels

  • Rename some leftovers from package extraction

0.9.0 (2015-07-30)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsel-0.9.1.tar.gz (26.5 kB view details)

Uploaded Source

Built Distribution

parsel-0.9.1-py2.py3-none-any.whl (7.7 kB view details)

Uploaded Python 2Python 3

File details

Details for the file parsel-0.9.1.tar.gz.

File metadata

  • Download URL: parsel-0.9.1.tar.gz
  • Upload date:
  • Size: 26.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for parsel-0.9.1.tar.gz
Algorithm Hash digest
SHA256 e8d840e60372078a15362662db4e00913246d485830ba55e700d022ab636a114
MD5 2cd7ee30604e40e2199057d570558224
BLAKE2b-256 09c948fe7a8231724e3978828fa7ab0822bb890084240a23e510050ca7722f44

See more details on using hashes here.

File details

Details for the file parsel-0.9.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for parsel-0.9.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 227fd50aeb2177cde6efffcc6e79c80ccd9edcfafc0ea227a44f21b6454d2db8
MD5 e1f390ae10a814d25f8f28a3fa56163a
BLAKE2b-256 0b99ac360aa55a0f6ee32d06cbab1f44893f8dc76965f820525591ddb4b36c58

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page