Skip to main content

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Project description

https://img.shields.io/travis/scrapy/parsel.svg https://img.shields.io/pypi/v/parsel.svg

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Features

  • Extract text using CSS or XPath selectors

  • Regular expression helper methods

Example:

>>> from parsel import Selector
>>> sel = Selector(text=u"""<html>
        <body>
            <h1>Hello, Parsel!</h1>
            <ul>
                <li><a href="http://example.com">Link 1</a></li>
                <li><a href="http://scrapy.org">Link 2</a></li>
            </ul
        </body>
        </html>""")
>>>
>>> sel.css('h1::text').extract_first()
u'Hello, Parsel!'
>>>
>>> sel.css('h1::text').re('\w+')
[u'Hello', u'Parsel']
>>>
>>> for e in sel.css('ul > li'):
        print(e.xpath('.//a/@href')).extract_first()
http://example.com
http://scrapy.org

History

0.9.5 (2015-08-11)

  • Support for extending SelectorList

0.9.4 (2015-08-10)

  • Try workaround for travis-ci/dpl#253

0.9.3 (2015-08-07)

  • Add base_url argument

0.9.2 (2015-08-07)

  • Rename module unified -> selector and promoted root attribute

  • Add create_root_node function

0.9.1 (2015-08-04)

  • Setup Sphinx build and docs structure

  • Build universal wheels

  • Rename some leftovers from package extraction

0.9.0 (2015-07-30)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsel-0.9.5.tar.gz (27.0 kB view details)

Uploaded Source

Built Distribution

parsel-0.9.5-py2.py3-none-any.whl (8.0 kB view details)

Uploaded Python 2Python 3

File details

Details for the file parsel-0.9.5.tar.gz.

File metadata

  • Download URL: parsel-0.9.5.tar.gz
  • Upload date:
  • Size: 27.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for parsel-0.9.5.tar.gz
Algorithm Hash digest
SHA256 ece2ecad5015baf1e288ed074c60ea1a81dddb4739c171938841948818109ddf
MD5 9f5817236e5f97d4f4f2122728a5969f
BLAKE2b-256 be3a61ea74a06cfa0f5774aaa9f43fbf9287edff60579da1767c1f208083487a

See more details on using hashes here.

File details

Details for the file parsel-0.9.5-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for parsel-0.9.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 276ca1f22c804b9e979e9e2a47353c0ac4bf6967b5462ea6f056676ae64400b9
MD5 f29211e4aa6b09583a34965f9663bcd2
BLAKE2b-256 710b40e082df9df96a6d2462b7c629cf72c2b4c48cc2c431f1e5bb1a5a6fc98d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page