selectolax·PyPI

Fast HTML5 parser with CSS selectors.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

rushter

These details have not been verified by PyPI

Project links

Documentation

Project description

https://img.shields.io/pypi/v/selectolax.svg

A fast HTML5 parser with CSS selectors using Modest and Lexbor engines.

Installation

From PyPI using pip:

pip install selectolax

If installation fails due to compilation errors, you may need to install Cython:

pip install selectolax[cython]

This usually happens when you try to install an outdated version of selectolax on a newer version of Python.

Development version from GitHub:

git clone --recursive  https://github.com/rushter/selectolax
cd selectolax
pip install -r requirements_dev.txt
python setup.py install

How to compile selectolax while developing:

make clean
make dev

Basic examples

Here are some basic examples to get you started with selectolax:

Parsing HTML and extracting text:

In [1]: from selectolax.parser import HTMLParser
   ...:
   ...: html = """
   ...: <h1 id="title" data-updated="20201101">Hi there</h1>
   ...: <div class="post">Lorem Ipsum is simply dummy text of the printing and typesetting industry. </div>
   ...: <div class="post">Lorem ipsum dolor sit amet, consectetur adipiscing elit.</div>
   ...: """
   ...: tree = HTMLParser(html)

In [2]: tree.css_first('h1#title').text()
Out[2]: 'Hi there'

In [3]: tree.css_first('h1#title').attributes
Out[3]: {'id': 'title', 'data-updated': '20201101'}

In [4]: [node.text() for node in tree.css('.post')]
Out[4]:
['Lorem Ipsum is simply dummy text of the printing and typesetting industry. ',
 'Lorem ipsum dolor sit amet, consectetur adipiscing elit.']

Using advanced CSS selectors:

In [1]: html = "<div><p id=p1><p id=p2><p id=p3><a>link</a><p id=p4><p id=p5>text<p id=p6></div>"
   ...: selector = "div > :nth-child(2n+1):not(:has(a))"

In [2]: for node in HTMLParser(html).css(selector):
   ...:     print(node.attributes, node.text(), node.tag)
   ...:     print(node.parent.tag)
   ...:     print(node.html)
   ...:
{'id': 'p1'}  p
div
<p id="p1"></p>
{'id': 'p5'} text p
div
<p id="p5">text</p>

Detailed overview

Available backends

Selectolax supports two backends: Modest and Lexbor. By default, all examples use the Modest backend. Most of the features between backends are almost identical, but there are still some differences.

As of 2024, the preferred backend is Lexbor. The Modest backend is still available for compatibility reasons and the underlying C library that selectolax uses is not maintained anymore.

To use lexbor, just import the parser and use it in the similar way to the HTMLParser.

In [1]: from selectolax.lexbor import LexborHTMLParser

In [2]: html = """
   ...: <title>Hi there</title>
   ...: <div id="updated">2021-08-15</div>
   ...: """

In [3]: parser = LexborHTMLParser(html)
In [4]: parser.root.css_first("#updated").text()
Out[4]: '2021-08-15'

Simple Benchmark

Extract title, links, scripts and a meta tag from main pages of top 754 domains. See examples/benchmark.py for more information.

Package	Time
Beautiful Soup (html.parser)	61.02 sec.
lxml / Beautiful Soup (lxml)	9.09 sec.
html5_parser	16.10 sec.
selectolax (Modest)	2.94 sec.
selectolax (Lexbor)	2.39 sec.

License

Modest engine — LGPL2.1
selectolax - MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

rushter

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

This version

0.3.30

Jun 15, 2025

0.3.29

Apr 30, 2025

0.3.28

Feb 18, 2025

0.3.27

Dec 9, 2024

0.3.26

Nov 11, 2024

0.3.25

Oct 25, 2024

0.3.24

Oct 9, 2024

0.3.23

Oct 9, 2024

0.3.21

Mar 10, 2024

0.3.20

Feb 9, 2024

0.3.19

Jan 27, 2024

0.3.18

Jan 21, 2024

0.3.17

Oct 17, 2023

0.3.16

Jul 27, 2023

0.3.15

Jul 18, 2023

0.3.14

May 25, 2023

0.3.13

Apr 29, 2023

0.3.12

Jan 1, 2023

0.3.11

Sep 21, 2022

0.3.10

Sep 19, 2022

0.3.8

Aug 4, 2022

0.3.7

Mar 21, 2022

0.3.6

Nov 17, 2021

0.3.5

Oct 7, 2021

0.3.4

Sep 21, 2021

0.3.2

Aug 28, 2021

0.3.1

Aug 22, 2021

0.2.14

Aug 12, 2021

0.2.13

Jul 10, 2021

0.2.12

Jun 15, 2021

0.2.11

Apr 16, 2021

0.2.10

Dec 19, 2020

0.2.9

Nov 1, 2020

0.2.7

Aug 15, 2020

0.2.6

Aug 4, 2020

0.2.4

Jun 19, 2020

0.2.3

Nov 16, 2019

0.2.1

Sep 1, 2019

0.2.0

Aug 31, 2019

0.1.13

Apr 30, 2019

0.1.12

Apr 5, 2019

0.1.11

Apr 1, 2019

0.1.10

Feb 13, 2019

0.1.9

Jan 10, 2019

0.1.8

Sep 30, 2018

0.1.7

Aug 28, 2018

0.1.6

May 14, 2018

0.1.4

Feb 28, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

selectolax-0.3.30.tar.gz (4.7 MB view details)

Uploaded Jun 15, 2025 Source

Built Distributions

selectolax-0.3.30-cp313-cp313-win_arm64.whl (1.7 MB view details)

Uploaded Jun 15, 2025 CPython 3.13Windows ARM64

selectolax-0.3.30-cp313-cp313-win_amd64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.13Windows x86-64

selectolax-0.3.30-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (5.9 MB view details)

Uploaded Jun 15, 2025 CPython 3.13manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

selectolax-0.3.30-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (5.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.13manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

selectolax-0.3.30-cp313-cp313-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.13macOS 11.0+ ARM64

selectolax-0.3.30-cp313-cp313-macosx_10_13_x86_64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.13macOS 10.13+ x86-64

selectolax-0.3.30-cp312-cp312-win_arm64.whl (1.7 MB view details)

Uploaded Jun 15, 2025 CPython 3.12Windows ARM64

selectolax-0.3.30-cp312-cp312-win_amd64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.12Windows x86-64

selectolax-0.3.30-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (6.0 MB view details)

Uploaded Jun 15, 2025 CPython 3.12manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

selectolax-0.3.30-cp312-cp312-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (5.9 MB view details)

Uploaded Jun 15, 2025 CPython 3.12manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

selectolax-0.3.30-cp312-cp312-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.12macOS 11.0+ ARM64

selectolax-0.3.30-cp312-cp312-macosx_10_13_x86_64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.12macOS 10.13+ x86-64

selectolax-0.3.30-cp311-cp311-win_arm64.whl (1.7 MB view details)

Uploaded Jun 15, 2025 CPython 3.11Windows ARM64

selectolax-0.3.30-cp311-cp311-win_amd64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.11Windows x86-64

selectolax-0.3.30-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (5.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

selectolax-0.3.30-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (5.7 MB view details)

Uploaded Jun 15, 2025 CPython 3.11manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

selectolax-0.3.30-cp311-cp311-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.11macOS 11.0+ ARM64

selectolax-0.3.30-cp311-cp311-macosx_10_9_x86_64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.11macOS 10.9+ x86-64

selectolax-0.3.30-cp310-cp310-win_arm64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.10Windows ARM64

selectolax-0.3.30-cp310-cp310-win_amd64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.10Windows x86-64

selectolax-0.3.30-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (5.6 MB view details)

Uploaded Jun 15, 2025 CPython 3.10manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

selectolax-0.3.30-cp310-cp310-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (5.6 MB view details)

Uploaded Jun 15, 2025 CPython 3.10manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

selectolax-0.3.30-cp310-cp310-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.10macOS 11.0+ ARM64

selectolax-0.3.30-cp310-cp310-macosx_10_9_x86_64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.10macOS 10.9+ x86-64

selectolax-0.3.30-cp39-cp39-win_arm64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.9Windows ARM64

selectolax-0.3.30-cp39-cp39-win_amd64.whl (1.8 MB view details)

Uploaded Jun 15, 2025 CPython 3.9Windows x86-64

selectolax-0.3.30-cp39-cp39-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (5.6 MB view details)

Uploaded Jun 15, 2025 CPython 3.9manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

selectolax-0.3.30-cp39-cp39-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (5.6 MB view details)

Uploaded Jun 15, 2025 CPython 3.9manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

selectolax-0.3.30-cp39-cp39-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.9macOS 11.0+ ARM64

selectolax-0.3.30-cp39-cp39-macosx_10_9_x86_64.whl (2.1 MB view details)

Uploaded Jun 15, 2025 CPython 3.9macOS 10.9+ x86-64

File details

Details for the file selectolax-0.3.30.tar.gz.

File metadata

Download URL: selectolax-0.3.30.tar.gz
Upload date: Jun 15, 2025
Size: 4.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for selectolax-0.3.30.tar.gz
Algorithm	Hash digest
SHA256	`2a1790ae1d88adfe54845627de62390301107cad331f63e2020f90ffe5a56a4a`
MD5	`08c9696075342ec575df91f321f09c4c`
BLAKE2b-256	`e7179e9bb8fbf653689d57055d44700eec783e966b5ace00458c9acba587b736`

Algorithm	Hash digest
SHA256	`fffd61187f92c0749cfc02d8085a24db0245e623a8329f0376e5c1d41c921c66`
MD5	`dcf54a38806669f847def901b19a14b6`
BLAKE2b-256	`6f37a6cd80a0c0f12a968f9f1bd8149c3ba12c36321b2a6de23fc5033d95f21e`

Algorithm	Hash digest
SHA256	`8b9d4546b598f2d7ad05ef38f086c10ac9e3809f432810d162ea99563b42f300`
MD5	`d9a18c6a87b4129d3ffd2433f85920cc`
BLAKE2b-256	`7c58befd700f1d7a258a5d9342349f93c91859e026455aa75694d68745a84788`

Algorithm	Hash digest
SHA256	`a32541fcccd442bba53ab11478037504e9f414fdbff7aa7eb4e3ee6b4ba38f4f`
MD5	`268d6bd004c6596d7d4e49dc187a4a41`
BLAKE2b-256	`657fc4cb5f85827129ee363ea7cdd7b5a51d94bf9f665b0e0098885d50df921a`

Algorithm	Hash digest
SHA256	`c96b41946b6ea5778b437c3e823d813e6a79644788a497e64e980c3454b021eb`
MD5	`e44656776a1eb8eb8f9f78a16e0b651f`
BLAKE2b-256	`eecaac57c0b61b7df92883734e3097de3d4d890a86f314e3a0e20e9c35ad1295`

Algorithm	Hash digest
SHA256	`5aa53b803fd22f70ee79017c22c0eb2df9dc0effcbb90db83517ca89f6357c13`
MD5	`229412e36f75daae3c004cc3c00152f5`
BLAKE2b-256	`32acbc45bbd86235c6c17c1f6299b783b51ec121deed55ebbc36ecdf71ab54d1`

Algorithm	Hash digest
SHA256	`96b119dcc5b998ea97b5e3968c6260033ebe49f9ff188f0b956d5b0a3cefd05b`
MD5	`33b59bafb86e41ded34888e344635c35`
BLAKE2b-256	`f87fb7b6b0944f8fd427c0de057252122cec204fb5f1a33a4333d701a7699205`

Algorithm	Hash digest
SHA256	`adc4a1445619150835de9277039fb874ac341c77e5ea27b68a57f12fa4d028f0`
MD5	`7073acec3db92883665c6c5a338d1651`
BLAKE2b-256	`bd5c036bf13525e4473a6c4095557ed3cb40e0c0c762dd35246db2175f440bb5`

Algorithm	Hash digest
SHA256	`ce1c9e26cd3df57962eae9df20bf6285fbbf3bd9489a5bd4811af9f499f9e71b`
MD5	`bcf5111d630d55388e2a2617386f9427`
BLAKE2b-256	`87e0ec0391626eda0925fd616e41d845dd9f782db9d44f9fc1f4b8f37e3cf1b2`

Algorithm	Hash digest
SHA256	`aac7b1401d98db999d671a4019320ed2015a51ec293a376049d0e286447983c8`
MD5	`6d047fe5764d8d014ad7bf88f00585e1`
BLAKE2b-256	`d39b6d4acbc5f7f51c890df694ff40cc468970d4667c18e1b427b5fe7045c125`

Algorithm	Hash digest
SHA256	`951293a1cd1f01042157858dcbc3d430488200a7390d58a12b108750b6ba152a`
MD5	`07673518b342566c28604e20f463670c`
BLAKE2b-256	`7bbb729bb4360ff629917f52e9a8a4a928986848c9dfde2f850b0b307c4f86c9`

Algorithm	Hash digest
SHA256	`6a6dfee6a6dff81a10709d8ce9be264cf12be1e7fd743dc24f8d5948b21db07d`
MD5	`25cf5c60084368e8b0ecd824322075c2`
BLAKE2b-256	`1c544419a2d22734dcbf4a11b2f01c1ff6a32b0a0d4c20323c9672892e562825`

Algorithm	Hash digest
SHA256	`da2f28d183c8c34324480c51debdf9407bb035d6eb4a88fda2a62d5f792d44d5`
MD5	`ce110c7cd03e9311114d063081fe2ad8`
BLAKE2b-256	`b14ddd0ff9c14742e972cec0d71574bc5a07d46691c96dd5fc33e1daa9b15b84`

Algorithm	Hash digest
SHA256	`8a43a88dd180cd28924254c8f8aa6ec3a5770e940107a93c6e3b06ee02a0e0b1`
MD5	`d13aa5c973c0abbde60e9163c281b7e3`
BLAKE2b-256	`56c5def75195270a63becd62bf343bd00f19e6f37e5025cca3e0fba6ed98d13e`

Algorithm	Hash digest
SHA256	`26c6f4c43fce72e045620d6749a63e32237f7f25e5201f3c02060071f55be77d`
MD5	`3219380875735499ef61320b17bee754`
BLAKE2b-256	`2df079128c16ede571caaab06008c2d889f6ad1d220534b78c31cefc34077078`

Algorithm	Hash digest
SHA256	`525c63a236187b63213ce6a70e66808589446799f89da254038ee6e0fb5d523b`
MD5	`09bd214c8a1fd402b069697761e5c8e2`
BLAKE2b-256	`8b87e8a1ca15ebfc2dafed42ece0c729115c716c34b44f5f5b04106b0f384911`

Algorithm	Hash digest
SHA256	`87ce96b4ded82273a8b3f25f8ac1109b3c3501c2e5e75e291b829943d2203f25`
MD5	`c499eafacd23e94533a076f566af2802`
BLAKE2b-256	`9ce450c6192a884547469c5a8918976e356e6e3d30e6096e6f2c714659068abf`

Algorithm	Hash digest
SHA256	`8bfeb8aeb13ffd7d3a225f095573fb8da85c8cea9366a9ce69df4012a4136427`
MD5	`07a248b466f0113b17364b70fb55df57`
BLAKE2b-256	`c957458e65fb7106185efcaea5dd9f1699fabf0d5fa7a19ee4cdacbda56e2558`

Algorithm	Hash digest
SHA256	`a417f316c10b545864f0f7b61f95247d57704a68327408e5614323a8e09be590`
MD5	`37f7504634602047a4d70204a5a9c85a`
BLAKE2b-256	`73367adebfbe27e752ab944b24747190b8b68dec6d222f3192fa3e90ac1617b6`

Algorithm	Hash digest
SHA256	`cda8902c963a968d955862b5a1c9505a13123cb337bb52eb7b94d8ba30c4cc6b`
MD5	`8703703968ecfd70222806ca5ee5b2d1`
BLAKE2b-256	`3907df2ab3513c8781896f2991708f533b79647436c2b7f73606bc4fd075f02f`

Algorithm	Hash digest
SHA256	`1051a302295ff215ae2b7a45045fe1f0813706d2541fe29990c0e6f4ba23d64e`
MD5	`214b1ad3aadf9386b0d4ad0cc80666e3`
BLAKE2b-256	`6c580fc7b4370739e8be20befb629a6e2d2290f0237d69c8378a09cd0d05aeb0`

selectolax 0.3.30

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Basic examples

Available backends

Simple Benchmark

Links

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

Algorithm	Hash digest
SHA256	`3047500d99cd58c8bb88655c0f30db044d9636cb4cf03c0e27bef3f35b857772`
MD5	`ec8775c5adbc686443e9acd009bd8de5`
BLAKE2b-256	`3595f3a84f7e370c517e32f433ea2b9ba11485eb790c186aa4a83e2622eec253`

Algorithm	Hash digest
SHA256	`76fbc3a79b68cc933560b833020ad1f8e08c3193ac2efc24cc6aad6cd0a4b154`
MD5	`caaba79390fa56db13bb140234eb1d55`
BLAKE2b-256	`5297a16de74300063ac5677579e4a2f467104688cbd4534c57736b026ba8cd2c`

Algorithm	Hash digest
SHA256	`38e136bff97ad50d0a4663b9ba4549b359a06facb90f752b20bb5ca898652d97`
MD5	`1f07aadeb946655aeb291bb88d9153d4`
BLAKE2b-256	`7269ea8f7cc093b117dd9cda7e2f2813878270341f3ecdcaa83d227c04c5249f`

Algorithm	Hash digest
SHA256	`f94432ffafd7f70ea70b445321cb892252674b75256835ea5d0a347e7ebf94fc`
MD5	`aac503fddafb74d8e7a415cfc582cfb0`
BLAKE2b-256	`3592efdc53b8646b7186a6ae6cad5f4ae74101930c414a2908f97a03f264a0d7`

Algorithm	Hash digest
SHA256	`4a2d42164e1bba1069421ecc25c61c1b5fe271c3befe5039a436c05d425d7684`
MD5	`7f9a3075db6ea960f5fa0613a7d82308`
BLAKE2b-256	`dfb070e7bfb8a131f38d8468de68d28ba3d9048f8de903c66ac142dffe8a29fa`

Algorithm	Hash digest
SHA256	`b87de4c22459fd95e9967ef775c08d372db04c9343b1ead3575a016a35d521cb`
MD5	`f741a0e9d53a09f39646168573b04e70`
BLAKE2b-256	`efe6a18930e83b6ab288a3d9c3ecc4f533bade9159ccce600720a237ff3cf65f`

Algorithm	Hash digest
SHA256	`6571e6a2747c3038e878ed677f9569f338164af940ada9660362544ac223c619`
MD5	`46065ee5e9cd918838a10b92d4caa543`
BLAKE2b-256	`9104c894e060a87e2762edefaadf1adc986a1ccf3f3bd81ed0a74b4b5607b45f`

Algorithm	Hash digest
SHA256	`c74b1ef46a3c3f6b0f25b158c84006f1abe0b3ca3195e1202ed25799cbe83c9c`
MD5	`14d2f49390be0080791b5c34d62fdc7b`
BLAKE2b-256	`96fb4968d5acb9cf950fcbd77f51b35e866a330f89694d5e6ca58569202853ce`

Algorithm	Hash digest
SHA256	`a586481150360b2f1f9f48c34cb7a689ef16ca2915867fac3d83a5986265878a`
MD5	`2bfb66497c7223dfe66d68580955aa42`
BLAKE2b-256	`8b7d0afb470d9cc348cba74812ca33d90e01434364bbcf9ac6421f0a540bd7a7`

Algorithm	Hash digest
SHA256	`19c3376717dec83443788a1303c2c8e27c632acfa90a88d7223ec20cfa9aa7cd`
MD5	`999748517379aef51c8bf1e42afee45b`
BLAKE2b-256	`0a9b5094eb73e1d54a6bd96e6f553b726324796b7d238d114a039b91524b31ae`