Skip to main content

Library of web-related functions

Project description

https://github.com/scrapy/w3lib/actions/workflows/tests.yml/badge.svg Coverage report

Overview

This is a Python library of web-related functions, such as:

  • remove comments, or tags from HTML snippets

  • extract base url from HTML snippets

  • translate entites on HTML strings

  • convert raw HTTP headers to dicts and vice-versa

  • construct HTTP auth header

  • converting HTML pages to unicode

  • sanitize urls (like browsers do)

  • extract arguments from urls

Requirements

Python 3.8+

Install

pip install w3lib

Documentation

See http://w3lib.readthedocs.org/

License

The w3lib library is licensed under the BSD license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

w3lib-2.2.1.tar.gz (49.6 kB view details)

Uploaded Source

Built Distribution

w3lib-2.2.1-py3-none-any.whl (21.9 kB view details)

Uploaded Python 3

File details

Details for the file w3lib-2.2.1.tar.gz.

File metadata

  • Download URL: w3lib-2.2.1.tar.gz
  • Upload date:
  • Size: 49.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for w3lib-2.2.1.tar.gz
Algorithm Hash digest
SHA256 756ff2d94c64e41c8d7c0c59fea12a5d0bc55e33a531c7988b4a163deb9b07dd
MD5 33cb5a532eb5650ca2a65b2d05522d86
BLAKE2b-256 ccdd8d080c3bf19f4e853433193e0ffd894d9f5c5a55c11d7283038ee822a0db

See more details on using hashes here.

File details

Details for the file w3lib-2.2.1-py3-none-any.whl.

File metadata

  • Download URL: w3lib-2.2.1-py3-none-any.whl
  • Upload date:
  • Size: 21.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for w3lib-2.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e56d81c6a6bf507d7039e0c95745ab80abd24b465eb0f248af81e3eaa46eb510
MD5 83db9d39e0eeffed829a9cdb67e15f05
BLAKE2b-256 dfd6ff9000e85b820ab36c0a93f2c8a4b334a80821b631a56c252aed2d0bd2d3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page