Skip to main content

Library of web-related functions

Project description

https://secure.travis-ci.org/scrapy/w3lib.png?branch=master

Overview

This is a Python library of web-related functions, such as:

  • remove comments, or tags from HTML snippets

  • extract base url from HTML snippets

  • translate entites on HTML strings

  • convert raw HTTP headers to dicts and vice-versa

  • construct HTTP auth header

  • converting HTML pages to unicode

  • sanitize urls (like browsers do)

  • extract arguments from urls

Requirements

Python 2.7 or Python 3.3+

Install

pip install w3lib

Documentation

See http://w3lib.readthedocs.org/

License

The w3lib library is licensed under the BSD license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

w3lib-1.12.0.tar.gz (35.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

w3lib-1.12.0-py2.py3-none-any.whl (14.8 kB view details)

Uploaded Python 2Python 3

File details

Details for the file w3lib-1.12.0.tar.gz.

File metadata

  • Download URL: w3lib-1.12.0.tar.gz
  • Upload date:
  • Size: 35.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for w3lib-1.12.0.tar.gz
Algorithm Hash digest
SHA256 dedaadf01aa18bc566d9043a37c3a551b4d47623fdb6f6fe75ecb95b6821d7c4
MD5 c2545617229b3fc72d59afed85b18035
BLAKE2b-256 c12ac324bf1d7b2c2de9274c13dec4e8804b41c444c4ef16885ccaf1892077d6

See more details on using hashes here.

File details

Details for the file w3lib-1.12.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for w3lib-1.12.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9b4569a47185cf5b7cd22d30644837a5ebe444c31076038c3caa93365a285301
MD5 40329673bc2369ff697115778cee4ac1
BLAKE2b-256 00c007693919365bf4866ab416827580334d2f0252d0f6990baa3ff312e1d229

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page