Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.11.12.tar.gz (11.0 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.11.12-py3-none-any.whl (14.6 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.11.12.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.11.12.tar.gz
  • Upload date:
  • Size: 11.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.9

File hashes

Hashes for urlfinderlib-0.11.12.tar.gz
Algorithm Hash digest
SHA256 ee8f1bb7ffb85478fcbeaece0459bacd71a858a4ff5c3a38bd6a5c518e12f9f2
MD5 a2940251fc2063aa4eda08f140c27950
BLAKE2b-256 49cb73074df724b691ead57e0420a9ad0747b4ebba34428f7739df3660831e86

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.11.12-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.11.12-py3-none-any.whl
  • Upload date:
  • Size: 14.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.9

File hashes

Hashes for urlfinderlib-0.11.12-py3-none-any.whl
Algorithm Hash digest
SHA256 76499659b1d7c08779b8848b837030796569d45dd822eaa85f7cb42abdb518eb
MD5 995e576b03166ffedeb047d793581934
BLAKE2b-256 4bf45fbc1cae220bf76da42599955a5dd4a9174b8677323549831a3f9166616e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page