Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.11.6.tar.gz (10.4 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.11.6-py3-none-any.whl (13.9 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.11.6.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.11.6.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.6.tar.gz
Algorithm Hash digest
SHA256 9841cd089b6492779c8edeb8da11bf62bd39336539444b94d41049c23c2a8582
MD5 a3b9baa6b9a3bd9dd18307ca5e91f85a
BLAKE2b-256 1adeec16805b222f54bfc2b08cae9963302137cf0dedf02e4d1b267e6835bd5f

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.11.6-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.11.6-py3-none-any.whl
  • Upload date:
  • Size: 13.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.6-py3-none-any.whl
Algorithm Hash digest
SHA256 ae03a0e0b13e8cad33e6f6d04444231d3c72ff1774728e6c711749622745a92d
MD5 e29d67e2532edc6e57941f4bd48e84ae
BLAKE2b-256 856822119693084b807a0ce34eec07f30fc2087931997d068f64cf7614fe9e22

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page