Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.11.0.tar.gz (14.9 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.11.0-py3-none-any.whl (13.5 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.11.0.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.11.0.tar.gz
  • Upload date:
  • Size: 14.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.0.tar.gz
Algorithm Hash digest
SHA256 df0498111ff25bb0e2d353677c9acb737839367f913f47bdad71c9cd2e0dad7e
MD5 d37e284eb858281cfa7ff67e6b178e65
BLAKE2b-256 625742cdc39c249c4b02d5a2b5373a69e12c92de7b2e0ce2d50f7429811f8459

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.11.0-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.11.0-py3-none-any.whl
  • Upload date:
  • Size: 13.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b927847987ea2924c99de7d8e5344a2557866ce958fced7ed82c64fd66739fac
MD5 9a6cc5c56929e9bc57e68dfd2b4860f7
BLAKE2b-256 f627a9fd28328b4c43ee929c4b14310459edf19fb8712cb0e9d894fc81bdfbef

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page