Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.11.9.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.11.9-py3-none-any.whl (14.0 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.11.9.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.11.9.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.9.tar.gz
Algorithm Hash digest
SHA256 ce060253da35f283857362e4f77bb50acf9de5dd09fb3b4608a32b3df3cde9b0
MD5 e07727be54232721f370e8d30b5e0577
BLAKE2b-256 985baccc715f7d7b4202d4cecc08d595aef89de1a40b27d39354045b8306c6d9

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.11.9-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.11.9-py3-none-any.whl
  • Upload date:
  • Size: 14.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.9-py3-none-any.whl
Algorithm Hash digest
SHA256 d19f3f274d83e8ba3d52955fb9d77fc6bb76951aea8503691b9dc05b98f3c35a
MD5 2594d09013641b0795dba4b2bc76255f
BLAKE2b-256 392f75d30d285f7e3c8957006cbdd87e6b3e4926564a462885fb09a8c6ecb1f9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page