Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.11.7.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.11.7-py3-none-any.whl (14.0 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.11.7.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.11.7.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.7.tar.gz
Algorithm Hash digest
SHA256 15149e93000656400eb5870496e2577a84abde66ccd58c0c4fea0391700330e7
MD5 288db4c2ddd874f20abf117c0b111a2e
BLAKE2b-256 a25e26368c484d08b2aa39086d814673336624fa3b8315a316a371bd5afcded4

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.11.7-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.11.7-py3-none-any.whl
  • Upload date:
  • Size: 14.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.11.7-py3-none-any.whl
Algorithm Hash digest
SHA256 72c267c885e0b97aa5ac8f8ef4577255d1769542979d9f34b783742b46556ad8
MD5 f6083fe463dcf5a11ade8238977236d7
BLAKE2b-256 c7804657bcbf8111152827f4c4ae9400f5a5355b153879c0f4f855d6e30c0a8a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page