Skip to main content

Library to find URLs and check their validity.

Project description

# urlfinderlib Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open(‘/path/to/file’, ‘rb’) as f:

print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these “relative” URLs.

from urlfinderlib import find_urls

with open(‘/path/to/file’, ‘rb’) as f:

print(find_urls(f.read(), base_url=’http://somewebsite.com/’)

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.9.0.tar.gz (14.4 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.9.0-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.9.0.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.9.0.tar.gz
  • Upload date:
  • Size: 14.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.9.0.tar.gz
Algorithm Hash digest
SHA256 c231628ef0968a552388eb9aced183416c433dd4ae4261537e181ba18c1362c4
MD5 dbcaa038e3b85c2256dcf11a1a586407
BLAKE2b-256 af5b556e0161f460a3166e15cb0991a49c1952fcef282103cfea293625a695a6

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.9.0-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.9.0-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 33cddce1bf581fb1b64676b2e22db98d06b9fb3a9fa9bb4194255072d7a72d42
MD5 fdc2be0ce5fcda84eb6d951ceddd4c1a
BLAKE2b-256 a32902eadedf059dfabcf43662f6c93d5516cdb3a80e198d2cd1a792acc810c0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page