Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.10.0.tar.gz (14.6 kB view details)

Uploaded Source

Built Distribution

urlfinderlib-0.10.0-py3-none-any.whl (13.2 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.10.0.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.10.0.tar.gz
  • Upload date:
  • Size: 14.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.10.0.tar.gz
Algorithm Hash digest
SHA256 d1d3cd0b15e92e6bf4034ff4359ef87c4bf23514c380608b0355f2373d5576b4
MD5 77a0290000d92125a0d8dae826d81518
BLAKE2b-256 f84cb036234827539b32de08c6611fa86a668b212a7557e88d07f2817bea3a1d

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.10.0-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.10.0-py3-none-any.whl
  • Upload date:
  • Size: 13.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.7

File hashes

Hashes for urlfinderlib-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 71563975267c93332cdbf75e313a80cc2a3570d3c97a5288da29b44a674f8b0c
MD5 e19f54df3ce4536c94a53c19251a6285
BLAKE2b-256 4742f3f821e4eb09c579720c09c7f5387d73b5dcd73aeb43d2dc64c208748d12

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page