Library to find URLs and check their validity.
Project description
urlfinderlib
Python library for finding URLs in documents and arbitrary data and checking their validity.
Basic usage
from urlfinderlib import find_urls
with open('/path/to/file', 'rb') as f:
print(find_urls(f.read())
base_url usage
If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.
from urlfinderlib import find_urls
with open('/path/to/file', 'rb') as f:
print(find_urls(f.read(), base_url='http://somewebsite.com/')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
urlfinderlib-0.11.7.tar.gz
(10.5 kB
view details)
Built Distribution
File details
Details for the file urlfinderlib-0.11.7.tar.gz
.
File metadata
- Download URL: urlfinderlib-0.11.7.tar.gz
- Upload date:
- Size: 10.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 15149e93000656400eb5870496e2577a84abde66ccd58c0c4fea0391700330e7 |
|
MD5 | 288db4c2ddd874f20abf117c0b111a2e |
|
BLAKE2b-256 | a25e26368c484d08b2aa39086d814673336624fa3b8315a316a371bd5afcded4 |
File details
Details for the file urlfinderlib-0.11.7-py3-none-any.whl
.
File metadata
- Download URL: urlfinderlib-0.11.7-py3-none-any.whl
- Upload date:
- Size: 14.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 72c267c885e0b97aa5ac8f8ef4577255d1769542979d9f34b783742b46556ad8 |
|
MD5 | f6083fe463dcf5a11ade8238977236d7 |
|
BLAKE2b-256 | c7804657bcbf8111152827f4c4ae9400f5a5355b153879c0f4f855d6e30c0a8a |