extract urls from html files
Project description
xurl
extract links (href data) from html files/web pages.
Installation
pip install xurl
Options
run the xurl -h
or xurl --help
for options
-a = append an URL to start of the links
-c = contain text (REGEX)
-C = not contain text (REGEX)
-q = quiet mode (do not print Errors/Warnings/Infos)
-v = version
Usages
xurl https://example.com
and same for the files
xurl path/to/file
search using regex
xurl https://example.com -c "section\-[1-10].*.[pdf|xlsx]"
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xurl-0.0.5.tar.gz
(3.4 kB
view details)
Built Distribution
xurl-0.0.5-py3-none-any.whl
(3.6 kB
view details)
File details
Details for the file xurl-0.0.5.tar.gz
.
File metadata
- Download URL: xurl-0.0.5.tar.gz
- Upload date:
- Size: 3.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 592f2c8e5567931424af630cbfdcf932b3037c7e687fad5cee72c78c8ea9d01d |
|
MD5 | 67b263233b4d1e61dd0e5b12d557e584 |
|
BLAKE2b-256 | 6b343b915dc6a71c6ac4d9f64c1a4362d5dfe3649ee0a65f50d571f53c450148 |
File details
Details for the file xurl-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: xurl-0.0.5-py3-none-any.whl
- Upload date:
- Size: 3.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58afb0b93a6cff63e64b03d8a2cb39b8b0059005022933528711949ee303cda9 |
|
MD5 | 623a3b4105f8fcff7261bb7a82a183ff |
|
BLAKE2b-256 | f43592875ab6d4b7f5d78716c5ae1216df1d2c3100879ade1423f3778bc44777 |