extract urls from html files
Project description
xurl
extract links (href data) from html files/web pages.
Installation
pip install xurl
Options
run the xurl -h
or xurl --help
for options
-a = append an URL to start of the links
-c = contain text (REGEX)
-C = not contain text (REGEX)
-q = quiet mode (do not print Errors/Warnings/Infos)
-v = version
Usages
xurl https://example.com
and same for the files
xurl path/to/file
search using regex
xurl https://example.com -c "section\-[1-10].*.[pdf|xlsx]"
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xurl-0.0.5.tar.gz
(3.4 kB
view hashes)
Built Distribution
xurl-0.0.5-py3-none-any.whl
(3.6 kB
view hashes)