Throw all URIs in a page on to Wayback Machine from CLI.
Project description
wbsv
wbsv
(stands for "WayBack machine SavepageNow") is…
CLI tool for saving webpage on Wayback Machine forever. Enables you to save all URIs in a webpage forever on Wayback Machine.
Install
pip install wbsv
DEMO
Run & Examples
Help
$ wbsv -h
usage: wbsv [-h] [-r times] [-t] [-l level] [-O] [-V] [url ...]
CLI tool for save webpage on Wayback Machine forever.
Save webpage and one 's all URI(s) on Wayback Machine.
positional arguments:
url Saving pages in order. (default: None)
optional arguments:
-h, --help show this help message and exit
-r times, --retry times Set a retry limit on failed save. (default: 3)
-t, --only_target Save just same domain of target (default: False)
-l level, --level level Set maximum recursion depth (default: 1)
-O, --own Only URLs with the same domain as target (default: False)
-V, --version show program's version number and exit
If you don't give the URL,
interactive mode will be launched.
(To quit interactive mode,
type "end", "exit", "exit()",
"break", "bye", ":q" or "finish".)
Interactive mode
$ wbsv
[[Input a target url (ex: https://google.com)]]
>>> https://www.u.tsukuba.ac.jp
[+]Target: ['https://www.u.tsukuba.ac.jp']
[+]61 URI(s) found.
[01/60]: <NOW> https://web.archive.org/web/20200412020015/https://www.u.tsukuba.ac.jp/password/
[02/60]: <FAIL> https://www.u.tsukuba.ac.jp/info_lit/tebiki.html
[03/60]: <NOW> https://web.archive.org/web/20200412020026/https://www.u.tsukuba.ac.jp/account/
...
[58/60]: <NOW> https://web.archive.org/web/20200412022608/https://www.u.tsukuba.ac.jp/phishing/
[59/60]: <FAIL> https://www.u.tsukuba.ac.jp/wordpress/wp-content/uploads/note_usingcomputerrooms.png
[60/60]: <NOW> https://web.archive.org/web/20200412022640/https://www.u.tsukuba.ac.jp/
[+]FIN!: ['https://www.u.tsukuba.ac.jp']
[+]ALL: 60, SAVE: 57, PAST: 0, FAIL: 3
>>>
From stdin
$ wbsv https://tsumanne.net
[+]Target: ['https://tsumanne.net']
[+]4 URI(s) found.
[1/4]: <NOW> https://web.archive.org/web/20200412022931/https://tsumanne.net/si/
[2/4]: <NOW> https://web.archive.org/web/20200412022935/https://tsumanne.net/
[3/4]: <NOW> https://web.archive.org/web/20200412022938/https://tsumanne.net/my/
[4/4]: <NOW> https://web.archive.org/web/20200412022949/https://tsumanne.net/ct/
[+]FIN!: ['https://tsumanne.net']
[+]ALL: 4, SAVE: 4, PAST: 0, FAIL: 0
$
Search links recurcively
wbsv https://programming-place.net/ppp/contents/c/index.html -l 2
Increase limit of retry
wbsv https://tsumanne.net -r 10
LISENCE
MIT
Author
eggplants (haruna)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wbsv-0.5.4.tar.gz
(8.6 kB
view details)
Built Distribution
wbsv-0.5.4-py3-none-any.whl
(8.6 kB
view details)
File details
Details for the file wbsv-0.5.4.tar.gz
.
File metadata
- Download URL: wbsv-0.5.4.tar.gz
- Upload date:
- Size: 8.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8a3b0d73095f0ef618adf97b480e1ff01c7ea99413d8a983e6f2ef37d0cf6346 |
|
MD5 | 81bdf9f624f5666c4aca5c1739d57ff0 |
|
BLAKE2b-256 | 6b0ca9bf63dd3f0d2f5a5cc137faeeb2937481e69a8f258dc4a0e10693267b2e |
File details
Details for the file wbsv-0.5.4-py3-none-any.whl
.
File metadata
- Download URL: wbsv-0.5.4-py3-none-any.whl
- Upload date:
- Size: 8.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6cb46216d8fe4c6fd06efebdc5015b057099ee305ce7db8f6c3280cb32055916 |
|
MD5 | 3a5214cc64ae7d69ea37db7b5415b42b |
|
BLAKE2b-256 | 00d4721075653dbba085cfee800b1c5bd892b1295ad352bc8d486c3e75bcddbd |