Throw all URIs in a page on to Wayback Machine savepagenow from CLI.
Project description
wbsv
-
wbsv
("Wabisavi", "わびさび", stands for "WayBack machine SavepageNow") is...- CLI tool for saving webpage on Wayback Machine forever.
- Enables you to save all URIs in a webpage forever on Wayback Machine.
DEMO
Install
$ pip install wbsv # Python3.0+
Run & Examples
Help
$ wbsv -h
wbsv 0.0.5
CLI tool for saving webpage on Wayback Machine forever.
Save webpage and one's all URI(s) on Wayback Machine.
Usage:
wbsv [options] <url1> <url2> ... <urln>
Args:
<urls> Saving pages in order.
no arg Launch Interactive mode.
Options:
-h, --help Show help and exit.
-v, --version Show version and exit.
-r, --retry <times> Give the limit of retry when saving fails.
--only-page Save just target webpage(s).
Interactive mode
$ wbsv
[[Input a target url (ex: https://google.com)]]
>>> https://tsukuba.ac.jp
[!]Now: https://tsukuba.ac.jp
[!]class 'urllib.error.URLError'
[!]urlopen error [Errno -2] Name or service not known
[!]traceback object at 0x7eff0d207188
[[Input a target url (ex: https://google.com)]]
>>> https://www.u.tsukuba.ac.jp
[+]Now: https://www.u.tsukuba.ac.jp
87 URI(s) found.
[01]: <NOW> https://web.archive.org/web/20200123135244/https://www.u.tsukuba.ac.jp/20180622terminals/
[02]: <NOW> https://web.archive.org/web/20200123135247/https://www.u.tsukuba.ac.jp/
[03]: <NOW> https://web.archive.org/web/20200123135250/https://www.u.tsukuba.ac.jp/anti-virus/
...
[85]: <NOW> https://web.archive.org/web/20200123140917/https://www.u.tsukuba.ac.jp/snapshot/
[86]: <FAIL> https://www.u.tsukuba.ac.jp/wp-json/oembed/1.0/embed?url=https%3A%2F%2Fwww.u.tsukuba.ac.jp%2F&format=xml
[87]: <FAIL> https://www.u.tsukuba.ac.jp/info_lit/tebiki.html
[+]FIN!: https://www.u.tsukuba.ac.jp
[+]ALL: 87 SAVE: 61 FAIL: 21
[+]To exit, use CTRL+C or type 'end'
[[Input a target url (ex: https://google.com)]]
>>> exit
$
From stdin
$ wbsv https://tsumanne.net https://tsumanne.net/ct
[+]Now: https://tsumanne.net
9 URI(s) found.
[1]: <NOW> https://web.archive.org/web/20200123194439/https://tsumanne.net
...
[9]: <FAIL> https://tsumanne.net/src/iphone.png
[+]FIN!: https://tsumanne.net
[+]ALL: 9 SAVE: 5 FAIL: 4
[+]Now: https://tsumanne.net/ct
7 URI(s) found.
[1]: <NOW> https://web.archive.org/web/20200123194602/https://tsumanne.net/ct/?cat=&of=25
...
[7]: <FAIL> https://tsumanne.net/src/site.js
[+]FIN!: https://tsumanne.net/ct
[+]ALL: 7 SAVE: 5 FAIL: 2
$
Increase limit of retry
$ wbsv https://tsumanne.net --retry 10
VERSION
wbsv 0.0.5
LISENCE
MIT
Author
eggplants (haruna)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wbsv-0.0.5.tar.gz
(4.5 kB
view hashes)
Built Distribution
wbsv-0.0.5-py3-none-any.whl
(5.0 kB
view hashes)