Skip to main content

Throw all URIs in a page on to Wayback Machine from CLI.

Project description

wbsv

PyPI version Maintainability

wbsv(stands for "WayBack machine SavepageNow") is…

CLI tool for saving webpage on Wayback Machine forever. Enables you to save all URIs in a webpage forever on Wayback Machine.

Install

pip install wbsv

DEMO

demo.gif

Run & Examples

Help

$ wbsv -h
usage: wbsv [-h] [-r times] [-t] [-l level] [-O] [-V] [url ...]

CLI tool for save webpage on Wayback Machine forever.
Save webpage and one 's all URI(s) on Wayback Machine.

positional arguments:
  url                      Saving pages in order. (default: None)

optional arguments:
  -h, --help               show this help message and exit
  -r times, --retry times  Set a retry limit on failed save. (default: 3)
  -t, --only_target        Save just same domain of target (default: False)
  -l level, --level level  Set maximum recursion depth (default: 1)
  -O, --own                Only URLs with the same domain as target (default: False)
  -V, --version            show program's version number and exit

If you don't give the URL,
interactive mode will be launched.
(To quit interactive mode,
type "end", "exit", "exit()",
"break", "bye", ":q" or "finish".)

Interactive mode

$ wbsv
[[Input a target url (ex: https://google.com)]]
>>> https://www.u.tsukuba.ac.jp
[+]Target: ['https://www.u.tsukuba.ac.jp']
[+]61 URI(s) found.
[01/60]: <NOW> https://web.archive.org/web/20200412020015/https://www.u.tsukuba.ac.jp/password/
[02/60]: <FAIL> https://www.u.tsukuba.ac.jp/info_lit/tebiki.html
[03/60]: <NOW> https://web.archive.org/web/20200412020026/https://www.u.tsukuba.ac.jp/account/
...
[58/60]: <NOW> https://web.archive.org/web/20200412022608/https://www.u.tsukuba.ac.jp/phishing/
[59/60]: <FAIL> https://www.u.tsukuba.ac.jp/wordpress/wp-content/uploads/note_usingcomputerrooms.png
[60/60]: <NOW> https://web.archive.org/web/20200412022640/https://www.u.tsukuba.ac.jp/
[+]FIN!: ['https://www.u.tsukuba.ac.jp']
[+]ALL: 60, SAVE: 57, PAST: 0, FAIL: 3
>>>

From stdin

$ wbsv https://tsumanne.net
[+]Target: ['https://tsumanne.net']
[+]4 URI(s) found.
[1/4]: <NOW> https://web.archive.org/web/20200412022931/https://tsumanne.net/si/
[2/4]: <NOW> https://web.archive.org/web/20200412022935/https://tsumanne.net/
[3/4]: <NOW> https://web.archive.org/web/20200412022938/https://tsumanne.net/my/
[4/4]: <NOW> https://web.archive.org/web/20200412022949/https://tsumanne.net/ct/
[+]FIN!: ['https://tsumanne.net']
[+]ALL: 4, SAVE: 4, PAST: 0, FAIL: 0
$

Search links recurcively

wbsv https://programming-place.net/ppp/contents/c/index.html -l 2

Increase limit of retry

wbsv https://tsumanne.net -r 10

LISENCE

MIT

Author

eggplants (haruna)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wbsv-0.5.4.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

wbsv-0.5.4-py3-none-any.whl (8.6 kB view details)

Uploaded Python 3

File details

Details for the file wbsv-0.5.4.tar.gz.

File metadata

  • Download URL: wbsv-0.5.4.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2

File hashes

Hashes for wbsv-0.5.4.tar.gz
Algorithm Hash digest
SHA256 8a3b0d73095f0ef618adf97b480e1ff01c7ea99413d8a983e6f2ef37d0cf6346
MD5 81bdf9f624f5666c4aca5c1739d57ff0
BLAKE2b-256 6b0ca9bf63dd3f0d2f5a5cc137faeeb2937481e69a8f258dc4a0e10693267b2e

See more details on using hashes here.

File details

Details for the file wbsv-0.5.4-py3-none-any.whl.

File metadata

  • Download URL: wbsv-0.5.4-py3-none-any.whl
  • Upload date:
  • Size: 8.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2

File hashes

Hashes for wbsv-0.5.4-py3-none-any.whl
Algorithm Hash digest
SHA256 6cb46216d8fe4c6fd06efebdc5015b057099ee305ce7db8f6c3280cb32055916
MD5 3a5214cc64ae7d69ea37db7b5415b42b
BLAKE2b-256 00d4721075653dbba085cfee800b1c5bd892b1295ad352bc8d486c3e75bcddbd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page