Skip to main content

A tool to download pages from Internet Archive.

Project description

Yet Another WayBack DownLoader

YAWBDL is a tool to download archived pages from Internet Archive, which I wrote because none of other tools worked good enough (or at all) in my use cases.

Usage

pipx install yawbdl
yawbdl

usage: yawbdl [-h] [-d DOMAIN] [-o DST_DIR] [--from FROM_DATE] [--to TO_DATE]
                 [--timeout TIMEOUT] [-n] [--delay DELAY] [--retries RETRIES]
                 [--no-fail]
                 [--skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]]
                 [--latest-only] [--debug]

Download a website from Internet Archive

options:
  -h, --help            show this help message and exit
  -d DOMAIN             domain to download (default: None)
  -o DST_DIR            output directory (default: None)
  --from FROM_DATE      from date, up to 14 digits: yyyyMMddhhmmss (default:
                        None)
  --to TO_DATE          to date (default: None)
  --timeout TIMEOUT     request timeout (default: 10)
  -n                    dry run (default: False)
  --delay DELAY         delay between requests (default: 1)
  --retries RETRIES     max number of retries (default: 0)
  --no-fail             if retries are exceeded, and the file still couldn't
                        have been downloaded, proceed to the next file instead
                        of aborting the run (default: False)
  --skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]
                        skip snapshots with these timestamps (sometimes Internet
                        Archive just fails to serve a specific snapshot)
                        (default: None)
  --latest-only         download only the latest version of each URL (default:
                        False)
  --debug               enable debug logging to show detailed error information
                        (default: False)

Examples

yawbdl -d "tools.oszone.net" -o "tools.oszone.net" --no-fail --retries 3 --delay 2
Getting snapshot list...
Got snapshot list!
(1/1017) 20051203093720 http://tools.oszone.net:80/Vadikan/faq.html [OK]
(2/1017) 20060101071820 http://tools.oszone.net:80/Vadikan/DotNet.exe [OK]
...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yawbdl-1.1.2.tar.gz (8.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yawbdl-1.1.2-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file yawbdl-1.1.2.tar.gz.

File metadata

  • Download URL: yawbdl-1.1.2.tar.gz
  • Upload date:
  • Size: 8.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.17

File hashes

Hashes for yawbdl-1.1.2.tar.gz
Algorithm Hash digest
SHA256 0f96f16546993fccce3fe68aa80321566ae5d72992a82fdaf0511d29db77250b
MD5 c9a45c3ee7820a1f22241507762546d2
BLAKE2b-256 69956906febfa76f4cad3f85e5afc58fd7e09730abe050a972d87daaa6264b13

See more details on using hashes here.

File details

Details for the file yawbdl-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: yawbdl-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.17

File hashes

Hashes for yawbdl-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0250acaa36c3664cb3399ef8af69c9718920133b10bb984b1dab5230592d1669
MD5 7c70911eac0b9b883cba766237d80ee7
BLAKE2b-256 f467deda4cf6a86cd5618f144e93bce7641ae4d0bf5ed503439f1b4161cffc60

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page