Skip to main content

A tool to download pages from Internet Archive.

Project description

Yet Another WayBack DownLoader

YAWBDL is a tool to download archived pages from Internet Archive, which I wrote because none of other tools worked good enough (or at all) in my use cases.

Installation

pipx install yawbdl
# or
uv tool install yawbdl

Usage

yawbdl

usage: yawbdl [-h] [-d DOMAIN] [-o DST_DIR] [--from FROM_DATE] [--to TO_DATE]
                 [--timeout TIMEOUT] [-n] [--delay DELAY] [--retries RETRIES]
                 [--no-fail]
                 [--skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]]
                 [--latest-only] [--debug]

Download a website from Internet Archive

options:
  -h, --help            show this help message and exit
  -d DOMAIN             domain to download (default: None)
  -o DST_DIR            output directory (default: None)
  --from FROM_DATE      from date, up to 14 digits: yyyyMMddhhmmss (default:
                        None)
  --to TO_DATE          to date (default: None)
  --timeout TIMEOUT     request timeout (default: 10)
  -n                    dry run (default: False)
  --delay DELAY         delay between requests (default: 1)
  --retries RETRIES     max number of retries (default: 0)
  --no-fail             if retries are exceeded, and the file still couldn't
                        have been downloaded, proceed to the next file instead
                        of aborting the run (default: False)
  --skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]
                        skip snapshots with these timestamps (sometimes Internet
                        Archive just fails to serve a specific snapshot)
                        (default: None)
  --latest-only         download only the latest version of each URL (default:
                        False)
  --debug               enable debug logging to show detailed error information
                        (default: False)

Examples

yawbdl -d "tools.oszone.net" -o "tools.oszone.net" --no-fail --retries 3 --delay 2
Getting snapshot list...
Got snapshot list!
(1/1017) 20051203093720 http://tools.oszone.net:80/Vadikan/faq.html [OK]
(2/1017) 20060101071820 http://tools.oszone.net:80/Vadikan/DotNet.exe [OK]
...

Development

uv sync                    # Install dependencies
uv run ruff check          # Lint
uv run ruff format --check # Check formatting
uv run ruff check --fix    # Auto-fix lint issues
uv run ruff format         # Auto-format

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yawbdl-1.1.4.tar.gz (21.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yawbdl-1.1.4-py3-none-any.whl (9.3 kB view details)

Uploaded Python 3

File details

Details for the file yawbdl-1.1.4.tar.gz.

File metadata

  • Download URL: yawbdl-1.1.4.tar.gz
  • Upload date:
  • Size: 21.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.10 {"installer":{"name":"uv","version":"0.11.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for yawbdl-1.1.4.tar.gz
Algorithm Hash digest
SHA256 3eaa4cb298d634cd6bb39f8023158cd3e78b3067d03b8d09b665aa3c4a316843
MD5 c1de24a4cefff8dae191653f4d1eb172
BLAKE2b-256 b9bdc8986e33cf7afb33cc18f9c0b3f9e8ccadd450a32ba9a8eca4cef18b4fdb

See more details on using hashes here.

File details

Details for the file yawbdl-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: yawbdl-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 9.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.10 {"installer":{"name":"uv","version":"0.11.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for yawbdl-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 657017e075e14b4299a0c4fbfe70d9c2b2ffcf6027dadbafc9dd5139b155a571
MD5 3a2ee8863a5a241026332412ee139809
BLAKE2b-256 b46ba8e777fc93543e696b033a8e4a037fbcb988f749ac3c9570543e9980fb4e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page