Skip to main content

Archive your Chrome and Firefox bookmarks

Project description

bookmarkarchiver

Python script that archives all of your bookmarks on the Internet Archive. Supports all major browsers. Contributions welcome!

bookmarkarchiver uses the official Save Page Now API. Anonymous users are limited to 4,000 requests per day, which should be enough to save around 200 websites. If you create a free account to the Internet Archive and log in with a Chromium-based browser (e.g. Google Chrome) or Firefox, your single-day request limit increases to 100,000 and you should be able to save approximately 5000 websites. This may be significant overestimate if you also choose to captire outlinks. Due to API limitations, each bookmark should take around 10 seconds to archive.

bookmarkarchiver uses the browser_cookie3 module.

Usage

To use bookmarkarchiver, you need a bookmark file. You can get one by exporting them from a browser—instructions vary by browser and are readily available online. Typically, you have to export them through your browser's bookmarks page.

The easiest way to install bookmarkarchiver is to run pip3 install bookmarkarchiver.

$ pip3 install -r requirements.txt
$ python3 bookmarkarchiver.py --help
usage: bookmarkarchiver.py [-h] [--no_capture_all] [--capture_outlinks] [--capture_screenshot] [--delay_wb_availability] [--force_get]
                           [--no_skip_first_archive] [--email_result] [--quit_immediately] [--api_wait_seconds API_WAIT_SECONDS]
                           bookmark_file

Archives your bookmarks with the Wayback Machine.

positional arguments:
  bookmark_file         A Netscape format bookmarks file

optional arguments:
  -h, --help            show this help message and exit
  --no_capture_all, -n  Don't capture error pages
  --capture_outlinks, -c
                        Capture all outlinks
  --capture_screenshot, -s
                        Capture a screenshot
  --delay_wb_availability, -d
                        Delay uploading capture
  --force_get, -f       Force a GET request
  --no_skip_first_archive, -a
                        Don't recapture pages
  --email_result, -e    Email results to user
  --quit_immediately, -q
                        Don't show end results
  --api_wait_seconds API_WAIT_SECONDS, -w API_WAIT_SECONDS

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bookmark-archiver-1.0.0.tar.gz (16.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bookmark_archiver-1.0.0-py3-none-any.whl (16.6 kB view details)

Uploaded Python 3

File details

Details for the file bookmark-archiver-1.0.0.tar.gz.

File metadata

  • Download URL: bookmark-archiver-1.0.0.tar.gz
  • Upload date:
  • Size: 16.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for bookmark-archiver-1.0.0.tar.gz
Algorithm Hash digest
SHA256 bc777ace0ed1a133ec9d2ff25554d2aec0eeeef60a42767bdd8e5d7b0f0a5dff
MD5 daf036e120568ce9f0e950d2100c28aa
BLAKE2b-256 183c61bd77a4431e71d74600027b604303edd73175b56010cc583f3241afedb4

See more details on using hashes here.

File details

Details for the file bookmark_archiver-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for bookmark_archiver-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 586da3886a0d8d3b37720d387d49f3bb188625e47daeae6345f7940279aef6dc
MD5 9eb2e513a81e922c1589aa3c0005ea5e
BLAKE2b-256 e5a0e2bbe02897e60d096eaa7c1f46a818741120a4f14fda5167007c95d10292

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page