Skip to main content

Module for reading Apple's webarchive format

Project description

pywebarchive is software for reading Apple's webarchive format.

A webarchive stores a complete web page -- including external media like images, scripts, and style sheets -- in a single file. It is most notable as the default format for the Safari browser's "Save As" command, though other Apple software also uses it for various purposes.

pywebarchive consists of two main components: Webarchive Extractor, a tool to convert webarchives to standard HTML documents; and the webarchive Python module, which is the code "under the hood" that makes it all work.

Webarchive Extractor

Webarchive Extractor converts webarchives to standard HTML documents. It allows opening webarchives on Windows and Linux/Unix systems, where Safari is not available.

Downloads

File Size Description
Webarchive.Extractor.exe 7.3 MB Windows (32-bit, standalone)
Webarchive.Extractor.x64.exe 8.0 MB Windows (64-bit, standalone)
pywebarchive-0.4.0.zip source code (zip)
pywebarchive-0.4.0.tar.gz source code (tar.gz)

Notes

The Windows version runs on Windows 7 and higher. It is a standalone executable -- no installation required.

The pywebarchive source code includes both graphical (extractor-gui.py) and command-line (extractor.py) versions of Webarchive Extractor. The graphical version requires Tkinter, and can optionally use userpaths. The command-line version should run on any system.

Disclaimer

The download links above are intended for users looking at pywebarchive's GitHub page (https://github.com/bmjcode/pywebarchive). If you are reading this anywhere else, they may no longer point to the current version of pywebarchive; visit the GitHub page to ensure you are getting the latest and greatest.

If you are reading this on GitHub, note the download links here are for the most recent stable release. This may be older than the browseable code at the top of the page, which is the latest version in development.

The webarchive module

webarchive is a Python module for reading the webarchive format. While its primary function is to power Webarchive Extractor, applications can also use it to examine webarchives directly.

The recommended way to install the webarchive module is through PyPI. For detailed documentation, try python3 -m pydoc webarchive.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywebarchive-0.4.0.tar.gz (16.8 kB view details)

Uploaded Source

Built Distribution

pywebarchive-0.4.0-py3-none-any.whl (19.7 kB view details)

Uploaded Python 3

File details

Details for the file pywebarchive-0.4.0.tar.gz.

File metadata

  • Download URL: pywebarchive-0.4.0.tar.gz
  • Upload date:
  • Size: 16.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.8

File hashes

Hashes for pywebarchive-0.4.0.tar.gz
Algorithm Hash digest
SHA256 3fa6f59516086fe5e79f4a07be61380f698d6325cc880e07fc54daceed5eb12c
MD5 a9aa521a76569b23e1bf974f0da18369
BLAKE2b-256 427e60c2732f6b34d90c85d8793c0bfcf9cd0db170a7d0e49f3c4f117b9f4c64

See more details on using hashes here.

File details

Details for the file pywebarchive-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: pywebarchive-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 19.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.8

File hashes

Hashes for pywebarchive-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cdd75c29ed114401dd788f56204d81ffe2a5d960445b6200d5fc391e3f1e801f
MD5 5e03400f1a7a9dae52a3ff26f95f76f2
BLAKE2b-256 a181ab61a584efecea9c6d27366f6eaa0e82d76e858d3aa5ba853b4a34ce7ee6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page