Module for reading Apple's webarchive format
Project description
pywebarchive is software for reading Apple's webarchive format.
A webarchive stores a complete web page -- including external media like images, scripts, and style sheets -- in a single file. It is most notable as the default format for the Safari browser's "Save As" command, though other Apple software also uses it for various purposes.
pywebarchive consists of two main components: Webarchive Extractor, a tool to convert webarchives to standard HTML documents; and the webarchive
Python module, which is the code "under the hood" that makes it all work.
Webarchive Extractor
Webarchive Extractor converts webarchives to standard HTML documents. It allows opening webarchives on Windows and Linux/Unix systems, where Safari is not available.
Downloads
File | Size | Description |
---|---|---|
Webarchive.Extractor.exe | 7.3 MB | Windows (32-bit, standalone) |
Webarchive.Extractor.x64.exe | 8.0 MB | Windows (64-bit, standalone) |
pywebarchive-0.4.0.zip | source code (zip) | |
pywebarchive-0.4.0.tar.gz | source code (tar.gz) |
Notes
The Windows version runs on Windows 7 and higher. It is a standalone executable -- no installation required.
The pywebarchive source code includes both graphical (extractor-gui.py) and command-line (extractor.py) versions of Webarchive Extractor. The graphical version requires Tkinter, and can optionally use userpaths. The command-line version should run on any system.
Disclaimer
The download links above are intended for users looking at pywebarchive's GitHub page (https://github.com/bmjcode/pywebarchive). If you are reading this anywhere else, they may no longer point to the current version of pywebarchive; visit the GitHub page to ensure you are getting the latest and greatest.
If you are reading this on GitHub, note the download links here are for the most recent stable release. This may be older than the browseable code at the top of the page, which is the latest version in development.
The webarchive
module
webarchive
is a Python module for reading the webarchive format. While its primary function is to power Webarchive Extractor, applications can also use it to examine webarchives directly.
The recommended way to install the webarchive
module is through PyPI. For detailed documentation, try python3 -m pydoc webarchive
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pywebarchive-0.4.0.tar.gz
.
File metadata
- Download URL: pywebarchive-0.4.0.tar.gz
- Upload date:
- Size: 16.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3fa6f59516086fe5e79f4a07be61380f698d6325cc880e07fc54daceed5eb12c |
|
MD5 | a9aa521a76569b23e1bf974f0da18369 |
|
BLAKE2b-256 | 427e60c2732f6b34d90c85d8793c0bfcf9cd0db170a7d0e49f3c4f117b9f4c64 |
File details
Details for the file pywebarchive-0.4.0-py3-none-any.whl
.
File metadata
- Download URL: pywebarchive-0.4.0-py3-none-any.whl
- Upload date:
- Size: 19.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cdd75c29ed114401dd788f56204d81ffe2a5d960445b6200d5fc391e3f1e801f |
|
MD5 | 5e03400f1a7a9dae52a3ff26f95f76f2 |
|
BLAKE2b-256 | a181ab61a584efecea9c6d27366f6eaa0e82d76e858d3aa5ba853b4a34ce7ee6 |