Skip to main content

Module for reading Apple's .webarchive files

Project description

pywebarchive is software for reading Apple's webarchive format.

A webarchive stores a complete web page -- including external media like images, scripts, and style sheets -- in a single file. It is most notable as the default format for the Safari browser's "Save As" command, though other Apple software also uses it for various purposes.

pywebarchive consists of two main components: Webarchive Extractor, a tool to convert webarchives to standard HTML documents; and the webarchive Python module, which is the code "under the hood" that makes it all work.

Webarchive Extractor

Webarchive Extractor converts webarchives to standard HTML documents. It allows opening webarchives on Windows and Linux/Unix systems, where Safari is not available.

Downloads

File Size Description
Webarchive.Extractor.exe 7.3 MB Windows (32-bit, standalone)
Webarchive.Extractor.x64.exe 8.1 MB Windows (64-bit, standalone)
pywebarchive-0.3.1.zip source code (zip)
pywebarchive-0.3.1.tar.gz source code (tar.gz)

Notes

The Windows version runs on Windows 7 and higher. It is a standalone executable -- no installation required.

The pywebarchive source code includes both graphical (extractor-gui.py) and command-line (extractor.py) versions of Webarchive Extractor. The graphical version requires Tkinter; the command-line version should run on any system.

These download links are for the most recent stable release. If you're reading this on GitHub, be aware that you may be looking at a newer version of the code than what's linked here.

The webarchive module

webarchive is a Python module for reading the webarchive format. While its primary function is to power Webarchive Extractor, applications can also use it to examine webarchives directly.

The recommended way to install the webarchive module is through PyPI. For detailed documentation, try python3 -m pydoc webarchive.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywebarchive-0.3.1.tar.gz (13.4 kB view hashes)

Uploaded Source

Built Distribution

pywebarchive-0.3.1-py3-none-any.whl (16.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page