Skip to main content

Module for reading Apple's .webarchive files

Project description

pywebarchive is a Python 3 module for reading Apple's .webarchive files. It includes Webarchive Extractor, a tool to convert those files to standard HTML pages that you can open in any browser.

pywebarchive is stable enough for everyday use. It remains in alpha because its support for the .webarchive format is a work in progress; in particular, some pages using advanced HTML5 features may not convert perfectly.

Webarchive Extractor

Builds for Windows are available on the releases page on GitHub. These are standalone executables that run on Windows 7 and higher. On other platforms, Webarchive Extractor is included with the pywebarchive source code.

Information for Developers

Here's an example of how to use the webarchive module:

import webarchive
archive = webarchive.open("example.webarchive")
archive.extract("example.html")

For detailed documentation, try python3 -m pydoc webarchive.

The source distribution also includes two webarchive extraction tools:

  • extractor.py is a command-line version.
  • extractor-gui.py is a GUI version using Tkinter.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for pywebarchive, version 0.2.4
Filename, size File type Python version Upload date Hashes
Filename, size pywebarchive-0.2.4-py3-none-any.whl (12.8 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size pywebarchive-0.2.4.tar.gz (9.8 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page