Skip to main content

Module for reading Apple's .webarchive files

Project description

pywebarchive is a Python 3 module for reading Apple's .webarchive files. It includes Webarchive Extractor, a tool to convert those files to standard HTML pages that you can open in any browser.

pywebarchive is stable enough for everyday use. It remains in beta because its support for the .webarchive format is a work in progress; in particular, some pages using advanced HTML5 features may not convert perfectly.

Webarchive Extractor

Builds for Windows are available on the releases page on GitHub. These are standalone executables that run on Windows 7 and higher. On other platforms, Webarchive Extractor is included with the pywebarchive source code.

Information for Developers

Here's an example of how to use the webarchive module:

import webarchive
archive = webarchive.open("example.webarchive")
archive.extract("example.html")

For detailed documentation, try python3 -m pydoc webarchive.

The source distribution also includes two webarchive extraction tools:

  • extractor.py is a command-line version.
  • extractor-gui.py is a GUI version using Tkinter.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywebarchive-0.3.0.tar.gz (12.7 kB view details)

Uploaded Source

Built Distribution

pywebarchive-0.3.0-py3-none-any.whl (15.2 kB view details)

Uploaded Python 3

File details

Details for the file pywebarchive-0.3.0.tar.gz.

File metadata

  • Download URL: pywebarchive-0.3.0.tar.gz
  • Upload date:
  • Size: 12.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/57.2.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.6.12

File hashes

Hashes for pywebarchive-0.3.0.tar.gz
Algorithm Hash digest
SHA256 ab2feb68439aa8eff582a3b9223692704dcc0f3bc7eef78309cf3c8c5bf8e775
MD5 a742968ba9179a7b60b661eb433b419c
BLAKE2b-256 0e49f859837071ce835dbf6574abd953aeb626e2881221fa5c8d3c30679886e8

See more details on using hashes here.

File details

Details for the file pywebarchive-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: pywebarchive-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 15.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/57.2.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.6.12

File hashes

Hashes for pywebarchive-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f914f3696c1cb213653560c8b4647952a78e3ad87fe843dbabf214a163d04a22
MD5 09610c9b2048d7351dab3bb2ba9b33d5
BLAKE2b-256 b021bce7cb6e277e816821b0ac360056a1008c98314a2e46693f57cf0d736251

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page