Skip to main content

Module for reading Apple's .webarchive files

Project description

pywebarchive is a Python 3 module for reading Apple's .webarchive files. It includes Webarchive Extractor, a tool to convert those files to standard HTML pages that you can open in any browser.

pywebarchive is stable enough for everyday use. It remains in alpha because its support for the .webarchive format is a work in progress; in particular, some pages using advanced HTML5 features may not convert perfectly.

Webarchive Extractor

Builds for Windows are available on the releases page on GitHub. These are standalone executables that run on Windows 7 and higher. On other platforms, Webarchive Extractor is included with the pywebarchive source code.

Information for Developers

Here's an example of how to use the webarchive module:

import webarchive
archive = webarchive.open("example.webarchive")
archive.extract("example.html")

For detailed documentation, try python3 -m pydoc webarchive.

The source distribution also includes two webarchive extraction tools:

  • extractor.py is a command-line version.
  • extractor-gui.py is a GUI version using Tkinter.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywebarchive-0.2.4.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

pywebarchive-0.2.4-py3-none-any.whl (12.8 kB view details)

Uploaded Python 3

File details

Details for the file pywebarchive-0.2.4.tar.gz.

File metadata

  • Download URL: pywebarchive-0.2.4.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.5.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.10

File hashes

Hashes for pywebarchive-0.2.4.tar.gz
Algorithm Hash digest
SHA256 220e036a813881ca035549dd21076f82650858b04a2241057a9669f19e2c7d2d
MD5 15ebeed8b4b039b235529bfd266ac517
BLAKE2b-256 80eda64b1da9eff61457f21820aae5be9c170dfabf0081dbd972b4235b7cab32

See more details on using hashes here.

File details

Details for the file pywebarchive-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: pywebarchive-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 12.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.5.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.10

File hashes

Hashes for pywebarchive-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 704f8dcb226320f2075bb9ffdd878c97e8e6456e3433b3b791b0a49ed1a071aa
MD5 b8530e670f759c540a5a72a85b2b3258
BLAKE2b-256 67b3110a9297f6de8a8a0330fd2189bb7c1acbead5400d41bbe843661226104f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page