Skip to main content

Module for reading Apple's .webarchive files

Project description

**pywebarchive** is a Python module for reading Apple's `.webarchive` files. It is currently in the very early stages of development.

It provides the `webarchive` module, which consists of two main classes:

* `WebArchive`, to read `.webarchive` files
* `Extractor`, to extract a `WebArchive` to a standard HTML document

Individual resources (i.e., files) in a `WebArchive` are represented by `WebResource` objects.

pywebarchive requires Python 3; there are no current plans to add Python 2 support.

Example usage:

```python
from webarchive import WebArchive, Extractor

archive = WebArchive("example.webarchive")

extractor = Extractor(archive)
extractor.extract("example.html")
```

For detailed documentation, try `python -m pydoc webarchive`.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywebarchive-0.1.0.tar.gz (6.0 kB view hashes)

Uploaded Source

Built Distribution

pywebarchive-0.1.0-py3-none-any.whl (8.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page