ZIM file iterator
Project description
ZIM Scan
Minimal ZIM file reader, designed for article streaming.
Getting Started
Install using pip:
pip install zimscan
Or from Git repository, for latest version:
pip install -U git+https://github.com/jojolebarjos/zimscan.git
Iterate over a records, which are binary file-like objects:
from zimscan import Reader
path = "wikipedia_en_all_nopic_2019-10.zim"
with Reader(open(path, "rb"), skip_metadata=True) as reader:
for record in reader:
data = record.read()
...
Links
- ZIM file format, official documentation
- Kiwix ZIM repository, to download official ZIM files
- Wikipedia ZIM dumps, to download Wikipedia ZIM files
- ZIMply, a ZIM file reader in the browser, in Python
- libzim, the reference implementation, in C++
- pyzim, Python wrapper for libzim
- pyzim, another Python wrapper for libzim
- Internet In A Box, a project to bundle open knowledge locally
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
zimscan-0.2.0.tar.gz
(6.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file zimscan-0.2.0.tar.gz.
File metadata
- Download URL: zimscan-0.2.0.tar.gz
- Upload date:
- Size: 6.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.9.13 Windows/10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3e2b8c60532dcb9834c2b35a2ef1743d97d8074a95df1c36d921d9a1d007645d
|
|
| MD5 |
5715ad61c8410a3fc05040bf525198aa
|
|
| BLAKE2b-256 |
f5059dff0428a4e51d3b63b321f07ccea20dec8ba5c9345458de5a9425fe2ef8
|
File details
Details for the file zimscan-0.2.0-py3-none-any.whl.
File metadata
- Download URL: zimscan-0.2.0-py3-none-any.whl
- Upload date:
- Size: 6.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.9.13 Windows/10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0511fd14048cb595795d967c1549ed03dc5f36d73885bca71135ad72a2e9de3a
|
|
| MD5 |
360d21d8caa57552b85840cc421de756
|
|
| BLAKE2b-256 |
b0a9258ab47a503e312aff7138fca28852eadde9e6410e28ceba1907fae4fc97
|