Skip to main content

Make zimfile from stackexchange dump

Project description

# Sotoki

*Stack Overflow to Kiwix*

The goal of this project is to create a suite of tools to create
[zim](http://www.openzim.org) files required by
[kiwix](http://kiwix.org/) reader to make available [Stack Overflow](https://stackoverflow.com/)
offline (without access to Internet).

## Getting started

The use of btrfs as a file system is recommended (and required for stackoverflow)

Install non python dependencies:

```
sudo apt-get install jpegoptim pngquant gifsicle advancecomp python-pip python-virtualenv python-dev libxml2-dev libxslt1-dev libbz2-dev p7zip-full python-pillow gif2apng imagemagick
```


Create a virtual environment for python:

```
virtualenv --system-site-packages venv
```

Activate the virtual enviroment:

```
source venv/bin/activate
```


Install this lib:

```
pip install sotoki
```


```
sotoki <domain> <publisher> [--directory=<dir>] [--nozim] [--tag-depth=<tag_depth>] [--threads=<threads>] [--zimpath=<zimpath>] [--reset] [--reset-images] [--clean-previous] [--nofulltextindex] [--ignoreoldsite] [--nopic] [--no-userprofile]

```
You can use `sotoki -h` to have more explanation about these options


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sotoki-1.0.tar.gz (1.6 MB view details)

Uploaded Source

Built Distribution

sotoki-1.0-py2-none-any.whl (1.6 MB view details)

Uploaded Python 2

File details

Details for the file sotoki-1.0.tar.gz.

File metadata

  • Download URL: sotoki-1.0.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.10.0 pkginfo/1.4.2 requests/2.18.4 setuptools/38.6.0 requests-toolbelt/0.8.0 tqdm/4.19.8 CPython/2.7.13

File hashes

Hashes for sotoki-1.0.tar.gz
Algorithm Hash digest
SHA256 b9100fd458fdb406a475b85825bc195f2f55b1479317f9bfa1584a8597fa63e5
MD5 f124d9c42ac07e50a27098ae6f98d226
BLAKE2b-256 6bb41453b19accbe664f9a77c027a9fc18c48981d7817b291378ffb9f9ac99d3

See more details on using hashes here.

File details

Details for the file sotoki-1.0-py2-none-any.whl.

File metadata

  • Download URL: sotoki-1.0-py2-none-any.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.10.0 pkginfo/1.4.2 requests/2.18.4 setuptools/38.6.0 requests-toolbelt/0.8.0 tqdm/4.19.8 CPython/2.7.13

File hashes

Hashes for sotoki-1.0-py2-none-any.whl
Algorithm Hash digest
SHA256 1a046518d4ace7eff330328bc4a9bcdb39b651163248875e2120010dc012cb60
MD5 d792ae0322c247f3893a3e286275261b
BLAKE2b-256 6422477faf5a8bbe34ec90789e6497fdba2142dae823195f58725158324ae8e2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page