Tools for indexing gzip files to support random-like access.
Project description
Tools for indexing gzip files to support random-like access.
Installing
To install library, run the following coomand:
$ python setup.py install
Testing
$ make test $ make lint
Indexing a file
Indexing a file from stdin:
$ cat profiles.json.gz | gzipi index --format json --field id > index.json.gz
Indexing a local file:
$ cat profiles.json.bz -i profiles.json.gz -o index.json.gz --format csv --column 0 --delimiter ','
Retrieving data
Retrieving all data for specific domains:
$ zcat domains.json.gz | gzipi retrieve -f profiles.json.gz -i index.json.gz --format json > data.json --field domain
Using local and S3 paths:
$ gzipi retrieve -k domains.json.gz -f s3://domain-data/reports/Q1.json.gz -i index.json.gz --format json -o data.json --field domain
Repacking existing archives
$ gzipi repack -f profiles.json.gz -i new_index.json.gz -o new_profiles.json.gz --format json --field domain
Help
To get more information, run the following command:
$ gzipi --help
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gzipi-0.1.3.tar.gz
(12.8 kB
view details)
File details
Details for the file gzipi-0.1.3.tar.gz
.
File metadata
- Download URL: gzipi-0.1.3.tar.gz
- Upload date:
- Size: 12.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.34.0 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 56baff05d9848de31ca0e956e3e5931146e34ce1ee595be937d246f5ad956d6d |
|
MD5 | 294c5487d4b90383f5b0d3600f51b575 |
|
BLAKE2b-256 | 775ef8d509d8b98a9d1e212d67586582d9973f0a52b2a7dc6d6215f6183a36d4 |