Skip to main content

Scrapinghub Hubstorage Collection scanner.

Project description

High level hubstorage collection scanner

  • Provides convenient way to scan a collection in batches

  • Allows to merge data from multiple collections

  • Accepts endts and startts in many string formats (as accepted by dateparser lib) or standard HS epoch in millisecs

  • Accepts excluded prefixes

  • Adds stopbefore feature (analogous to startafter but the inverse)

  • Provides method for arbitrary prefix aggregation counting

  • Supports partitioned collections

  • Provides a suite for testing hs collection code.

Up to version 0.1.6: Python2 only Starting version 0.2: Python3 only

See usage instructions at scanner.py docstring.

Instalation

pip install collection-scanner

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

collection_scanner-0.4.2.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

collection_scanner-0.4.2-py3-none-any.whl (10.5 kB view details)

Uploaded Python 3

File details

Details for the file collection_scanner-0.4.2.tar.gz.

File metadata

  • Download URL: collection_scanner-0.4.2.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for collection_scanner-0.4.2.tar.gz
Algorithm Hash digest
SHA256 8a5ca0eafe1aef1f843aa0f0b43811fd55792eb679960b221b5686eaa92123ba
MD5 8a8b16867ab323dd8c09c0bcbd03058b
BLAKE2b-256 8b2796d28be8c95725d222c686ddd25c972e1f3b819907a07225f3324655b87d

See more details on using hashes here.

File details

Details for the file collection_scanner-0.4.2-py3-none-any.whl.

File metadata

  • Download URL: collection_scanner-0.4.2-py3-none-any.whl
  • Upload date:
  • Size: 10.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for collection_scanner-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 db31401d30a69718fcf6c48d946212a92ae3ea8552b68a34962573a9baa28117
MD5 efa1951fc2ae17311e8fe93a7c6996bc
BLAKE2b-256 4dde14ae0c8f1c23d05bab3eb5527ca7ad30370000280ae4f46b5991c3d17bb3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page