Skip to main content

Scrapinghub Hubstorage Collection scanner.

Project description

High level hubstorage collection scanner

  • Provides convenient way to scan a collection in batches

  • Allows to merge data from multiple collections

  • Accepts endts and startts in many string formats (as accepted by dateparser lib) or standard HS epoch in millisecs

  • Accepts excluded prefixes

  • Adds stopbefore feature (analogous to startafter but the inverse)

  • Provides method for arbitrary prefix aggregation counting

  • Supports partitioned collections

  • Provides a suite for testing hs collection code.

Up to version 0.1.6: Python2 only Starting version 0.2: Python3 only

See usage instructions at scanner.py docstring.

Instalation

pip install collection-scanner

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

collection_scanner-0.5.1.tar.gz (9.1 kB view details)

Uploaded Source

Built Distribution

collection_scanner-0.5.1-py3-none-any.whl (10.5 kB view details)

Uploaded Python 3

File details

Details for the file collection_scanner-0.5.1.tar.gz.

File metadata

  • Download URL: collection_scanner-0.5.1.tar.gz
  • Upload date:
  • Size: 9.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.5

File hashes

Hashes for collection_scanner-0.5.1.tar.gz
Algorithm Hash digest
SHA256 94c1e53ef24af2badd2b709434b2d060c4af656b221157ba87d34c4cfc092b06
MD5 94536f1bb72b04a82933f9ce78d77b3e
BLAKE2b-256 af202025555ef26690c2e22d179d166895e0f9982c74a4b680993b954f67d9ed

See more details on using hashes here.

File details

Details for the file collection_scanner-0.5.1-py3-none-any.whl.

File metadata

File hashes

Hashes for collection_scanner-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fb88b6ee23fc50191cdf82c77464f4fa4641a118f34a70647ab9a07382b57cbd
MD5 7d37aeca5ea84b847f213f621c269b30
BLAKE2b-256 3fc93088372839fb7aa709bab35b703abd04612dd06cf9d4ecf99a244108e54f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page