Skip to main content

Scrapinghub Hubstorage Collection scanner.

Project description

  • Provides convenient way to scan a collection in batches

  • Allows to merge data from multiple collections

  • Accepts endts and startts in many string formats (as accepted by dateparser lib) or standard HS epoch in millisecs

  • Accepts excluded prefixes

  • Adds stopbefore feature (analogous to startafter but the inverse)

  • Provides method for arbitrary prefix aggregation counting

  • Supports partitioned collections

  • Provides a suite for testing hs collection code.

Up to version 0.1.6: Python2 only Starting version 0.2: Python3 only

See usage instructions at scanner.py docstring.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

collection_scanner-0.3.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

collection_scanner-0.3-py3-none-any.whl (10.6 kB view details)

Uploaded Python 3

File details

Details for the file collection_scanner-0.3.tar.gz.

File metadata

  • Download URL: collection_scanner-0.3.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for collection_scanner-0.3.tar.gz
Algorithm Hash digest
SHA256 a4c31788a8860c904e5f8505b59ac97c3c1a915b73db4de7cd31ac24e71d9d8d
MD5 92ce3b33d8cb8c948334737c1664ebf5
BLAKE2b-256 6e42b347735c75b411d577b92841f22ff20b957635f67ff16248122917ff5e04

See more details on using hashes here.

File details

Details for the file collection_scanner-0.3-py3-none-any.whl.

File metadata

  • Download URL: collection_scanner-0.3-py3-none-any.whl
  • Upload date:
  • Size: 10.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for collection_scanner-0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 1511518094ae71fa6921d7ebcd733388841ab6aff069de2091bc4351651b60f1
MD5 941d951762e2aa21d6eac2e72c866dba
BLAKE2b-256 3618d264b940153b49ae7b37bbee7b4aa1b0a716ae166c24bc1663ae1725ab0b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page