Skip to main content

Scrapinghub Hubstorage Collection scanner.

Project description

High level hubstorage collection scanner

  • Provides convenient way to scan a collection in batches

  • Allows to merge data from multiple collections

  • Accepts endts and startts in many string formats (as accepted by dateparser lib) or standard HS epoch in millisecs

  • Accepts excluded prefixes

  • Adds stopbefore feature (analogous to startafter but the inverse)

  • Provides method for arbitrary prefix aggregation counting

  • Supports partitioned collections

  • Provides a suite for testing hs collection code.

Up to version 0.1.6: Python2 only Starting version 0.2: Python3 only

See usage instructions at scanner.py docstring.

Instalation

pip install collection-scanner

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

collection_scanner-0.5.0.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

collection_scanner-0.5.0-py3-none-any.whl (10.5 kB view details)

Uploaded Python 3

File details

Details for the file collection_scanner-0.5.0.tar.gz.

File metadata

  • Download URL: collection_scanner-0.5.0.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.5

File hashes

Hashes for collection_scanner-0.5.0.tar.gz
Algorithm Hash digest
SHA256 c886eb534aee9ed8950cb72606357bb781270bf811c012d4472be0c829b6af47
MD5 7af34652672f38f2254587fb522960cc
BLAKE2b-256 35f99b0256184eef6ceea800b9264ee306032a3b5d854c9d932ae5f022b6e119

See more details on using hashes here.

File details

Details for the file collection_scanner-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: collection_scanner-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 10.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.5

File hashes

Hashes for collection_scanner-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 74f0543274ccd100d10d58c51dde4d2e325665738e0b0786fae0891c0ecab5ee
MD5 c30c06bdaa9b559a5a6ddbfbff631d41
BLAKE2b-256 28bed599812b32c6c332885ee85a804c84b72d46306a268abc3c58df1582e73c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page