Skip to main content

Scrapinghub Hubstorage Collection scanner.

Project description

  • Provides convenient way to scan a collection in batches

  • Allows to merge data from multiple collections

  • Accepts endts and startts in many string formats (as accepted by dateparser lib) or standard HS epoch in millisecs

  • Accepts excluded prefixes

  • Adds stopbefore feature (analogous to startafter but the inverse)

  • Provides method for arbitrary prefix aggregation counting

  • Supports partitioned collections

  • Provides a suite for testing hs collection code.

Up to version 0.1.6: Python2 only Starting version 0.2: Python3 only

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

collection_scanner-0.2.1.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

collection_scanner-0.2.1-py3-none-any.whl (10.4 kB view details)

Uploaded Python 3

File details

Details for the file collection_scanner-0.2.1.tar.gz.

File metadata

  • Download URL: collection_scanner-0.2.1.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for collection_scanner-0.2.1.tar.gz
Algorithm Hash digest
SHA256 5abf043db6ceca85e0da7774daa8df84f3128c5a3c3d8a43e521f434f7ac3ff3
MD5 3609c0057ee4e9815215c4cc50e77ddb
BLAKE2b-256 2c901defd24deb0fe885bc2d22e4834fc96ef8fa2736029dd49673ce8eca0870

See more details on using hashes here.

File details

Details for the file collection_scanner-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: collection_scanner-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 10.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for collection_scanner-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 86b2fc7efb1848f4326331a9b5d837fc71f2638a957d191d73cf97ba680ba18b
MD5 3cd6d41e0144bb0808e6fd4a84fa8525
BLAKE2b-256 85693b1e3b7ce904070cb2bfd5d6ad739f69456b8ec84ee00fb94d548e7ec97c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page