Skip to main content

tools which can easy the process of making crawler

Project description

Introduction:

This class can be used in crawler project. And it contain two function:

1. proxies IP

2. hide header

If also used pkl cache to speed up the abstract information from IP Pool cache will be expired in one day

How to use: create new crawlerComponent object, and then,you can use:

1. get_an_ip: get a random IP

2. get_a_header: get a random header

3. updateIpLib: update the ip library (original set as once per day)

methods to use


Class Version: 2.0.0 in https://gitlab.com/snippets/1873717

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crawlerHelper-0.0.1.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

crawlerHelper-0.0.1-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file crawlerHelper-0.0.1.tar.gz.

File metadata

  • Download URL: crawlerHelper-0.0.1.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for crawlerHelper-0.0.1.tar.gz
Algorithm Hash digest
SHA256 bf3010ceebfc4e6bf5c75fe2e3cc40d7b5ca04e3ca607d3a9ea51c0b86ae9d1a
MD5 57aea80f32c2f2164631b66ee739d41f
BLAKE2b-256 0ddfa3f06ba515f8ee67c42c1166743614a41c758bc0996a92ca76238b912c8f

See more details on using hashes here.

File details

Details for the file crawlerHelper-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: crawlerHelper-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 4.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for crawlerHelper-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ecfe6c986857e366d3d26c40464a83cbd0ad1aaf6dffa3a61455fb9a18dcd40c
MD5 2c2530fe1ef5a3bae9768bcb390fc83a
BLAKE2b-256 f15460872d7cc17e76f4787dc1025d002344a9e67f60fb19c4c03b1cf5a00528

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page