Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.19.tar.gz (43.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.19-py3-none-any.whl (43.3 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.19.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.19.tar.gz
  • Upload date:
  • Size: 43.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.19.tar.gz
Algorithm Hash digest
SHA256 5b3bd570569427f185dcbc98f728e6b3dbe1f02af2b861f5a4f27475e7d0d99d
MD5 1309f0d1d368bf6a0c392e63761c3f21
BLAKE2b-256 1d235b99351d3a4d2e1e67f9eb4d901f85c1ff0625f5959e28b64aa51314daa6

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.19-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.19-py3-none-any.whl
  • Upload date:
  • Size: 43.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.19-py3-none-any.whl
Algorithm Hash digest
SHA256 ec9fa45c4d3b187eecd0ac992ad37a275dc89d34061e9a7eaa1bda40b623df83
MD5 d6e0310511f4a4894d100ff57a400304
BLAKE2b-256 bcd655e78079f519877f59d809ada308c7b398507713fe5e19d156b71be33b30

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page