Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.8.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.8-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.8.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.8.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.8.tar.gz
Algorithm Hash digest
SHA256 e087c768affdc5f0bb339cdeefd18de4bd6a3b817544dabc1d5300acd0568514
MD5 4195147829b1eda2a8bc362cd69c6c92
BLAKE2b-256 3de326d4cd1a14728d1062f30cb0b471d7e654b8fb03f2eae4afe62a1cc0d10f

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.8-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.8-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.8-py3-none-any.whl
Algorithm Hash digest
SHA256 70810d2b5e7d816d3a8e7ad6866b1cb62a23216e34f34c0b5cb9284fa294fde3
MD5 af3ae1f6b34db3c877490ffce32fd748
BLAKE2b-256 e3ccede9ef0e3cee59dd60062ce8132d43453d7ee4e99161fac4b2f06c804260

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page