Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.6.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.6-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.6.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.6.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.6.tar.gz
Algorithm Hash digest
SHA256 b7318a387821e588991774878d85f393219127e63602790843cef1ff6c4383bb
MD5 adf00aa9b3a7358e79ec5ef6a5a9840d
BLAKE2b-256 3f22915f0f75c0dfb348a4b9c0452e00831ca5d31a4ef14de2ceb80f0bb45f9b

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.6-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.6-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.6-py3-none-any.whl
Algorithm Hash digest
SHA256 a66ee9bb61ca4e6a61c0018e1c9577838594a47ddd720db2265d4c26880c9a26
MD5 feedb6ecf5afc485c63f9baebb8c10ff
BLAKE2b-256 9292f098cbda87651ef41f97f1e0f404892d767936d07a92a8c40ef44cf691f6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page