Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.4.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.4-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.4.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.4.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.4.tar.gz
Algorithm Hash digest
SHA256 a8071a244a658b6db7ba3e0b7f42878476488dd22f8f5c1235f694b962171bea
MD5 1c0249aafe036e108ea37dd80920931b
BLAKE2b-256 6a01997481ec6211816c22e5739fd88adcde7b4b79c3f20bc262dd990200bcec

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.4-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.4-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 43b22740f663ac3fa8977b2feb328de0682fd53ec4f4f97e4fac6e528549d2ce
MD5 c5bb93999d71b76f586e8a25ea90ed4d
BLAKE2b-256 0a3d27c5d4206f2d7eaa45aba874ddb083a80f26a4da5da8ea13487aac11237a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page