Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.1.2.tar.gz (5.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.1.2-py3-none-any.whl (5.9 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.1.2.tar.gz.

File metadata

  • Download URL: swagenttools-0.1.2.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.1.2.tar.gz
Algorithm Hash digest
SHA256 2ea7417d26d98bdf6355c1b688d5708f112fd8bc0c2ecba5313317ced2ea67de
MD5 2667a588899bd4a587394e305f8b9dd1
BLAKE2b-256 b3cba583e266f585be0a3d26c559a9165d2239a9122565cf6cc80e978db6dfe3

See more details on using hashes here.

File details

Details for the file swagenttools-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 5.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e42c5b1f181994f75ac8463a1ccba866a7b97d86cc827f900ff6f3cb2e005d08
MD5 41e3ad953c5d70072c0a035bdf7bd41a
BLAKE2b-256 f18e422d41a458e50af2067ceed16cb8b37e391b787865f8c06e76f043be3b59

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page