Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.20.tar.gz (43.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.20-py3-none-any.whl (43.3 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.20.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.20.tar.gz
  • Upload date:
  • Size: 43.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.20.tar.gz
Algorithm Hash digest
SHA256 b5eeacefda646112a21160499a52c64516b3ff8368ef05ec9166f65bf9520bf2
MD5 3e5480d4a8fc27727253f544754cf10f
BLAKE2b-256 09fe1d3d7a97c6957b6dad4f8cb4c79a0161bc7f497343101d33c8ee8f79c17e

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.20-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.20-py3-none-any.whl
  • Upload date:
  • Size: 43.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.20-py3-none-any.whl
Algorithm Hash digest
SHA256 1fc349aa2362ca188597967223b4315f2c3c30938d49a1e5f38538eb1abb1f6e
MD5 14d1cb7bf85abea467a7a1b0fd38f0ae
BLAKE2b-256 af35f73c8c48302b29fcae09ae84e44ff9ec1e9af042856b887c739c59e2b2a4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page