Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.18.tar.gz (43.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.18-py3-none-any.whl (43.3 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.18.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.18.tar.gz
  • Upload date:
  • Size: 43.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.18.tar.gz
Algorithm Hash digest
SHA256 49d368b7dfbb11992d698d60765c043f66df88a3b9abd481bfe40f40b398f26c
MD5 a73e020a8df1d0618ff52310082bb567
BLAKE2b-256 53f34aea84549b107af10dc70704097465bb63b31e19319f12dfaca2ea906ae5

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.18-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.18-py3-none-any.whl
  • Upload date:
  • Size: 43.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.18-py3-none-any.whl
Algorithm Hash digest
SHA256 42b074ee75a3ab0ee0e3e90a73b6fe62fb40778c85c621f3049f4c2010f72035
MD5 3786fd3271305945e4ee7a87253c3c1b
BLAKE2b-256 08d73e4d25869fa9d21c8c8478620f162e393c50e26f8581b6bd82d2f54fa58d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page