Skip to main content

A small crawler to scrape data from swranking.com and store in in the local db of the vm.

Project description

#######################################################

SWAgent Crawler

#######################################################

This is a simple web crawler specifically designed to scrape data from swranking.com continuously in order to build a database with data useful enoughn to train a ML model to make RTA draft predictions in real time.

The package contains two helper classes:

- USERAGENT: creates randomized user_agents to

send through the REST request t obtain data from the websites API.

- SEEKER: this is the actual crawler that finds

the information for us and then sends it out as a json object.

The package main routine focuses on a basic ETL schema. afte obtaining the data from the seeker object it then transfoms the data to be in the format wanted by the Database. Then we send it to the local db of the VM to store for further processing by other jobs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swagenttools-0.3.9.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swagenttools-0.3.9-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file swagenttools-0.3.9.tar.gz.

File metadata

  • Download URL: swagenttools-0.3.9.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.9.tar.gz
Algorithm Hash digest
SHA256 5920ab2598325c6ed02e8af3a1a09b5fe7d1de4e75ea4f3e857879f17f2cc226
MD5 ac0aa1497ae1f452c534034511c493c9
BLAKE2b-256 cdb859e0b5d9e2e3c12e9d332cbf0650c54ae10ea8240ff80e5d6ac1f588df27

See more details on using hashes here.

File details

Details for the file swagenttools-0.3.9-py3-none-any.whl.

File metadata

  • Download URL: swagenttools-0.3.9-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for swagenttools-0.3.9-py3-none-any.whl
Algorithm Hash digest
SHA256 b06dec806e2b48fe0e76fed6aa8ad87136ea665f7ca2ef52b128ece90df9c2fb
MD5 ff56e053c44a828e8eeae2b51882a90e
BLAKE2b-256 222fdf1490643aa62a80e80e7ad99e2ad4bef91515ea58dc4b2cd99234739962

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page