Skip to main content

No project description provided

Project description

RobotsTxt

Description

The project is a robots.txt parser. The parser is written in Python, easy to use and has small improvements for typos. The project is open source and encourages contributions and ideas for improvements.

Usage

The usage of the package is simple. first you have to init the parser.

USER_AGENT = "GOOGLEBOT"
RobotsTxt_TheHunterDog.initRobotsTxt(USER_AGENT, robots_txt_url = "https://www.google.com/robots.txt")

when it has been initailized the allowed and disallowd list has been parsed and stored. Each get request must be checked against the list. this is being done with the built in make get request function.

USER_AGENT = "GOOGLEBOT"
RobotsTxt_TheHunterDog.main.makeRequest("https://www.google.com", USER_AGENT)

The user agent is required for each request, so that the user agent is supplied to the server.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robotstxt_thehunterdog-0.0.3.tar.gz (2.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

robotstxt_thehunterdog-0.0.3-py3-none-any.whl (2.9 kB view details)

Uploaded Python 3

File details

Details for the file robotstxt_thehunterdog-0.0.3.tar.gz.

File metadata

  • Download URL: robotstxt_thehunterdog-0.0.3.tar.gz
  • Upload date:
  • Size: 2.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for robotstxt_thehunterdog-0.0.3.tar.gz
Algorithm Hash digest
SHA256 50c2ecd9011d7842c70bb12b1faad1babe9974b87aa7031da7f4c6937b574cf3
MD5 6a0ca1640ad43ee5a5bdbbc8607d9a0e
BLAKE2b-256 85e084565370622b07a7ffcf11a133a066debf0d44ffea1d2822da6ce79ea049

See more details on using hashes here.

File details

Details for the file robotstxt_thehunterdog-0.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for robotstxt_thehunterdog-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6ced83c799cb7b80bc7a5e2bc14a4822c6b27525e4c2de4eb1215525d4ad8a65
MD5 1d955517bf4871b38ed8cf1a3fd133b2
BLAKE2b-256 41d2492a3acbb307ddf54f96875f1452d8a86a7f39f8b8af16ea14224d659d5e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page