Skip to main content

No project description provided

Project description

RobotsTxt

Description

The project is a robots.txt parser. The parser is written in Python, easy to use and has small improvements for typos. The project is open source and encourages contributions and ideas for improvements.

Usage

The usage of the package is simple. first you have to init the parser.

USER_AGENT = "GOOGLEBOT"
RobotsTxt_TheHunterDog.initRobotsTxt(USER_AGENT, robots_txt_url = "https://www.google.com/robots.txt")

when it has been initailized the allowed and disallowd list has been parsed and stored. Each get request must be checked against the list. this is being done with the built in make get request function.

RobotsTxt_TheHunterDog.main.makeRequest("https://www.google.com")

The user agent that has been initailized will be used for each get request made using the function.

Contributing

Pull requests are welcome, for major changes please open an issue first to discuss what you would like to change. Issues are also welcome to report bugs, request features or suggest improvements.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robotstxt_thehunterdog-0.0.5.tar.gz (3.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

robotstxt_thehunterdog-0.0.5-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file robotstxt_thehunterdog-0.0.5.tar.gz.

File metadata

  • Download URL: robotstxt_thehunterdog-0.0.5.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for robotstxt_thehunterdog-0.0.5.tar.gz
Algorithm Hash digest
SHA256 87652cd5ede298c3ce5e5f6876d15b40e8d7461c42eb2e1162b03f7c0908b89e
MD5 05ab2899d2558bd716b5c2e1cf2068d0
BLAKE2b-256 e7cac03b9dddb1dda6fa51868ec2e54e541eb9b625ca3ae2b0c03e6468ac1f85

See more details on using hashes here.

File details

Details for the file robotstxt_thehunterdog-0.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for robotstxt_thehunterdog-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 714be2560958756e50df62badf2a58c7e51b39898abaa713b33692c3df94b0d9
MD5 1005390f128a2fae2a232e5b6d628f15
BLAKE2b-256 400b9232b5a0e7358a1c3af2baaed919bff72444ff93eab623c65e64e39de071

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page