Skip to main content

A Python module for web scraping and image scraping

Project description

Thorn Remastered

Thorn Remastered Logo

Thorn Remastered is an enhanced version of the original Thorn.py project, designed to simplify web scraping tasks by providing a versatile set of tools.

Features

  • Scrapes a wide range of content types, including HTML, JSON, CSS, and more.
  • Image scraping capabilities for downloading images from websites.
  • User-friendly and interactive command-line interface.
  • Flexible and extensible architecture for adding new scraping functionality.
  • Built-in error handling to ensure robust scraping.

Installation

You can install Thorn Remastered via pip:

pip install thorn.py-remast

Usage

Here are some examples of how to use Thorn Remastered:

Scraping HTML

from thorn import Thorn

thorn = Thorn()
html_content = thorn.scrape_html("https://example.com")
print(html_content)

Scraping JSON

from thorn import Thorn

thorn = Thorn()
json_data = thorn.scrape_json("https://api.example.com/data.json")
print(json_data)

Scraping Images

from thorn import ImageThorn

image_thorn = ImageThorn()
image_thorn.scrape_images("https://example.com", output_folder="./example_images")

Scraping Python Code

from thorn import Thorn

thorn = Thorn()
python_code = thorn.scrape_python("https://raw.githubusercontent.com/example/repo/main/script.py")
print(python_code)

Contributing

Contributions are welcome! Feel free to open issues, submit pull requests, or suggest improvements.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Special thanks to the original Thorn.py project for inspiration.


Note: Always ensure compliance with website terms of service and legal regulations when performing web scraping.

Table of Contents

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thorn.py-remast-2.1.9.tar.gz (4.3 kB view hashes)

Uploaded Source

Built Distribution

thorn.py_remast-2.1.9-py3-none-any.whl (5.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page