Skip to main content

A Python module for web scraping and image scraping

Project description

Thorn Remastered

Thorn Remastered Logo

Thorn Remastered is an enhanced version of the original Thorn.py project, designed to simplify web scraping tasks by providing a versatile set of tools.

Features

  • Scrapes a wide range of content types, including HTML, JSON, CSS, and more.
  • Image scraping capabilities for downloading images from websites.
  • User-friendly and interactive command-line interface.
  • Flexible and extensible architecture for adding new scraping functionality.
  • Built-in error handling to ensure robust scraping.

Installation

You can install Thorn Remastered via pip:

pip install thorn.py-remast

Usage

Here are some examples of how to use Thorn Remastered:

Scraping HTML

from thorn import Thorn

thorn = Thorn()
html_content = thorn.scrape_html("https://example.com")
print(html_content)

Scraping JSON

from thorn import Thorn

thorn = Thorn()
json_data = thorn.scrape_json("https://api.example.com/data.json")
print(json_data)

Scraping Images

from thorn import ImageThorn

image_thorn = ImageThorn()
image_thorn.scrape_images("https://example.com", output_folder="./example_images")

Scraping Python Code

from thorn import Thorn

thorn = Thorn()
python_code = thorn.scrape_python("https://raw.githubusercontent.com/example/repo/main/script.py")
print(python_code)

Contributing

Contributions are welcome! Feel free to open issues, submit pull requests, or suggest improvements.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Special thanks to the original Thorn.py project for inspiration.


Note: Always ensure compliance with website terms of service and legal regulations when performing web scraping.

Table of Contents

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thorn.py-remast-2.1.9.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

thorn.py_remast-2.1.9-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file thorn.py-remast-2.1.9.tar.gz.

File metadata

  • Download URL: thorn.py-remast-2.1.9.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for thorn.py-remast-2.1.9.tar.gz
Algorithm Hash digest
SHA256 fef427cac51594d894e3b94b6227b234d6a200a830aec515b1cabe92b5d4188d
MD5 ec6936d2d860048e0887058be33118f6
BLAKE2b-256 ca3888e7d055e5b4ffb2c2f82dabe6796119441a582c3ee6bffd7b7964f9d38a

See more details on using hashes here.

File details

Details for the file thorn.py_remast-2.1.9-py3-none-any.whl.

File metadata

File hashes

Hashes for thorn.py_remast-2.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 d9a25180cdca7ff0dfa97252edb2274b1cc24481a0faed572a1d5d66e4d35a4e
MD5 df50270a3d0badef76050cdcb1576b7e
BLAKE2b-256 57034780ad6440a1b7053b56de7a4a78118fb3dfd8757ddd2b7dda9fc4e98feb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page