A Python module for web scraping and image scraping
Project description
Thorn Remastered
Thorn Remastered is an enhanced version of the original Thorn.py project, designed to simplify web scraping tasks by providing a versatile set of tools.
Features
- Scrapes a wide range of content types, including HTML, JSON, CSS, and more.
- Image scraping capabilities for downloading images from websites.
- User-friendly and interactive command-line interface.
- Flexible and extensible architecture for adding new scraping functionality.
- Built-in error handling to ensure robust scraping.
Installation
You can install Thorn Remastered via pip:
pip install thorn.py-remast
Usage
Here are some examples of how to use Thorn Remastered:
Scraping HTML
from thorn import Thorn
thorn = Thorn()
html_content = thorn.scrape_html("https://example.com")
print(html_content)
Scraping JSON
from thorn import Thorn
thorn = Thorn()
json_data = thorn.scrape_json("https://api.example.com/data.json")
print(json_data)
Scraping Images
from thorn import ImageThorn
image_thorn = ImageThorn()
image_thorn.scrape_images("https://example.com", output_folder="./example_images")
Scraping Python Code
from thorn import Thorn
thorn = Thorn()
python_code = thorn.scrape_python("https://raw.githubusercontent.com/example/repo/main/script.py")
print(python_code)
Contributing
Contributions are welcome! Feel free to open issues, submit pull requests, or suggest improvements.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
Special thanks to the original Thorn.py project for inspiration.
Note: Always ensure compliance with website terms of service and legal regulations when performing web scraping.
Table of Contents
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file thorn.py-remast-2.1.9.tar.gz
.
File metadata
- Download URL: thorn.py-remast-2.1.9.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fef427cac51594d894e3b94b6227b234d6a200a830aec515b1cabe92b5d4188d |
|
MD5 | ec6936d2d860048e0887058be33118f6 |
|
BLAKE2b-256 | ca3888e7d055e5b4ffb2c2f82dabe6796119441a582c3ee6bffd7b7964f9d38a |
File details
Details for the file thorn.py_remast-2.1.9-py3-none-any.whl
.
File metadata
- Download URL: thorn.py_remast-2.1.9-py3-none-any.whl
- Upload date:
- Size: 5.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d9a25180cdca7ff0dfa97252edb2274b1cc24481a0faed572a1d5d66e4d35a4e |
|
MD5 | df50270a3d0badef76050cdcb1576b7e |
|
BLAKE2b-256 | 57034780ad6440a1b7053b56de7a4a78118fb3dfd8757ddd2b7dda9fc4e98feb |