Skip to main content

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

Project description

Webtrench

WebTrench provides a comprehensive and powerful toolkit for web scraping. Whether you're working on a machine learning project, conducting research, or simply need to gather data from the web, WebTrench is the perfect tool for the job. So why wait? Start using WebTrench today and streamline your data collection process!

pip install Webtrench

Check Documentation Here


Downloads PyPI - Format GitHub license Upload Python Package Supported Versions GitHub PyPI Documentation Status PyPI - Downloads Downloads PyPI - Format

Why WebTrench

Easy to use: With its simple and intuitive interface, WebTrench makes it easy to extract data from the web. Comprehensive: WebTrench includes functions for extracting a wide range of data, from images to tables and beyond. Fast and efficient: WebTrench is designed to be fast and efficient, so you can quickly gather the data you need. Suitable for a variety of use cases: Whether you're working on a machine learning project, conducting research, or simply need to gather data from the web, WebTrench is a versatile tool that can meet your needs.

from Webtrench import ImageScrapper
url = 'https://example.com'
folder_path = './images'
ImageScrapper.all_image_from_url(url, folder_path)

This code snippet downloads an image from the URL https://example.com/image.jpg and saves it in the ./images folder with a random number as the file name.

Limitations of WebTrench

Depends on website structure: The success of web scraping with WebTrench depends on the structure of the website being scraped. If the website's structure changes, WebTrench may not work as expected. Legal restrictions: There may be legal restrictions on the use of web scraping, so it's important to familiarize yourself with the laws in your jurisdiction before using WebTrench.

Privacy Policy

WebTrench respects the privacy of its users and is committed to protecting their data. We do not collect or store any personal information, and all data collected through the use of WebTrench is kept confidential.

Web Scraping Ethics

When using WebTrench or any other web scraping tool, it's important to follow ethical guidelines and avoid scraping websites without the owner's permission. This includes websites that explicitly prohibit scraping, as well as websites that contain sensitive or confidential information.

Legal Warning

The use of web scraping may be subject to legal restrictions, and the legality of web scraping depends on the jurisdiction in which it is being used. Before using WebTrench, it's important to familiarize yourself with the laws in your jurisdiction and ensure that your use of the tool complies with all applicable laws. WebTrench cannot be held responsible for any illegal use of the tool.

Contributing Guide

We welcome contributions from the community! If you are interested in contributing to the WebTrench project, here are some guidelines to get started:

  • Check the issues page to see if there are any open bugs or features that you would like to work on.
  • Fork the repository and make your changes in a separate branch.
  • Once you have made your changes, submit a pull request for review.
  • The project maintainers will review your pull request and provide feedback. If necessary, make any requested changes and resubmit your pull request.
  • Once your pull request is approved and merged, you will become a contributor to the WebTrench project!

Project Clone Guide

If you would like to clone the WebTrench repository, follow these steps:

  • Install Git on your computer.
  • Open a terminal window and navigate to the directory where you would like to clone the repository.
  • Run the following command: git clone https://github.com/nuhmanpk/WebTrench.git
  • The repository will be cloned to your computer, and you can now make changes to the code and contribute to the project.

Reminder

Please note that WebTrench is currently in the pre-release stage and is not yet finished. If you encounter any issues, please check the issues page, or consider contributing to make a better version of WebTrench!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Webtrench-0.1.2.tar.gz (9.4 kB view details)

Uploaded Source

Built Distribution

Webtrench-0.1.2-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file Webtrench-0.1.2.tar.gz.

File metadata

  • Download URL: Webtrench-0.1.2.tar.gz
  • Upload date:
  • Size: 9.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for Webtrench-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c7b670b7530794cc8d3ef002308ffa1c638259c02d43fb061d889c1e61f8b976
MD5 3cc02369eeb8f0afa705c01a3cef9609
BLAKE2b-256 771a09db88be8c1289ef130631c9771eb8e258432819fa22405cba6fccafca14

See more details on using hashes here.

File details

Details for the file Webtrench-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: Webtrench-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for Webtrench-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 213e0b0f635cc772549b2b5948038149a42a54fcee5899dd6d1d198219e3c94a
MD5 d7d5d9d39752c6a044d9bee13c13e2e2
BLAKE2b-256 8a20db7d39dd3bcbf4bb8c9d15517c3b19a98ad0c6d94957eb1b1d3fa3f4083e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page