Skip to main content

The WebSchaber is python3 script which extracts the text and images content on search engine `bing.com`

Project description

# WebSchaber

The WebSchaber is Python3 script which extracts the text and images content on search engine `bing.com`

It helps the user in a way that the user will be getting only meaningful results and images for their search query. It does not download the ad content and hence saving data for the user.

It requests for a search term and creates directory (if not made previously) in the root directory of the script where all the content of the related particular search is stored. This python script will be downloading the hypertext and hyperlink to that text and saving it to a .txt file within the directory made by the script itself. This directory saves the text content as well as the images downloaded using the script.

## Requirements
`1. Modules:

a. requests: Used for requesting various content through two possible HTTPS Methods: GET and POST.
This project uses GET Method.

b. BeautifulSoup: Used for creating JSON like dictionary using HTML Parser. This project uses bs4.

c. os: Used for checking and making directories.

d. PIL.Image: Used for extracting image content.

e. io.ByteIO: Used for saving the extracted image using the PIL.Image.

2. Internet Connection: Continuous high speed internet connection is required for the proper function of the python script as it continuously creates the copy of the images to the local machine.
`
## Installation

`pip install webschaber`


Project details


Release history Release notifications | RSS feed

This version

1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webschaber-1.0.tar.gz (2.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

webschaber-1.0-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file webschaber-1.0.tar.gz.

File metadata

  • Download URL: webschaber-1.0.tar.gz
  • Upload date:
  • Size: 2.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for webschaber-1.0.tar.gz
Algorithm Hash digest
SHA256 542cc65108e34a0dde36904ddfcce8491626e36576c33f14947a33d9fdec695c
MD5 69dfbb5f5a2fc7401db3d2e52db305fe
BLAKE2b-256 69e4c06dfb0f05fadd344d787288f3dabf2f4414fd9cf989a9cdd274f12e7686

See more details on using hashes here.

File details

Details for the file webschaber-1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for webschaber-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1bd83909a1bfec666ecf24231ec0f2e5ceab0bed09bb23e343cac521dd24d434
MD5 05c5291f4bf4d43a9977d889654363c0
BLAKE2b-256 bedab97c26c89e6945951d23d0dc4580b11b69992897f529f6eb68b455840103

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page