A piece of cake way to scrape product images from Amazon or flipkart
Project description
Imgscrape is a selenium and pillow based image scraper.
Supported Websites
Features!
Scrapes image link
Convert image into thumbnail
Encode image into Base64
Save image directly to folder : In development
Imgscrape is a selenium and pillow based image scraper. You just provide the product link or a list of the same and it will provide you the links in image or list of it based on your argument.
The reason behind this idea is to scrape image links are unreachable by using Beautifulsoup4. Thus switching to selenium gets you the images you need.
Installation & Usage
$ pip install imgscrape
Imgscrape requires chromedriver to run. Download it.
After downloading you’ll need to provide the path of chromedriver to the constructer.
from imgscrape import imgscrape
any_var = #path to driver
var = imgscrape(producturl, path= any_var)
you can feed a list of product urls and get the image links in a list.
from imgscrape import imgscrape
any_var = #path to driver
lst_var = ['link1', 'link2', 'link3']
var = imgscrape(*lst_var, path= any_var)
Development
Want to contribute? Great! Create a pull request and we’ll see.
Support
linux
windows
### Todos
Add support for MacOS
Add more supported websites
License
MIT
Free Software, Hell Yeah!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file imgscrape-0.1.1.tar.gz
.
File metadata
- Download URL: imgscrape-0.1.1.tar.gz
- Upload date:
- Size: 2.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fdf8403046e3c8229a767c448dc41f9407d58197c0e66a6a7267eaceff340d82 |
|
MD5 | 01cb39b8b40282e149c72f4de941c6a6 |
|
BLAKE2b-256 | bce482190c147acbad81b1f21316604ebdfd4a20a7a64ef92df0247724d3d285 |