A simple image scraper to download all images from a given url
Project description
ImageScraper
============
First python app :D
A simple python script which downloads all images in the given webpage.
Download
--------
tar file:
Grab the latest build using https://pypi.python.org/pypi/ImageScraper
pip install:
$pip install ImageScraper
Usage
-----
Using the tar file:
Extract the contents of the tar file.
Note that ``ImageScraper`` depends on ``lxml``. and ``requests``.
If you run into problems in the compilation of ``lxml`` through ``pip``, install the ``libxml2-dev`` and ``libxslt-dev`` packages on your system.
$cd ImageScraper/image_scraper/
$python __init__.py
$ Enter URL to scrap: https://github.com
$ Found 6 images:
$ How many images do you want ? : 6
$ Done.
If installed using pip:
Open python in terminal.
$python
>>>import image_scraper
Enter URL to scrap: https://github.com
Found 6 images:
How many images do you want ? : 6
Done.
NOTE:
A new folder called "images" will be created in the same place, containing all the downloaded images.
Issues
------
Q.)All images were not downloaded?
It could be that the content was injected into the page via javascript and this scraper doesn't run javascript.
Todo
----
Scraping sites which inject image tags via javascript using PhantomJS or Selenium.
============
First python app :D
A simple python script which downloads all images in the given webpage.
Download
--------
tar file:
Grab the latest build using https://pypi.python.org/pypi/ImageScraper
pip install:
$pip install ImageScraper
Usage
-----
Using the tar file:
Extract the contents of the tar file.
Note that ``ImageScraper`` depends on ``lxml``. and ``requests``.
If you run into problems in the compilation of ``lxml`` through ``pip``, install the ``libxml2-dev`` and ``libxslt-dev`` packages on your system.
$cd ImageScraper/image_scraper/
$python __init__.py
$ Enter URL to scrap: https://github.com
$ Found 6 images:
$ How many images do you want ? : 6
$ Done.
If installed using pip:
Open python in terminal.
$python
>>>import image_scraper
Enter URL to scrap: https://github.com
Found 6 images:
How many images do you want ? : 6
Done.
NOTE:
A new folder called "images" will be created in the same place, containing all the downloaded images.
Issues
------
Q.)All images were not downloaded?
It could be that the content was injected into the page via javascript and this scraper doesn't run javascript.
Todo
----
Scraping sites which inject image tags via javascript using PhantomJS or Selenium.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ImageScraper-1.0.3.tar.gz
(1.8 kB
view details)
File details
Details for the file ImageScraper-1.0.3.tar.gz.
File metadata
- Download URL: ImageScraper-1.0.3.tar.gz
- Upload date:
- Size: 1.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9687fe58b68cb0db475135f0df06502040dad51f16fc849f2d24553684a03633
|
|
| MD5 |
0ff75419f98b355bc2865d2d35d24bd5
|
|
| BLAKE2b-256 |
d5d2fbb168dfc2b09669ef4f4b40d4cdea05ec3a1d98a4528c80d1a6268c9011
|