imagebot·PyPI

A web bot to scrape images from websites.

Project description

A web bot to scrape images from websites.

Features

Supported platform: Linux (+Gnome) / Python 2.x.
Uses scrapy web crawling framework.
Maintains a database of all downloaded images to avoid duplicate downloads.
Optionally, it can scrape only under a particular url, e.g. scraping “http://website.com/albums/new” with this option will only download from new album.
You can specify minimum image size to be downloaded.
Scrapes through javascript popup links.
Live monitor window for displaying images as they are scraped.

Usage

Scrape images from http://website.com:
```
imagebot http://website.com
```
Scrape images from http://website.com while allowing images from a cdn such as amazonaws.com (add multiple domains with comma separated list):
```
imagebot http://website.com -d amazonaws.com
```
Specify minimum size of image to be downloaded (width x height):
```
imagebot http://website.com -s 300x300
```

Stay under http://website.com/albums/new:

imagebot http://website.com/albums/new -u http://website.com/albums/new

Launch monitor windows for live images:
```
imagebot http://website.com -m
```

Set user-agent:

imagebot http://website.com -a "my_imagebot(http://mysite.com)"

For more options, get help:
```
imagebot -h
```

Dependencies

python-gi (Python GObject Introspection API)
On Ubuntu:
```
apt-get install python-gi
```
scrapy (a powerful web crawling framework)

It will be automatically installed by pip.
Pillow (Python Imaging Library)

It will be automatically installed by pip.

Download

PyPI: http://pypi.python.org/pypi/imagebot/
Source: https://bitbucket.org/amol9/imagebot/

Project details

Release history Release notifications | RSS feed

1.2.1

Jul 13, 2015

1.2.0

Mar 6, 2015

1.1.1

Feb 24, 2015

1.1.0

Feb 23, 2015

1.0.3

Feb 3, 2015

This version

1.0.2

Feb 1, 2015

1.0.1

Jan 31, 2015

1.0

Jan 31, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imagebot-1.0.2.tar.gz (11.9 kB view details)

Uploaded Feb 1, 2015 Source

File details

Details for the file imagebot-1.0.2.tar.gz.

File metadata

Download URL: imagebot-1.0.2.tar.gz
Upload date: Feb 1, 2015
Size: 11.9 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for imagebot-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`ea091b734269c896967a159a7de48c8aee8d351d2ac3ab335dc9cd42aa548551`
MD5	`c02cb8d652b511bbdce85796f2947b43`
BLAKE2b-256	`c80355a8b1cd1832cc992755b412b0d4ec2d8432922422a1bb2694c660fcbd0c`

See more details on using hashes here.

imagebot 1.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta