Skip to main content

Image scraper for DuckDuckGo for creating deep learning datasets

Project description


An image scraping library for creating deep learning datasets.

This library is for creating deep learning datasets.

It uses DuckDuckGo for the image scraping as they return nice big images and have some rather nice parameters to make your life easier, for example we can filter the searches to only return square images which are photos.

jmd_imagescraper.core contains the main scraping/downloading functionality.

jmd_imagescraper.imagecleaner contains an image cleaner you can use from within your notebook to clean up the results and delete anything unsuitable.


pip install jmd_imagescraper

How to use

from jmd_imagescraper.core import * # dont't worry, it's designed to work with import *
from pathlib import Path

root = Path().cwd()/"images"

duckduckgo_search(root, "Cats", "cute kittens", max_results=20)
from jmd_imagescraper.imagecleaner import *



If you're reading this on you can find the docs at


20/09/2020 add: PR from @butchland, add uuid to filenames, fix for users of
18/09/2020 rel: version 1 released as pypi package

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for jmd-imagescraper, version 1.0.2
Filename, size File type Python version Upload date Hashes
Filename, size jmd_imagescraper-1.0.2-py3-none-any.whl (12.7 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size jmd_imagescraper-1.0.2.tar.gz (13.9 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page