Image scraper for DuckDuckGo for creating deep learning datasets
Project description
jmd_imagescraper
An image scraping library for creating deep learning datasets.
This library is for creating deep learning datasets.
It uses DuckDuckGo for the image scraping as they have return nice big images and have some rather nice parameters to make your life easier, for example we can filter the searches to only return square images which are photos.
jmd_imagescraper.core
contains the main scraping/downloading functionality.
jmd_imagescraper.imagecleaner
contains an image cleaner you can use from within your notebook to clean up the results and delete anything unsuitable.
Install
pip install jmd_imagescraper
How to use
from jmd_imagescraper.core import *
from pathlib import Path
root = Path().cwd()/"images"
duckduckgo_search(root, "Puppies", "cute puppies", max_results=10)
Duckduckgo search: cute puppies
Downloading results into C:\Users\Joe\Documents\GitHub\jmd_imagescraper\images\Puppies
[WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/001.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/002.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/003.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/004.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/005.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/006.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/007.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/008.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/009.jpg'),
WindowsPath('C:/Users/Joe/Documents/GitHub/jmd_imagescraper/images/Puppies/010.jpg')]
from jmd_imagescraper.imagecleaner import *
display_image_cleaner(root)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for jmd_imagescraper-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0258ffba5f99cf827ea6c343f32b5a294d9ce4318d9d4d71bfc87e2268add0e0 |
|
MD5 | 159e33ab0ae6ecdbae5667d1f0567250 |
|
BLAKE2b-256 | a9ed4a85d715d5d6b35b6cc4e1a78abdefa4e165fd24ae6aeacc9fd4690a7949 |