A downloader for named images containing faces from Wiki servers.
Project description
Wiki Faces:
TLDR
This project downloads images from a Wiki that include human faces. Specifically, images that are associated with certain wikipedia categories.
Installation
Pip Installation Procedure:
From PIP:
pip install wikifaces
From Repo:
git clone git@github.com:tford9/Wiki-Faces-Downloader.git
cd Wiki-Faces-Downloader
python setup.py
pip install wikifaces
Usage
Command-Line Example
python downloader -i "indonesian engineers" -o ../data/ -d
Package Example
from wikifaces.downloader import WikiFace
wikiface_obj = WikiFace()
wikiface_obj.download(categories=['facebook'], depth=2, output_location='../data/')
The following structure is output:
facebook
cached_1_people_pages_d2.pkl
cached_pages_d2.pkl
alan_rushbridger
Alan_Rusbridger_01.jpg-p0.jpg
...
mark_zuckerberg
MarkZuckerbergcrop.jpg-p1.jpg
...
The process is carried out as follows:
- Given a category from a Wiki, collect n pages that contain the same category as well as at least one category containing "people" in the title.
- With those pages, crawl across their included categories and collect y pages that contain those categories as well as at least one "people" category.
- Given the collected Wiki pages, download the primary image from the page and determine if it is a human face using light facial detection.
- We capture all images from the wiki that contain the name of the page (if it's a person then the filename contains their name),
- Using the captured name and images, we create a dataset for that face.
TODOs:
- Currently, a part of this process uses a recursive call structure to get all related pages; there may be a way to linearize, or parallelize this.
- Currently, we are only pulling images contain the person's name in the title and only have one visible face in the image. All other images are not considered. A voting system should be added to get the most represented faces across multiple images.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wikifaces-1.0.6.tar.gz
(9.8 kB
view hashes)
Built Distribution
Close
Hashes for wikifaces-1.0.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b27cab0880f16f0022ecaa984b27e0929d50222d90aaa5087ec30ef2057cfd62 |
|
MD5 | ec1193d71140f61ff875927b27a6447b |
|
BLAKE2b-256 | 8702f3199a71a4674eb3f6bfc8c891ed391d0567042540da27d3f3b760e3e49e |