A downloader for named images containing faces from Wiki servers.
Project description
Wiki Faces:
TLDR
This project downloads images from a Wiki that include human faces. Specifically, images that are associated with certain wikipedia categories.
Installation
Pip Installation Procedure:
From PIP:
pip install wikifaces
From Repo:
git clone git@github.com:tford9/Wiki-Faces-Downloader.git
cd Wiki-Faces-Downloader
python setup.py
pip install wikifaces
Usage
Command-Line Example
python downloader -i "indonesian engineers" -o ../data/ -d
Package Example
from wikifaces.downloader import WikiFace
wikiface_obj = WikiFace()
wikiface_obj.download(categories=['facebook'], depth=2, output_location='../data/')
The following structure is output:
facebook
cached_1_people_pages_d2.pkl
cached_pages_d2.pkl
alan_rushbridger
Alan_Rusbridger_01.jpg-p0.jpg
...
mark_zuckerberg
MarkZuckerbergcrop.jpg-p1.jpg
...
The process is carried out as follows:
- Given a category from a Wiki, collect n pages that contain the same category as well as at least one category containing "people" in the title.
- With those pages, crawl across their included categories and collect y pages that contain those categories as well as at least one "people" category.
- Given the collected Wiki pages, download the primary image from the page and determine if it is a human face using light facial detection.
- We capture all images from the wiki that contain the name of the page (if it's a person then the filename contains their name),
- Using the captured name and images, we create a dataset for that face.
TODOs:
- Currently, a part of this process uses a recursive call structure to get all related pages; there may be a way to linearize, or parallelize this.
- Currently, we are only pulling images contain the person's name in the title and only have one visible face in the image. All other images are not considered. A voting system should be added to get the most represented faces across multiple images.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wikifaces-1.0.8.tar.gz
(11.5 kB
view details)
Built Distribution
wikifaces-1.0.8-py3-none-any.whl
(10.7 kB
view details)
File details
Details for the file wikifaces-1.0.8.tar.gz
.
File metadata
- Download URL: wikifaces-1.0.8.tar.gz
- Upload date:
- Size: 11.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 700fbe6d8e4a640463b67717332cfa3dcebbdec7982b6f721426c15131f0e751 |
|
MD5 | 2ec624878a86bb334bd258528c922565 |
|
BLAKE2b-256 | 3ffef11ef0f16541eda8a1059c81a0a43a3afe5470c42b86b66db491b4b80b44 |
File details
Details for the file wikifaces-1.0.8-py3-none-any.whl
.
File metadata
- Download URL: wikifaces-1.0.8-py3-none-any.whl
- Upload date:
- Size: 10.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 18ebc2810a8bb5bc0d5968a526a40af2729d4db86b7aedc1f1f4ee045966bffd |
|
MD5 | 724e4c8f7ba61191cb50d0871278f7fc |
|
BLAKE2b-256 | 0e86f8a5ff8c683e3c6772d08badb581a5c689e75abaa54cf3d48e3cea58c19a |