Off-the-shelf computer vision ML models. Yolov5, gender and age determination.
Project description
QuickTake
Off-the-shelf computer vision ML models. Yolov5, gender and age determination.
The goal of this repository is to provide, easy to use, abstracted, APIs to powerful computer vision models.
Models
$3$ models are currently available:
Object detection
Gender determination
Age determination
Model Engine
The models
-
YoloV5
: Object detection. This forms the basis of the other models. Pretrained onCOCO
. Documentation here -
Gender
:ResNet18
is used as the models backend. Transfer learning is applied to model gender. The additional gender training was done on the gender classification dataset, using code extract from here. -
Age
: The age model is an implementation of theSSR-Net
paper: SSR-Net: A Compact Soft Stagewise Regression Network for Age Estimation. ThepyTorch
model was largely derived from oukohou.
Getting Started
Install the package with pip:
pip install quicktake
Usage
Build an instance of the class:
from quicktake import QuickTake
Image Input
Each model is designed to handle $3$ types of input:
raw pixels (torch.Tensor)
: raw pixels of a single image. Used when streaming video input.image path (str)
: path to an image. Used when processing a single image.image directory (str)
: path to a directory of images. Used when processing a directory of images.
Expected Use
Gender
and age
determination models are trained on faces. They work fine on a larger image, however, will fail to make multiple predictions in the case of multiple faces in a single image.
The API is currently designed to chain models:
yolo
is used to identify objects.IF
a person is detected, thegender
andage
models are used to make predictions.
This is neatly bundled in the QuickTake.yolo_loop()
method.
Getting Started
Launch a webcam stream:
QL = QuickTake()
QL.launchStream()
Note: Each model returns the results results_
as well as the runtime time_
.
Run on a single frame:
from IPython.display import display
from PIL import Image
import cv2
# example images
img = './data/random/dave.png'
# to avoid distractions
import warnings
warnings.filterwarnings('ignore')
# init module
from quicktake import QuickTake
qt = QuickTake()
# extract frame from raw image path
frame = qt.read_image(img)
We can now fit qt.age(<frame>)
or qt.gender(<frame>)
on the frame. Alternatively we can cycle through the objects detected by yolo
and if a person is detected, fit qt.age()
and qt.gender()
:
# generate points
for _label, x0,y0,x1,y1, colour, thickness, results, res_df, age_, gender_ in qt.yolo_loop(frame):
_label = QuickTake.generate_yolo_label(_label)
QuickTake.add_block_to_image(frame, _label, x0,y0,x1,y1, colour=colour, thickness=thickness)
The result is an image with the bounding boxes and labels, confidence (in yolo prediction), age, and gender if a person is detected.
.
The staged output is also useful:
.
For a more comprehensive example directory.
Future
I have many more models; deployment methods & applications in the pipeline.
If you wish to contribute, please email me @zachcolinwolpe@gmail.com.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file quicktake-0.0.14.tar.gz
.
File metadata
- Download URL: quicktake-0.0.14.tar.gz
- Upload date:
- Size: 6.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 06575afbb003f6e69c00b2962dc20d12e59e0d7c429714628ee089dec7d867bd |
|
MD5 | 054500d9e804d44d318220352d81825a |
|
BLAKE2b-256 | bac544bd4561cb99defdd838464877acbabbacc0c16c5f94609d059c7d7f371e |
File details
Details for the file quicktake-0.0.14-py3-none-any.whl
.
File metadata
- Download URL: quicktake-0.0.14-py3-none-any.whl
- Upload date:
- Size: 6.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9287d6a30c26cd6a528e5ed0979735f4176feb35a05b31255dbfbd573ba89c06 |
|
MD5 | 1919f6495c12ff0b3b7a50df8f5933fe |
|
BLAKE2b-256 | 3abb33af48c605c8d97758637be2bccaf6bd4268a7ba131347f7c6a4f6a0ac62 |