Predict gender using first name using data from World Gender Name Dictionary 2.0.
Project description
Global Gender Predictor
Predict gender using first name using data from World Gender Name Dictionary 2.0.
The dataset contains 4,148,966 unique names. The predictor is case-insensitive and predicts Male
, Female
, or Unknown
(i.e. unisex or not found in data)
Install using pip:
pip install global_gender_predictor
Usage
from global_gender_predictor import GlobalGenderPredictor
predictor = GlobalGenderPredictor()
predictor.predict_gender('John')
'Male'
predictor.predict_gender('Jane')
'Female'
predictor.predict_gender('attract.ai')
'Unknown'
The dataset contains probabilities for each name:
{'name': 'taylor', 'gender_prob': {'F': 0.699, 'M': 0.301}}
.
Change the probability threshold for unisex names:
predictor.predict_gender('taylor',threshold=0.5)
'Female'
predictor.predict_gender('taylor',threshold=0.8)
'Unknown'
Citation
World Gender Name Dictionary (WGND 2.0)
@data{DVN/MSEGSJ_2021,
author = {Raffo, Julio},
publisher = {Harvard Dataverse},
title = {{WGND 2.0}},
UNF = {UNF:6:5rI3h1mXzd6zkVhHurelLw==},
year = {2021},
version = {DRAFT VERSION},
doi = {10.7910/DVN/MSEGSJ},
url = {https://doi.org/10.7910/DVN/MSEGSJ}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for global_gender_predictor-0.0.3.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | ffc67932d207e0f3bc74340ef8c1a64a57483ce8cda3fce380df3fc93fb16242 |
|
MD5 | 94a26980307cf67d513524fc293c04a1 |
|
BLAKE2b-256 | b05921c08132e8ea8c94cc537b25e0ef0b3d0677a9c8da7c2a851d0ef27dad1c |
Close
Hashes for global_gender_predictor-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9cc224043b5bc1ae0426fd33c5e8ba47abf3c5ce134514e5def19d724a9a329d |
|
MD5 | 17a4bd2c27e4045c7babb80f8129ca9f |
|
BLAKE2b-256 | f1eaea57fe0caa9f720fd1b6892b2e96fa998db94ed3510da5b5dd61a84c0d9d |