Skip to main content

Python library to guess gender given a spanish full name

Project description

Genderator is a Python library to process spanish names to guess their gender.

For this to work, the libray uses the next datasets from Instituto Nacional de Estadística:

  • name_surname_ratio: List of words that could be both, a name or a surname, and shows the probability to be a surname.

  • names_ine: List of registered names on Spain, with the probability for each one to be a male or a female name.

  • surnames_ine: List of registeres surnames on Spain.

Installation

The easiest way to install the latest version is by using pip to pull it from PyPI:

pip install genderator

You may also use Git to clone the repository from Github and install it manually:

git clone https://github.com/davidmogar/genderator.git
cd genderator
python setup.py install

Usage

The next code shows a sample usage of this library:

import collections
import json
from genderator.genderator import Genderator

guesser = Genderator()
answer = guesser.guess_gender('David Moreno García')
if answer:
    # Keep returned keys order
    parsed = json.JSONDecoder(object_pairs_hook=collections.OrderedDict).decode(answer)
    print(json.dumps(parsed, indent=4))
else:
    print('No answer')

Output:

{
    "names": {
        "david": 0.991
    },
    "surnames": {
        "moreno": 1.0
    },
    "real_name": "david",
    "gender": "Male",
    "confidence": 1.0
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genderator-0.2.zip (417.2 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page