genderator

Python library to guess gender given a spanish full name

These details have not been verified by PyPI

Project links

Project description

https://img.shields.io/travis/davidmogar/genderator.svg

https://img.shields.io/coveralls/davidmogar/genderator.svg

https://img.shields.io/pypi/v/genderator.svg

https://img.shields.io/badge/license-MIT-blue.svg

https://img.shields.io/badge/gitter-join%20chat-brightgreen.svg

Genderator is a Python library to process Spanish names (from Spain) to guess their gender.

For this to work, the libray uses the next datasets from Instituto Nacional de Estadística:

name_surname_ratio: List of words that could be both, a name or a surname, and shows the probability to be a surname.
names_ine: List of registered names on Spain, with the probability for each one to be a male or a female name.
surnames_ine: List of registeres surnames on Spain.

Installation

The easiest way to install the latest version is by using pip to pull it from PyPI:

pip install genderator

You may also use Git to clone the repository from Github and install it manually:

git clone https://github.com/davidmogar/genderator.git
cd genderator
python setup.py install

Python 3.3 & 3.4 are supported.

Usage

The next code shows a sample usage of this library:

import genderator

guesser = genderator.Parser()
answer = guesser.guess_gender('David Moreno García')
if answer:
    print(answer)
else:
    print('Name doesn\'t match')

Output:

OrderedDict([
    ('names', ['david']),
    ('surnames', ['moreno', 'garcia']),
    ('real_name', 'david'),
    ('gender', 'Male'),
    ('confidence', 1.0)
])

Options

Genderator’s parser can receive some arguments to control its behaviour. Those arguments are:

force_combinations=Boolean: Force combinations during classification.
force_split=Boolean: Force name split if no surnames are detected.
normalize=Boolean: Enable or disable normalization.
normalizer_options=Dictionary: Normalizer options to be applied.

Normalizer options are a dictionary to control what normalization rules are applied to each name. Possible options are:

hyphens: Boolean option to enable or disable hyphens removal.
symbols: Boolean option to enable or disable symbols removal.
whitespaces: Boolean option to enable or disable extra whitespaces removal.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.7.9

May 14, 2015

0.2.7.8

May 7, 2015

0.2.7.7

May 7, 2015

0.2.7.6

May 6, 2015

0.2.7.5

May 6, 2015

0.2.7.4

May 6, 2015

0.2.7.3

May 6, 2015

0.2.7.2

May 6, 2015

0.2.7.1

May 6, 2015

0.2.7

May 6, 2015

0.2.6.2

May 5, 2015

0.2.6.1

May 5, 2015

0.2.6

May 5, 2015

0.2.5

May 4, 2015

0.2.3

May 1, 2015

0.2.2

Apr 29, 2015

0.2.1

Apr 29, 2015

0.2

Apr 29, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genderator-0.2.7.9.zip (891.6 kB view details)

Uploaded May 14, 2015 Source

File details

Details for the file genderator-0.2.7.9.zip.

File metadata

Download URL: genderator-0.2.7.9.zip
Upload date: May 14, 2015
Size: 891.6 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for genderator-0.2.7.9.zip
Algorithm	Hash digest
SHA256	`aa8fe9e1527c84c20741aabf85697ffb8c811cdc0c41aefdc106f6f5ffcbb708`
MD5	`6d2ae342bba51429feb27e5ec62512d0`
BLAKE2b-256	`f5005024c829901e4b5f4070abeff0e909a16a47ab9295061cd56d882a6f50f5`

See more details on using hashes here.

genderator 0.2.7.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Usage

Options

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes