Skip to main content

Get the gender from first name.

Project description

https://travis-ci.org/lead-ratings/gender-guesser.svg?branch=master

This package uses the underlying data from the program “gender” by Jorg Michael (described here). Its use is pretty straightforward:

>>> import gender_guesser.detector as gender
>>> d = gender.Detector()
>>> print(d.get_gender(u"Bob"))
male
>>> print(d.get_gender(u"Sally"))
female
>>> print(d.get_gender(u"Pauley")) # should be androgynous
andy

The result will be one of unknown (name not found), andy (androgynous), male, female, mostly_male, or mostly_female. The difference between andy and unknown is that the former is found to have the same probability to be male than to be female, while the later means that the name wasn’t found in the database.

I18N is fully supported:

>>> print(d.get_gender(u"\xc1lfr\xfan"))  # u"Álfrún"
female

Additionally, you can give preference to specific countries:

>>> print(d.get_gender(u"Jamie"))
mostly_female
>>> print(d.get_gender(u"Jamie", u'great_britain'))
mostly_male

Additionally, you can create a detector that is not case sensitive (default is to be case sensitive):

>>> d = gender.Detector(case_sensitive=False)
>>> print(d.get_gender(u"sally"))
female
>>> print(d.get_gender(u"Sally"))
female

Try to avoid creating many Detectors, as each creation means reading the data file.

Licenses

The generator code is distributed under the GPLv3. The data file nam_dict.txt is released under the GNU Free Documentation License.

Changelog

0.3.0 (2016-07-02)

  • Remove unknown_value init option, since it can be implemented very easily with a wrapper if needed.

  • Return unknown when name is not found and andy when it is valid equally for both male and female.

  • Test README examples as doctests.

  • Fix incorrect country-wise gender detection for non-iso886-15 names coming from line length change after data file conversion to UTF-8. See #gh2. Thanks @miquelcamprodon.

0.2.0 (2015-12-06)

  • Wire in tox to test in both Python 2 and Python 3.

  • Python 2 and 3 compatiblity.

  • Remove obsolete character mapper code.

For previous versions, see sexmachine.

Credits

This is a fork of the SexMachine package by Ferhat Elmas. It was created to be able to publish a Python 3 compatible version to PyPI and to be able add some more improvements without bugging the original author.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gender-guesser-0.4.0.tar.gz (337.2 kB view details)

Uploaded Source

Built Distribution

gender_guesser-0.4.0-py2.py3-none-any.whl (379.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file gender-guesser-0.4.0.tar.gz.

File metadata

File hashes

Hashes for gender-guesser-0.4.0.tar.gz
Algorithm Hash digest
SHA256 1591c14592805ca7da06a46d5f7202511f7cb87547049a68dfccbeedb879f31b
MD5 32f5cf52eeacf4462cc552c55cf6c031
BLAKE2b-256 a8dc69939b7af56b7adf3aa2736771b7c3e7191f7cd36fbc80b0727570c275fa

See more details on using hashes here.

File details

Details for the file gender_guesser-0.4.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for gender_guesser-0.4.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 7cb01ce5d8d43b94573498bc02c959b622872abd399622ca67d1b73ba6e7e222
MD5 9f1e052c464a9583788df7ad901def59
BLAKE2b-256 13fb3f2aac40cd2421e164cab1668e0ca10685fcf896bd6b3671088f8aab356e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page