Skip to main content

Hand-curated dataset of English given names and nicknames

Project description

Nicknames

A hand-curated CSV file containing English given names (first name) and their associated nicknames or diminutive names.

There are Java, Perl, Python, and R parsers provided for convenience.

This is a relatively large list with roughly 1600 names. Any help from people to clean this list up and add to it is greatly appreciated.

This lookup file was initially created by mining this genealogy page. Because the lookup originates from a dataset used for genealogy purposes there are old names that aren't commonly used these days, but there are recent ones as well. Examples are "gregory", "greg", or "geoffrey", "geoff". There was also a significant effort to make it machine readable, i.e. separate it with commas, remove human conventions like "rickie(y)" would need to be made into two different names "rickie", and "ricky".

This project was created by Old Dominion University - Web Science and Digital Libraries Research Group. More information about the creation of this lookup can be found on this blog post about the creation of this library

Python API

The Python parser is available on PyPI from

pip install nicknames

and then you can do:

from nicknames import NickNamer

nn = NickNamer()

# Get the nicknames for a given name as a set of strings
nicks = nn.nicknames_of("Alexander")
assert isinstance(nicks, set)
assert "al" in nicks
assert "alex" in nicks

# Note that the relationship isn't symmetric: al is a nickname for alexander,
# but alexander is not a nickname for al.
assert "alexander" not in nn.nicknames_of("al")

# Capitalization is ignored and leading and trailing whitespace is ignored
assert nn.nicknames_of("alexander") == nn.nicknames_of(" ALEXANDER ")

# Queries that aren't found return an empty set
assert nn.nicknames_of("not a name") == set()

# The other useful thing is to go the other way, nickname to canonical:
# It acts very similarly to nicknames_of.
can = nn.canonicals_of("al")
assert isinstance(can, set)
assert "alexander" in can
assert "alex" in can

assert "al" not in nn.canonicals_of("alexander")

# You can combine these to see if two names are interchangeable:
union = nn.nicknames_of("al") | nn.canonicals_of("al")
are_interchangeable = "alexander" in union

For more advanced usage, such as loading your own data, read the source code.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nicknames-0.1.0.tar.gz (18.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nicknames-0.1.0-py3-none-any.whl (16.3 kB view details)

Uploaded Python 3

File details

Details for the file nicknames-0.1.0.tar.gz.

File metadata

  • Download URL: nicknames-0.1.0.tar.gz
  • Upload date:
  • Size: 18.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.0

File hashes

Hashes for nicknames-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6027c706ae429882aaaa1bd88d4588e0bbf3857283b99fd5656ee86986f3bf8a
MD5 eded5e6a58dad97ced7b983e0e00794c
BLAKE2b-256 6e583eb0d21dbac27dd38428736a592b0995e09b7c083f8e3b24dbcede0db255

See more details on using hashes here.

File details

Details for the file nicknames-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: nicknames-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 16.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.0

File hashes

Hashes for nicknames-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dcf3bd43a2fbceb64be0375097db986bdcf3e148564e79aec4a293ddae31a7b4
MD5 3dda65f012da809bb4c9dadb2f889482
BLAKE2b-256 0a89c5b9b190c122a66105f2172f099ddbe5f6d4b12835ab141a237873098f46

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page