nameparser

A simple Python module for parsing human names into their individual components.

These details have not been verified by PyPI

Project links

Homepage

Project description

A simple Python (3.2+ & 2.6+) module for parsing human names into their individual components.

hn.title
hn.first
hn.middle
hn.last
hn.suffix
hn.nickname
hn.surnames (middle + last)

Supported Name Structures

The supported name structure is generally “Title First Middle Last Suffix”, where all pieces are optional. Comma-separated format like “Last, First” is also supported.

Title Firstname “Nickname” Middle Middle Lastname Suffix
Lastname [Suffix], Title Firstname (Nickname) Middle Middle[,] Suffix [, Suffix]
Title Firstname M Lastname [Suffix], Suffix [Suffix] [, Suffix]

Instantiating the HumanName class with a string splits on commas and then spaces, classifying name parts based on placement in the string and matches against known name pieces like titles and suffixes.

It correctly handles some common conjunctions and special prefixes to last names like “del”. Titles and conjunctions can be chained together to handle complex titles like “Asst Secretary of State”. It can also try to correct capitalization of names that are all upper- or lowercase names.

It attempts the best guess that can be made with a simple, rule-based approach. Its main use case is English and it is not likely to be useful for languages that do not conform to the supported name structure. It’s not perfect, but it gets you pretty far.

Installation

pip install nameparser

If you want to try out the latest code from GitHub you can install with pip using the command below.

pip install -e git+git://github.com/derek73/python-nameparser.git#egg=nameparser

If you need to handle lists of names, check out namesparser, a compliment to this module that handles multiple names in a string.

Quick Start Example

>>> from nameparser import HumanName
>>> name = HumanName("Dr. Juan Q. Xavier de la Vega III (Doc Vega)")
>>> name
<HumanName : [
    title: 'Dr.'
    first: 'Juan'
    middle: 'Q. Xavier'
    last: 'de la Vega'
    suffix: 'III'
    nickname: 'Doc Vega'
]>
>>> name.last
'de la Vega'
>>> name.as_dict()
{'last': 'de la Vega', 'suffix': 'III', 'title': 'Dr.', 'middle': 'Q. Xavier', 'nickname': 'Doc Vega', 'first': 'Juan'}
>>> str(name)
'Dr. Juan Q. Xavier de la Vega III (Doc Vega)'
>>> name.string_format = "{first} {last}"
>>> str(name)
'Juan de la Vega'

The parser does not attempt to correct mistakes in the input. It mostly just splits on white space and puts things in buckets based on their position in the string. This also means the difference between ‘title’ and ‘suffix’ is positional, not semantic. “Dr” is a title when it comes before the name and a suffix when it comes after. (“Pre-nominal” and “post-nominal” would probably be better names.)

>>> name = HumanName("1 & 2, 3 4 5, Mr.")
>>> name
<HumanName : [
    title: ''
    first: '3'
    middle: '4 5'
    last: '1 & 2'
    suffix: 'Mr.'
    nickname: ''
]>

Customization

Your project may need some adjustment for your dataset. You can do this in your own pre- or post-processing, by customizing the configured pre-defined sets of titles, prefixes, etc., or by subclassing the HumanName class. See the full documentation for more information.

Full documentation

Contributing

If you come across name piece that you think should be in the default config, you’re probably right. Start a New Issue and we can get them added.

Please let me know if there are ways this library could be structured to make it easier for you to use in your projects. Read CONTRIBUTING.md for more info on running the tests and contributing to the project.

GitHub Project

https://github.com/derek73/python-nameparser

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.1.3

Sep 21, 2023

1.1.2

Nov 14, 2022

1.1.1

Jan 29, 2022

This version

1.1.0

Jan 4, 2022

1.0.6

Feb 8, 2020

1.0.5

Dec 12, 2019

1.0.4

Jun 27, 2019

1.0.3

Apr 19, 2019

1.0.2

Oct 26, 2018

1.0.1

Sep 1, 2018

1.0.0

Aug 31, 2018

0.5.8

Aug 20, 2018

0.5.7

Jun 16, 2018

0.5.6

Jan 15, 2018

0.5.5

Jan 11, 2018

0.5.4

Dec 7, 2017

0.5.3

Jun 28, 2017

0.5.2

Mar 20, 2017

0.5.1

Aug 12, 2016

0.5.0

Aug 10, 2016

0.4.1

Jul 26, 2016

0.4.0

Jun 2, 2016

0.3.16

Mar 24, 2016

0.3.15

Mar 21, 2016

0.3.14

Mar 19, 2016

0.3.13

Mar 15, 2016

0.3.12

Mar 14, 2016

0.3.11

Oct 18, 2015

0.3.10

Sep 20, 2015

0.3.9

Sep 5, 2015

0.3.8

Sep 3, 2015

0.3.7

Aug 31, 2015

0.3.6

Aug 6, 2015

0.3.5

Aug 4, 2015

0.3.4

Mar 2, 2015

0.3.3

Aug 4, 2014

0.3.2

Jul 17, 2014

0.3.1

Jul 5, 2014

0.3.0

Jul 4, 2014

0.2.10

May 17, 2014

0.2.9

Apr 2, 2014

0.2.8

Oct 25, 2013

0.2.7

Feb 14, 2013

0.2.6

Feb 13, 2013

0.2.5

Feb 12, 2013

0.2.4

Feb 11, 2013

0.2.3

Oct 7, 2012

0.2.2

Aug 24, 2012

0.2.0

Jan 16, 2012

0.1.4

Jan 13, 2012

0.1.3

Feb 4, 2011

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nameparser-1.1.0.tar.gz (35.9 kB view details)

Uploaded Jan 4, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nameparser-1.1.0-py2.py3-none-any.whl (24.4 kB view details)

Uploaded Jan 4, 2022 Python 2Python 3

File details

Details for the file nameparser-1.1.0.tar.gz.

File metadata

Download URL: nameparser-1.1.0.tar.gz
Upload date: Jan 4, 2022
Size: 35.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.27.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for nameparser-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`bbd4831c72426757ec59674a1aad29c40bf411358a6d6e1cdd68002cbcb90d08`
MD5	`2ffc34feb141986d4c62c229c4b0db2a`
BLAKE2b-256	`eda56f83562b1d669c1298cadd05a0197c0d1d994d7d10c8d22c8c944e4c23e7`

See more details on using hashes here.

File details

Details for the file nameparser-1.1.0-py2.py3-none-any.whl.

File metadata

Download URL: nameparser-1.1.0-py2.py3-none-any.whl
Upload date: Jan 4, 2022
Size: 24.4 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.27.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for nameparser-1.1.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`4efc7b6d3e77bb399936610f0d67fe64b14a4b7877b8a01ddf34273edd724a1a`
MD5	`f24d583aebeac6cd02ce049eabebd6e4`
BLAKE2b-256	`dfef326ca9101560d47954bff7e7c6a90194af5dc1da78e3d61b116bd36b7277`

See more details on using hashes here.

nameparser 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Supported Name Structures

Installation

Quick Start Example

Customization

Full documentation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes