Skip to main content

Determine if romanized Japanese names are family names

Project description

About

This library identifies (kenchi) romanized Japanese family names (myouji), largely in order to split romanized Japanese names into given name and family name.

Usage

>>> import myouji_kenchi
>>> myouji_kenchi.order_names(['Yamada', 'Satoshi'])
['Satoshi', 'Yamada']
>>> myouji_kenchi.get_score_as_myouji('Yamada')
201046.0
>>> myouji_kenchi.get_score_as_myouji('Satoshi')
329.0
>>> transliterator = myouji_kenchi.MyoujiBackTransliteration()
>>> transliterator.back_transliterate('Yamada')
[('ヤマダ', 201046.0)]

Background

The Japanese ordering of a name is family name given name, whereas the Western ordering is given name family name. When a name is written in Japanese script one can assume it follows that convention. However, when a Japanese name is written in Latin characters it might be in either order, depending on the context and the author. While sometimes a person's name can be treated as atomic, it is often desirable to know which name is which.

One major complication is the variety of romanization schemes in active use. Most libraries for back transliterating from Latin characters to Japanese script presuppose that the romanization scheme for the original transliteration is known. Often in the sort of situation where you do not know the order of a Japanese name the romanization scheme will also be unknown. This library targets Kunrei-shiki, Nihon-shiki, and (Modified) Hepburn, with allowance for the common deviations of omitting macrons and apostrophes.

Installation/Dependencies

  1. Install OpenFst.

    It is necessary that the version of OpenFst and the version of the OpenFst Python binding match. Note that OpenFst must be compiled with the --enable-far option in order to support that binding package.

    wget http://www.openfst.org/twiki/pub/FST/FstDownload/openfst-1.6.6.tar.gz
    tar xf openfst-1.6.6.tar.gz
    cd openfst-1.6.6
    ./configure --enable-far
    make
    sudo make install 
    
  2. Pip install myouji-kenchi

    pip install myouji-kenchi
    

Compiling OpenFst on OS X can be troublesome. Making sure you have the latest version of your compiler and prefixing pip install and/or make with CFLAGS="-std=c++11 -stdlib=libc++" can be helpful.

For Python dependencies see setup.py and requirements_dev.txt.

License

MIT License (see LICENSE file)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

myouji_kenchi-1.0.1.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

myouji_kenchi-1.0.1-py2.py3-none-any.whl (401.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file myouji_kenchi-1.0.1.tar.gz.

File metadata

File hashes

Hashes for myouji_kenchi-1.0.1.tar.gz
Algorithm Hash digest
SHA256 9e82343dd98c001883321b3be4e52ce1cbcf37e0560e32f950e16fe7e88ad6e4
MD5 5ced6105c3c3b00906559ab87f89ad54
BLAKE2b-256 40dad721a686b2694426ffff561ad70915606888a27307127ac026d3dd7bf707

See more details on using hashes here.

File details

Details for the file myouji_kenchi-1.0.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for myouji_kenchi-1.0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 92fefb72a90842b5c4d26db49c398472554300163001102eeca616569e3da1dd
MD5 592294e740dba293cf0677743e9ef2d5
BLAKE2b-256 74b61f1e30e10a3b785f5006931724330cf6d97669048d30bba6d284101bb610

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page