Determine if romanized Japanese names are family names
Project description
About
This library identifies (kenchi) romanized Japanese family names (myouji), largely in order to split romanized Japanese names into given name and family name.
Usage
>>> import myouji_kenchi
>>> myouji_kenchi.order_names(['Yamada', 'Satoshi'])
['Satoshi', 'Yamada']
>>> myouji_kenchi.get_score_as_myouji('Yamada')
201046.0
>>> myouji_kenchi.get_score_as_myouji('Satoshi')
329.0
>>> transliterator = myouji_kenchi.MyoujiBackTransliteration()
>>> transliterator.back_transliterate('Yamada')
[('ヤマダ', 201046.0)]
Background
The Japanese ordering of a name is family name
given name
,
whereas the Western ordering is given name
family name
.
When a name is written in Japanese script
one can assume it follows that convention.
However,
when a Japanese name is written in Latin characters
it might be in either order,
depending on the context and the author.
While sometimes a person's name can be treated as atomic,
it is often desirable to know which name is which.
One major complication is the variety of romanization schemes in active use. Most libraries for back transliterating from Latin characters to Japanese script presuppose that the romanization scheme for the original transliteration is known. Often in the sort of situation where you do not know the order of a Japanese name the romanization scheme will also be unknown. This library targets Kunrei-shiki, Nihon-shiki, and (Modified) Hepburn, with allowance for the common deviations of omitting macrons and apostrophes.
Installation/Dependencies
-
Install OpenFst.
It is necessary that the version of OpenFst and the version of the OpenFst Python binding match. Note that OpenFst must be compiled with the
--enable-far
option in order to support that binding package.wget http://www.openfst.org/twiki/pub/FST/FstDownload/openfst-1.6.6.tar.gz tar xf openfst-1.6.6.tar.gz cd openfst-1.6.6 ./configure --enable-far make sudo make install
-
Pip install myouji-kenchi
pip install myouji-kenchi
Compiling OpenFst on OS X can be troublesome.
Making sure you have the latest version of your compiler and
prefixing pip install
and/or make
with CFLAGS="-std=c++11 -stdlib=libc++"
can be helpful.
For Python dependencies see setup.py
and requirements_dev.txt
.
License
MIT License (see LICENSE file)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file myouji_kenchi-1.0.1.tar.gz
.
File metadata
- Download URL: myouji_kenchi-1.0.1.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9e82343dd98c001883321b3be4e52ce1cbcf37e0560e32f950e16fe7e88ad6e4 |
|
MD5 | 5ced6105c3c3b00906559ab87f89ad54 |
|
BLAKE2b-256 | 40dad721a686b2694426ffff561ad70915606888a27307127ac026d3dd7bf707 |
File details
Details for the file myouji_kenchi-1.0.1-py2.py3-none-any.whl
.
File metadata
- Download URL: myouji_kenchi-1.0.1-py2.py3-none-any.whl
- Upload date:
- Size: 401.8 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 92fefb72a90842b5c4d26db49c398472554300163001102eeca616569e3da1dd |
|
MD5 | 592294e740dba293cf0677743e9ef2d5 |
|
BLAKE2b-256 | 74b61f1e30e10a3b785f5006931724330cf6d97669048d30bba6d284101bb610 |