Japanese Postal Code Data

These details have not been verified by PyPI

Project links

Homepage

Project description

posuto

Posuto is a wrapper for the postal code data distributed by Japan Post. It makes mapping Japanese postal codes to addresses easier than working with the raw CSV.

You can read more about the motivations for posuto in Parsing the Infamous Japanese Postal CSV.

issueを英語で書く必要はありません。

Features:

multi-line neighborhoods are joined
parenthetical notes are put in a separate field
change reasons are converted from flags to labels
romaji and kana records are unified for easy access
codes with multiple areas provide a list of alternates

To install:

pip install posuto

Example usage:

import posuto as 〒

🗼 = 〒.get('〒105-0011')

print(🗼)
# "東京都港区芝公園"
print(🗼.prefecture)
# "東京都"
print(🗼.kana)
# "トウキョウトミナトクシバコウエン"
print(🗼.romaji)
# "Tokyo To, Minato Ku, Shibakoen"
print(🗼.note)
# None

Note: Unfortunately 〒 and 🗼 are not valid identifiers in Python, so the above is pseudocode. See examples/sample.py for an executable version.

You can provide a postal code with basic formatting, and postal data will be returned as a named tuple with a few convenience functions. Read on for details of how quirks in the original data are handled.

Details

The original CSV files are managed in source control here but are not distributed as part of the pip package. Instead, the CSV is converted to JSON, which is then put into an sqlite db and included in the package distribution. That means most of the complexity in code in this package is actually in the build and not at runtime.

The postal code data has many irregularities and strange parts. This explains how they're dealt with.

As another note, in normal usage posuto doesn't require any dependencies. When actually building the postal data from the raw CSVs mojimoji is used for character conversion and iconv for encoding conversion.

Field names

The primary fields of an address and the translations preferred here for each are:

都道府県: prefecture
市区町村: city
町域名: neighborhood

    # 🗼
    tt = posuto.get('〒105-0011')
    print(tt.prefecture, tt.city, tt.neighborhood)
    # "東京都 港区 芝公園"

Notes

The postal data often includes notes in the neighborhood field. These are always in parenthesis with one exception, "以下に掲載がない場合". All notes are put in the notes field, and no attempt is made to extract their yomigana or romaji (which are often not available anyway).

minatoku = posuto.get('1050000')
print(minatoku.note)
# "以下に掲載がない場合"

Yomigana

Yomigana are converted to full-width kana.

Romaji

Romaji in the original file are in all caps. This is converted to title case.

The supplied romaji make no effort to accommodate words of foreign origin, so "スウェーデンヒルズ" is rendered as "Suedenhiruzu" rather than "Sweden Hills". It may be possible to improve on this but it's outside the scope of this library; it's better to use a good romanization library, like cutlet.

Some more issues:

1006890: "大手町　ＪＡビル（地階・階層不明）" → "OTEMACHI JIEIEIBIRU(CHIKAI.KAISOFUM"
- JA → JIEIEI
- ・ → .
- transliteration is randomly truncated, also not translated
1000004: "次のビルを除く" → "TSUGINOBIRUONOZOKU"

In general use the romaji here with caution.

sweden = posuto.get('0613777')
print(sweden.romaji)
# "Hokkaido, Ishikari Gun Tobetsu Cho, Suedenhiruzu"

Long Neighborhood Names

The postal data README explains that when the neighborhood field is over 38 characters it will be continued onto multiple lines. This is not explicitly marked in the data, and where line breaks are inserted in long neighborhoods appears to be random (it's often neither after the 38th character nor at a reasonable word boundary). The only indicator of long lines is an unclosed parenthesis on the first line. Such long lines are always in order in the original file.

In posuto, the parenthetical information is considered a note and put in the note field.

omiya = posuto.get('6020847')
print(omiya)
# "京都府京都市上京区大宮町"
print(omiya.note)
# "今出川通河原町西入、今出川通寺町東入、今出川通寺町東入下る、河原町通今出川下る、河原町通今出川下る西入、寺町通今出川下る東入、中筋通石薬師上る"

Multiple Regions in One Code

Sometimes a postal code covers multiple regions. Often the city is the same and just the neighborhood varies, but sometimes part of the city field varies, or even the whole city field. Codes like this are indicated by the "一つの郵便番号で二以上の町域を表す場合の表示" field in the original CSV data, which is called multi here.

For now, if more than one region uses multiple codes, the main entry is for the first region listed in the main CSV, and other regions are stored as a list in the alternates property. There may be a better way to do this.

Programming Notes

This section is for notes on the use of the library itself as opposed to notes about the data structure.

Multi-threaded Environments

By default, posuto creates a DB connection and cursor on startup and reuses it for all requests. In the typical single-threaded, read-only scenario this is not a problem, but it causes warnings (and may cause problems) in a multi-threaded scenario. In that case you can manage db connections manually using a context manager object.

from posuto import Posuto

with Posuto() as pp:
    tower = pp.get('〒105-0011')

Using the object this way the connection will be automatically closed when the with block is exited.

License

The original postal data is provided by JP Post with an indication they will not assert copyright. The code in this repository is released under the MIT or WTFPL license.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2025.7.0

Jul 1, 2025

2025.6.0

May 30, 2025

2025.5.0

Apr 30, 2025

2025.4.0

Mar 31, 2025

2025.3.0

Mar 1, 2025

2025.2.0

Feb 1, 2025

2025.1.0

Jan 1, 2025

2024.12.0

Nov 30, 2024

2024.11.0

Nov 1, 2024

2024.10.0

Oct 1, 2024

2024.9.0

Sep 1, 2024

2024.8.0

Aug 1, 2024

2024.7.0

Jun 30, 2024

2024.6.0

Jun 1, 2024

2024.5.0

May 1, 2024

2024.4.0

Mar 31, 2024

2024.3.0

Mar 1, 2024

2024.2.0

Jan 31, 2024

2024.1.0

Jan 1, 2024

2023.12.0

Dec 1, 2023

2023.11.0

Nov 1, 2023

2023.10.0

Sep 29, 2023

2023.9.1.dev2 pre-release

Sep 16, 2023

2023.9.0

Sep 1, 2023

2023.8.1.dev1 pre-release

Aug 1, 2023

2023.8.0

Aug 1, 2023

2023.7.0

Jul 1, 2023

2023.6.1

Jun 8, 2023

2023.6.0

Jun 2, 2023

2023.5.0

Apr 30, 2023

2023.4.0

Apr 3, 2023

2023.3.1

Mar 15, 2023

2023.3.0

Feb 28, 2023

2023.2.0

Jan 31, 2023

2023.1.0

Jan 1, 2023

2022.12.0

Dec 1, 2022

2022.11.0

Oct 31, 2022

2022.10.0

Oct 1, 2022

2022.9.0

Aug 31, 2022

2022.8.0

Jul 30, 2022

This version

2022.7.0

Jul 2, 2022

2022.6.0

May 31, 2022

2022.5.0

May 2, 2022

2022.3.0

Mar 1, 2022

2022.2.0

Feb 1, 2022

2022.1.0

Dec 31, 2021

2021.12.0

Dec 1, 2021

2021.11.0

Nov 1, 2021

2021.10.0

Oct 1, 2021

2021.9.0

Aug 31, 2021

0.4.5

Aug 6, 2021

0.4.4

Jul 2, 2021

0.4.3

May 31, 2021

0.4.2

May 1, 2021

0.4.1

Apr 4, 2021

0.4.0

Feb 26, 2021

0.3.0

Jan 29, 2021

0.2.3

Dec 31, 2020

0.2.2

Dec 18, 2020

0.2.1

Dec 7, 2020

0.2.1a1 pre-release

Dec 7, 2020

0.2.0

Dec 1, 2020

0.1.16a2 pre-release

Dec 1, 2020

0.1.16a1 pre-release

Dec 1, 2020

0.1.15

Dec 1, 2020

0.1.14

Oct 3, 2020

0.1.13

Sep 1, 2020

0.1.12

Aug 4, 2020

0.1.11

Aug 4, 2020

0.1.10

Jul 2, 2020

0.1.9

Jun 1, 2020

0.1.8

Apr 30, 2020

0.1.7

Apr 2, 2020

0.1.6

Feb 1, 2020

0.1.5

Jan 5, 2020

0.1.4

Jan 1, 2020

0.1.3

Dec 3, 2019

0.1.2

Dec 3, 2019

0.1.1

Dec 3, 2019

0.1.0

Dec 3, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

posuto-2022.7.0.tar.gz (7.4 MB view details)

Uploaded Jul 2, 2022 Source

File details

Details for the file posuto-2022.7.0.tar.gz.

File metadata

Download URL: posuto-2022.7.0.tar.gz
Upload date: Jul 2, 2022
Size: 7.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.9.0

File hashes

Hashes for posuto-2022.7.0.tar.gz
Algorithm	Hash digest
SHA256	`bcbfb8f52903de7bbee54d96da5217a469535cb9ccd18c9a60078053aff4a3fd`
MD5	`7ae0fcd6e7e52f58a184df9ce2413871`
BLAKE2b-256	`58431f90ec0ca468a98869d5a6ed59a3b21ba31fce32c6341b36f0e8c7844583`

See more details on using hashes here.

posuto 2022.7.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

posuto

Details

Field names

Notes

Yomigana

Romaji

Long Neighborhood Names

Multiple Regions in One Code

Programming Notes

Multi-threaded Environments

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes