Skip to main content

Library for CJK (chinese, japanese, korean) language data.

Project description

cihai · Python Package License Code Coverage

Python library for CJK (chinese, japanese, korean) data.

This project is under active development. Follow our progress and check back for updates!

Quickstart

API / Library (this repository)

$ pip install --user cihai
from cihai.core import Cihai

c = Cihai()

if not c.unihan.is_bootstrapped:  # download and install Unihan to db
    c.unihan.bootstrap()

query = c.unihan.lookup_char('好')
glyph = query.first()
print("lookup for 好: %s" % glyph.kDefinition)
# lookup for 好: good, excellent, fine; well

query = c.unihan.reverse_char('good')
print('matches for "good": %s ' % ', '.join([glph.char for glph in query]))
# matches for "good": 㑘, 㑤, 㓛, 㘬, 㙉, 㚃, 㚒, 㚥, 㛦, 㜴, 㜺, 㝖, 㤛, 㦝, ...

See API documentation and /examples.

CLI (cihai-cli)

$ pip install --user cihai-cli

Character lookup:

$ cihai info 
char: 
kCantonese: hou2 hou3
kDefinition: good, excellent, fine; well
kHangul: 
kJapaneseOn: KOU
kKorean: HO
kMandarin: hǎo
kTang: "*xɑ̀u *xɑ̌u"
kTotalStrokes: "6"
kVietnamese: háo
ucn: U+597D

Reverse lookup:

$ cihai reverse library
char: 
kCangjie: WLGA
kCantonese: syu1
kCihaiT: '308.302'
kDefinition: library
kMandarin: 
kTotalStrokes: '13'
ucn: U+5715
--------

UNIHAN data

All datasets that cihai uses have stand-alone tools to export their data. No library required.

Developing

$ git clone https://github.com/cihai/cihai.git`
$ cd cihai/

Bootstrap your environment and learn more about contributing. We use the same conventions / tools across all cihai projects: pytest, sphinx, mypy, ruff, tmuxp, and file watcher helpers (e.g. entr(1)).

Python versions

  • 0.19.0: Last Python 3.7 release

Quick links

Docs Build Status

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cihai-0.38.0.tar.gz (292.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cihai-0.38.0-py3-none-any.whl (25.3 kB view details)

Uploaded Python 3

File details

Details for the file cihai-0.38.0.tar.gz.

File metadata

  • Download URL: cihai-0.38.0.tar.gz
  • Upload date:
  • Size: 292.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cihai-0.38.0.tar.gz
Algorithm Hash digest
SHA256 3f34994c7c7973cf8c300aafdc29d2e3e89fedf5d4d80896b5d88cc1fdd30ce9
MD5 f5a66394ecc68962fa7e92d146de5087
BLAKE2b-256 e152a8444d7c2707d83ac35c4d0e9ffc3b9b8177ebe5558a6f75501ab0953beb

See more details on using hashes here.

Provenance

The following attestation bundles were made for cihai-0.38.0.tar.gz:

Publisher: tests.yml on cihai/cihai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cihai-0.38.0-py3-none-any.whl.

File metadata

  • Download URL: cihai-0.38.0-py3-none-any.whl
  • Upload date:
  • Size: 25.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cihai-0.38.0-py3-none-any.whl
Algorithm Hash digest
SHA256 94fc953ca66681bcbbf8361d7b7b0015d35c5186a10ce4a27f4dbbbdfd169638
MD5 d2cddf468fef37e00e475239cf1a8188
BLAKE2b-256 7c18c994464304341054c7a7dffd62085e91c635df39fe5d3addcdf113db0734

See more details on using hashes here.

Provenance

The following attestation bundles were made for cihai-0.38.0-py3-none-any.whl:

Publisher: tests.yml on cihai/cihai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page