Skip to main content

A Python parser for the CC-CEDICT Chinese-English dictionary

Project description

A Python parser for the CC-CEDICT Chinese-English dictionary

Travis Pypi


A parser for the CC-CEDICT Chinese-English dictionary.

Install

PIP

https://pypi.org/project/cedict-utils/

pip install cedict-utils

REPO

  1. Create virtualenv.
python3 -m venv ~/.virtualenvs/cedict-utils
  1. Activate venv.
source ~/.virtualenvs/cedict-utils/bin/activate
  1. Install requirements.
pip install -r requirements.txt

Dictionary

Download dictionary

wget -O - https://www.mdbg.net/chinese/export/cedict/cedict_1_0_ts_utf-8_mdbg.txt.gz | gunzip > data/cedict_ts.u8

Usage

$ python
>>> from cedict_utils.cedict import CedictParser
>>> parser = CedictParser()
>>> parser.read_file()
>>> entries = parser.parse()
>>> for e in entries:
...     print(e)
... 
..
龟缩 (龜縮) - gui1 suo1
龟背竹 (龜背竹) - gui1 bei4 zhu2
龟船 (龜船) - gui1 chuan2
..
>>> entries[200].simplified
'敦'
>>> entries[200].traditional
'㪟'
>>> entries[200].pinyin
'dun1'
>>> entries[200].raw_line
'㪟 敦 [dun1] /variant of 敦[dun1]/'
>>> entries[200].meanings
['variant of 敦[dun1]']
>>> 

Tests

Running tests

$ make test
pytest
================================================================ test session starts ================================================================
platform linux -- Python 3.6.8, pytest-3.2.3, py-1.4.34, pluggy-0.4.0
rootdir: /Development/cedict-utils, inifile:
collected 8 items                                                                                                                                    

tests/test_cedict.py ........

============================================================= 8 passed in 0.09 seconds ==============================================================

Resources

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cedict_utils-0.0.13.tar.gz (7.9 MB view hashes)

Uploaded Source

Built Distribution

cedict_utils-0.0.13-py3-none-any.whl (4.3 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page