A Python parser for the CC-CEDICT Chinese-English dictionary
Project description
A parser for the CC-CEDICT Chinese-English dictionary.
# Install
## PIP
https://pypi.org/project/cedict-utils/
~~~ pip install cedict-utils ~~~
## REPO
Create virtualenv.
~~~ python3 -m venv ~/.virtualenvs/cedict-utils ~~~
Activate venv.
~~~ source ~/.virtualenvs/cedict-utils/bin/activate ~~~
Install requirements.
~~~ pip install -r requirements.txt ~~~
# Dictionary
Download dictionary
~~~ wget -O - https://www.mdbg.net/chinese/export/cedict/cedict_1_0_ts_utf-8_mdbg.txt.gz | gunzip > data/cedict_ts.u8 ~~~
# Usage
~~~ python $ python >>> from cedict_utils.cedict import CedictParser >>> parser = CedictParser() >>> parser.read_file() >>> entries = parser.parse() >>> for e in entries: … print(e) … .. 龟缩 (龜縮) - gui1 suo1 龟背竹 (龜背竹) - gui1 bei4 zhu2 龟船 (龜船) - gui1 chuan2 .. >>> entries[200].simplified ‘敦’ >>> entries[200].traditional ‘㪟’ >>> entries[200].pinyin ‘dun1’ >>> entries[200].raw_line ‘㪟 敦 [dun1] /variant of 敦[dun1]/’ >>> entries[200].meanings [‘variant of 敦[dun1]’] >>> ~~~
# Tests
Running tests
~~~ $ make test pytest ================================================================ test session starts ================================================================ platform linux – Python 3.6.8, pytest-3.2.3, py-1.4.34, pluggy-0.4.0 rootdir: /Development/cedict-utils, inifile: collected 8 items
tests/test_cedict.py ……..
============================================================= 8 passed in 0.09 seconds ============================================================== ~~~
# Resources
Cedict project https://www.mdbg.net/chinese/dictionary?page=cc-cedict
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for cedict_utils-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ec7a6266886dc7bf3914716065234f00f2ca491c22d2f0d8a7d6ac310438cc4e |
|
MD5 | d964bd0370c794c86f16301cdc56680b |
|
BLAKE2b-256 | d77b011791c4581d3c94bb4fb6b8f97ad05db1ccced2ff148adf3f4e40d40042 |