Programmatic interface to the Chinese Vocab List
Project description
Chinese Vocab List
A list of Chinese vocabulary words with definitions, pronunciations, and example sentences. Under a CC-BY-SA license. See chinese_vocab_list.yaml for the list itself.
Used by the Chinese Prestudy Anki addon. See this blog post for more details.
Contributing
There are a few ways to contribute:
- Making changes to the source code in
src/. - Making changes files in
contrib_files/:subtlex_dupes.yamllists words that are redundant with other words. For example,身上: 身in that file means that instead of learning the word "身上", someone should just learn the word "身".preferred_entries.yamlindicates which entries from CC-CEDICT are the best to use for each word. Only needed when you increase the size of the vocab list and it complains because it finds a word with multiple definition. Note: some words have multiple meanings that are worth learning but are split across different entries in CC-CEDICT. For example, 只 and 面. I don't have a good way to represent these inchinese_vocab_list.yamlyet.
- Directly modifying
chinese_vocab_list.yaml.
If you change src/ or contrib_files/, be sure to run make chinese_vocab_list.yaml and check in both your changes and the generated changes to chinese_vocab_list.yaml.
Updating reference_files:
cc_cedict.txt: Runcurl https://www.mdbg.net/chinese/export/cedict/cedict_1_0_ts_utf-8_mdbg.txt.gz | gunzip > reference_files/cc_cedict.txt- You may need to update contrib_files/preferred_entries.yaml and/or other files in order to handle the update. Run
makeand fix errors until the vocab list builds cleanly.
- You may need to update contrib_files/preferred_entries.yaml and/or other files in order to handle the update. Run
Publishing to PyPI
If your name is Kerrick, you can publish the chinesevocablist package to PyPI by running these commands from the root of the repo:
rm -rf dist/*
python3 -m build
python3 -m twine upload dist/*
Note that this directly uploads to prod PyPI and skips uploading to test PyPI.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file chinesevocablist-0.3.8.tar.gz.
File metadata
- Download URL: chinesevocablist-0.3.8.tar.gz
- Upload date:
- Size: 523.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
45e4e21d632af513d94aa0aced3fa85a8c2187a165b1d49d06101cc25f213b2b
|
|
| MD5 |
5df0d743da64bb1f7185e43bdad4d2fb
|
|
| BLAKE2b-256 |
94b4092e2ea0c40dfa4b2070916b884b2fd20e7b10c723c0247e0bcd6f37c011
|
File details
Details for the file chinesevocablist-0.3.8-py3-none-any.whl.
File metadata
- Download URL: chinesevocablist-0.3.8-py3-none-any.whl
- Upload date:
- Size: 527.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
51345d779681034452f509b5f4e4685ce60e5b965a0725577550a04b8cf871d5
|
|
| MD5 |
6a8e49a83917fc09d326bb832c023959
|
|
| BLAKE2b-256 |
fbbbab7504d859e7f645be9f4295bd476236aec7c9ebf6be29218a5730aa382c
|