Skip to main content

Programmatic interface to the Chinese Vocab List

Project description

Chinese Vocab List

A list of Chinese vocabulary words with definitions, pronunciations, and example sentences. Under a CC-BY-SA license. See chinese_vocab_list.yaml for the list itself.

Used by the Chinese Prestudy Anki addon. See this blog post for more details.

Build Status

Contributing

There are a few ways to contribute:

  • Making changes to the source code in src/.
  • Making changes files in contrib_files/:
    • subtlex_dupes.yaml lists words that are redundant with other words. For example, 身上: 身 in that file means that instead of learning the word "身上", someone should just learn the word "身".
    • preferred_entries.yaml indicates which entries from CC-CEDICT are the best to use for each word. Only needed when you increase the size of the vocab list and it complains because it finds a word with multiple definition. Note: some words have multiple meanings that are worth learning but are split across different entries in CC-CEDICT. For example, 只 and 面. I don't have a good way to represent these in chinese_vocab_list.yaml yet.
  • Directly modifying chinese_vocab_list.yaml.

If you change src/ or contrib_files/, be sure to run make chinese_vocab_list.yaml and check in both your changes and the generated changes to chinese_vocab_list.yaml.

Updating reference_files:

  • cc_cedict.txt: Run curl https://www.mdbg.net/chinese/export/cedict/cedict_1_0_ts_utf-8_mdbg.txt.gz | gunzip > reference_files/cc_cedict.txt
    • You may need to update contrib_files/preferred_entries.yaml and/or other files in order to handle the update. Run make and fix errors until the vocab list builds cleanly.

Publishing to PyPI

If your name is Kerrick, you can publish the chinesevocablist package to PyPI by running these commands from the root of the repo:

rm -rf dist/*
python3 -m build
python3 -m twine upload dist/*

Note that this directly uploads to prod PyPI and skips uploading to test PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chinesevocablist-0.3.8.tar.gz (523.4 kB view details)

Uploaded Source

Built Distribution

chinesevocablist-0.3.8-py3-none-any.whl (527.8 kB view details)

Uploaded Python 3

File details

Details for the file chinesevocablist-0.3.8.tar.gz.

File metadata

  • Download URL: chinesevocablist-0.3.8.tar.gz
  • Upload date:
  • Size: 523.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for chinesevocablist-0.3.8.tar.gz
Algorithm Hash digest
SHA256 45e4e21d632af513d94aa0aced3fa85a8c2187a165b1d49d06101cc25f213b2b
MD5 5df0d743da64bb1f7185e43bdad4d2fb
BLAKE2b-256 94b4092e2ea0c40dfa4b2070916b884b2fd20e7b10c723c0247e0bcd6f37c011

See more details on using hashes here.

File details

Details for the file chinesevocablist-0.3.8-py3-none-any.whl.

File metadata

File hashes

Hashes for chinesevocablist-0.3.8-py3-none-any.whl
Algorithm Hash digest
SHA256 51345d779681034452f509b5f4e4685ce60e5b965a0725577550a04b8cf871d5
MD5 6a8e49a83917fc09d326bb832c023959
BLAKE2b-256 fbbbab7504d859e7f645be9f4295bd476236aec7c9ebf6be29218a5730aa382c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page