Skip to main content

the Old Chinese language for spaCy

Project description

spacy-och

ci pypi

the Old Chinese (och) language for the spaCy NLP library.

installation

requires spacy v3.

$ pip install spacy-och

usage

this package currently doesn't include trained models and is intended for basic NLP usage only, via nlp.blank(). it tokenizes texts by character and supports the Token.like_num and Token.is_stop attributes.

>>> import spacy
>>> nlp = spacy.blank("och")
>>> from spacy_och.examples import sentences
>>> doc = nlp(sentences[0])
>>> doc.text
子曰:「上下无常非為邪也進退无恆非離群也君子進德脩業欲及時也故无咎。」
>>> [t for t in doc if t.is_stop] # all stop words
[, , , , , , , , , , , , , ]

more functionality is coming soon!

developing

after cloning the repository:

$ pip install -e ".[dev]"
$ pre-commit install

building

build a source archive and distribution for a release:

$ rm -rf dist/*
$ python -m build

publish the release on test PyPI (useful for making sure everything worked):

$ python -m twine upload --repository testpypi dist/*

if everything looks ok, upload to the real PyPI:

$ python -m twine upload dist/*

license

code is licensed under the MIT license. some lookups data is derived from files licensed under the unicode data files and software license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spacy-och-0.1.2.tar.gz (358.5 kB view details)

Uploaded Source

Built Distribution

spacy_och-0.1.2-py3-none-any.whl (372.2 kB view details)

Uploaded Python 3

File details

Details for the file spacy-och-0.1.2.tar.gz.

File metadata

  • Download URL: spacy-och-0.1.2.tar.gz
  • Upload date:
  • Size: 358.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.7

File hashes

Hashes for spacy-och-0.1.2.tar.gz
Algorithm Hash digest
SHA256 6314bce6907047c107662476ce50ff83f618832d7ce044d79e3ed40ecc898e05
MD5 320340cfbee2db2c2c4e43a4f6ee6b0e
BLAKE2b-256 f496d37c3419778ba8c382b27c1bd138de9bf53a40f5eae4e54d181964e3317f

See more details on using hashes here.

File details

Details for the file spacy_och-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: spacy_och-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 372.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.7

File hashes

Hashes for spacy_och-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ba114f0e35f6b88a3cf77a3f5167fd05b300a9d29cfb275dcdad033c6b2e3291
MD5 57ce556e11f8d99e49de3b27d366eee8
BLAKE2b-256 5c588da555ff8d821b5855fc4401b0a972969d5b6d47d9d6ca46331f76459f4b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page