Skip to main content

Underthesea Core

Project description

Underthesea Core

Underthesea Core is a powerful extension of the popular natural language processing library Underthesea, which includes a range of efficient data preprocessing tools and machine learning models for training. Built with Rust for optimal performance, Underthesea Core offers fast processing speeds and is easy to implement, with Python bindings for seamless integration into existing projects. This extension is an essential tool for developers looking to build high-performance NLP systems that deliver accurate and reliable results.

Usage

CRFFeaturizer

>>> from underthesea_core import CRFFeaturizer
>>> features = ["T[-1]", "T[0]", "T[1]"]
>>> dictionary = set(["sinh viên"])
>>> featurizer = CRFFeaturizer(features, dictionary)
>>> sentences = [[["sinh", "X"], ["viên", "X"], ["đi", "X"], ["học", "X"]]]
>>> featurizer.process(sentences)
[[['T[-1]=BOS', 'T[0]=sinh', 'T[1]=viên'],
  ['T[-1]=sinh', 'T[0]=viên', 'T[1]=đi'],
  ['T[-1]=viên', 'T[0]=đi', 'T[1]=học'],
  ['T[-1]=đi', 'T[0]=học', 'T[1]=EOS']]]

Release Workflow

  1. Change version in Cargo.toml and pyproject.toml
  2. Push to branch core with commit Publish Underthesea Core
  • This will trigger release-pypi-core action
  1. Check latest version in pypi

Note*: Run a self-hosted for building macos-arm

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

underthesea_core-1.0.4.tar.gz (560.4 kB view hashes)

Uploaded Source

Built Distributions

underthesea_core-1.0.4-cp311-none-win_amd64.whl (553.0 kB view hashes)

Uploaded CPython 3.11 Windows x86-64

underthesea_core-1.0.4-cp311-cp311-manylinux2010_x86_64.whl (657.8 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.12+ x86-64

underthesea_core-1.0.4-cp311-cp311-macosx_11_0_arm64.whl (574.6 kB view hashes)

Uploaded CPython 3.11 macOS 11.0+ ARM64

underthesea_core-1.0.4-cp311-cp311-macosx_10_7_x86_64.whl (601.6 kB view hashes)

Uploaded CPython 3.11 macOS 10.7+ x86-64

underthesea_core-1.0.4-cp310-none-win_amd64.whl (553.0 kB view hashes)

Uploaded CPython 3.10 Windows x86-64

underthesea_core-1.0.4-cp310-cp310-manylinux2010_x86_64.whl (657.8 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.12+ x86-64

underthesea_core-1.0.4-cp310-cp310-macosx_11_0_arm64.whl (574.6 kB view hashes)

Uploaded CPython 3.10 macOS 11.0+ ARM64

underthesea_core-1.0.4-cp310-cp310-macosx_10_7_x86_64.whl (601.6 kB view hashes)

Uploaded CPython 3.10 macOS 10.7+ x86-64

underthesea_core-1.0.4-cp39-none-win_amd64.whl (553.2 kB view hashes)

Uploaded CPython 3.9 Windows x86-64

underthesea_core-1.0.4-cp39-cp39-manylinux2010_x86_64.whl (657.9 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.12+ x86-64

underthesea_core-1.0.4-cp39-cp39-macosx_11_0_arm64.whl (574.9 kB view hashes)

Uploaded CPython 3.9 macOS 11.0+ ARM64

underthesea_core-1.0.4-cp39-cp39-macosx_10_7_x86_64.whl (601.6 kB view hashes)

Uploaded CPython 3.9 macOS 10.7+ x86-64

underthesea_core-1.0.4-cp38-none-win_amd64.whl (552.4 kB view hashes)

Uploaded CPython 3.8 Windows x86-64

underthesea_core-1.0.4-cp38-cp38-manylinux2010_x86_64.whl (657.9 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.12+ x86-64

underthesea_core-1.0.4-cp38-cp38-macosx_11_0_arm64.whl (574.2 kB view hashes)

Uploaded CPython 3.8 macOS 11.0+ ARM64

underthesea_core-1.0.4-cp38-cp38-macosx_10_7_x86_64.whl (601.4 kB view hashes)

Uploaded CPython 3.8 macOS 10.7+ x86-64

underthesea_core-1.0.4-cp37-none-win_amd64.whl (552.9 kB view hashes)

Uploaded CPython 3.7 Windows x86-64

underthesea_core-1.0.4-cp37-cp37m-manylinux2010_x86_64.whl (657.7 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.12+ x86-64

underthesea_core-1.0.4-cp37-cp37m-macosx_10_7_x86_64.whl (601.2 kB view hashes)

Uploaded CPython 3.7m macOS 10.7+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page