Skip to main content

No project description provided

Project description

日本語

kuzukiri

A simple text segmenter

What's this?

This is a python library for text segmentation of Japanese text.

Features

  • Text segmentation by simple rules,
    • rule-based, no machine learning,
    • so you can assume results.
  • comparably fast. It's written in rust-lang.

Install

from PyPI

pip install kuzukiri

from source code

pip install setuptools-rust
python setup.py install

Usage

import kuzukiri

segmenter = kuzukiri.Segmenter()
text = "これはテストです。文分割します。"
sentences = segmenter.split(text)
print(sentences)  # => ['これはテストです。', '文分割します。']

For details, see examples and tests directories.

License

MIT

Dependencies

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kuzukiri-0.1.3.tar.gz (5.0 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

kuzukiri-0.1.3-cp310-cp310-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded CPython 3.10

kuzukiri-0.1.3-cp310-cp310-macosx_12_0_x86_64.whl (290.8 kB view details)

Uploaded CPython 3.10macOS 12.0+ x86-64

kuzukiri-0.1.3-cp39-cp39-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded CPython 3.9

kuzukiri-0.1.3-cp39-cp39-macosx_12_0_x86_64.whl (290.9 kB view details)

Uploaded CPython 3.9macOS 12.0+ x86-64

kuzukiri-0.1.3-cp38-cp38-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded CPython 3.8

kuzukiri-0.1.3-cp38-cp38-macosx_12_0_x86_64.whl (291.0 kB view details)

Uploaded CPython 3.8macOS 12.0+ x86-64

kuzukiri-0.1.3-cp37-cp37m-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded CPython 3.7m

kuzukiri-0.1.3-cp37-cp37m-macosx_12_0_x86_64.whl (291.2 kB view details)

Uploaded CPython 3.7mmacOS 12.0+ x86-64

File details

Details for the file kuzukiri-0.1.3.tar.gz.

File metadata

  • Download URL: kuzukiri-0.1.3.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.8.10

File hashes

Hashes for kuzukiri-0.1.3.tar.gz
Algorithm Hash digest
SHA256 c8c5ff9c892086f0e7eab02e3d7bb3546ba2b1683bab0d70f6e3f5b96378e850
MD5 1ee2ab4788612dee2189ce6bb94ca401
BLAKE2b-256 81387200f22d50acb382faec99ee2c99d536725142c9f1ba4198bfa03fb4e65a

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp310-cp310-manylinux1_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp310-cp310-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: CPython 3.10
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.10.7

File hashes

Hashes for kuzukiri-0.1.3-cp310-cp310-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 1d4c7bf5d7d20654bfa2522753ee3794d25139f42bc3a321182f713625024398
MD5 4d480d6c3429461ed4b6c812b5bff2ca
BLAKE2b-256 7b3ab0ca1d5c0f649812d3f70f7f66cad3ca96f6e811fae1c979f5f53c2236b5

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp310-cp310-macosx_12_0_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp310-cp310-macosx_12_0_x86_64.whl
  • Upload date:
  • Size: 290.8 kB
  • Tags: CPython 3.10, macOS 12.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.10.6

File hashes

Hashes for kuzukiri-0.1.3-cp310-cp310-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 b022424f16e2d3115f7bbe946c02821b4432e1b44b12750f2395388448e64f3d
MD5 ad1dd8e949d81238b2c543723be3b190
BLAKE2b-256 1655bfcc54eef29d56f8430673e3fab41bacb3265302277dd3a1fcaafde2b899

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp39-cp39-manylinux1_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp39-cp39-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: CPython 3.9
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.9.5

File hashes

Hashes for kuzukiri-0.1.3-cp39-cp39-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 3c7fc6a5b105ce7f82f4bff607bbe1dad847293223424c72906b51c67b70d603
MD5 479b1004a775b07f773d7e5e39d6d309
BLAKE2b-256 6865fdaad9f712d41511c84b942de7c86a181e93702ea68fa443a9d4b6a5a49b

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp39-cp39-macosx_12_0_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp39-cp39-macosx_12_0_x86_64.whl
  • Upload date:
  • Size: 290.9 kB
  • Tags: CPython 3.9, macOS 12.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.9.14

File hashes

Hashes for kuzukiri-0.1.3-cp39-cp39-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 70775db3cb275d0a29c539819ae2ac61c2db8f784a34a4b284ff9856a54ef6e9
MD5 efc180c1f6ec595693244edafdd93e1e
BLAKE2b-256 0164ca6153fc094753746aa58f7b85bf81e848ccd1c57a089054724136a50425

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp38-cp38-manylinux1_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp38-cp38-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.8.10

File hashes

Hashes for kuzukiri-0.1.3-cp38-cp38-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 f2a129e38d4684a29b0eaf6d0e5601a363e460f7943a701dce619c26b38e470f
MD5 8755d60cc21ecd903e44de516032870a
BLAKE2b-256 b047c10792e27c434e9ac36493d8d655e3232fa245585c42f0cbbf351244d06d

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp38-cp38-macosx_12_0_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp38-cp38-macosx_12_0_x86_64.whl
  • Upload date:
  • Size: 291.0 kB
  • Tags: CPython 3.8, macOS 12.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.8.14

File hashes

Hashes for kuzukiri-0.1.3-cp38-cp38-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 e38bfda84ec090b89654308f86e27f57a50c924e6a468cfb2b16edd068b923f4
MD5 e44a3b224a53068296882a8979441fb9
BLAKE2b-256 d1ceaf5a65d31c6de709f3d2e70df052d7065ed0d8036472e6f3d1704b7bbe45

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.7.13

File hashes

Hashes for kuzukiri-0.1.3-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 61791df3efa53affeefcef54fbdbf48c193f4a7ce25aee6b8c965cab027febb9
MD5 745c307ff8e5e8db0f4a86bf224d841b
BLAKE2b-256 d3355a6031b8f8841d5fb4561f845cef17ce4ec270d76305ddf70700d7b4c2e4

See more details on using hashes here.

File details

Details for the file kuzukiri-0.1.3-cp37-cp37m-macosx_12_0_x86_64.whl.

File metadata

  • Download URL: kuzukiri-0.1.3-cp37-cp37m-macosx_12_0_x86_64.whl
  • Upload date:
  • Size: 291.2 kB
  • Tags: CPython 3.7m, macOS 12.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.12.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.7.14

File hashes

Hashes for kuzukiri-0.1.3-cp37-cp37m-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 cb109853d3f6910bc9ea2ff6ee8585db4eade12f793eed994d932f64b8d5e6a2
MD5 8f5220c032c63fc97ef4d8ef86f13818
BLAKE2b-256 9081e2fe46787884dfc0f84e8432f4c8c3dabf593748da06c599049c6417adb3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page