Skip to main content

A Python Client for CKIP Chinese Word Segmentation System

Project description

CKIP Client

A Python client for the Chinese Word Segmentation System (see ckipsvr.iis.sinica.edu.tw) provided by Academia Sinica Chinese Knowledge and Information Processing (CKIP) Group.

Installation

Simply run tho following command:

pip install ckip-client

If pip is not available, you can also download it manually from PyPI.

Example

In order to use the Chinese Word Segmentation System, you should register an account on ckipsvr.iis.sinica.edu.tw. You will get the server’s IP address and port number when you finish the registration process.

Replace CKIP_* with your real arguments.

from ckipclient import CKIPClient

sample_text = '第二屆內地搖滾音樂祭將在 9 月 24 日登場!今年共有五十五組獨立樂團'\
              '參與演出,包含曾獲得金曲獎的「舒米恩」、「滅火器」等,陣容十分堅強。'

ckip = CKIPClient(CKIP_IP, CKIP_PORT, CKIP_USERNAME, CKIP_PASSWORD)
sample_results = ckip.segment(sample_text)

for sentence in sample_results:
    print('/'.join(' '.join(word) for word in sentence))

The code above will output:

第二 DET/屆 M/內地 N/搖滾 N/音樂 N/祭 Vt/將 ADV/在 P/9 DET/月 N/24 DET/日 M/登場 Vi/! EXCLAMATIONCATEGORY
今年 N/共有 Vt/五十五 DET/組 M/獨立 Vi/樂團 N/參與 Vt/演出 N/, COMMACATEGORY
包含 Vt/曾 ADV/獲得 Vt/金曲獎 N/的 T/「 PARENTHESISCATEGORY/舒米恩 N/」 PARENTHESISCATEGORY/、 PAUSECATEGORY/「 PARENTHESISCATEGORY/滅火器 N/」 PARENTHESISCATEGORY/等 POST/, COMMACATEGORY
陣容 N/十分 ADV/堅強 Vi/。 PERIODCATEGORY

For more details, please refer to the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ckip-client-0.1.0.tar.gz (3.7 kB view details)

Uploaded Source

File details

Details for the file ckip-client-0.1.0.tar.gz.

File metadata

  • Download URL: ckip-client-0.1.0.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for ckip-client-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9855b429c9ea9c78740641ac15e58cc887a554fc1a516e290c1762bd9fe98974
MD5 f4c9cd490c8ec64a5b95ccaa05513e13
BLAKE2b-256 e4037fc93c6d40ee2244e650de9ae14d8de76ae73f2df909dfb92486351a456b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page