Skip to main content

The baikal nlp python client library

Project description

What is this?

baikalnlpy is the python 3 library for baikal NLP.

Baikal NLP is a Korean NLP, which provides tokenizing, POS tagging for Korean.

How to install

pip3 install baikalnlpy

How to get baikal NLP

  • Go to https://license.baikal.ai/.
    • With registration, for the first time, you can get a free license for 3 months.
    • If you are a student or a researcher, you can get also a free license for 1 year, which is able to renew after 1 year.
  • Or use docker image.
docker pull baikalai/baikal-nlp:v1.7.3

How to use

import sys
import google.protobuf.text_format as tf
from baikalnlpy import Tagger

# If you have your own localhost baikal NLP. 
my_tagger = Tagger('localhost')
# or if you have your own baikal NLP which is running on 10.8.3.211:15656.
my_tagger = Tagger('10.8.3.211', 15656)
# or with smaller public cloud instance, it may be slow. It is free.
tagger = Tagger()

# print results. 
res = tagger.tags(["안녕하세요.", "반가워요!"])

# get protobuf message.
m = res.msg()
tf.PrintMessage(m, out=sys.stdout, as_utf8=True)
print(tf.MessageToString(m, as_utf8=True))
print(f'length of sentences is {len(m.sentences)}')
## output : 2
print(f'length of tokens in sentences[0] is {len(m.sentences[0].tokens)}')
print(f'length of morphemes of first token in sentences[0] is {len(m.sentences[0].tokens[0].morphemes)}')
print(f'lemma of first token in sentences[0] is {m.sentences[0].tokens[0].lemma}')
print(f'first morph of first token in sentences[0] is {m.sentences[0].tokens[0].morphemes[0]}')
print(f'tag of first morph of first token in sentences[0] is {m.sentences[0].tokens[0].morphemes[0].tag}')
# print number

# get json object
jo = res.as_json()
print(jo)

# get tuple of pos tagging.
pa = res.pos()
print(pa)
# another methods
ma = res.morphs()
print(ma)
na = res.nouns()
print(na)
va = res.verbs()
print(va)

# custom dictionary
cust_dic = tagger.custom_dict("my")
cust_dic.copy_np_set({'내고유명사', '우리집고유명사'})
cust_dic.copy_cp_set({'코로나19'})
cust_dic.copy_cp_caret_set({'코로나^백신', '"독감^백신'})
cust_dic.update()

# laod prev custom dict
cust_dict2 = tagger.custom_dict("my")
cust_dict2.load()

tagger.set_domain('my')
tagger.pos('코로나19는 언제 끝날까요?')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

baikalnlpy-1.1.0.tar.gz (10.6 kB view details)

Uploaded Source

Built Distribution

baikalnlpy-1.1.0-py3-none-any.whl (11.6 kB view details)

Uploaded Python 3

File details

Details for the file baikalnlpy-1.1.0.tar.gz.

File metadata

  • Download URL: baikalnlpy-1.1.0.tar.gz
  • Upload date:
  • Size: 10.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.14 CPython/3.7.13 Darwin/21.6.0

File hashes

Hashes for baikalnlpy-1.1.0.tar.gz
Algorithm Hash digest
SHA256 ac65b54945babb25b7562abc0e25017a5eed2c053be8041247e0e1dbdd384144
MD5 5dda3f93c05d46d9d1347e34fb857816
BLAKE2b-256 01c73f191e7d9647bfbe3d05b3d97f5c6c1e90b30089fc70d49edcda54a5aff6

See more details on using hashes here.

File details

Details for the file baikalnlpy-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: baikalnlpy-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 11.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.14 CPython/3.7.13 Darwin/21.6.0

File hashes

Hashes for baikalnlpy-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 71aec4938394680e03b278c61d3f78e6137e95fd0cba19d3eb58a4124d205f80
MD5 d2a9bc781ca0f593c81b7c3457dbcc31
BLAKE2b-256 d29c7baf0bec3dad846ad1535c803a8e0ce2f2fe018e0dcd484f9aeedea825b6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page