Sentence Segmentation with sequece tagging
Project description
DeepSegment: A sentence segmenter that actually works!
Note: For the original implementation please use the "master" branch of this repo.
The Demo for deepsegment (en) + deeppunct is available at http://bpraneeth.com/projects/deeppunct
Installation:
pip install --upgrade deepsegment
Supported languages:
en - english (Trained on data from various sources)
fr - french (Only Tatoeba data)
it - italian (Only Tatoeba data)
Usage:
from deepsegment import DeepSegment
# The default language is 'en'
segmenter = DeepSegment('en')
segmenter.segment('I am Batman i live in gotham')
# ['I am Batman', 'i live in gotham']
Using with tf serving docker image
docker pull bedapudi6788/deepsegment_en:v2
docker run -d -p 8500:8500 bedapudi6788/deepsegment_en:v2
from deepsegment import DeepSegment
# The default language is 'en'
segmenter = DeepSegment('en', tf_serving=True)
segmenter.segment('I am Batman i live in gotham')
# ['I am Batman', 'i live in gotham']
Training deepsegment on custom data: https://colab.research.google.com/drive/1CjYbdbDHX1UmIyvn7nDW2ClQPnnNeA_m
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
deepsegment-2.2.2.tar.gz
(6.1 kB
view hashes)
Built Distribution
Close
Hashes for deepsegment-2.2.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 402eda88f9e14c70aee0ad5e6302c7f6fb63e8e9f1e8322f58f521501d7f94a1 |
|
MD5 | 06db6f614df733599ab71b441a830249 |
|
BLAKE2b-256 | 7083accf178a62a9ec924da4cd793d9e88165033b370333ddee53d5fa10f1e92 |