Sentence Segmentation with sequece tagging
Project description
Deep-Segmentation
Sentence Segmentation of un-punctuated text.
Place holder for the code and pre-trained models for "DeepCorrection 1: Sentence Segmentation of unpunctuated text." as explained in the medium post at https://medium.com/@praneethbedapudi/deepcorrection-1-sentence-segmentation-of-unpunctuated-text-a1dbc0db4e98 .
The pre-trained models is available at https://drive.google.com/open?id=1keUOKjloauUvhAhxErPMZjjkfA2tPnXH
The data is available at https://drive.google.com/open?id=1inDBFHZA8pKhVdFB-I4Vkk3tEuxzt6Dv
Requirements:
seqtag
from deepsegment import DeepSegment
# the config file can be found at in the pre-trained model zip. Change the model paths in the config file before loading.
# Since the complete glove embeddings are not needed for predictions, "glove_path" can be left empty in config file
segmenter = DeepSegment('path_to_config')
segmenter.segment('I am Batman i live in gotham')
['I am Batman', 'i live in gotham']
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
deepsegment-1.0.2.tar.gz
(3.2 kB
view hashes)
Built Distribution
Close
Hashes for deepsegment-1.0.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 45f208eb95a5865b14a0826ae3cfc880d3c1c6e71d22139f6f9a185c85ae54f4 |
|
MD5 | ab3a25523c41e1aeade62d5014716b1c |
|
BLAKE2b-256 | c9c870b6acde420556b21d0b046925eeaba6eace9541904788946d2d977e2564 |