Sentence Segmentation with sequece tagging
Project description
Deep-Segmentation
Sentence Segmentation of un-punctuated text.
Place holder for the code and pre-trained models for "DeepCorrection 1: Sentence Segmentation of unpunctuated text." as explained in the medium post at https://medium.com/@praneethbedapudi/deepcorrection-1-sentence-segmentation-of-unpunctuated-text-a1dbc0db4e98 .
The pre-trained models is available at https://drive.google.com/open?id=1keUOKjloauUvhAhxErPMZjjkfA2tPnXH
The data is available at https://drive.google.com/open?id=1inDBFHZA8pKhVdFB-I4Vkk3tEuxzt6Dv
Requirements:
seqtag
from deepsegment import DeepSegment
# the config file can be found at in the pre-trained model zip. Change the model paths in the config file before loading.
# Since the complete glove embeddings are not needed for predictions, "glove_path" can be left empty in config file
segmenter = DeepSegment('path_to_config')
segmenter.segment('I am Batman i live in gotham')
['I am Batman', 'i live in gotham']
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
deepsegment-1.0.1.tar.gz
(3.2 kB
view hashes)
Built Distribution
Close
Hashes for deepsegment-1.0.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1f09deef94e292dc822b0175b3066bbe14a8b5b1e95dff90244b87d5131babdc |
|
MD5 | da46d403a72c9ff5bca013c3c6a2a2d9 |
|
BLAKE2b-256 | f6de19dcbf899bd6e46216493d63ae8b8626dc49a572576d78090392fdfd9f9e |