Sentence Segmentation with sequece tagging
Project description
Deep-Segmentation
Sentence Segmentation of un-punctuated text.
Place holder for the code and pre-trained models for "DeepCorrection 1: Sentence Segmentation of unpunctuated text." as explained in the medium post at https://medium.com/@praneethbedapudi/deepcorrection-1-sentence-segmentation-of-unpunctuated-text-a1dbc0db4e98 .
The pre-trained models is available at https://drive.google.com/open?id=1keUOKjloauUvhAhxErPMZjjkfA2tPnXH
The data is available at https://drive.google.com/open?id=1inDBFHZA8pKhVdFB-I4Vkk3tEuxzt6Dv
Requirements:
seqtag
from seqtag import predictor
from deepsegment import segment
# the config file can be found at in the pre-trained model zip. Change the model paths in the config file before loading.
# Since the complete glove embeddings are not needed for predictions, "glove_path" can be left empty in config file
seqtag_model = predictor.load_model(path_to_config_file)
segment('I am Batman i live in gotham', seqtag_model)
['I am Batman', 'i live in gotham']
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
deepsegment-1.0.0.tar.gz
(3.2 kB
view hashes)
Built Distribution
Close
Hashes for deepsegment-1.0.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 307b4febfb38a02161efcbaf3340208ba1409b6416b7791533fb025b5c8d2001 |
|
MD5 | 6b623fda4d16eddc8da9ee933582b098 |
|
BLAKE2b-256 | fe4440dff2a47bb2d7c1dfe5d71fd53a104909bbf8ead47d1e6b376cb62e4d31 |