Fast and Efficient Sentence Segmentation
Project description
Fast Sentence Segmentation (fast-sentence-segment)
Fast and Efficient Sentence Segmentation
Usage
from fast_sentence_segment import segment_text
results = segment_text(
'here is a dr. who says something. and then again, what else? i dont know. Do you?')
assert results == [
[
'here is a dr. who says something.',
'and then again, what else?',
'i dont know.',
'Do you?'
]
]
Why use a double-scripted list?
The segementation process will segment into paragraphs and sentences. A paragraph is composed of 1..* sentences, hence each list of lists is equivalent to a paragraph.
This usage
results = segment_text(input_text, flatten=True)
Will return a list of strings, regardless of paragraph delimitation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for fast-sentence-segment-0.1.9.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 438879f141ceb324205be34cdcddb30286009119368152adb9a03c993f7b78cb |
|
MD5 | ba3dae931bc2c10e284a951f884065b6 |
|
BLAKE2b-256 | f6e24110c6a824a2a554e5e9e1a2a2de300c0a21f7ab418dca53ade860a9db2c |
Close
Hashes for fast_sentence_segment-0.1.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 92ce6096e2d9f16b643b7a7ba09be47f90d4c4699c0b4b02fd0dc1dd7e414e36 |
|
MD5 | 9ec260e74ee0cd6570c0b6e955113efc |
|
BLAKE2b-256 | 38ad21d4889dde016cd6507d3e057ba18dc8ba12f4c2e0c7157a4f13e8508190 |