Skip to main content

Fast and Efficient Sentence Segmentation

Project description

Fast Sentence Segmentation (fast-sentence-segment)

Fast and Efficient Sentence Segmentation

Usage

from fast_sentence_segment import segment_text

results = segment_text(
    'here is a dr. who says something.  and then again, what else?  i dont know.  Do you?')

assert results == [
    [
        'here is a dr. who says something.',
        'and then again, what else?',
        'i dont know.',
        'Do you?'
    ]
]

Why use a double-scripted list?

The segementation process will segment into paragraphs and sentences. A paragraph is composed of 1..* sentences, hence each list of lists is equivalent to a paragraph.

This usage

results = segment_text(input_text, flatten=True)

Will return a list of strings, regardless of paragraph delimitation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast-sentence-segment-0.1.9.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

fast_sentence_segment-0.1.9-py3-none-any.whl (10.3 kB view details)

Uploaded Python 3

File details

Details for the file fast-sentence-segment-0.1.9.tar.gz.

File metadata

  • Download URL: fast-sentence-segment-0.1.9.tar.gz
  • Upload date:
  • Size: 6.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.8.5 Windows/10

File hashes

Hashes for fast-sentence-segment-0.1.9.tar.gz
Algorithm Hash digest
SHA256 438879f141ceb324205be34cdcddb30286009119368152adb9a03c993f7b78cb
MD5 ba3dae931bc2c10e284a951f884065b6
BLAKE2b-256 f6e24110c6a824a2a554e5e9e1a2a2de300c0a21f7ab418dca53ade860a9db2c

See more details on using hashes here.

File details

Details for the file fast_sentence_segment-0.1.9-py3-none-any.whl.

File metadata

File hashes

Hashes for fast_sentence_segment-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 92ce6096e2d9f16b643b7a7ba09be47f90d4c4699c0b4b02fd0dc1dd7e414e36
MD5 9ec260e74ee0cd6570c0b6e955113efc
BLAKE2b-256 38ad21d4889dde016cd6507d3e057ba18dc8ba12f4c2e0c7157a4f13e8508190

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page