Skip to main content

Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder

Project description

Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.

This module allows splitting of text paragraphs into sentences. It is based on scripts developed by Philipp Koehn and Josh Schroeder for processing the Europarl corpus.

The module is a port of Lingua::Sentence Perl module with some extra additions (improved non-breaking prefix lists for some languages and added support for Danish, Finnish, Lithuanian, Norwegian (Bokmål), Romanian, and Turkish).

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sentence_splitter-1.4.tar.gz (30.6 kB view details)

Uploaded Source

Built Distribution

sentence_splitter-1.4-py2.py3-none-any.whl (45.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file sentence_splitter-1.4.tar.gz.

File metadata

  • Download URL: sentence_splitter-1.4.tar.gz
  • Upload date:
  • Size: 30.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.5.6

File hashes

Hashes for sentence_splitter-1.4.tar.gz
Algorithm Hash digest
SHA256 3d1d773d07cc733ca2955aa87d0fa1c0a7274c6bdeec1daac5c5e92efb512f63
MD5 a58c1f759d0b8ce0fe1e0c99b150ecb1
BLAKE2b-256 20b386b431fe7002ba006c08b8559d2ad78e1153bfc515a453cc96d2f55a2c40

See more details on using hashes here.

File details

Details for the file sentence_splitter-1.4-py2.py3-none-any.whl.

File metadata

  • Download URL: sentence_splitter-1.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 45.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.5.6

File hashes

Hashes for sentence_splitter-1.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 5645a3ad9c348e4287f4bc73bd573d92dccd4139042fddd51fff0591f1376763
MD5 2997a3de186228e9d434f92bceb751ec
BLAKE2b-256 4aae3bd609c760d57849d7ddf223762f1881f3c4df6467f4eadb3a33652b7e0d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page