Skip to main content

convert a long text to sentences

Project description

text2sentences

This module can be used by tts modules or others.

It split input text to sentences according the punctuation of native language, and detects each sentence's language id, sentence in a dialog or not(include in ""), start position in the source text, and if sentence is a new paragraph.

Dependents

and the fasttext dependents wget, so we also dependents on

wget is used to download the model from facebook

Installation

pip install fasttext-langdetect
pip install wget
git clone git@github.com:yumoqing/text2sentences
cd text2sentences
python setup.py install

Usage

see the test/test.py

Test

we only test English, Chinese

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text2sentences-0.0.1.tar.gz (3.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page