Module for conversational and text summarization and much more
Project description
A python module for conversation and text summarization and much more exciting features.
Features provided by this module:
- Text Segmentation using:
- TextTiling with Block Score
- TextTiling with Vocabulary introduction
- Conversational summarization using:
- Cluster Rank
- Text summarization using :
- TextRank
- Random
Installation
Make sure you have Python 2.7/3.3+ and pip(Windows, Linux) installed. Run simply (preferred way):
$ [sudo] pip install readless
Or for the latest version in development:
$ [sudo] pip install git+git://github.com/adichat/read-less.git
ReadLess API
You can use readless like a library in your project.
For quickly summarizing a conversation using ClusterRank algorithm:
# -*- coding: utf8 -*- from readless.Summarization import clusterrank summarizer = clusterrank.ClusterRank() pathToFile = "C:/conversation.in" summary = summarizer.summarizeFile(pathToFile)
For segmenting a text using TextTiling algorithm:
# -*- coding: utf8 -*- from readless.Segmentation import texttiling segmentation = texttiling.TextTiling() pathToFile = "C:/conversation.in" segmentedText = segmentation.segmentFile(pathToFile)
For a detailed list of other API functionalities, see **ReadLess Documentation**.
Contributions
All contributions are welcomed. This module is in development and there are several scopes of improvement. Tests are to be implemented along with other Summarization algorithms with support for web page summarization. For upcoming features, see Future developments.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size | File type | Python version | Upload date | Hashes |
---|---|---|---|---|
Filename, size readless-1.0.17.zip (14.2 kB) | File type Source | Python version None | Upload date | Hashes View |