A streamlined and approximate implementation of the LexRank algorithm for rapid text summarization.

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

FastLexRank

A streamlined and approximate implementation of the LexRank algorithm for rapid text summarization.

LexRank for large scale data

The original implementation of LexRank utilizes the power method to calculate the eigenvector associated with an eigenvalue of 1. In the foundational paper by Erkan and Radev[1], they mathematically demonstrated why the normalized similarity matrix is a stochastic matrix and will, therefore, converge.

However, a key challenge with the original LexRank algorithm is its dependence on the power method, which often requires multiple iterations to converge. For a large corpus, matrix multiplication can become a bottleneck, slowing down the computation considerably.

To address this issue, we introduce an approximate approach that efficiently computes a score for each sentence while retaining the essential characteristic of relative centrality. Our modified method offers significant speed improvements in LexRank calculations and delivers reliable results.

Reference

[1] Erkan, G., & Radev, D. R. (2004). Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of artificial intelligence research, 22, 457-479.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.4

Nov 13, 2023

0.1.3

Nov 2, 2023

0.1.2

Nov 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastlexrank-0.1.4.tar.gz (7.1 kB view hashes)

Uploaded Nov 13, 2023 Source

Built Distribution

fastlexrank-0.1.4-py3-none-any.whl (7.8 kB view hashes)

Uploaded Nov 13, 2023 Python 3

Hashes for fastlexrank-0.1.4.tar.gz

Hashes for fastlexrank-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`a8515081d4cd709cbc98f9de945a9a3164b370781468393b7be2a4f094f1b02e`
MD5	`323866cffc5eb0187da9c95a6c1ece84`
BLAKE2b-256	`934deab247c8d7330bc2c3b5ad77de01f2a4776e2806bd5ae156cbb504d66510`

Hashes for fastlexrank-0.1.4-py3-none-any.whl

Hashes for fastlexrank-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7eec850b844cdbded316bb7474d2705220b666b0069fcfa10ee3381c70b04d38`
MD5	`cf0908157c0fe3670e3e9a88d5f7c6f9`
BLAKE2b-256	`98abe080420ee9618d88f1b984d6163edae596df1eda029435ecce7a3b7241ce`