Skip to main content

A Longer Version of Pegasus TF Model For Abstractive Summarization

Project description

This package is used for inducing longformer self attention over base pegasus abstractive summarization model to increase the token limit and performance.The Pegasus is a large Transformer-based encoder-decoder model with a new pre-training objective which is adapted to abstractive summarization. More specifically, the pre-training objective, called “Gap Sentence Generation (GSG)”, consists of masking important sentences from a document and generating these gap-sentences.On the other hand, the Longformer is a Transformer which replaces the full-attention mechanism (quadratic dependency) with a novel attention mechanism which scale linearly with the input sequence length. Consequently, Longformer can process sequences up to 4,096 tokens long (8 times longer than BERT which is limited to 512 tokens).This package plugs Longformers attention mechanism to Pegasus in order to perform abstractive summarization on long documents. The base modules are built on Tensorflow platform.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

LongPegasus-0.3.tar.gz (3.7 kB view details)

Uploaded Source

File details

Details for the file LongPegasus-0.3.tar.gz.

File metadata

  • Download URL: LongPegasus-0.3.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.5.0.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.3

File hashes

Hashes for LongPegasus-0.3.tar.gz
Algorithm Hash digest
SHA256 89724595186b36de9989fe2307db0c12b7fa9a3a761030817b205bd9722f6f7f
MD5 d0cf722366e9b3b57c8717dd9aedce6d
BLAKE2b-256 41be0b1c00041a52a3d9d8748b453587033c4d2a5ccec79a2e6bf3cce9b39381

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page