Skip to main content
Donate to the Python Software Foundation or Purchase a PyCharm License to Benefit the PSF! Donate Now

Collection of scripts to generate various TACL results and reports

Project description

tacl-extra provides scripts and libraries that make use of the TACL software.

Scripts provided are:

  • int-all: Generates extended and reduced intersect results files for every pair of texts in a supplied corpus.
  • jitc: Generates an HTML report showing the amount of overlap between a set of works, ignoring those parts that overlap with works in a second set of works.
  • lifetime: Generates results data and a report showing the lifetime of n-grams that come into or fall out of use in a group of corpora.
  • paternity: Generates a series of results files giving the n-grams in common between one corpus and each work in a second corpus, that are not present in a third corpus.

The actual work of the scripts is done in library code that can be imported and used by other code.

The code is developed at https://github.com/ajenhl/tacl-extra/ and the documentation is available at http://tacl-extra.readthedocs.io/en/latest/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
tacl_extra-1.0.1-py3-none-any.whl (27.7 kB) Copy SHA256 hash SHA256 Wheel py3
tacl-extra-1.0.1.tar.gz (19.6 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page