Skip to main content

Collection of scripts to generate various TACL results and reports

Project description

tacl-extra provides scripts and libraries that make use of the TACL software.

Scripts provided are:

  • int-all: Generates extended and reduced intersect results files for every pair of texts in a supplied corpus.
  • jitc: Generates an HTML report showing the amount of overlap between a set of works, ignoring those parts that overlap with works in a second set of works.
  • lifetime: Generates results data and a report showing the lifetime of n-grams that come into or fall out of use in a group of corpora.
  • paternity: Generates a series of results files giving the n-grams in common between one corpus and each work in a second corpus, that are not present in a third corpus.

The actual work of the scripts is done in library code that can be imported and used by other code.

The code is developed at https://github.com/ajenhl/tacl-extra/ and the documentation is available at http://tacl-extra.readthedocs.io/en/latest/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for tacl-extra, version 1.0.1
Filename, size File type Python version Upload date Hashes
Filename, size tacl_extra-1.0.1-py3-none-any.whl (27.7 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size tacl-extra-1.0.1.tar.gz (19.6 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page