Skip to main content

Collection of scripts to generate various TACL results and reports

Project description

tacl-extra provides scripts and libraries that make use of the TACL software.

Scripts provided are:

  • int-all: Generates extended and reduced intersect results files for every pair of texts in a supplied corpus.
  • jitc: Generates an HTML report showing the amount of overlap between a set of works, ignoring those parts that overlap with works in a second set of works.
  • lifetime: Generates results data and a report showing the lifetime of n-grams that come into or fall out of use in a group of corpora.
  • paternity: Generates a series of results files giving the n-grams in common between one corpus and each work in a second corpus, that are not present in a third corpus.

The actual work of the scripts is done in library code that can be imported and used by other code.

The code is developed at https://github.com/ajenhl/tacl-extra/ and the documentation is available at http://tacl-extra.readthedocs.io/en/latest/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tacl-extra-1.0.1.tar.gz (19.6 kB view hashes)

Uploaded source

Built Distribution

tacl_extra-1.0.1-py3-none-any.whl (27.7 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page