Collection of scripts to generate various TACL results and reports
Project description
tacl-extra provides scripts and libraries that make use of the TACL software.
Scripts provided are:
int-all: Generates extended and reduced intersect results files for every pair of texts in a supplied corpus.
jitc: Generates an HTML report showing the amount of overlap between a set of works, ignoring those parts that overlap with works in a second set of works.
lifetime: Generates results data and a report showing the lifetime of n-grams that come into or fall out of use in a group of corpora.
paternity: Generates a series of results files giving the n-grams in common between one corpus and each work in a second corpus, that are not present in a third corpus.
The actual work of the scripts is done in library code that can be imported and used by other code.
The code is developed at https://github.com/ajenhl/tacl-extra/ and the documentation is available at http://tacl-extra.readthedocs.io/en/latest/.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for tacl_extra-1.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ceb78b27a2c777b342d066a76e794ade82e4314779a5af16fcea75fcf663ad79 |
|
MD5 | 4a5676b93bdfab22df51551deb1b9b5f |
|
BLAKE2b-256 | e9f6fbd910e8144a2714446603a0ee27ea3b308af7e47b013b0000c09c1bb7f5 |