Skip to main content

Constructional Sophistication Analyzer (CCA) is a tool designed to measure the constructional diversity, constructional elboration, verb inventory size, and other construction-related indices of English texts.

Project description

Constructional Complexity Analyzer


Project Description

The Constructional Complexity Analyzer (CCA) is a tool designed to measure the constructional diversity, constructional elaboration, and verbal diversity of English texts.


Features

CCA outputs various scores for the following indices:

1. Constructional Diversity

Calculates the following diversity measures based on constructions:

  • Root TTR (RTTR; Guiraud, 1960)
  • Herdan’s C (or log TTR; Chotlos, 1944; Herdan, 1960)
  • Maas’ score (Maas, 1972)
  • Moving-average type-token ratio (MATTR; Covington & McFall, 2010)
  • Hypergeometric distribution diversity index (HD-D; McCarthy & Jarvis, 2007)
  • Measure of textual lexical diversity (MTLD-Original; McCarthy, 2005; McCarthy & Jarvis, 2010)
  • Moving-average bidirectional measure of textual lexical diversity (MTLD-MA-BI; McCarthy & Jarvis, 2010)
  • Moving-average wrapped measure of textual lexical diversity (MT

2. Constructional Elaboration

Calculates the mean number of dependents (except punctuation marks) for individual constructions

3. Verb Inventory Size

Calculates log 10 [type frequency of verbs + 1 ] for individual constructions


Installation

1. Install spaCy

pip install spacy
python -m spacy download en_core_web_sm

2. Install Constructional Complexity Analyzer

pip install constructional_complexity_analyzer

3. Run the Analyzer

constructional-analysis /path/to/text_files

Citation

Hwang, H., & Kim, H. (2023). Automatic analysis of constructional diversity as a predictor of EFL students’ writing proficiency. Applied Linguistics, 44(1), 127–147. https://doi.org/10.1093/applin/amac046

Hwang, H. (in revision). Are the diversity measurement approaches proven effective for lexicon also suitable for syntax? Evaluating reliability and validity of ten constructional diversity indices. Journal of Second Language Writing.


License

This project is licensed under the
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

constructional_complexity_analyzer-0.3.0.tar.gz (9.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file constructional_complexity_analyzer-0.3.0.tar.gz.

File metadata

File hashes

Hashes for constructional_complexity_analyzer-0.3.0.tar.gz
Algorithm Hash digest
SHA256 8865044cc749c6669e74ecfa61ceb69ee2d0db9395e3e4344d21180e8990c2ad
MD5 4cc6e2d97ce7d59a0222c5bd22836b36
BLAKE2b-256 8e1d32b52ac6952be37b2b475a5049318fedbe42b3e5beaaa6de27ae32923ec1

See more details on using hashes here.

File details

Details for the file constructional_complexity_analyzer-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for constructional_complexity_analyzer-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5029a265d628168d3fb7b83c1f9b9942c7314e2f075fb2f6c3ae575d770a4370
MD5 ad571a6d2619040b2fbbf3ba952df17d
BLAKE2b-256 b3c03de8e5d2ca7d81b95d4df8ebe8d18d755a3b386c4dcf7465b54911f8ef33

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page