Skip to main content

Constructional Sophistication Analyzer (CCA) is a tool designed to measure the constructional diversity, constructional sophistication, and verbal diversity of English texts.

Project description

Constructional Complexity Analyzer


Project Description

The Constructional Complexity Analyzer (CCA) is a tool designed to measure the constructional diversity, constructional elaboration, and verbal diversity of English texts.


Features

CCA outputs various scores for the following indices:

1. Constructional Diversity

Calculates the following diversity measures based on constructions:

  • Root TTR (RTTR; Guiraud, 1960)
  • Herdan’s C (or log TTR; Chotlos, 1944; Herdan, 1960)
  • Maas’ score (Maas, 1972)
  • Moving-average type-token ratio (MATTR; Covington & McFall, 2010)
  • Hypergeometric distribution diversity index (HD-D; McCarthy & Jarvis, 2007)
  • Measure of textual lexical diversity (MTLD-Original; McCarthy, 2005; McCarthy & Jarvis, 2010)
  • Moving-average bidirectional measure of textual lexical diversity (MTLD-MA-BI; McCarthy & Jarvis, 2010)
  • Moving-average wrapped measure of textual lexical diversity (MT

2. Constructional Elaboration

Calculates the mean number of dependents (except punctuation marks) for individual constructions

3. Verb Inventory Size

Calculates log 10 [type frequency of verbs + 1 ] for individual constructions


Installation

1. Install spaCy

pip install spacy
python -m spacy download en_core_web_sm

2. Install Constructional Complexity Analyzer

pip install constructional_complexity_analyzer

3. Run the Analyzer

constructional-analysis /path/to/text_files

Citation

Hwang, H., & Kim, H. (2023). Automatic analysis of constructional diversity as a predictor of EFL students’ writing proficiency. Applied Linguistics, 44(1), 127–147. https://doi.org/10.1093/applin/amac046

Hwang, H. (in revision). Are the diversity measurement approaches proven effective for lexicon also suitable for syntax? Evaluating reliability and validity of ten constructional diversity indices. Journal of Second Language Writing.


License

This project is licensed under the
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

constructional_complexity_analyzer-0.2.0.tar.gz (9.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file constructional_complexity_analyzer-0.2.0.tar.gz.

File metadata

File hashes

Hashes for constructional_complexity_analyzer-0.2.0.tar.gz
Algorithm Hash digest
SHA256 6aa27abf88e0519a1abfde596f4ce7dc1ed5cda45e5b0d47cc99c15dc100ef3c
MD5 e70d92b43a50f0ce436dc76423c6039e
BLAKE2b-256 8cc7b40e96274f14bf8433884a2c33de8bb823aeabf30b4883c30d7b8fb0e252

See more details on using hashes here.

File details

Details for the file constructional_complexity_analyzer-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for constructional_complexity_analyzer-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6694b3da3ab5f05fefacfc8bf74994c42d4d8743fa3c8e4e167a5fae1ec8d774
MD5 abf95b7c0843f4105118a03a640d1b5d
BLAKE2b-256 4f305a2b2c0df5153721e4909b40072b73bf99620728da20b4c6da1d871457c9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page