Skip to main content

Constructional Sophistication Analyzer (CCA) is a tool designed to measure the constructional diversity, constructional sophistication, and verbal diversity of English texts.

Project description

Constructional Complexity Analyzer


Project Description

The Constructional Complexity Analyzer (CCA) is a tool designed to measure the constructional diversity, constructional elaboration, and verbal diversity of English texts.


Features

CCA outputs various scores for the following indices:

1. Constructional Diversity

Calculates the following diversity measures based on constructions:

  • Root TTR (RTTR; Guiraud, 1960)
  • Herdan’s C (or log TTR; Chotlos, 1944; Herdan, 1960)
  • Maas’ score (Maas, 1972)
  • Moving-average type-token ratio (MATTR; Covington & McFall, 2010)
  • Hypergeometric distribution diversity index (HD-D; McCarthy & Jarvis, 2007)
  • Measure of textual lexical diversity (MTLD-Original; McCarthy, 2005; McCarthy & Jarvis, 2010)
  • Moving-average bidirectional measure of textual lexical diversity (MTLD-MA-BI; McCarthy & Jarvis, 2010)
  • Moving-average wrapped measure of textual lexical diversity (MT

2. Constructional Elaboration

Calculates the mean number of dependents (except punctuation marks) for individual constructions

3. Verb Inventory Size

Calculates log 10 [type frequency of verbs + 1 ] for individual constructions


Installation

1. Install spaCy

pip install spacy
python -m spacy download en_core_web_sm

2. Install Constructional Complexity Analyzer

pip install constructional_complexity_analyzer

3. Run the Analyzer

constructional-analysis /path/to/text_files

Citation

Hwang, H., & Kim, H. (2023). Automatic analysis of constructional diversity as a predictor of EFL students’ writing proficiency. Applied Linguistics, 44(1), 127–147. https://doi.org/10.1093/applin/amac046

Hwang, H. (in revision). Are the diversity measurement approaches proven effective for lexicon also suitable for syntax? Evaluating reliability and validity of ten constructional diversity indices. Journal of Second Language Writing.


License

This project is licensed under the
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

constructional_complexity_analyzer-0.1.1.tar.gz (10.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file constructional_complexity_analyzer-0.1.1.tar.gz.

File metadata

File hashes

Hashes for constructional_complexity_analyzer-0.1.1.tar.gz
Algorithm Hash digest
SHA256 85a33e3452137110c5856c23a1939a5d239d0b2b63db744874b3c225d1c2631f
MD5 319a4445e6a4ff0157d9f1185763fc8a
BLAKE2b-256 c463e934b30beb2bcfa602fbacb962d26dc0429276274d0fcda0a15a136968c3

See more details on using hashes here.

File details

Details for the file constructional_complexity_analyzer-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for constructional_complexity_analyzer-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8791e39e3a09b972e091f1f18d9371c85f9f30ed38b56d0973e85452e5a6f01c
MD5 80a7189f50b1c8f661beb4ff05c958ed
BLAKE2b-256 11a032668c60e64540de5235c2556a33cc363032e88154f63077fd9af31de15d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page