Skip to main content

asc-analyzer

Project description

ASC-analyzer

The ASC Analyzer extracts Argument Structure Constructions (ASCs) from raw English texts and computes indices related to ASC usage.

Installation

To ensure stability and compatibility, we recommend installing dependencies in the following order:

  1. Install spaCy:

    pip install spacy
    
  2. Install spaCy-transformers:

    pip install spacy-transformers
    
  3. Download the transformer-based spaCy model:

    python -m spacy download en_core_web_trf
    
  4. Install the ASC analyzer package:

    pip install asc-analyzer
    

Quickstart

Prepare a directory with .txt files (e.g., data/text/). Each file should contain plain English text.

Then run:

asc-analyzer \
  --input-dir data/text \
  --source cow \
  --print-asc \
  --save-asc-output

This command will:

  • Assign ASC tags to each sentence
  • Print the ASC-tagged results directly to the terminal (--print-asc)
  • Save token-level ASC tagging results as *_ASCinfo.txt files (--save-asc-output)
  • Compute ASC usage statistics (e.g., diversity, proportion, frequency, and verb–ASC association strength) and save them in a CSV summary file
  • The --source option determines which reference corpus is used for computing frequency and association measures:
    • cow: uses the COW corpus (web-based, written English)
    • subt: uses the SUBTLEX corpus (subtitle-based, spoken English)
    • Choose the source based on the register that best matches your input data.

Options

Option Description
--input-dir, -i Directory containing .txt files to process (default: asc_analyzer/data/test)
--output-csv, -o Path to save the resulting CSV (default: Written_COW.csv or Spoken_SubT.csv)
--source, -s Reference dataset: cow (written, default) or subt (spoken)
--indices, -x Comma-separated list of index names to include in the CSV (default: all standard indices)
--save-asc-output, -a Save ASC-tagged outputs as *_ASCinfo.txt in the input directory
--print-asc, -p Print ASC-tagged results to the terminal

Output for --print-asc

When using the --print-asc option, the output for each sentence shows aligned token information and its ASC label (None if no ASC applies):

# sent_id = 1
1	The	the	
2	idea	idea	
3	is	be	ATTR
4	trust	trust	

You can save this output to txt files by including --save-asc-output.

Citation

License

This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

See the full license here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asc_analyzer-0.0.7.tar.gz (12.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

asc_analyzer-0.0.7-py3-none-any.whl (13.4 MB view details)

Uploaded Python 3

File details

Details for the file asc_analyzer-0.0.7.tar.gz.

File metadata

  • Download URL: asc_analyzer-0.0.7.tar.gz
  • Upload date:
  • Size: 12.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for asc_analyzer-0.0.7.tar.gz
Algorithm Hash digest
SHA256 f928d5d570a289a44377f31270dc765501a84809aecb6e77853fe5b55c063628
MD5 f43b453baa6b2b478b3c3a06bc3e7e45
BLAKE2b-256 7ec6f40ee42656e08d9551d32cf38fc9620bd54b9d0868f1e53e42d266cda363

See more details on using hashes here.

File details

Details for the file asc_analyzer-0.0.7-py3-none-any.whl.

File metadata

  • Download URL: asc_analyzer-0.0.7-py3-none-any.whl
  • Upload date:
  • Size: 13.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for asc_analyzer-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 a2fcce1ef6ec7b6e957aca08aea80255520fa62f302a632b5b7caf2b6c6f2ce0
MD5 882c3c3cdf5cdab3012d910499a52fc4
BLAKE2b-256 e23d853bf26d337891e50a93f9702a0f89e0a557051cb5107633db3a59648b86

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page