Skip to main content

asc-analyzer

Project description

ASC-analyzer

The ASC Analyzer extracts Argument Structure Constructions (ASCs) from raw English texts and computes indices related to ASC usage.

Installation

pip install asc-analyzer

Quickstart

Prepare a directory with .txt files (e.g., data/text/). Each file should contain plain English text.

Then run:

asc-analyzer \
  --input-dir data/text \
  --source cow \
  --print-asc \
  --save-asc-output

This command will:

  • Assign ASC tags to each sentence
  • Print the ASC-tagged results directly to the terminal (--print-asc)
  • Save token-level ASC tagging results as *_ASCinfo.txt files (--save-asc-output)
  • Compute ASC usage statistics (e.g., diversity, proportion, frequency, and verb–ASC association strength) and save them in a CSV summary file
  • The --source option determines which reference corpus is used for computing frequency and association measures:
    • cow: uses the COW corpus (web-based, written English)
    • subt: uses the SUBTLEX corpus (subtitle-based, spoken English)
    • Choose the source based on the register that best matches your input data.

Options

Option Description
--input-dir, -i Input folder with .txt files (default: data/test)
--output-csv, -o Path to save output CSV (default: data/Written_<source>.csv)
--source, -s Reference dataset: cow (written, default) or subt (spoken)
--indices, -x Comma-separated list of indices to include in CSV
--save-asc-output Save ASC-tagged outputs for each file (*_ASCinfo.txt)
--print-asc Print ASC-tagged output to terminal

Output for --print-asc

When using the --print-asc option, the output for each sentence shows aligned token information and its ASC label (None if no ASC applies):

# sent_id = 1
1	The	the	
2	idea	idea	
3	is	be	ATTR
4	trust	trust	

You can save this output to txt files by including --save-asc-output.

Citation

License

This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

See the full license here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asc_analyzer-0.0.5.tar.gz (12.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

asc_analyzer-0.0.5-py3-none-any.whl (13.4 MB view details)

Uploaded Python 3

File details

Details for the file asc_analyzer-0.0.5.tar.gz.

File metadata

  • Download URL: asc_analyzer-0.0.5.tar.gz
  • Upload date:
  • Size: 12.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for asc_analyzer-0.0.5.tar.gz
Algorithm Hash digest
SHA256 c8f922f1718df4a9e9be87aa900553d6435a28f821f06c13e2b95cd69cc85254
MD5 b18603a4c566334b9fcb52ba591aaa2a
BLAKE2b-256 72afc2a008e87daf3637ee5af6b0253cb84bc6a90c69af9ef2ed1c3727b7f3c9

See more details on using hashes here.

File details

Details for the file asc_analyzer-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: asc_analyzer-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 13.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for asc_analyzer-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 c8acf62ebdf6317d52366ff994b9e9e88e86af00a30b009518de910ef49ae2d0
MD5 67abe47a056ef076271bc9462d35e490
BLAKE2b-256 72d6996848f4285887cee47c45a378a408ede9ad6a4f8a32296b1546c4a44e7f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page