Skip to main content

Italian ATS Evaluator

Project description

italian-ats-evalautor

This is an open source project to evaluate the performance of an italian ATS (Automatic Text Simplifier) on a set of texts.

You can analyze a single text extracting the following features:

  • Overall:
    • Number of tokens
    • Number of tokens (including punctuation)
    • Number of characters
    • Number of characters (including punctuation)
    • Number of words
    • Number of syllables
    • Number of unique lemmas
    • Number of sentences
  • Readability:
    • Type-Token Ratio (TTR)
    • Gulpease Index
    • Flesch-Vacca Index
    • Lexical Density
  • Part of Speech (POS) distribution
  • Verbs distribution
    • Active Verbs
    • Passive Verbs
  • Italian Basic Vocabulary (NVdB) from Il Nuovo vocabolario di base della lingua italiana, Tullio De Mauro
    • All
    • FO (Fundamentals)
    • AU (High Usage)
    • AD (High Availability)

You can also compare two texts and get the following metrics:

  • Semantic:
    • Semantic Similarity
  • Character diff:
    • Edit Distance
  • Token diff:
    • Amount of tokens added
    • Amount of tokens removed
    • Amount of VdB tokens removed
    • Amount of VdB tokens added

Installation

pip install italian-ats-evaluator

Usage

from italian_ats_evaluator import TextAnalyzer

result = TextAnalyzer(
  text="Il gatto mangia il topo"
)
from italian_ats_evaluator import SimplificationAnalyzer

result =  SimplificationAnalyzer(
    reference_text="Il felino mangia il roditore",
    simplified_text="Il gatto mangia il topo"
)

Development

Create a virtual environment

python3 -m venv venv
source venv/bin/activate

Install the package in editable mode

pip install -e .

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Acknowledgements

This project is part of the research project "VerbACxSS: su verbi analitici, complessità, verbi sintetici, e semplificazione. Per l’accessibilità." funded by the Italian Ministry of University and Research (MUR) under the PRIN 2020 program.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

italian_ats_evaluator-2.0.5.tar.gz (34.1 kB view hashes)

Uploaded Source

Built Distribution

italian_ats_evaluator-2.0.5-py3-none-any.whl (35.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page