Skip to main content

AITutor-AssessmentKit is the first open-source toolkit designed to evaluate the pedagogical performance of AI tutors in student mistake remediation tasks. With the growing capabilities of large language models (LLMs), this library provides a systematic approach to assess their teaching potential across multiple dimensions in educational dialogues.

Project description

AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues


The AITutor-AssessmentKit is the first open-source library to enable the pedagogical abilities assessment of large language model (LLM)-powered AI tutors in educational dialogues. This unified framework:

  • Evaluates AI tutor responses across eight comprehensive dimensions in the context of student error remediation tasks in mathematics.
  • Offers a pluggable and customizable interface for integrating models and LLM releases from the community.

By providing an efficient, scalable alternative to costly and subjective human evaluations, AITutor-AssessmentKit facilitates on-the-fly assessment of AI tutors.


📥 Installation

To install AITutor-AssessmentKit with pip, type:

pip install aitutor-assessmentkit

📚 Overview of the AITutor-AssessmentKit

The library comprises three modular components:

  1. autoeval: For automated evaluation.
  2. llmeval: For LLM-based evaluation.
  3. visualizer: For visualization and interpretation of evaluation scores.


📖 Tutorials

We provide several resources to help you get started:

  1. Tutorial Notebook-I: Automated Evaluation with AITutor-AssessmentKit
  2. Tutorial Notebook-II: LLM-Based Evaluation with AITutor-AssessmentKit
  3. Tutorial Notebook-III: Visualizing AI Tutor Performance with AITutor-AssessmentKit

For a quick overview, check out our Demo Notebook.


🚀 Next Steps

Here are the upcoming milestones for the project:

  1. Create and release detailed documentation.
  2. Publish a longer tutorial video.
  3. Develop a GUI-friendly interaction mode.

📜 Citation

If you use AITutor-AssessmentKit in your research, please cite us:

@inproceedings{maurya2024aitutorassessmentkit,
  title={AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues},
  author={Kaushal Kumar Maurya and Ekaterina Kochmar},
  year={2024}
}

📧 Contact

For any questions or support, feel free to reach out:

📧 Kaushal Kumar Maurya: Kaushal.Maurya@mbzuai.ac.ae


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitutor_assessmentkit-0.1.6.tar.gz (41.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitutor_assessmentkit-0.1.6-py3-none-any.whl (57.6 kB view details)

Uploaded Python 3

File details

Details for the file aitutor_assessmentkit-0.1.6.tar.gz.

File metadata

  • Download URL: aitutor_assessmentkit-0.1.6.tar.gz
  • Upload date:
  • Size: 41.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.5

File hashes

Hashes for aitutor_assessmentkit-0.1.6.tar.gz
Algorithm Hash digest
SHA256 cf25d111b419b85804253a67a28589b1d529d70082e90725c2c13ae74f917ddf
MD5 d146f3fd9f92e7b448803c2c3fe21da1
BLAKE2b-256 a348396536ea46d78fdca15a722887b413a1b3fba4c0b0b150296527dbdb217b

See more details on using hashes here.

File details

Details for the file aitutor_assessmentkit-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for aitutor_assessmentkit-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 5fe1cb905072419258d014c911849cf617956139c02ccdcb53f06b876ddd59de
MD5 41892d37d2c6e6a74cf9efd12b1cd04a
BLAKE2b-256 5451d9777642cd2f911dfe2009ea2283d794cfa16b8e38962f638e4b86ba31e3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page