Skip to main content

AITutor-AssessmentKit is the first open-source toolkit designed to evaluate the pedagogical performance of AI tutors in student mistake remediation tasks. With the growing capabilities of large language models (LLMs), this library provides a systematic approach to assess their teaching potential across multiple dimensions in educational dialogues.

Project description

AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues


The AITutor-AssessmentKit is the first open-source library to enable the pedagogical abilities assessment of large language model (LLM)-powered AI tutors in educational dialogues. This unified framework:

  • Evaluates AI tutor responses across eight comprehensive dimensions in the context of student error remediation tasks in mathematics.
  • Offers a pluggable and customizable interface for integrating models and LLM releases from the community.

By providing an efficient, scalable alternative to costly and subjective human evaluations, AITutor-AssessmentKit facilitates on-the-fly assessment of AI tutors.


📥 Installation

To install AITutor-AssessmentKit with pip, type:

pip install aitutor-assessmentkit

📚 Overview of the AITutor-AssessmentKit

The library comprises three modular components:

  1. autoeval: For automated evaluation.
  2. llmeval: For LLM-based evaluation.
  3. visualizer: For visualization and interpretation of evaluation scores.


📖 Tutorials

We provide several resources to help you get started:

  1. Tutorial Notebook-I: Automated Evaluation with AITutor-AssessmentKit
  2. Tutorial Notebook-II: LLM-Based Evaluation with AITutor-AssessmentKit
  3. Tutorial Notebook-III: Visualizing AI Tutor Performance with AITutor-AssessmentKit

For a quick overview, check out our Demo Notebook.


🚀 Next Steps

Here are the upcoming milestones for the project:

  1. Create and release detailed documentation.
  2. Publish a longer tutorial video.
  3. Develop a GUI-friendly interaction mode.

📜 Citation

If you use AITutor-AssessmentKit in your research, please cite us:

@inproceedings{maurya2024aitutorassessmentkit,
  title={AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues},
  author={Kaushal Kumar Maurya and Ekaterina Kochmar},
  year={2024}
}

📧 Contact

For any questions or support, feel free to reach out:

📧 Kaushal Kumar Maurya: Kaushal.Maurya@mbzuai.ac.ae


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitutor_assessmentkit-0.1.8.tar.gz (41.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitutor_assessmentkit-0.1.8-py3-none-any.whl (57.6 kB view details)

Uploaded Python 3

File details

Details for the file aitutor_assessmentkit-0.1.8.tar.gz.

File metadata

  • Download URL: aitutor_assessmentkit-0.1.8.tar.gz
  • Upload date:
  • Size: 41.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.5

File hashes

Hashes for aitutor_assessmentkit-0.1.8.tar.gz
Algorithm Hash digest
SHA256 7456ae1ff177d737325b4079c00bb12b60c8e54ae07da93aaa1aa766deb56ba0
MD5 7ade88341c0f0c46f9fa1a9953b47d35
BLAKE2b-256 aa39bf4c5bffaef9f36e42c8f5f24675a7fc646bcae77b4a33bff5634ddb87a1

See more details on using hashes here.

File details

Details for the file aitutor_assessmentkit-0.1.8-py3-none-any.whl.

File metadata

File hashes

Hashes for aitutor_assessmentkit-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 fff918b3c4cfff7da58981d2b7ee54e8b9126cac6d31165b1391efb41f895e39
MD5 3294c7a47f4a5d0075ae18c2a941bbb0
BLAKE2b-256 57e9a10caae7dde8d6863b6d3ef354a04c5f97d27ae11b3379d168badc216951

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page