Skip to main content

AITutor-AssessmentKit is the first open-source toolkit designed to evaluate the pedagogical performance of AI tutors in student mistake remediation tasks. With the growing capabilities of large language models (LLMs), this library provides a systematic approach to assess their teaching potential across multiple dimensions in educational dialogues.

Project description

AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues


The AITutor-AssessmentKit is the first open-source library to enable the pedagogical abilities assessment of large language model (LLM)-powered AI tutors in educational dialogues. This unified framework:

  • Evaluates AI tutor responses across eight comprehensive dimensions in the context of student error remediation tasks in mathematics.
  • Offers a pluggable and customizable interface for integrating models and LLM releases from the community.

By providing an efficient, scalable alternative to costly and subjective human evaluations, AITutor-AssessmentKit facilitates on-the-fly assessment of AI tutors.


📥 Installation

To install AITutor-AssessmentKit with pip, type:

pip install aitutor-assessmentkit

📚 Overview of the AITutor-AssessmentKit

The library comprises three modular components:

  1. autoeval: For automated evaluation.
  2. llmeval: For LLM-based evaluation.
  3. visualizer: For visualization and interpretation of evaluation scores.


📖 Tutorials

We provide several resources to help you get started:

  1. Tutorial Notebook-I: Automated Evaluation with AITutor-AssessmentKit
  2. Tutorial Notebook-II: LLM-Based Evaluation with AITutor-AssessmentKit
  3. Tutorial Notebook-III: Visualizing AI Tutor Performance with AITutor-AssessmentKit

For a quick overview, check out our Demo Notebook.


🚀 Next Steps

Here are the upcoming milestones for the project:

  1. Create and release detailed documentation.
  2. Publish a longer tutorial video.
  3. Develop a GUI-friendly interaction mode.

📜 Citation

If you use AITutor-AssessmentKit in your research, please cite us:

@inproceedings{maurya2024aitutorassessmentkit,
  title={AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues},
  author={Kaushal Kumar Maurya and Ekaterina Kochmar},
  year={2024}
}

📧 Contact

For any questions or support, feel free to reach out:

📧 Kaushal Kumar Maurya: Kaushal.Maurya@mbzuai.ac.ae


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitutor_assessmentkit-0.1.7.tar.gz (41.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitutor_assessmentkit-0.1.7-py3-none-any.whl (57.6 kB view details)

Uploaded Python 3

File details

Details for the file aitutor_assessmentkit-0.1.7.tar.gz.

File metadata

  • Download URL: aitutor_assessmentkit-0.1.7.tar.gz
  • Upload date:
  • Size: 41.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.5

File hashes

Hashes for aitutor_assessmentkit-0.1.7.tar.gz
Algorithm Hash digest
SHA256 c50facc94e89f26c1fe97bca452deedda1c44e62ce52a3b0d3c8eb36c40af47e
MD5 eb697ab054f2b23ce81e229a1dba2750
BLAKE2b-256 73dffc3359aef0ed04b4db90c35e6393d6bf9003f2ee8fa0cb4bee096074ad3e

See more details on using hashes here.

File details

Details for the file aitutor_assessmentkit-0.1.7-py3-none-any.whl.

File metadata

File hashes

Hashes for aitutor_assessmentkit-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 4f6c43823bcd80732edb63356e9c4520bc281037ae4f326b71e8f25161ecc178
MD5 28ac7d635e9df143c931855b498883b5
BLAKE2b-256 855aec4ce13fa779987e9d3a3b51de4c4043610d733a18144edbbc979b817280

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page