Skip to main content

AITutor-AssessmentKit is the first open-source toolkit designed to evaluate the pedagogical performance of AI tutors in student mistake remediation tasks. With the growing capabilities of large language models (LLMs), this library provides a systematic approach to assess their teaching potential across multiple dimensions in educational dialogues.

Project description

AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues


The AITutor-AssessmentKit is the first open-source library to enable the pedagogical abilities assessment of large language model (LLM)-powered AI tutors in educational dialogues. This unified framework:

  • Evaluates AI tutor responses across eight comprehensive dimensions in the context of student error remediation tasks in mathematics.
  • Offers a pluggable and customizable interface for integrating models and LLM releases from the community.

By providing an efficient, scalable alternative to costly and subjective human evaluations, AITutor-AssessmentKit facilitates on-the-fly assessment of AI tutors.


📥 Installation

To install AITutor-AssessmentKit with pip, type:

pip install aitutor-assessmentkit

📚 Overview of the AITutor-AssessmentKit

The library comprises three modular components:

  1. autoeval: For automated evaluation.
  2. llmeval: For LLM-based evaluation.
  3. visualizer: For visualization and interpretation of evaluation scores.


📖 Tutorials

We provide several resources to help you get started:

  1. Tutorial Notebook-I: Automated Evaluation with AITutor-AssessmentKit
  2. Tutorial Notebook-II: LLM-Based Evaluation with AITutor-AssessmentKit
  3. Tutorial Notebook-III: Visualizing AI Tutor Performance with AITutor-AssessmentKit

For a quick overview, check out our Demo Notebook.


🚀 Next Steps

Here are the upcoming milestones for the project:

  1. Create and release detailed documentation.
  2. Publish a longer tutorial video.
  3. Develop a GUI-friendly interaction mode.

📜 Citation

If you use AITutor-AssessmentKit in your research, please cite us:

@inproceedings{maurya2024aitutorassessmentkit,
  title={AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues},
  author={Kaushal Kumar Maurya and Ekaterina Kochmar},
  year={2024}
}

📧 Contact

For any questions or support, feel free to reach out:

📧 Kaushal Kumar Maurya: Kaushal.Maurya@mbzuai.ac.ae


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitutor_assessmentkit-0.1.5.tar.gz (29.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitutor_assessmentkit-0.1.5-py3-none-any.whl (44.9 kB view details)

Uploaded Python 3

File details

Details for the file aitutor_assessmentkit-0.1.5.tar.gz.

File metadata

  • Download URL: aitutor_assessmentkit-0.1.5.tar.gz
  • Upload date:
  • Size: 29.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.5

File hashes

Hashes for aitutor_assessmentkit-0.1.5.tar.gz
Algorithm Hash digest
SHA256 611a280ae922416fc043354b967bc67fb8070a0d77cec15cf13c8f9beca3a755
MD5 90a124d24cb74ebfa90408106ec903ac
BLAKE2b-256 0de7ac9e9f03aa81340548c64f88f8208fe601dc1ab81c6cb5d4b792b240cd2d

See more details on using hashes here.

File details

Details for the file aitutor_assessmentkit-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for aitutor_assessmentkit-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 f6f85584ac3964a6eb18e7bdafb691a608e0891899c5dc6454cb3f9a137ec50a
MD5 1fbb528704952c9373a834afea560b5f
BLAKE2b-256 5d3a7767745633741a2b692d1c1b6e58549f21f9e897bd14fe1679bea4db8136

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page