AITutor-AssessmentKit is the first open-source toolkit designed to evaluate the pedagogical performance of AI tutors in student mistake remediation tasks. With the growing capabilities of large language models (LLMs), this library provides a systematic approach to assess their teaching potential across multiple dimensions in educational dialogues.
Project description
AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues
The AITutor-AssessmentKit is the first open-source library to enable the pedagogical abilities assessment of large language model (LLM)-powered AI tutors in educational dialogues. This unified framework:
- Evaluates AI tutor responses across eight comprehensive dimensions in the context of student error remediation tasks in mathematics.
- Offers a pluggable and customizable interface for integrating models and LLM releases from the community.
By providing an efficient, scalable alternative to costly and subjective human evaluations, AITutor-AssessmentKit facilitates on-the-fly assessment of AI tutors.
📥 Installation
To install AITutor-AssessmentKit with pip, type:
pip install aitutor-assessmentkit
📚 Overview of the AITutor-AssessmentKit
The library comprises three modular components:
autoeval: For automated evaluation.llmeval: For LLM-based evaluation.visualizer: For visualization and interpretation of evaluation scores.
📖 Tutorials
We provide several resources to help you get started:
- Tutorial Notebook-I: Automated Evaluation with AITutor-AssessmentKit
- Tutorial Notebook-II: LLM-Based Evaluation with AITutor-AssessmentKit
- Tutorial Notebook-III: Visualizing AI Tutor Performance with AITutor-AssessmentKit
For a quick overview, check out our Demo Notebook.
🚀 Next Steps
Here are the upcoming milestones for the project:
- Create and release detailed documentation.
- Publish a longer tutorial video.
- Develop a GUI-friendly interaction mode.
📜 Citation
If you use AITutor-AssessmentKit in your research, please cite us:
@inproceedings{maurya2024aitutorassessmentkit,
title={AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues},
author={Kaushal Kumar Maurya and Ekaterina Kochmar},
year={2024}
}
📧 Contact
For any questions or support, feel free to reach out:
📧 Kaushal Kumar Maurya: Kaushal.Maurya@mbzuai.ac.ae
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aitutor_assessmentkit-0.1.8.tar.gz.
File metadata
- Download URL: aitutor_assessmentkit-0.1.8.tar.gz
- Upload date:
- Size: 41.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7456ae1ff177d737325b4079c00bb12b60c8e54ae07da93aaa1aa766deb56ba0
|
|
| MD5 |
7ade88341c0f0c46f9fa1a9953b47d35
|
|
| BLAKE2b-256 |
aa39bf4c5bffaef9f36e42c8f5f24675a7fc646bcae77b4a33bff5634ddb87a1
|
File details
Details for the file aitutor_assessmentkit-0.1.8-py3-none-any.whl.
File metadata
- Download URL: aitutor_assessmentkit-0.1.8-py3-none-any.whl
- Upload date:
- Size: 57.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fff918b3c4cfff7da58981d2b7ee54e8b9126cac6d31165b1391efb41f895e39
|
|
| MD5 |
3294c7a47f4a5d0075ae18c2a941bbb0
|
|
| BLAKE2b-256 |
57e9a10caae7dde8d6863b6d3ef354a04c5f97d27ae11b3379d168badc216951
|