Skip to main content

AITutor-AssessmentKit is the first open-source toolkit designed to evaluate the pedagogical performance of AI tutors in student mistake remediation tasks. With the growing capabilities of large language models (LLMs), this library provides a systematic approach to assess their teaching potential across multiple dimensions in educational dialogues.

Project description

{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues\n", "\n", "---\n", "\n", "The AITutor-AssessmentKit is the first open-source library to enable the pedagogical abilities assessment of large language model (LLM)-powered AI tutors in educational dialogues. This unified framework:\n", "\n", "- Evaluates AI tutor responses across eight comprehensive dimensions in the context of student error remediation tasks in mathematics.\n", "- Offers a pluggable and customizable interface for integrating models and LLM releases from the community.\n", "\n", "By providing an efficient, scalable alternative to costly and subjective human evaluations, AITutor-AssessmentKit facilitates on-the-fly assessment of AI tutors.\n", "\n", "---\n", "\n", "## 📥 Installation\n", "\n", "To install AITutor-AssessmentKit with pip, type:\n", "\n", "bash\n", "pip install aitutor-assessmentkit\n", "\n", "\n", "---\n", "\n", "## 📚 Overview of the AITutor-AssessmentKit\n", "\n", "The library comprises three modular components:\n", "\n", "1. autoeval: For automated evaluation.\n", "2. llmeval: For LLM-based evaluation.\n", "3. visualizer: For visualization and interpretation of evaluation scores.\n", "\n", "\n", "\n", "---\n", "\n", "## 📖 Tutorials\n", "\n", "We provide several resources to help you get started:\n", "\n", "1. Tutorial Notebook-I: Automated Evaluation with AITutor-AssessmentKit\n", "2. Tutorial Notebook-II: LLM-Based Evaluation with AITutor-AssessmentKit\n", "3. Tutorial Notebook-III: Visualizing AI Tutor Performance with AITutor-AssessmentKit\n", "\n", "For a quick overview, check out our Demo Notebook.\n", "\n", "---\n", "\n", "## 🚀 Next Steps\n", "\n", "Here are the upcoming milestones for the project:\n", "\n", "1. Create and release detailed documentation.\n", "2. Publish a longer tutorial video.\n", "3. Develop a GUI-friendly interaction mode.\n", "\n", "---\n", "\n", "## 📜 Citation\n", "\n", "If you use AITutor-AssessmentKit in your research, please cite us:\n", "\n", "bibtex\n", "@inproceedings{maurya2024aitutorassessmentkit,\n", " title={AITutor-AssessmentKit: An Open-Source Library to Measure Pedagogical Ability of AI Tutors in Educational Dialogues},\n", " author={Kaushal Kumar Maurya and Ekaterina Kochmar},\n", " year={2024}\n", "}\n", "\n", "\n", "---\n", "\n", "## 📧 Contact\n", "\n", "For any questions or support, feel free to reach out:\n", "\n", "📧 Kaushal Kumar Maurya: Kaushal.Maurya@mbzuai.ac.ae\n", "\n", "---\n" ] } ], "metadata": { "kernelspec": { "display_name": "pds_py310", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.10.13" } }, "nbformat": 4, "nbformat_minor": 2 }

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitutor_assessmentkit-0.1.4.tar.gz (29.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitutor_assessmentkit-0.1.4-py3-none-any.whl (45.2 kB view details)

Uploaded Python 3

File details

Details for the file aitutor_assessmentkit-0.1.4.tar.gz.

File metadata

  • Download URL: aitutor_assessmentkit-0.1.4.tar.gz
  • Upload date:
  • Size: 29.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.5

File hashes

Hashes for aitutor_assessmentkit-0.1.4.tar.gz
Algorithm Hash digest
SHA256 f44b2ffdffeed8cb92d6fa78f7ec143a4e1b32960735f6739a8511da45e9f183
MD5 d805936903dab940a2680ebf5541984d
BLAKE2b-256 4d77d68cd9a8627fc205302bd00a11ac3f25d6b5d99b0eca74811f76fb8ad9a2

See more details on using hashes here.

File details

Details for the file aitutor_assessmentkit-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for aitutor_assessmentkit-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 13d050b52501302ebd60b4139748d1fb2fcd23389b5ac8632aa4c6e835522332
MD5 18d1e700679cc7654f24f0d850d67e7e
BLAKE2b-256 c8d3bf54b113478bb6abd9e8a6706ef353baf0d72835a87232981b318b6dfbee

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page