3 projects
graphrag-eval
For assessing question answering systems' final answers and intermediate steps, against a given set of questions, reference answers and steps.
ttyg
Natural language querying for GraphDB using LangGraph agents
ttyg-evaluation
Talk to Your Graph (TTYG) Evaluation is a Python module for evaluating whether LLM agents correctly orchestrate and invoke available tools to answer user questions, based on a Q&A dataset with tool call expectations.