Package for evaluating algebra problems using AI systems.
Project description
Algebra Problems Evaluator (APE)
What is APE?
APE is a framework that simplifies building and running mathematical benchmarks for AI systems.
It was initially meant as a way to evaluate mathematical problems from the field of algebra on whether they are suited to be a target goal for LLM reasoning research (hence the name). However it might be more useful to think about it the other way around - the subjects of the evaluation are LLMs or more broadly - AI systems and they are being evaluated on their ability to solve specific algebra problems. Problems, which solutions are hard to generate, but relatively easy to automatically check for correctness.
APE was created as a part of a Bachelor's project at the University of Warsaw.
User's Guide
[TODO]
Development setup
Install development dependencies:
pip install -r requirements-dev.txt
Install git hooks:
pre-commit install
Run all hooks manually:
pre-commit run --all-files
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ape_framework-0.1.1.tar.gz.
File metadata
- Download URL: ape_framework-0.1.1.tar.gz
- Upload date:
- Size: 9.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ddc0f803434964a03f9a73d6f8cf9769996c6bac411837ce5430e4fcfd70b834
|
|
| MD5 |
b05311274317fbd38125bf3a5a0d87db
|
|
| BLAKE2b-256 |
e0d9238995c0004f82ec06dd54136efa4f6728c9b3e28767a9a5cec7f3e24787
|
File details
Details for the file ape_framework-0.1.1-py3-none-any.whl.
File metadata
- Download URL: ape_framework-0.1.1-py3-none-any.whl
- Upload date:
- Size: 11.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0eb200b0f23c339cbb5d34860720d021f310852a5e7e5acda704d47547804c56
|
|
| MD5 |
e9ba26b4d264006f6d5129f683b4dbb8
|
|
| BLAKE2b-256 |
882cba3aa9616b93b67eb35bbf0cc2d1a3dc3f59c705dbd18398838dbce645da
|