Skip to main content

Package for evaluating algebra problems using AI systems.

Project description

Algebra Problems Evaluator (APE)

What is APE?

APE is a framework that simplifies building and running mathematical benchmarks for AI systems.

It was initially meant as a way to evaluate mathematical problems from the field of algebra on whether they are suited to be a target goal for LLM reasoning research (hence the name). However it might be more useful to think about it the other way around - the subjects of the evaluation are LLMs or more broadly - AI systems and they are being evaluated on their ability to solve specific algebra problems. Problems, which solutions are hard to generate, but relatively easy to automatically check for correctness.

APE was created as a part of a Bachelor's project at the University of Warsaw.

User's Guide

[TODO]

Development setup

Install development dependencies:

pip install -r requirements-dev.txt

Install git hooks:

pre-commit install

Run all hooks manually:

pre-commit run --all-files

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ape_framework-0.1.0.tar.gz (9.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ape_framework-0.1.0-py3-none-any.whl (10.9 kB view details)

Uploaded Python 3

File details

Details for the file ape_framework-0.1.0.tar.gz.

File metadata

  • Download URL: ape_framework-0.1.0.tar.gz
  • Upload date:
  • Size: 9.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for ape_framework-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3e86800ee28db808eb29854c175d19496d76d60e19cbe1136f3cba134c06980e
MD5 01c4ef58bb5b54541524b920e2269fe6
BLAKE2b-256 7fc134aa885731752623b9ba25e0c88481b2b09890ec49ba1fe52dd427bd7c6c

See more details on using hashes here.

File details

Details for the file ape_framework-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ape_framework-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 10.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for ape_framework-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1309be342161e8676bf158e25a2619ab4a8c0af8665dc0dc5e375688d9f131f5
MD5 084342c99cfc96f8ce8176d0a93bcfa3
BLAKE2b-256 0d8711214b44131614b7b37b792183d982b15efa1aeefa21533715a652aaef50

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page