Skip to main content

A sitreamlined version of "LLM-Math" that judges the math solutions of LLMs.

Project description

LLM-MathJudger

基于 LLM-Math 简化的数学判别器. 该版本仅用于数学检查,因此不需要加载模型.

Usage

  1. basic_check(A, B)

    检查 A, B 两个纯数学表达式是否一致,返回 True / False.

  2. check(prompt_type, data_name, target, pred)

    检查 pred 是否与 target 一致,返回 True / False. target 即为数据集的某一行.

  • 支持的 prompt 类型: tool-integrated, direct, cot, pal, self-instruct, self-instruct-boxed, tora, wizard_zs, platypus_fs, deepseek-math, kpmath.

  • 支持的数据集: gsm8k, math, svamp, asdiv, mawps, tabmwp, mathqa, mmlu_stem, sat_math.

Note

该版本为了提高速度采用了激进的修改,还未经严格测试,请谨慎使用.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_mathjudger-0.2.1.tar.gz (6.5 MB view details)

Uploaded Source

Built Distribution

llm_mathjudger-0.2.1-py3-none-any.whl (6.6 MB view details)

Uploaded Python 3

File details

Details for the file llm_mathjudger-0.2.1.tar.gz.

File metadata

  • Download URL: llm_mathjudger-0.2.1.tar.gz
  • Upload date:
  • Size: 6.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.19.1 CPython/3.9.19 Darwin/24.1.0

File hashes

Hashes for llm_mathjudger-0.2.1.tar.gz
Algorithm Hash digest
SHA256 a20bd84679bdd94a592a288b75524eb86ccdf7dca6904aa2aeb923f784e25798
MD5 77343fb0214eaf7c7d67e8d37c0bc1a8
BLAKE2b-256 382b44c3c9ce924106e7e0c935d8ef13601efe0581b16430963f5a16ad829eae

See more details on using hashes here.

File details

Details for the file llm_mathjudger-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: llm_mathjudger-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 6.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.19.1 CPython/3.9.19 Darwin/24.1.0

File hashes

Hashes for llm_mathjudger-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 00c498ec8d0a51e4e17de326935edc1f2b5885247ec6d7de2ffd7b70792d64ec
MD5 794aa876c102f0e5164cdeaac2af2236
BLAKE2b-256 04c0d68a591c07cd708ff4bc3fbae6e43aa595e5b4cc123c79c72e7026a8eb84

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page