A sitreamlined version of "LLM-Math" that judges the math solutions of LLMs.
Project description
LLM-MathJudger
基于 LLM-Math 简化的数学判别器. 该版本仅用于数学检查,因此不需要加载模型.
Usage
-
basic_check(A, B)
检查 A, B 两个纯数学表达式是否一致,返回 True / False.
-
check(prompt_type, data_name, target, pred)
检查 pred 是否与 target 一致,返回 True / False. target 即为数据集的某一行.
-
支持的 prompt 类型:
tool-integrated
,direct
,cot
,pal
,self-instruct
,self-instruct-boxed
,tora
,wizard_zs
,platypus_fs
,deepseek-math
,kpmath
. -
支持的数据集:
gsm8k
,math
,svamp
,asdiv
,mawps
,tabmwp
,mathqa
,mmlu_stem
,sat_math
.
Note
该版本为了提高速度采用了激进的修改,还未经严格测试,请谨慎使用.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
llm_mathjudger-0.2.1.tar.gz
(6.5 MB
view hashes)
Built Distribution
Close
Hashes for llm_mathjudger-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 00c498ec8d0a51e4e17de326935edc1f2b5885247ec6d7de2ffd7b70792d64ec |
|
MD5 | 794aa876c102f0e5164cdeaac2af2236 |
|
BLAKE2b-256 | 04c0d68a591c07cd708ff4bc3fbae6e43aa595e5b4cc123c79c72e7026a8eb84 |