A sitreamlined version of "LLM-Math" that judges the math solutions of LLMs.
Project description
LLM-MathJudger
基于 LLM-Math 简化的数学判别器. 该版本仅用于数学检查,因此不需要加载模型.
Usage
-
basic_check(A, B)
检查 A, B 两个纯数学表达式是否一致,返回 True / False.
-
check(prompt_type, data_name, target, pred)
检查 pred 是否与 target 一致,返回 True / False. target 即为数据集的某一行.
-
支持的 prompt 类型:
tool-integrated
,direct
,cot
,pal
,self-instruct
,self-instruct-boxed
,tora
,wizard_zs
,platypus_fs
,deepseek-math
,kpmath
. -
支持的数据集:
gsm8k
,math
,svamp
,asdiv
,mawps
,tabmwp
,mathqa
,mmlu_stem
,sat_math
.
Note
该版本为了提高速度采用了激进的修改,还未经严格测试,请谨慎使用.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
llm_mathjudger-0.2.0.tar.gz
(6.5 MB
view hashes)
Built Distribution
Close
Hashes for llm_mathjudger-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a53ab50915331ce2502f640fe614be18ca89d1788bcc2fc85341a887bc0ee427 |
|
MD5 | 3c38b2b3150fe1efef717575690c252c |
|
BLAKE2b-256 | 71de353682ee479e93874ded2a359717ba091106eb413a44069fe7aa25c5dbd0 |