A sitreamlined version of "LLM-Math" that judges the math solutions of LLMs.
Project description
LLM-MathJudger
基于 LLM-Math 简化的数学判别器. 该版本仅用于数学检查,因此不需要加载模型.
Usage
-
basic_check(A, B)检查 A, B 两个纯数学表达式是否一致,返回 True / False.
-
check(prompt_type, data_name, target, pred)检查 pred 是否与 target 一致,返回 True / False. target 即为数据集的某一行.
-
支持的 prompt 类型:
tool-integrated,direct,cot,pal,self-instruct,self-instruct-boxed,tora,wizard_zs,platypus_fs,deepseek-math,kpmath. -
支持的数据集:
gsm8k,math,svamp,asdiv,mawps,tabmwp,mathqa,mmlu_stem,sat_math.
Note
该版本为了提高速度采用了激进的修改,还未经严格测试,请谨慎使用.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llm_mathjudger-0.2.1.tar.gz.
File metadata
- Download URL: llm_mathjudger-0.2.1.tar.gz
- Upload date:
- Size: 6.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.19.1 CPython/3.9.19 Darwin/24.1.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a20bd84679bdd94a592a288b75524eb86ccdf7dca6904aa2aeb923f784e25798
|
|
| MD5 |
77343fb0214eaf7c7d67e8d37c0bc1a8
|
|
| BLAKE2b-256 |
382b44c3c9ce924106e7e0c935d8ef13601efe0581b16430963f5a16ad829eae
|
File details
Details for the file llm_mathjudger-0.2.1-py3-none-any.whl.
File metadata
- Download URL: llm_mathjudger-0.2.1-py3-none-any.whl
- Upload date:
- Size: 6.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.19.1 CPython/3.9.19 Darwin/24.1.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
00c498ec8d0a51e4e17de326935edc1f2b5885247ec6d7de2ffd7b70792d64ec
|
|
| MD5 |
794aa876c102f0e5164cdeaac2af2236
|
|
| BLAKE2b-256 |
04c0d68a591c07cd708ff4bc3fbae6e43aa595e5b4cc123c79c72e7026a8eb84
|