IRT Equating for Unidimensional Mixed Format Test with Python
Project description
EqUMP
IRT Equating for Unidimensional Mixed Format Test with Python
Motivation
Item Respone Theory (IRT) test equating is a critical process for maintaining score comparability across different test forms. However, current implementations often rely on limited software or complex scripting environments. We introduce EqUMP, a Python package designed to streamline and democratize IRT equating for unidimensional mixed format test. Built upon the robust ecosystem of scientific computing in Python (Numpy, Scipy), it provides a flexible framework for various IRT equating methods (e.g., scale transformation, true score equating). The package is openly licensed, pip-installable, and includes comprehensive documentation and test suites, enabling researchers and practitioners to readily reproduce analysis pipelines. By leveraging Python’s accessibility and extensibility, EqUMP facilitates rigorous and reproducible test equating for a wider educational data analysists and psychometricians.
Core Principles
-
Academically faithful code and documentation
EqUMP is designed for researchers and practitioners familiar with standard test equating literature. We follow textbook-style notation and structure—even at the cost of computational speed—to ensure clarity and pedagogical value. -
Practical focus
While many equating methods exist in the literature, some are infeasible in operational testing. EqUMP prioritizes features that reflect real-world usage and operational constraints. Users can request or contribute less common methods. -
Symmetric linking constant support
Symmetry property of equating function is a critical but often neglected aspect of Haebara or Stocking-Lord linking method. EqUMP handles this through carefully implemented two-sided(symmtric) or one-sided loss function.
Installation
pip install EqUMP
Compared to other related packages
- Before we create this package, we analyze the packages below
- As far as we know, there are no Python-based IRT Test Equating packages.
- We found some R packages and commercial programs below. And summarize their features in the table below.
| Package/Program | Author | Category | scale-linking | true-score | observed-score | kernel | link |
|---|---|---|---|---|---|---|---|
| STUIRT | Kim & Kolen, 2004 | Standalone | O | X | X | X | download |
| POLYEQUATE | Kolen, 2004 | Standalone | X | O | O | X | download |
| IRTEQ | Han, 2009 | Standalone | O | O | X | X | homepage |
| kequate | Andersson et al., 2013 | R | O | X | O | O | cran |
| equateIRT | Battauz, 2015 | R | O | O | O | X | cran |
| SNSequate | Jorge Gonzalez, 2024 | R | O | O | O | O | download |
Core API
Item Response Model
from EqUMP.base import IRF
# Dichotomous item
item_2pl = IRF(params={"a": 1.2, "b": 0.5}, model="2PL")
probs = item_2pl.prob(theta=0.0)
print(probs)
# Polytomous item with custom scores
item_gpcm = IRF(
params={"a": 1.0, "b": [-0.5, 0.0, 0.5]},
model="GPCM",
scores=[0, 2, 5, 10]
)
expected = item_gpcm.expected_score(theta=0.0)
Scale Transformation
Mean-Mean
from EqUMP.base import IRF
from EqUMP.linking import mean_mean
# Create IRF objects for new and old forms
items_new = {
1: IRF({"a": 1.2, "b": 0.5}, "2PL"),
2: IRF({"a": 1.0, "b": -0.3}, "2PL"),
3: IRF({"a": 1.5, "b": 0.8}, "2PL"),
}
items_old = {
1: IRF({"a": 1.15, "b": 0.48}, "2PL"),
2: IRF({"a": 0.95, "b": -0.28}, "2PL"),
3: IRF({"a": 1.45, "b": 0.85}, "2PL"),
}
A, B = mean_mean(
items_new=items_new,
items_old=items_old,
common_new=[1, 2, 3],
common_old=[1, 2, 3],
)
print(f"Linking constants: A={A:.4f}, B={B:.4f}")
Mean-Sigma
from EqUMP.base import IRF
from EqUMP.linking import mean_sigma
# Create IRF objects for new and old forms
items_new = {
1: IRF({"a": 1.2, "b": 0.5}, "2PL"),
2: IRF({"a": 1.0, "b": -0.3}, "2PL"),
3: IRF({"a": 1.5, "b": 0.8}, "2PL"),
}
items_old = {
1: IRF({"a": 1.15, "b": 0.48}, "2PL"),
2: IRF({"a": 0.95, "b": -0.28}, "2PL"),
3: IRF({"a": 1.45, "b": 0.85}, "2PL"),
}
A, B = mean_sigma(
items_new=items_new,
items_old=items_old,
common_new=[1, 2, 3],
common_old=[1, 2, 3],
)
print(f"Linking constants: A={A:.4f}, B={B:.4f}")
Haebara
import numpy as np
from EqUMP.base import IRF
from EqUMP.linking import haebara
# Create IRF objects with mixed item types
items_new = {
1: IRF({"a": 1.2, "b": 0.5, "c": 0.2}, "3PL", D=1.7),
2: IRF({"a": 1.0, "b": -0.3, "c": 0.15}, "3PL", D=1.7),
3: IRF({"a": 1.0, "b": [-0.5, 0.0, 0.5]}, "GPCM", D=1.7),
}
items_old = {
1: IRF({"a": 1.15, "b": 0.48, "c": 0.18}, "3PL", D=1.7),
2: IRF({"a": 0.95, "b": -0.28, "c": 0.14}, "3PL", D=1.7),
3: IRF({"a": 0.95, "b": [-0.48, 0.02, 0.52]}, "GPCM", D=1.7),
}
A, B = haebara(
items_new=items_new,
items_old=items_old,
common_new=[1, 2, 3],
common_old=[1, 2, 3],
quadrature="gauss_hermite",
nq=30,
symmetry=True,
)
print(f"Linking constants: A={A:.4f}, B={B:.4f}")
Stocking-Lord
import numpy as np
from EqUMP.base import IRF
from EqUMP.linking import stocking_lord
# Create IRF objects with mixed item types
items_new = {
1: IRF({"a": 1.2, "b": 0.5, "c": 0.2}, "3PL", D=1.7),
2: IRF({"a": 1.0, "b": -0.3, "c": 0.15}, "3PL", D=1.7),
3: IRF({"a": 1.0, "b": [-0.5, 0.0, 0.5]}, "GPCM", D=1.7),
}
items_old = {
1: IRF({"a": 1.15, "b": 0.48, "c": 0.18}, "3PL", D=1.7),
2: IRF({"a": 0.95, "b": -0.28, "c": 0.14}, "3PL", D=1.7),
3: IRF({"a": 0.95, "b": [-0.48, 0.02, 0.52]}, "GPCM", D=1.7),
}
A, B = stocking_lord(
items_new=items_new,
items_old=items_old,
common_new=[1, 2, 3],
common_old=[1, 2, 3],
quadrature="gauss_hermite",
nq=30,
symmetry=True,
)
print(f"Linking constants: A={A:.4f}, B={B:.4f}")
True Score Equating
from EqUMP.base import IRF
from EqUMP.equating.TSE import tse
# Create IRF objects for new and old forms
items_new = {
0: IRF({"a": 1.2, "b": -0.5}, "2PL"),
1: IRF({"a": 1.0, "b": 0.0}, "2PL"),
2: IRF({"a": 1.5, "b": 0.8}, "2PL"),
3: IRF({"a": 0.9, "b": -1.2}, "2PL"),
}
items_old = {
0: IRF({"a": 1.15, "b": -0.47}, "2PL"),
1: IRF({"a": 0.95, "b": 0.05}, "2PL"),
2: IRF({"a": 1.45, "b": 0.85}, "2PL"),
3: IRF({"a": 0.85, "b": -1.15}, "2PL"),
}
# Equate a test score of 2.5 on the new form
ts = 2.5
theta_eq, score_old = tse(
ts=ts,
items_new=items_new,
items_old=items_old,
common_new=[0, 1, 2],
common_old=[0, 1, 2],
anchor="internal"
)
print(f"Theta estimate: {theta_eq:.4f}")
print(f"Equivalent score on old form: {score_old:.4f}")
Observed Score Equating
# in developing
Kernel Methods
# in developing
How to contribute
EqUMP is actively being developed, and upcoming releases will include additional item response models and equating methods.
We welcome contributions of any kind—if you’d like to get involved, please check out our contribution guidelines.
How to cite
``` this section will be filled after publishing this package in a peer reviewed journal ```
Used by
following contents are not real at all. It's just our hope and vision
|
Korea Institute for Curriculum and Evaluation scale linking, true score equating |
💡 Want your organization featured? Open an Issue or Pull Request to be listed.
References
- [1] Kolen, M. J., & Brennan, R. L. (2014). Test equating, scaling, and linking: Methods and practices (3rd ed.). Springer Science + Business Media. https://doi.org/10.1007/978-1-4939-0317-7.
- [2] 김성훈. (2022). 문항반응이론 검사 동등화. 공동체.
- [3] Andersson, B., Bränberg, K., & Wiberg, M. (2013). Performing the Kernel Method of Test Equating with the Package kequate. Journal of Statistical Software, 55(6), 1–25. https://doi.org/10.18637/jss.v055.i06
- [4] Battauz, M. (2015). equateIRT: An R Package for IRT Test Equating. Journal of Statistical Software, 68(7), 1–22. https://doi.org/10.18637/jss.v068.i0
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file eqump-0.3.5.tar.gz.
File metadata
- Download URL: eqump-0.3.5.tar.gz
- Upload date:
- Size: 63.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
718bdef46101f8e666fea9af81a84ec46bde0bd53ad2bd3cf30fbf31ecaa0ac4
|
|
| MD5 |
05df91afd5a3d7ae4ee08c35b19b20d4
|
|
| BLAKE2b-256 |
13d99235f3875b541057ce953de0d4182f903b94af22b666176abb8ba8f38e12
|
Provenance
The following attestation bundles were made for eqump-0.3.5.tar.gz:
Publisher:
publish-pypi.yml on huni1023/EqUMP
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
eqump-0.3.5.tar.gz -
Subject digest:
718bdef46101f8e666fea9af81a84ec46bde0bd53ad2bd3cf30fbf31ecaa0ac4 - Sigstore transparency entry: 707968650
- Sigstore integration time:
-
Permalink:
huni1023/EqUMP@bf5264ef5b70a1426cd0f72a316e73aaad8c5666 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/huni1023
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@bf5264ef5b70a1426cd0f72a316e73aaad8c5666 -
Trigger Event:
workflow_run
-
Statement type:
File details
Details for the file eqump-0.3.5-py3-none-any.whl.
File metadata
- Download URL: eqump-0.3.5-py3-none-any.whl
- Upload date:
- Size: 85.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
28987404c0488f73797b47621fd0e7202d3d13388aaeaee546eae6c6ff402793
|
|
| MD5 |
a5c5ad9ccb8b82315c5b06faad827df9
|
|
| BLAKE2b-256 |
053cc911ee014aa3d29fa6419abba77a57e8f2e0e31d18fa32f1dd65783ec266
|
Provenance
The following attestation bundles were made for eqump-0.3.5-py3-none-any.whl:
Publisher:
publish-pypi.yml on huni1023/EqUMP
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
eqump-0.3.5-py3-none-any.whl -
Subject digest:
28987404c0488f73797b47621fd0e7202d3d13388aaeaee546eae6c6ff402793 - Sigstore transparency entry: 707968658
- Sigstore integration time:
-
Permalink:
huni1023/EqUMP@bf5264ef5b70a1426cd0f72a316e73aaad8c5666 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/huni1023
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@bf5264ef5b70a1426cd0f72a316e73aaad8c5666 -
Trigger Event:
workflow_run
-
Statement type: