Generalizability theory fully crossed and nested
Project description
GeneralizIT: Generalizability Theory Analysis in Python
Overview
GeneralizIT is a Python-based library designed for conducting Generalizability Theory (GT) analyses. The library supports multiple research designs and provides tools to calculate ANOVA tables, generalizability coefficients (G coefficients), and decision (D) studies.
Generalizability Theory extends classical test theory by estimating multiple sources of error variance, providing a more flexible and detailed approach to reliability assessment. This makes it particularly useful for researchers and practitioners who work with multi-faceted designs and want to quantify the reliability and generalizability of their measurements across different facets (e.g., raters, items, occasions).
Features
- Support for Various Designs: Handles different research designs (fully crossed, partially nested) with user-friendly syntax
- Robust to Data Issues: Supports unbalanced designs and can handle missing data
- Automated Data Cleaning: Prepares your data by dropping unnecessary columns and normalizing names.
- ANOVA Calculation: Produces ANOVA tables specific to your design using Analogous ANOVA based on Henderson's Method I
- G Coefficients: Computes generalizability coefficients to assess reliability.
- D Studies: Performs decision studies with customizable levels for facets to optimize measurement protocols
- Summaries: Provides concise summaries for ANOVA, G coefficients, and D studies
Installation
pip install generalizit
It is recommended to create a Python environment with Python 3.8 or higher.
The following dependencies are installed alongside the package:
pandasnumpyscipy
Usage
Initializing GeneralizIT
from generalizit import GeneralizIT
# Initialize with:
# - `data`: A pandas DataFrame containing your data.
# - `input_str`: A string describing the research design (e.g., "Person x i x o").
# - `response`: The column name for the response variable.
GT = GeneralizIT(data=formatted_df, input_str='Person x i x o', response='Response')
Example Workflow
# Import the package
from generalizit import GeneralizIT
import pandas as pd
# Load data
data = pd.read_csv("your_data.csv")
# Initialize with your research design
GT = GeneralizIT(data=data, input_str='Person x i x o', response='Response')
# 1. Calculate ANOVA
GT.calculate_anova()
GT.anova_summary()
# 2a. Compute G Coefficients with Random Facets
GT.calculate_g_coefficients()
GT.g_coefficients_summary()
# 2b. Compute G Coefficients with a Fixed Facet (i.e. Items 'i')
GT.calculate_g_coefficients(fixed_facets=['i'])
GT.g_coefficients_summary()
# 3. Perform a D Study with different facet levels
GT.calculate_d_study(levels={'Person': [8], 'i': [4, 8], 'o': [1, 2]})
GT.d_study_summary()
# 4. Calculate Confidence Intervals
GT.calculate_confidence_intervals(alpha=0.05)
GT.confidence_intervals_summary()
Input Data Format
The data should be structureda as a pandas DataFrame in a long format, where each row corresponds to a unique combination of the facets and the response variable.
| Person | i | o | Response |
|--------|---|---|----------|
| 1 | 1 | 1 | 2 |
| 1 | 2 | 1 | 6 |
| 1 | 3 | 1 | 7 |
| 1 | 4 | 1 | 5 |
| 1 | 1 | 2 | 2 |
| 1 | 2 | 2 | 5 |
| 1 | 3 | 2 | 5 |
| 1 | 4 | 2 | 5 |
...
| 10 | 1 | 1 | 6 |
| 10 | 2 | 1 | 8 |
| 10 | 3 | 1 | 7 |
| 10 | 4 | 1 | 6 |
| 10 | 1 | 2 | 6 |
| 10 | 2 | 2 | 8 |
| 10 | 3 | 2 | 8 |
| 10 | 4 | 2 | 6 |
Conversely, if the design was nested such as person x (rater:item), raters are nested under item and should be identified uniquely either by delineation item1_rater1 or unique numbering as below:
| Person | item | rater | Response |
|--------|------|-------|----------|
| 1 | 1 | 1 | 2 |
| 1 | 1 | 2 | 6 |
| 1 | 1 | 3 | 7 |
| 1 | 1 | 4 | 5 |
| 1 | 2 | 5 | 2 |
| 1 | 2 | 6 | 5 |
| 1 | 2 | 7 | 5 |
| 1 | 2 | 8 | 5 |
| 1 | 3 | 9 | 6 |
| 1 | 3 | 10 | 8 |
| 1 | 3 | 11 | 7 |
| 1 | 3 | 12 | 6 |
| 2 | 1 | 1 | 6 |
| 2 | 1 | 2 | 8 |
| 2 | 1 | 3 | 8 |
| 2 | 1 | 4 | 6 |
...
Example Datasets
Synthetic Data from Brennan (2001)
The package includes examples of synthetic data used to demonstrate the functionality of the library. These are located in the 'tests' directory. You can adapt these examples to your own datasets.
Research Design Syntax
The input_str parameter specifies the research design. Supported formats include:
- Crossed Designs:
"Person x i x o" - Nested Designs:
"Person x (r:t)"
Output
- ANOVA Table: Provides variance component estimates for each facet.
- G Coefficients:
- Phi (Φ): Absolute coefficient for dependability (also known as Ga)
- Rho² (ρ²): Relative coefficient for generalizability (also known as Gr)
- D Studies: Offers predictions of generalizability for specified facet levels to optimize measurement protocols
- Confidence Intervals: Calculates confidence intervals for means of each facet.
- Summaries: Concise summaries of ANOVA, G coefficients, D studies, and Confidence Intervals.
Advanced Tutorials
Advanced package usage and G-Theory methodology tutorials are available in our GitHub repository:
Documentation
The full API documentation is available at GeneralizIT Documentation.
Notes
- Include only the necessary facets and the response variable.
- Column names should match those specified in the research design.\
- For unbalanced designs, GeneralizIT will automatically use appropriate computational methods
Citation
If you use GeneralizIT in your research, please cite the following paper:
@misc{smith2025generalizit,
author = {Smith, Tyler J. and Kline, Theresa J. B. and Kline, Adrienne Sarah},
title = {{Generalizit: A Python Solution for Generalizability Theory Computations}},
year = {2025},
note = {Preprint submitted to \textit{SoftwareX}},
howpublished = {\url{https://ssrn.com/abstract=5209876}},
doi = {10.2139/ssrn.5209876}
}
License
This library is licensed under the MIT License.
Contributions
Contributions to improve functionality or expand supported designs are welcome! Please fork the repository and create a pull request.
Contact
For questions or support, raise an issue in the github repo!
Happy analyzing with GeneralizIT!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file generalizit-0.1.2.tar.gz.
File metadata
- Download URL: generalizit-0.1.2.tar.gz
- Upload date:
- Size: 31.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
49b3fc67a85ba2ff5589c549858cc50b5d1fa50de54be22db34a8bf08b4f763a
|
|
| MD5 |
464edd3a489f0994094cbe4d71b0c3fd
|
|
| BLAKE2b-256 |
be84db0af24497026e369b1cb1a4c9c0de8e431ae6affa3691223ca76525118e
|
File details
Details for the file generalizit-0.1.2-py3-none-any.whl.
File metadata
- Download URL: generalizit-0.1.2-py3-none-any.whl
- Upload date:
- Size: 29.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
00019a5594178393e46a4479006a6719d0b31fed37e86603dc4d3fc115ada3bf
|
|
| MD5 |
4d9cc91ed967ce8c95d8fee499f2bea9
|
|
| BLAKE2b-256 |
b5d4c3e2642009168f46ff5c7dfec8f9cc6521149fc8f9555bc492deda114b34
|