Python library for detecting Insufficient Effort Responding (IER) in survey data

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

cameronlyons

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

IER

A Python package for detecting Insufficient Effort Responding (IER) in survey data using various statistical indices and methods.

Overview

When taking online surveys, participants sometimes respond to items without regard to their content. These types of responses, referred to as insufficient effort responding (IER) or careless responding, constitute significant problems for data quality, leading to distortions in data analysis and hypothesis testing.

The ier package provides solutions designed to detect such insufficient effort responses by allowing easy calculation of indices proposed in the literature. For a comprehensive review of these methods, see Curran (2016).

Features

Multiple Detection Methods: Supports 20+ indices for detecting careless responding
Flexible Input: Works with lists, numpy arrays, pandas DataFrames, and polars DataFrames
Robust Implementation: Handles missing data and edge cases
Type Hints: Full type annotations for IDE support

Installation

From PyPI

pip install insufficient-effort

From Source

git clone https://github.com/Cameron-Lyons/ier.git
cd ier
pip install -e .

Optional Dependencies

For enhanced functionality (e.g., chi-squared outlier detection):

pip install insufficient-effort[full]

Quick Start

import numpy as np
from ier import irv, mahad, longstring, evenodd, psychsyn

# Sample survey data (rows = participants, columns = items)
data = np.array([
    [1, 2, 3, 4, 5, 6, 7, 8],  # Normal responding
    [3, 3, 3, 3, 3, 3, 3, 3],  # Straightlining
    [1, 5, 1, 5, 1, 5, 1, 5],  # Alternating pattern
])

# Intra-individual response variability (low = straightlining)
print("IRV:", irv(data))

# Mahalanobis distance (high = outlier)
print("Mahad:", mahad(data))

# Longest string of identical responses
print("Longstring:", longstring(data))

Available Functions

Consistency Indices

`evenodd(x, factors, diag=False)`

Computes even-odd consistency by correlating responses to even vs odd items within each factor.

from ier import evenodd

data = [[1, 2, 3, 4, 5, 6], [2, 3, 4, 5, 6, 7]]
factors = [3, 3]  # Two factors with 3 items each
scores = evenodd(data, factors)

`psychsyn(x, critval=0.60, anto=False, diag=False)`

Identifies highly correlated item pairs and computes within-person correlations.

from ier import psychsyn, psychant

data = [[1, 2, 3, 4], [2, 3, 4, 5], [3, 4, 5, 6]]
scores = psychsyn(data, critval=0.5)  # Synonyms
scores = psychant(data, critval=-0.5)  # Antonyms

`individual_reliability(x, n_splits=100, random_seed=None)`

Estimates response consistency using repeated split-half correlations.

from ier import individual_reliability, individual_reliability_flag

data = [[1, 2, 1, 2, 1, 2], [1, 5, 2, 4, 3, 3]]
reliability = individual_reliability(data, n_splits=50)
flags = individual_reliability_flag(data, threshold=0.3)

`person_total(x, na_rm=True)`

Correlates each person's responses with the sample mean response pattern.

from ier import person_total

data = [[1, 2, 3, 4, 5], [5, 4, 3, 2, 1], [1, 2, 3, 4, 5]]
scores = person_total(data)  # [1.0, -1.0, 1.0]

`semantic_syn(x, item_pairs, anto=False)` / `semantic_ant(x, item_pairs)`

Computes consistency for predefined semantic synonym/antonym pairs.

from ier import semantic_syn, semantic_ant

data = [[1, 1, 5, 5], [1, 2, 5, 4]]
pairs = [(0, 1), (2, 3)]  # Predefined synonym pairs
scores = semantic_syn(data, pairs)

`guttman(x, na_rm=True, normalize=True)`

Counts response reversals relative to item difficulty ordering.

from ier import guttman, guttman_flag

data = [[1, 2, 3, 4, 5], [5, 4, 3, 2, 1]]
errors = guttman(data)
flags = guttman_flag(data, threshold=0.5)

`mad(x, positive_items, negative_items, scale_max=None)`

Mean Absolute Difference between positively and negatively worded items. High MAD indicates careless responding (not attending to item direction).

from ier import mad, mad_flag

# Columns 0,2 are positively worded; columns 1,3 are negatively worded
data = [
    [5, 1, 5, 1],  # Attentive: high on pos, low on neg
    [5, 5, 5, 5],  # Careless: ignores item direction
]
scores = mad(data, positive_items=[0, 2], negative_items=[1, 3], scale_max=5)
scores, flags = mad_flag(data, positive_items=[0, 2], negative_items=[1, 3])

Response Pattern Indices

`longstring(x, avg=False)`

Computes the longest (or average) run of identical consecutive responses.

from ier import longstring

# Single string
longstring("AAABBBCCDAA")  # ('A', 3)

# Matrix of responses
data = [[1, 1, 1, 2, 3], [1, 2, 3, 4, 5]]
longstring(data)  # [('1', 3), ('1', 1)]
longstring(data, avg=True)  # [1.67, 1.0]

`irv(x, na_rm=True, split=False, num_split=1)`

Computes intra-individual response variability (standard deviation).

from ier import irv

data = [[1, 2, 3, 4, 5], [3, 3, 3, 3, 3]]
scores = irv(data)  # High for varied, low for straightlining

# Split-half IRV
scores = irv(data, split=True, num_split=2)

`u3_poly(x, scale_min=None, scale_max=None)`

Proportion of extreme responses (at scale endpoints).

from ier import u3_poly

data = [[1, 5, 1, 5, 3], [3, 3, 3, 3, 3]]
extreme = u3_poly(data, scale_min=1, scale_max=5)

`midpoint_responding(x, scale_min=None, scale_max=None, tolerance=0.0)`

Proportion of midpoint responses.

from ier import midpoint_responding

data = [[1, 2, 3, 4, 5], [3, 3, 3, 3, 3]]
mid = midpoint_responding(data, scale_min=1, scale_max=5)  # [0.2, 1.0]

`response_pattern(x, scale_min=None, scale_max=None)`

Returns multiple response style indices at once.

from ier import response_pattern

patterns = response_pattern(data, scale_min=1, scale_max=5)
# Returns dict with: extreme, midpoint, acquiescence, variability

Statistical Outlier Detection

`mahad(x, flag=False, confidence=0.95, na_rm=False, method='chi2')`

Computes Mahalanobis distance for multivariate outlier detection.

from ier import mahad, mahad_summary

data = [[1, 2, 3], [2, 3, 4], [3, 4, 5], [10, 10, 10]]
distances = mahad(data)
distances, flags = mahad(data, flag=True, confidence=0.95)

# Methods: 'chi2', 'iqr', 'zscore'
distances, flags = mahad(data, flag=True, method='iqr')

`lz(x, difficulty=None, discrimination=None, theta=None, model='2pl')`

Standardized log-likelihood (lz) person-fit statistic based on Item Response Theory. Negative values indicate aberrant response patterns.

from ier import lz, lz_flag

# Binary response data (0/1)
data = [
    [1, 1, 1, 0, 0, 0],  # Normal pattern
    [0, 0, 0, 1, 1, 1],  # Aberrant pattern (fails easy, passes hard)
]
scores = lz(data)  # Negative = suspicious
scores, flags = lz_flag(data, threshold=-1.96)

# Use 1PL (Rasch) model
scores = lz(data, model='1pl')

# Provide custom item parameters
scores = lz(data, difficulty=[-1, -0.5, 0, 0.5, 1, 1.5])

Response Time Indices

`response_time(times, metric='median')`

Computes response time statistics per person.

from ier import response_time, response_time_flag, response_time_consistency

times = [[2.1, 3.4, 2.8], [0.5, 0.4, 0.6], [2.5, 2.3, 2.7]]

avg_times = response_time(times, metric='mean')
med_times = response_time(times, metric='median')
min_times = response_time(times, metric='min')

# Flag fast responders
flags = response_time_flag(times, threshold=1.0)

# Coefficient of variation (low = suspiciously uniform)
cv = response_time_consistency(times)

Composite Index

`composite(x, indices=None, method='mean', standardize=True)`

Combines multiple IER indices into a single composite score. Higher scores indicate greater likelihood of careless responding.

from ier import composite, composite_flag, composite_summary

data = [
    [1, 2, 3, 4, 5, 4, 3, 2, 1, 2],  # Normal
    [3, 3, 3, 3, 3, 3, 3, 3, 3, 3],  # Straightliner
]

# Default: combines IRV, longstring, Mahalanobis, psychsyn, person-total
scores = composite(data)

# Select specific indices
scores = composite(data, indices=['irv', 'longstring', 'mahad'])

# Different combination methods
scores = composite(data, method='sum')   # Sum of z-scores
scores = composite(data, method='max')   # Maximum z-score

# Flag careless responders
scores, flags = composite_flag(data, threshold=1.5)
scores, flags = composite_flag(data, percentile=95.0)

# Detailed summary with individual index scores
summary = composite_summary(data)
print(summary['indices_used'])  # ['irv', 'longstring', 'mahad', ...]
print(summary['indices'])       # Dict of individual index scores

Working with DataFrames

The package works with pandas and polars DataFrames:

import pandas as pd
import polars as pl
from ier import irv

# Pandas
df_pandas = pd.DataFrame([[1, 2, 3], [4, 5, 6]])
scores = irv(df_pandas)

# Polars
df_polars = pl.DataFrame([[1, 2, 3], [4, 5, 6]])
scores = irv(df_polars)

Handling Missing Data

Most functions handle NaN values appropriately:

import numpy as np
from ier import irv, mahad

data = np.array([
    [1, 2, np.nan, 4],
    [np.nan, 2, 3, 4],
    [1, 2, 3, 4]
])

irv_scores = irv(data, na_rm=True)
mahad_scores = mahad(data, na_rm=True)

Contributing

Contributions are welcome! Please open an issue first to discuss changes.

License

MIT License - see LICENSE for details.

Citation

@software{ier2026,
  title={IER: Python package for detecting Insufficient Effort Responding},
  author={Lyons, Cameron},
  year={2026},
  url={https://github.com/Cameron-Lyons/ier}
}

References

Curran, P. G. (2016). Methods for the detection of carelessly invalid responses in survey data. Journal of Experimental Social Psychology, 66, 4-19.
Dunn, A. M., Heggestad, E. D., Shanock, L. R., & Theilgard, N. (2018). Intra-individual response variability as an indicator of insufficient effort responding. Journal of Business and Psychology, 33(1), 105-121.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

cameronlyons

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.6.2

Feb 20, 2026

1.6.0

Feb 10, 2026

1.5.0

Jan 29, 2026

This version

1.4.2

Jan 22, 2026

1.4.1

Jan 22, 2026

1.4.0

Jan 12, 2026

1.3.0

Jan 12, 2026

1.2.1

Jan 10, 2026

1.2.0

Jan 9, 2026

1.1.4

Jan 2, 2026

1.1.2

Jan 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

insufficient_effort-1.4.2.tar.gz (36.3 kB view details)

Uploaded Jan 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

insufficient_effort-1.4.2-py3-none-any.whl (35.9 kB view details)

Uploaded Jan 22, 2026 Python 3

File details

Details for the file insufficient_effort-1.4.2.tar.gz.

File metadata

Download URL: insufficient_effort-1.4.2.tar.gz
Upload date: Jan 22, 2026
Size: 36.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for insufficient_effort-1.4.2.tar.gz
Algorithm	Hash digest
SHA256	`da4151aa82c8a871eef1470a59b4182aaa2050eaf20844a861edf5380066ccc3`
MD5	`7ad99dff1ff76f61f50bdb872098fb30`
BLAKE2b-256	`0e62ea287596749f3f420b66868bb42bfdc073ac4e05fcfbdb5ede57956f03af`

See more details on using hashes here.

Provenance

The following attestation bundles were made for insufficient_effort-1.4.2.tar.gz:

Publisher: python-publish.yml on Cameron-Lyons/ier

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: insufficient_effort-1.4.2.tar.gz
- Subject digest: da4151aa82c8a871eef1470a59b4182aaa2050eaf20844a861edf5380066ccc3
- Sigstore transparency entry: 844591524
- Sigstore integration time: Jan 22, 2026
Source repository:
- Permalink: Cameron-Lyons/ier@b89946ea01956c108e2a6de9d91c7396eda1f258
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Cameron-Lyons
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@b89946ea01956c108e2a6de9d91c7396eda1f258
- Trigger Event: push

File details

Details for the file insufficient_effort-1.4.2-py3-none-any.whl.

File metadata

Download URL: insufficient_effort-1.4.2-py3-none-any.whl
Upload date: Jan 22, 2026
Size: 35.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for insufficient_effort-1.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`34a6eea513fea277b9ee236e2e45dda306a50dace83644cd1301b6ebd84c5312`
MD5	`10739cb9828f0b6195043df185054698`
BLAKE2b-256	`93b37829987c77d4803a2ecbc2719053396a4906b414400ba4e5498d7bb4c4dc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for insufficient_effort-1.4.2-py3-none-any.whl:

Publisher: python-publish.yml on Cameron-Lyons/ier

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: insufficient_effort-1.4.2-py3-none-any.whl
- Subject digest: 34a6eea513fea277b9ee236e2e45dda306a50dace83644cd1301b6ebd84c5312
- Sigstore transparency entry: 844591529
- Sigstore integration time: Jan 22, 2026
Source repository:
- Permalink: Cameron-Lyons/ier@b89946ea01956c108e2a6de9d91c7396eda1f258
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Cameron-Lyons
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@b89946ea01956c108e2a6de9d91c7396eda1f258
- Trigger Event: push

insufficient-effort 1.4.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

IER

Overview

Features

Installation

From PyPI

From Source

Optional Dependencies

Quick Start

Available Functions

Consistency Indices

evenodd(x, factors, diag=False)

psychsyn(x, critval=0.60, anto=False, diag=False)

individual_reliability(x, n_splits=100, random_seed=None)

person_total(x, na_rm=True)

semantic_syn(x, item_pairs, anto=False) / semantic_ant(x, item_pairs)

guttman(x, na_rm=True, normalize=True)

mad(x, positive_items, negative_items, scale_max=None)

Response Pattern Indices

longstring(x, avg=False)

irv(x, na_rm=True, split=False, num_split=1)

u3_poly(x, scale_min=None, scale_max=None)

midpoint_responding(x, scale_min=None, scale_max=None, tolerance=0.0)

response_pattern(x, scale_min=None, scale_max=None)

Statistical Outlier Detection

mahad(x, flag=False, confidence=0.95, na_rm=False, method='chi2')

lz(x, difficulty=None, discrimination=None, theta=None, model='2pl')

Response Time Indices

response_time(times, metric='median')

Composite Index

composite(x, indices=None, method='mean', standardize=True)

Working with DataFrames

Handling Missing Data

Contributing

License

Citation

References

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`evenodd(x, factors, diag=False)`

`psychsyn(x, critval=0.60, anto=False, diag=False)`

`individual_reliability(x, n_splits=100, random_seed=None)`

`person_total(x, na_rm=True)`

`semantic_syn(x, item_pairs, anto=False)` / `semantic_ant(x, item_pairs)`

`guttman(x, na_rm=True, normalize=True)`

`mad(x, positive_items, negative_items, scale_max=None)`

`longstring(x, avg=False)`

`irv(x, na_rm=True, split=False, num_split=1)`

`u3_poly(x, scale_min=None, scale_max=None)`

`midpoint_responding(x, scale_min=None, scale_max=None, tolerance=0.0)`

`response_pattern(x, scale_min=None, scale_max=None)`

`mahad(x, flag=False, confidence=0.95, na_rm=False, method='chi2')`

`lz(x, difficulty=None, discrimination=None, theta=None, model='2pl')`

`response_time(times, metric='median')`

`composite(x, indices=None, method='mean', standardize=True)`