Utilities for design research analysis workflows
Project description
design-research-analysis
[!IMPORTANT] Current monthly release: Mellon Metrics - May 2026
Due: May 1, 2026
Tracks: April 2026 work
design-research-analysis is the unified-table analysis layer in the cmudrc design research ecosystem.
It provides typed, reusable workflows for sequence, language, embedding-map, and statistical analysis over recurring event logs.
Overview
This package centers on reproducible analysis workflows with a small top-level API:
- Unified-table coercion, validation, and mapper-based derived columns
- Dataset profiling, schema checks, and codebook generation
- Sequence modeling (Markov chains, discrete HMM, Gaussian HMM)
- Language analysis (semantic convergence trajectories, topic modeling, sentiment scoring)
- Embedding maps (PCA, t-SNE, UMAP, PaCMAP, TriMap) with clustering, comparison, and trajectory-plotting helpers
- Statistical wrappers (group comparisons, OLS regression, mixed-effects models, nonparametrics, and power)
- Runtime provenance capture for reproducibility manifests
- A thin CLI for deterministic pipeline runs
Quickstart
Requires Python 3.12+.
Maintainer workflows target Python 3.12 (.python-version).
Install from PyPI:
python -m pip install --upgrade pip
pip install design-research-analysis
Common install profiles:
pip install "design-research-analysis[seq]"
pip install "design-research-analysis[lang,embeddings]"
pip install "design-research-analysis[maps]"
pip install "design-research-analysis[stats,data]"
pip install "design-research-analysis[all]"
For contributor workflows:
python -m venv .venv
source .venv/bin/activate
make dev
make test
Run a compact end-to-end example:
PYTHONPATH=src python examples/basic_usage.py
For dependency profiles and release-check guidance, see Dependencies and Extras.
CLI
The package installs a design-research-analysis CLI:
design-research-analysis validate-table --input data/events.csv --summary-json artifacts/validate.json
design-research-analysis run-sequence --input data/events.csv --summary-json artifacts/sequence.json --mode markov
design-research-analysis run-language --input data/events.csv --summary-json artifacts/language.json --trajectory-csv artifacts/language_trajectory.csv
design-research-analysis run-embedding-maps --input data/events.csv --summary-json artifacts/embedding_maps.json --map-csv artifacts/embedding_maps.csv
design-research-analysis run-stats --input data/events.csv --summary-json artifacts/stats.json --mode regression --x-columns x1,x2 --y-column y
The Python API can start from files too at the main ingestion points, for example
coerce_unified_table("data/events.csv") and profile_dataframe("data/events.csv").
Examples
Start with examples/README.md for runnable scripts across all analysis families.
Docs
See the published documentation for quickstart, workflow guidance, schema details, CLI reference, and API docs.
Build docs locally with:
make docs
Public API
The supported public surface is whatever is exported from design_research_analysis.__all__.
Top-level exports include:
- Package metadata:
__version__ - Table contracts:
UnifiedTableConfig,UnifiedTableValidationReport,coerce_unified_table,derive_columns,validate_unified_table - Sequence:
fit_markov_chain_from_table,fit_discrete_hmm_from_table,fit_text_gaussian_hmm_from_table,decode_hmm, plotting helpers, and result types - Language:
compute_language_convergence,compute_semantic_distance_trajectory,fit_topic_model,score_sentiment - Embedding maps:
embed_records,build_embedding_map,cluster_embedding_map,compare_embedding_maps,plot_embedding_map,plot_embedding_map_grid - Statistics:
compare_groups,fit_regression,fit_mixed_effects,permutation_test,bootstrap_ci, power helpers - Dataset + runtime:
profile_dataframe,validate_dataframe,generate_codebook,capture_run_context,attach_provenance,write_run_manifest
Contributing
Contribution workflow and validation gates are documented in CONTRIBUTING.md.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file design_research_analysis-0.1.1.tar.gz.
File metadata
- Download URL: design_research_analysis-0.1.1.tar.gz
- Upload date:
- Size: 95.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3eadc62fb69049eae4a6d9c8a6bcdbcc2a18155117114c73834e8415742904b0
|
|
| MD5 |
3e56f195856bf9b8fa5657569695ec2f
|
|
| BLAKE2b-256 |
7088fa104335be745904d78964bbacc872a290955801e4e255bb40649c14c1d2
|
Provenance
The following attestation bundles were made for design_research_analysis-0.1.1.tar.gz:
Publisher:
workflow.yml on cmudrc/design-research-analysis
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
design_research_analysis-0.1.1.tar.gz -
Subject digest:
3eadc62fb69049eae4a6d9c8a6bcdbcc2a18155117114c73834e8415742904b0 - Sigstore transparency entry: 1203974807
- Sigstore integration time:
-
Permalink:
cmudrc/design-research-analysis@42dff15b254365dfc4498e4348259642543ad239 -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/cmudrc
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
workflow.yml@42dff15b254365dfc4498e4348259642543ad239 -
Trigger Event:
release
-
Statement type:
File details
Details for the file design_research_analysis-0.1.1-py3-none-any.whl.
File metadata
- Download URL: design_research_analysis-0.1.1-py3-none-any.whl
- Upload date:
- Size: 72.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ab713585902526a2664e980e58f7bc4636a8e6e80cd361ba89d73b191a4afda0
|
|
| MD5 |
e538df103cedecec7685dcf330c2fe85
|
|
| BLAKE2b-256 |
8433b625c798d00a1b0b3c556a6bde452c9751b0f0e26d08a090c9b8eaf645aa
|
Provenance
The following attestation bundles were made for design_research_analysis-0.1.1-py3-none-any.whl:
Publisher:
workflow.yml on cmudrc/design-research-analysis
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
design_research_analysis-0.1.1-py3-none-any.whl -
Subject digest:
ab713585902526a2664e980e58f7bc4636a8e6e80cd361ba89d73b191a4afda0 - Sigstore transparency entry: 1203974820
- Sigstore integration time:
-
Permalink:
cmudrc/design-research-analysis@42dff15b254365dfc4498e4348259642543ad239 -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/cmudrc
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
workflow.yml@42dff15b254365dfc4498e4348259642543ad239 -
Trigger Event:
release
-
Statement type: