Skip to main content

Utilities for design research analysis workflows

Project description

design-research-analysis

CI Coverage Examples Passing Public API In Examples Docs

[!IMPORTANT] Current monthly release: Mellon Metrics - May 2026
Due: May 1, 2026
Tracks: April 2026 work

design-research-analysis is the unified-table analysis layer in the cmudrc design research ecosystem.

It provides typed, reusable workflows for sequence, language, embedding-map, and statistical analysis over recurring event logs.

Overview

This package centers on reproducible analysis workflows with a small top-level API:

  • Unified-table coercion, validation, and mapper-based derived columns
  • Dataset profiling, schema checks, and codebook generation
  • Sequence modeling (Markov chains, discrete HMM, Gaussian HMM)
  • Language analysis (semantic convergence trajectories, topic modeling, sentiment scoring)
  • Embedding maps (PCA, t-SNE, UMAP, PaCMAP, TriMap) with clustering, comparison, and trajectory-plotting helpers
  • Statistical wrappers (group comparisons, OLS regression, mixed-effects models, nonparametrics, and power)
  • Runtime provenance capture for reproducibility manifests
  • A thin CLI for deterministic pipeline runs

Quickstart

Requires Python 3.12+. Maintainer workflows target Python 3.12 (.python-version).

Install from PyPI:

python -m pip install --upgrade pip
pip install design-research-analysis

Common install profiles:

pip install "design-research-analysis[seq]"
pip install "design-research-analysis[lang,embeddings]"
pip install "design-research-analysis[maps]"
pip install "design-research-analysis[stats,data]"
pip install "design-research-analysis[all]"

For contributor workflows:

python -m venv .venv
source .venv/bin/activate
make dev
make test

Run a compact end-to-end example:

PYTHONPATH=src python examples/basic_usage.py

For dependency profiles and release-check guidance, see Dependencies and Extras.

CLI

The package installs a design-research-analysis CLI:

design-research-analysis validate-table --input data/events.csv --summary-json artifacts/validate.json
design-research-analysis run-sequence --input data/events.csv --summary-json artifacts/sequence.json --mode markov
design-research-analysis run-language --input data/events.csv --summary-json artifacts/language.json --trajectory-csv artifacts/language_trajectory.csv
design-research-analysis run-embedding-maps --input data/events.csv --summary-json artifacts/embedding_maps.json --map-csv artifacts/embedding_maps.csv
design-research-analysis run-stats --input data/events.csv --summary-json artifacts/stats.json --mode regression --x-columns x1,x2 --y-column y

The Python API can start from files too at the main ingestion points, for example coerce_unified_table("data/events.csv") and profile_dataframe("data/events.csv").

Examples

Start with examples/README.md for runnable scripts across all analysis families.

Docs

See the published documentation for quickstart, workflow guidance, schema details, CLI reference, and API docs.

Build docs locally with:

make docs

Public API

The supported public surface is whatever is exported from design_research_analysis.__all__.

Top-level exports include:

  • Package metadata: __version__
  • Table contracts: UnifiedTableConfig, UnifiedTableValidationReport, coerce_unified_table, derive_columns, validate_unified_table
  • Sequence: fit_markov_chain_from_table, fit_discrete_hmm_from_table, fit_text_gaussian_hmm_from_table, decode_hmm, plotting helpers, and result types
  • Language: compute_language_convergence, compute_semantic_distance_trajectory, fit_topic_model, score_sentiment
  • Embedding maps: embed_records, build_embedding_map, cluster_embedding_map, compare_embedding_maps, plot_embedding_map, plot_embedding_map_grid
  • Statistics: compare_groups, fit_regression, fit_mixed_effects, permutation_test, bootstrap_ci, power helpers
  • Dataset + runtime: profile_dataframe, validate_dataframe, generate_codebook, capture_run_context, attach_provenance, write_run_manifest

Contributing

Contribution workflow and validation gates are documented in CONTRIBUTING.md.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

design_research_analysis-0.1.1.tar.gz (95.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

design_research_analysis-0.1.1-py3-none-any.whl (72.1 kB view details)

Uploaded Python 3

File details

Details for the file design_research_analysis-0.1.1.tar.gz.

File metadata

  • Download URL: design_research_analysis-0.1.1.tar.gz
  • Upload date:
  • Size: 95.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for design_research_analysis-0.1.1.tar.gz
Algorithm Hash digest
SHA256 3eadc62fb69049eae4a6d9c8a6bcdbcc2a18155117114c73834e8415742904b0
MD5 3e56f195856bf9b8fa5657569695ec2f
BLAKE2b-256 7088fa104335be745904d78964bbacc872a290955801e4e255bb40649c14c1d2

See more details on using hashes here.

Provenance

The following attestation bundles were made for design_research_analysis-0.1.1.tar.gz:

Publisher: workflow.yml on cmudrc/design-research-analysis

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file design_research_analysis-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for design_research_analysis-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ab713585902526a2664e980e58f7bc4636a8e6e80cd361ba89d73b191a4afda0
MD5 e538df103cedecec7685dcf330c2fe85
BLAKE2b-256 8433b625c798d00a1b0b3c556a6bde452c9751b0f0e26d08a090c9b8eaf645aa

See more details on using hashes here.

Provenance

The following attestation bundles were made for design_research_analysis-0.1.1-py3-none-any.whl:

Publisher: workflow.yml on cmudrc/design-research-analysis

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page