Skip to main content

High-performance Python package for analyzing heterodyne scattering in X-ray Photon Correlation Spectroscopy (XPCS) with 14-parameter two-component model, automatic migration from legacy configs, and comprehensive parameter validation (v1.0.0)

Project description

Heterodyne Scattering Analysis Package

License: MIT Python PyPI version Documentation NumPy SciPy Numba DOI Research

⚠️ Dataset Size Limitation: This heterodyne analysis package is not recommended for large datasets exceeding 4M data points due to over-subsampling effects and reduced performance without adequate subsampling. For optimal results, use datasets with fewer than 4M data points or enable aggressive subsampling configurations.

Overview

heterodyne-analysis is a research-grade Python package for analyzing two-component heterodyne X-ray Photon Correlation Spectroscopy (XPCS) under nonequilibrium conditions. This package implements the 14-parameter heterodyne scattering model from He et al. PNAS 2024 (Equation S-95), which characterizes transport properties through separate reference and sample field correlations.

The package provides comprehensive analysis of nonequilibrium dynamics by modeling distinct transport behaviors in reference and sample components, with time-dependent fraction mixing that captures complex phenomena in soft matter systems, biological fluids, colloids, and active matter.

Key Features

Analysis Capabilities

  • Heterodyne Scattering Model (14 parameters): Two-component heterodyne with separate reference and sample correlations
    • Reference transport (3 params): $D_{0,\text{ref}}$, $\alpha_{\text{ref}}$, $D_{\text{offset,ref}}$
    • Sample transport (3 params): $D_{0,\text{sample}}$, $\alpha_{\text{sample}}$, $D_{\text{offset,sample}}$
    • Velocity (3 params): $v_0$, $\beta$, $v_{\text{offset}}$
    • Fraction (4 params): $f_0$, $f_1$, $f_2$, $f_3$
    • Flow angle (1 param): $\phi_0$
  • Multiple optimization methods: Classical (Nelder-Mead, Powell), Robust (Wasserstein DRO, Scenario-based, Ellipsoidal)
  • Parameter validation: Physical constraints ensure valid heterodyne parameters

Performance

  • Numba JIT compilation: 3-5x speedup for core calculations
  • Vectorized operations: Optimized NumPy array processing
  • Memory optimization: Ellipsoidal optimization with 90% memory limit for large datasets (8M+ data points)
  • Smart caching: Intelligent data caching with automatic dimension validation

Data Format Support

  • APS and APS-U formats: Auto-detection and loading of both old and new synchrotron data formats
  • Cached data compatibility: Automatic cache dimension adjustment and validation
  • Configuration templates: Comprehensive templates with documented subsampling strategies

Validation and Quality

  • Statistical validation: Cross-validation and bootstrap analysis for parameter reliability
  • Experimental validation: Tested at synchrotron facilities (APS Sector 8-ID-I)
  • Regression testing: Comprehensive test suite with performance benchmarks

Quick Start

# Install
pip install heterodyne-analysis[all]

# Create heterodyne configuration (14 parameters)
cp heterodyne/config/template.json my_config.json
# Edit my_config.json with your experimental parameters

# Run analysis
heterodyne --config my_config.json --method all

# Run robust optimization for noisy data
heterodyne --config my_config.json --method robust

Quick Reference (v1.0.0)

Most Common Workflows

1. First-time Setup:

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install with all dependencies
pip install heterodyne-analysis[all]

# Generate configuration template
heterodyne-config --sample my_experiment

2. Standard Analysis (Clean Data):

# Edit my_config.json with your experimental parameters
# Then run classical optimization
heterodyne --config my_config.json --method classical

# Results saved to: ./heterodyne_results/

3. Robust Analysis (Noisy Data):

# For data with outliers or measurement noise
heterodyne --config my_config.json --method robust --verbose

# Compare both methods
heterodyne --config my_config.json --method all

4. Python API Usage:

import numpy as np
import json
from heterodyne.analysis.core import HeterodyneAnalysisCore
from heterodyne.optimization.classical import ClassicalOptimizer
from heterodyne.data.xpcs_loader import load_xpcs_data

# Load config and initialize
config_file = "my_config.json"
with open(config_file, 'r') as f:
    config = json.load(f)
core = HeterodyneAnalysisCore(config)

# Load experimental XPCS data using the data loader
c2_data, time_length, phi_angles, num_angles = load_xpcs_data(config_file)

# Run optimization
optimizer = ClassicalOptimizer(core, config)
params, results = optimizer.run_classical_optimization_optimized(
    phi_angles=phi_angles,
    c2_experimental=c2_data
)

# Extract parameters (14 total)
D0_ref, alpha_ref, D_offset_ref = params[0:3]
D0_sample, alpha_sample, D_offset_sample = params[3:6]
v0, beta, v_offset = params[6:9]
f0, f1, f2, f3 = params[9:13]
phi0 = params[13]

print(f"D₀_ref = {D0_ref:.3e} Ų/s")
print(f"D₀_sample = {D0_sample:.3e} Ų/s")
print(f"χ² = {results.chi_squared:.6e}")

5. Troubleshooting:

# Enable verbose logging for debugging
heterodyne --config my_config.json --verbose

# Run test suite to verify installation
pytest -v -m "not slow"

# Check frame counting (v1.0.0 critical fix)
pytest heterodyne/tests/test_time_length_calculation.py -v

Installation

Standard Installation

pip install heterodyne-analysis[all]

Research Environment Setup

# Create isolated research environment
conda create -n heterodyne-research python=3.12
conda activate heterodyne-research

# Install with all scientific dependencies
pip install heterodyne-analysis[all]

# For development and method extension
git clone https://github.com/imewei/heterodyne.git
cd heterodyne
pip install -e .[dev]

Optional Dependencies

  • Performance: pip install heterodyne-analysis[performance] (numba, psutil, performance monitoring)
  • Robust optimization: pip install heterodyne-analysis[robust] (cvxpy, gurobipy, mosek)
  • Development: pip install heterodyne-analysis[dev] (testing, linting, documentation tools)

High-Performance Configuration

# Optimize for computational performance
export OMP_NUM_THREADS=8
export HETERODYNE_PERFORMANCE_MODE=1

# Enable advanced optimization (requires license)
pip install heterodyne-analysis[gurobi]

Commands

Main Analysis Command

heterodyne [OPTIONS]

Key Options:

  • --method {classical,robust,all} - Analysis method (default: classical)
  • --config CONFIG - Configuration file (default: ./heterodyne_config.json)
  • --output-dir DIR - Output directory (default: ./heterodyne_results)
  • --verbose - Debug logging
  • --quiet - File logging only
  • --plot-experimental-data - Generate data validation plots
  • --plot-simulated-data - Plot theoretical correlations

Examples:

# Basic analysis
heterodyne --method classical
heterodyne --method robust --verbose
heterodyne --method all

# Data validation
heterodyne --plot-experimental-data
heterodyne --plot-simulated-data --contrast 1.5 --offset 0.1

# Custom configuration
heterodyne --config experiment.json --output-dir ./results

Configuration Generator

heterodyne-config [OPTIONS]

Options:

  • --output OUTPUT - Output file (default: my_config.json)
  • --sample SAMPLE - Sample name
  • --author AUTHOR - Author name
  • --experiment EXPERIMENT - Experiment description

Examples:

# Generate heterodyne config
heterodyne-config

# With metadata
heterodyne-config --sample protein --author "Your Name"

Shell Completion

The package supports shell completion for bash, zsh, and fish shells:

# For bash
heterodyne --help  # Shows all available options

# Tab completion works for:
heterodyne --method <TAB>     # classical, robust, all
heterodyne --config <TAB>     # Completes file paths
heterodyne --output-dir <TAB> # Completes directory paths

Note: Shell completion is built into the CLI and works automatically in most modern shells. For advanced completion features, you may need to install optional completion dependencies.

Scientific Background

Heterodyne Scattering Model

The package implements the 14-parameter heterodyne scattering model from He et al. PNAS 2024 Equation S-95. This model analyzes two-component heterodyne X-ray photon correlation spectroscopy (XPCS) with separate reference and sample field correlations under nonequilibrium conditions.

Mathematical Foundation

Heterodyne Correlation Function (Equation S-95):

The two-time correlation function for heterodyne scattering:

$$ c_2(\vec{q}, t_1, t_2, \phi) = 1 + \frac{\beta}{f^2} \left[ \left[x_r(t_1) x_r(t_2)\right]^2 \exp\left(-q^2 \int_{t_1}^{t_2} J_r(t) , dt\right) + \left[x_s(t_1) x_s(t_2)\right]^2 \exp\left(-q^2 \int_{t_1}^{t_2} J_s(t) , dt\right) + 2 x_r(t_1) x_r(t_2) x_s(t_1) x_s(t_2) \exp\left(-\frac{1}{2} q^2 \int_{t_1}^{t_2} [J_s(t) + J_r(t)] , dt\right) \cos\left[q \cos(\phi) \int_{t_1}^{t_2} v(t) , dt\right] \right] $$

where:

$$ f^2 = \left[x_s(t_1)^2 + x_r(t_1)^2\right] \left[x_s(t_2)^2 + x_r(t_2)^2\right] $$

Key Notation:

  • Two-time structure: Fractions evaluated at BOTH $t_1$ and $t_2$
    • $x_s(t_1)$, $x_s(t_2)$: Sample fraction at time 1 and time 2
    • $x_r(t_1) = 1 - x_s(t_1)$: Reference fraction at time 1
    • $x_r(t_2) = 1 - x_s(t_2)$: Reference fraction at time 2
  • Normalization: $f^2$ uses fractions at both times
  • Integrals: All integrals from $t_1$ to $t_2$
  • Transport coefficients: $J_r(t)$, $J_s(t)$ for reference and sample
  • Velocity: $v(t)$ for flow-induced decorrelation
  • Angles:
    • $\phi$ in equation represents relative angle between flow and scattering directions
    • Implementation: $\phi = \phi_0 - \phi_{\text{scattering}}$ (flow angle minus scattering angle)
    • $\phi_0$: Flow direction parameter (14th parameter)
  • Contrast: $\beta$ (implicit in experimental measurements)
  • Baseline: 1 (zero-delay limit)

Physical Components:

  1. Transport Coefficients (separate for reference and sample):

$$ J_r(t) = J_{0,\text{ref}} \times t^{\alpha_{\text{ref}}} + J_{\text{offset,ref}} $$

$$ J_s(t) = J_{0,\text{sample}} \times t^{\alpha_{\text{sample}}} + J_{\text{offset,sample}} $$

Note: Parameters labeled "D" in code are transport coefficients J. For equilibrium: $J = 6D$.

  1. Velocity Coefficient (shared between components):

$$ v(t) = v_0 \times t^\beta + v_{\text{offset}} $$

  1. Sample Fraction Function:

$$ x_s(t) = f_0 \times \exp\left[f_1 \times (t - f_2)\right] + f_3 $$

where $0 \leq x_s(t) \leq 1$, and $x_r(t) = 1 - x_s(t)$

14-Parameter Complete Reference

# Parameter Symbol Units
Reference Transport (3 parameters)
1 D₀_ref $J_{0,\text{ref}}$ Ų/s
2 α_ref $\alpha_{\text{ref}}$
3 D_offset_ref $J_{\text{offset,ref}}$ Ų/s
Sample Transport (3 parameters)
4 D₀_sample $J_{0,\text{sample}}$ Ų/s
5 α_sample $\alpha_{\text{sample}}$
6 D_offset_sample $J_{\text{offset,sample}}$ Ų/s
Velocity (3 parameters)
7 v₀ $v_0$ Å/s
8 β $\beta$
9 v_offset $v_{\text{offset}}$ Å/s
Fraction (4 parameters)
10 f₀ $f_0$
11 f₁ $f_1$ s⁻¹
12 f₂ $f_2$ s
13 f₃ $f_3$
Flow Angle (1 parameter)
14 φ₀ $\phi_0$ degrees

Parameter Implementation:

# Extract 14 parameters
D0_ref, alpha_ref, D_offset_ref = parameters[0:3]           # Reference transport
D0_sample, alpha_sample, D_offset_sample = parameters[3:6]  # Sample transport
v0, beta, v_offset = parameters[6:9]                        # Velocity
f0, f1, f2, f3 = parameters[9:13]                           # Fraction
phi0 = parameters[13]                                        # Flow angle

Key Physical Features

  • Independent transport dynamics: Reference and sample components exhibit different diffusive behaviors (separate $J_r$ and $J_s$)
  • Three correlation terms:
    • Reference autocorrelation: $[x_r(t_1) x_r(t_2)]^2 \exp(-q^2 \int J_r , dt)$
    • Sample autocorrelation: $[x_s(t_1) x_s(t_2)]^2 \exp(-q^2 \int J_s , dt)$
    • Cross-correlation: $2 x_r(t_1) x_r(t_2) x_s(t_1) x_s(t_2) \exp(-\frac{1}{2} q^2 \int [J_s + J_r] , dt) \cos[q \cos(\phi) \int v , dt]$
  • Cross-term velocity modulation: Cosine factor captures flow-induced decorrelation between components
  • Time-dependent mixing: Exponential fraction evolution $x_s(t)$ captures compositional changes
  • Power-law transport: Generalizes equilibrium diffusion to nonequilibrium regimes ($\alpha \neq 0$, superdiffusion/subdiffusion)
  • Two-time correlation: Full $c_2(t_1, t_2)$ matrix structure preserves nonequilibrium dynamics

Frame Counting Convention

Overview

The package uses 1-based inclusive frame counting in configuration files, which is then converted to 0-based Python array indices for processing.

Frame Counting Formula

time_length = end_frame - start_frame + 1  # Inclusive counting

Examples:

  • start_frame=1, end_frame=100time_length=100 (not 99!)
  • start_frame=401, end_frame=1000time_length=600 (not 599!)
  • start_frame=1, end_frame=1time_length=1 (single frame)

Convention Details

Config Convention (1-based, inclusive):

  • start_frame=1 means "start at first frame"
  • end_frame=100 means "include frame 100"
  • Both boundaries are inclusive: [start_frame, end_frame]

Python Slice Convention (0-based, exclusive end):

  • Internally converted using: python_start = start_frame - 1
  • python_end = end_frame (kept as-is for exclusive slice)
  • Array slice [python_start:python_end] gives exactly time_length elements

Example Conversion:

# Config values
start_frame = 401  # 1-based
end_frame = 1000   # 1-based

# Convert to Python indices
python_start = 400  # 0-based (401 - 1)
python_end = 1000   # 0-based, exclusive

# Slice gives correct number of frames
data_slice = full_data[:, 400:1000, 400:1000]  # 600 frames
time_length = 1000 - 401 + 1  # = 600 ✓

Cached Data Compatibility

Cache Filename Convention:

  • Cache files use config values: cached_c2_isotropic_frames_{start_frame}_{end_frame}.npz
  • Example: cached_c2_isotropic_frames_401_1000.npz contains 600 frames

Cache Dimension Validation: The analysis core automatically detects and adjusts for dimension mismatches:

# Automatic adjustment if cache dimensions differ
if c2_experimental.shape[1] != self.time_length:
    logger.info(f"Auto-adjusting time_length to match cached data")
    self.time_length = c2_experimental.shape[1]

Utility Functions

Two centralized utility functions ensure consistency:

from heterodyne.core.io_utils import calculate_time_length, config_frames_to_python_slice

# Calculate time_length from config frames
time_length = calculate_time_length(start_frame=401, end_frame=1000)
# Returns: 600

# Convert for Python array slicing
python_start, python_end = config_frames_to_python_slice(401, 1000)
# Returns: (400, 1000) for use in data[400:1000]

Conditional Angle Subsampling

Strategy

The package automatically preserves angular information when the number of available angles is small:

# Automatic angle preservation
if n_angles < 4:
    # Use all available angles to preserve angular information
    angle_subsample_size = n_angles
else:
    # Subsample for performance (default: 4 angles)
    angle_subsample_size = config.get("n_angles", 4)

Impact

  • Before fix: 2 angles → 1 angle (50% angular information loss)
  • After fix: 2 angles → 2 angles (100% preservation)
  • Time subsampling still applied: ~16x performance improvement with <10% χ² degradation

Configuration

All configuration templates include subsampling documentation:

{
  "subsampling": {
    "n_angles": 4,
    "n_time_points": 16,
    "comment": "Conditional: n_angles preserved when < 4 for angular info retention"
  }
}

Optimization Methods

Classical Methods

  1. Nelder-Mead Simplex: Derivative-free optimization for robust convergence
  2. Gurobi Quadratic Programming: High-performance commercial solver with trust region methods

Robust Optimization Framework

Advanced uncertainty-aware optimization for noisy experimental data:

  1. Distributionally Robust Optimization (DRO):

    • Wasserstein uncertainty sets for data distribution robustness
    • Optimal transport-based uncertainty quantification
  2. Scenario-Based Optimization:

    • Bootstrap resampling for statistical robustness
    • Monte Carlo uncertainty propagation
  3. Ellipsoidal Uncertainty Sets:

    • Bounded uncertainty with confidence ellipsoids
    • Analytical uncertainty bounds
    • Memory optimization: 90% limit for large datasets

Usage Guidelines:

  • Use --method robust for noisy data with outliers
  • Use --method classical for clean, low-noise data
  • Use --method all to run both and compare results

Python API

Basic Usage

import numpy as np
import json
from heterodyne.analysis.core import HeterodyneAnalysisCore
from heterodyne.optimization.classical import ClassicalOptimizer
from heterodyne.data.xpcs_loader import load_xpcs_data

# Load configuration file
config_file = "config_heterodyne.json"
with open(config_file, 'r') as f:
    config = json.load(f)

# Initialize analysis core
core = HeterodyneAnalysisCore(config)

# Load experimental XPCS data using config file
c2_data, time_length, phi_angles, num_angles = load_xpcs_data(config_file)

# Run classical optimization
classical = ClassicalOptimizer(core, config)
optimal_params, results = classical.run_classical_optimization_optimized(
    phi_angles=phi_angles,
    c2_experimental=c2_data
)

print(f"Optimal parameters: {optimal_params}")
print(f"Chi-squared: {results.chi_squared:.6e}")
print(f"Method: {results.best_method}")

Research Workflow

import numpy as np
import json
from heterodyne.analysis.core import HeterodyneAnalysisCore
from heterodyne.optimization.classical import ClassicalOptimizer
from heterodyne.optimization.robust import RobustHeterodyneOptimizer
from heterodyne.data.xpcs_loader import load_xpcs_data

# Load experimental configuration
config_file = "config_heterodyne.json"
with open(config_file, 'r') as f:
    config = json.load(f)

# Initialize analysis core
core = HeterodyneAnalysisCore(config)

# Load XPCS correlation data using config file
c2_data, time_length, phi_angles, num_angles = load_xpcs_data(config_file)

# Classical analysis for clean data
classical = ClassicalOptimizer(core, config)
classical_params, classical_results = classical.run_classical_optimization_optimized(
    phi_angles=phi_angles,
    c2_experimental=c2_data
)

# Robust analysis for noisy data
robust = RobustHeterodyneOptimizer(core, config)
robust_result_dict = robust.optimize(
    phi_angles=phi_angles,
    c2_experimental=c2_data,
    method="wasserstein",  # Options: "wasserstein", "scenario", "ellipsoidal"
    epsilon=0.1  # Uncertainty radius for DRO
)

# Extract results
robust_params = robust_result_dict['parameters']
robust_chi2 = robust_result_dict['chi_squared']

print(f"Classical D₀_ref: {classical_params[0]:.3e} Ų/s")
print(f"Classical χ²: {classical_results.chi_squared:.6e}")
print(f"\nRobust D₀_ref: {robust_params[0]:.3e} Ų/s")
print(f"Robust χ²: {robust_chi2:.6e}")

Performance Benchmarking

import time
import numpy as np
import json
from heterodyne.analysis.core import HeterodyneAnalysisCore
from heterodyne.optimization.classical import ClassicalOptimizer
from heterodyne.data.xpcs_loader import load_xpcs_data

# Load configuration
config_file = "config_heterodyne.json"
with open(config_file, 'r') as f:
    config = json.load(f)

# Initialize
core = HeterodyneAnalysisCore(config)
c2_data, time_length, phi_angles, num_angles = load_xpcs_data(config_file)

# Benchmark classical optimization
classical = ClassicalOptimizer(core, config)
start_time = time.perf_counter()
params, results = classical.run_classical_optimization_optimized(
    phi_angles=phi_angles,
    c2_experimental=c2_data
)
elapsed_time = time.perf_counter() - start_time

print(f"Classical optimization completed in {elapsed_time:.2f} seconds")
print(f"Chi-squared: {results.chi_squared:.6e}")
print(f"Best method: {results.best_method}")

Configuration

Creating Configurations

# Generate templates
heterodyne-config --sample protein_01
heterodyne-config --sample microgel

Analysis Mode

The package uses the 14-parameter heterodyne model for all analyses:

{
  "analysis_settings": {
    "model_description": "g₂ = heterodyne correlation with separate g₁_ref and g₁_sample field correlations (He et al. PNAS 2024 Eq. S-95). 14-parameter model: 3 reference transport + 3 sample transport + 3 velocity + 4 fraction + 1 flow angle"
  }
}

Subsampling Configuration

{
  "subsampling": {
    "n_angles": 4,
    "n_time_points": 16,
    "strategy": "conditional",
    "preserve_angular_info": true
  }
}

Performance Impact:

  • Time subsampling: ~16x speedup
  • Angle subsampling: Conditional based on available angles
  • Combined impact: 20-50x speedup with <10% χ² degradation

Data Formats and Standards

XPCS Correlation Data Format:

  • Time correlation functions: $c_2(q, \phi, t_1, t_2)$ as HDF5 or NumPy arrays
  • Scattering angles: $\phi$ values in degrees [0°, 360°)
  • Time delays: $\tau = t_2 - t_1$ in seconds
  • Wavevector magnitude: $q$ in Å⁻¹

Configuration Schema:

{
  "analysis_settings": {
    "angle_filtering": true,
    "optimization_method": "all"
  },
  "experimental_parameters": {
    "q_magnitude": 0.0045,
    "gap_height": 50000.0,
    "temperature": 293.15,
    "viscosity": 1.0e-3
  },
  "frame_settings": {
    "start_frame": 401,
    "end_frame": 1000,
    "time_length_comment": "Calculated as end_frame - start_frame + 1 = 600"
  },
  "optimization_bounds": {
    "comment": "Simplified example - see heterodyne/config/template.json for complete 14-parameter bounds",
    "D0_ref": [1.0, 1e6],
    "alpha_ref": [-2.0, 2.0],
    "D_offset_ref": [-100.0, 100.0],
    "D0_sample": [1.0, 1e6],
    "alpha_sample": [-2.0, 2.0],
    "D_offset_sample": [-100.0, 100.0],
    "v0": [-10.0, 10.0],
    "beta": [-2.0, 2.0],
    "v_offset": [-1.0, 1.0],
    "f0": [0.0, 1.0],
    "f1": [-1.0, 1.0],
    "f2": [0.0, 200.0],
    "f3": [0.0, 1.0],
    "phi0": [-360.0, 360.0]
  }
}

Output Structure

When running heterodyne --method all, the complete analysis produces a comprehensive results directory with all optimization methods:

heterodyne_results/
├── heterodyne_analysis_results.json    # Summary with all methods
├── run.log                           # Detailed execution log
│
├── classical/                        # Classical optimization results
│   ├── nelder_mead/                  # Nelder-Mead simplex method
│   │   ├── parameters.json           # Optimal parameters with metadata
│   │   ├── fitted_data.npz          # Fitted correlation functions + experimental metadata
│   │   ├── analysis_results_nelder_mead.json  # Complete results + chi-squared
│   │   ├── convergence_metrics.json  # Iterations, function evaluations, diagnostics
│   │   └── c2_heatmaps_phi_*.png    # Experimental vs fitted comparison plots
│   └── gurobi/                       # Gurobi quadratic programming (if available)
│       ├── parameters.json
│       ├── fitted_data.npz
│       ├── analysis_results_gurobi.json
│       ├── convergence_metrics.json
│       └── c2_heatmaps_phi_*.png
│
├── robust/                           # Robust optimization results
│   ├── wasserstein/                  # Distributionally Robust Optimization (DRO)
│   │   ├── parameters.json           # Robust optimal parameters
│   │   ├── fitted_data.npz          # Fitted correlations with uncertainty bounds
│   │   ├── analysis_results_wasserstein.json  # DRO results + uncertainty radius
│   │   ├── convergence_metrics.json  # Optimization convergence info
│   │   └── c2_heatmaps_phi_*.png    # Robust fit comparison plots
│   ├── scenario/                     # Scenario-based bootstrap optimization
│   │   ├── parameters.json
│   │   ├── fitted_data.npz
│   │   ├── analysis_results_scenario.json
│   │   ├── convergence_metrics.json
│   │   └── c2_heatmaps_phi_*.png
│   └── ellipsoidal/                  # Ellipsoidal uncertainty sets
│       ├── parameters.json
│       ├── fitted_data.npz
│       ├── analysis_results_ellipsoidal.json
│       ├── convergence_metrics.json
│       └── c2_heatmaps_phi_*.png
│
└── comparison_plots/                 # Method comparison visualizations
    ├── method_comparison_phi_*.png   # Classical vs Robust comparison
    └── parameter_comparison.png      # Parameter values across methods

Key Output Files

heterodyne_analysis_results.json: Main summary containing:

  • Analysis timestamp and methods run
  • Experimental parameters (q, dt, gap size, frames)
  • Optimization results for all methods:
    • classical_nelder_mead, classical_gurobi, classical_best
    • robust_wasserstein, robust_scenario, robust_ellipsoidal, robust_best

fitted_data.npz: NumPy compressed archive with:

  • Experimental metadata: wavevector_q, dt, stator_rotor_gap, start_frame, end_frame
  • Correlation data: c2_experimental, c2_theoretical_raw, c2_theoretical_scaled
  • Scaling parameters: contrast_params, offset_params
  • Quality metrics: residuals

analysis_results_{method}.json: Method-specific detailed results:

  • Optimized parameters with names
  • Chi-squared and reduced chi-squared values
  • Experimental metadata
  • Scaling parameters for each angle
  • Success status and timestamp

convergence_metrics.json: Optimization diagnostics:

  • Number of iterations
  • Function evaluations
  • Convergence message
  • Final chi-squared value

Performance

Environment Optimization

export OMP_NUM_THREADS=8
export HETERODYNE_PERFORMANCE_MODE=1

Optimizations

  • Numba JIT: 3-5x speedup for core calculations
  • Vectorized operations: Optimized array processing
  • Memory efficiency: Smart caching and allocation
  • Batch processing: Vectorized chi-squared calculation
  • Conditional subsampling: 20-50x speedup with minimal accuracy loss

Benchmarking Results

Performance Comparison (Intel Xeon, 8 cores):

Data Size Pure Python Numba JIT Speedup
100 points 2.3 s 0.7 s 3.3×
500 points 12.1 s 3.2 s 3.8×
1000 points 45.2 s 8.9 s 5.1×
5000 points 892 s 178 s 5.0×

Memory Optimization:

Dataset Size Before After Improvement
8M data points Memory error 90% limit success Enabled
4M data points 85% usage 75% usage 12% reduction

Testing

Quick Test Suite (Development)

# Fast test suite excluding slow tests (recommended for development)
pytest -v -m "not slow"

# Run frame counting regression tests (v1.0.0 formula)
pytest heterodyne/tests/test_time_length_calculation.py -v

# Run with coverage
pytest -v --cov=heterodyne --cov-report=html -m "not slow"

Comprehensive Test Suite (CI/CD)

# Full test suite including slow performance tests
pytest heterodyne/tests/ -v

# Performance benchmarks only
pytest heterodyne/tests/ -v -m performance

# Run with parallel execution for speed
pytest -v -n auto

Testing Guide

For comprehensive testing documentation including:

  • Frame counting convention (v1.0.0 changes)
  • Test markers and categorization
  • Temporary file management best practices
  • Writing new tests for v1.0.0

See TESTING.md for detailed testing guidelines.

Citation

If you use this software in your research, please cite the original paper:

@article{he2024transport,
  title={Transport coefficient approach for characterizing nonequilibrium dynamics in soft matter},
  author={He, Hongrui and Liang, Hao and Chu, Miaoqi and Jiang, Zhang and de Pablo, Juan J and Tirrell, Matthew V and Narayanan, Suresh and Chen, Wei},
  journal={Proceedings of the National Academy of Sciences},
  volume={121},
  number={31},
  pages={e2401162121},
  year={2024},
  publisher={National Academy of Sciences},
  doi={10.1073/pnas.2401162121}
}

For the software package:

@software{heterodyne_analysis,
  title={heterodyne-analysis: High-performance XPCS analysis with robust optimization},
  author={Chen, Wei and He, Hongrui},
  year={2024-2025},
  url={https://github.com/imewei/heterodyne_analysis},
  version={1.0.0},
  institution={Argonne National Laboratory}
}

Development

Development setup:

git clone https://github.com/imewei/heterodyne_analysis.git
cd heterodyne
pip install -e .[dev]

# Run tests
pytest heterodyne/tests/ -v

# Code quality
ruff check heterodyne/
ruff format heterodyne/
black heterodyne/
isort heterodyne/
mypy heterodyne/

See DEVELOPMENT.md for detailed development guidelines.

License

This research software is distributed under the MIT License, enabling open collaboration while maintaining attribution requirements for academic use.

Research Use: Freely available for academic research with proper citation Commercial Use: Permitted under MIT License terms Modification: Encouraged with contribution back to the community


Contact Information:

  • Primary Investigator: Wei Chen (wchen@anl.gov)
  • Technical Support: GitHub Issues
  • Research Collaboration: Argonne National Laboratory, X-ray Science Division

Authors: Wei Chen, Hongrui He (Argonne National Laboratory) License: MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

heterodyne_analysis-1.0.1.tar.gz (622.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

heterodyne_analysis-1.0.1-py3-none-any.whl (479.0 kB view details)

Uploaded Python 3

File details

Details for the file heterodyne_analysis-1.0.1.tar.gz.

File metadata

  • Download URL: heterodyne_analysis-1.0.1.tar.gz
  • Upload date:
  • Size: 622.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for heterodyne_analysis-1.0.1.tar.gz
Algorithm Hash digest
SHA256 bf06ddf239146b4bcf9ae90fe23e90076ec262b91678bda4d7639e483774e664
MD5 182c9c9fdf4451798383e6ce60a15465
BLAKE2b-256 bfc06c78cc72eef2462925227f57686f758a911aacb2be3a0a1db904af5f0df3

See more details on using hashes here.

Provenance

The following attestation bundles were made for heterodyne_analysis-1.0.1.tar.gz:

Publisher: release.yml on imewei/heterodyne-analysis

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file heterodyne_analysis-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for heterodyne_analysis-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2538ee0ab6ed83bfbd6da5bdba22e4cc5e645941aadbf2c7d94cad52cf070dcf
MD5 6875bc4e0106fef026be01ef4d6b10a6
BLAKE2b-256 e910899db3b7155a51faa0adfc36c9341885f330d39c03d9ba5f4352cf2f0fb8

See more details on using hashes here.

Provenance

The following attestation bundles were made for heterodyne_analysis-1.0.1-py3-none-any.whl:

Publisher: release.yml on imewei/heterodyne-analysis

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page