Calibrating microdata

Project description

MicroCalibrate

MicroCalibrate is a Python package for calibrating survey weights to match population targets, with advanced features including L0 regularization for sparsity, hyperparameter tuning, and robustness evaluation.

Features

Survey Weight Calibration: The package adjusts sample weights to match known population totals.
L0 Regularization: The system creates sparse weights to reduce dataset size while maintaining accuracy.
Automatic Hyperparameter Tuning: The optimization module automatically finds optimal regularization parameters using cross-validation.
Robustness Evaluation: The evaluation tools assess calibration stability using holdout validation.
Target Assessment: The analysis features help identify which targets complicate calibration.
Performance Monitoring: The system tracks calibration progress with detailed logging.
Interactive Dashboard: Users can visualize calibration performance at https://microcalibrate.vercel.app/.

Installation

pip install microcalibrate

The package requires the following dependencies:

Python version 3.13 or higher is required.
PyTorch version 2.7.0 or higher is needed.
Additional required packages include NumPy, Pandas, Optuna, and L0-python.

Quick start

Basic calibration

from microcalibrate import Calibration
import numpy as np
import pandas as pd

# Create sample data for calibration
n_samples = 1000
weights = np.ones(n_samples)  # Initial weights are set to one

# Create an estimate matrix that represents the contribution of each record to targets
estimate_matrix = pd.DataFrame({
    'total_income': np.random.normal(50000, 15000, n_samples),
    'total_employed': np.random.binomial(1, 0.6, n_samples),
})

# Set the target values to achieve through calibration
targets = np.array([
    50_000_000,  # This is the total income target
    600,         # This is the total employed target
])

# Initialize the calibration object and configure the optimization parameters
cal = Calibration(
    weights=weights,
    targets=targets,
    estimate_matrix=estimate_matrix,
    epochs=500,
    learning_rate=1e-3,
)

# Perform the calibration to adjust weights
performance_df = cal.calibrate()

# Retrieve the calibrated weights from the calibration object
new_weights = cal.weights

API reference

Calibration class

The Calibration class is the main class for weight calibration.

Parameters:

weights: The initial weights array for each record.
targets: The target values to match during calibration.
estimate_matrix: A DataFrame containing the contribution of each record to targets.
estimate_function: An alternative to estimate_matrix that uses a custom function.
epochs: The number of optimization iterations to perform (default is 32).
learning_rate: The optimization learning rate (default is 1e-3).
noise_level: The amount of noise added for robustness (default is 10.0).
dropout_rate: The dropout rate for regularization (default is 0).
regularize_with_l0: This parameter enables L0 regularization (default is False).
l0_lambda: The L0 regularization strength parameter (default is 5e-6).
init_mean: The initial proportion of non-zero weights (default is 0.999).
temperature: The sparsity control parameter (default is 0.5).

Methods:

calibrate(): This method performs the weight calibration process.
tune_l0_hyperparameters(): This method automatically tunes L0 parameters using cross-validation.
evaluate_holdout_robustness(): This method assesses calibration stability using holdout validation.
assess_analytical_solution(): This method analyzes the difficulty of achieving target combinations.
summary(): This method returns a summary of the calibration results.

Examples and documentation

For detailed examples and interactive notebooks, see the documentation.

Contributing

Contributions are welcome to the project. Please feel free to submit a Pull Request with your improvements.

Project details

Release history Release notifications | RSS feed

This version

0.22.1

Apr 28, 2026

0.22.0

Apr 18, 2026

0.21.3

Apr 18, 2026

0.21.2

Feb 24, 2026

0.21.1

Jan 6, 2026

0.21.0

Aug 22, 2025

0.20.0

Aug 22, 2025

0.19.1

Aug 11, 2025

0.19.0

Aug 4, 2025

0.18.0

Jul 25, 2025

0.17.0

Jul 25, 2025

0.16.0

Jul 25, 2025

0.15.0

Jul 15, 2025

0.14.1

Jul 7, 2025

0.14.0

Jul 7, 2025

0.13.5

Jun 30, 2025

0.13.4

Jun 30, 2025

0.13.3

Jun 30, 2025

0.13.2

Jun 26, 2025

0.13.1

Jun 26, 2025

0.13.0

Jun 26, 2025

0.12.0

Jun 25, 2025

0.11.0

Jun 25, 2025

0.10.0

Jun 25, 2025

0.9.0

Jun 24, 2025

0.8.0

Jun 24, 2025

0.7.0

Jun 24, 2025

0.6.0

Jun 24, 2025

0.5.0

Jun 23, 2025

0.4.0

Jun 23, 2025

0.3.0

Jun 20, 2025

0.2.0

Jun 19, 2025

0.1.0

Jun 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

microcalibrate-0.22.1.tar.gz (216.6 kB view details)

Uploaded Apr 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

microcalibrate-0.22.1-py3-none-any.whl (31.6 kB view details)

Uploaded Apr 28, 2026 Python 3

File details

Details for the file microcalibrate-0.22.1.tar.gz.

File metadata

Download URL: microcalibrate-0.22.1.tar.gz
Upload date: Apr 28, 2026
Size: 216.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for microcalibrate-0.22.1.tar.gz
Algorithm	Hash digest
SHA256	`9f8ba0b2fd130767b939c0e64e476a72cba0751d5143e2b04d683b23595862af`
MD5	`1fed76164ae801e98bfd4c00a6da2c98`
BLAKE2b-256	`6903b357a9c0eff5c6a69f3c55a42ecfe09adf755c30ae2bd6769f16e3c02ab3`

See more details on using hashes here.

File details

Details for the file microcalibrate-0.22.1-py3-none-any.whl.

File metadata

Download URL: microcalibrate-0.22.1-py3-none-any.whl
Upload date: Apr 28, 2026
Size: 31.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for microcalibrate-0.22.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`42d35c59a936c87653231fb79d9fb0e8ba96259eb1951e30c244c4608eaae954`
MD5	`15bc45dc6528a7c3ffc42d85b16d60b9`
BLAKE2b-256	`0e8b5fc573f3148efec05bbf90e5392a1fb04637c73a9d1e8259b638223f763c`

See more details on using hashes here.

microcalibrate 0.22.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

MicroCalibrate

Features

Installation

Quick start

Basic calibration

API reference

Calibration class

Examples and documentation

Contributing

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes