Optimize hyperparameter search using Latin Hypercube Sampling principles

These details have not been verified by PyPI

Project links

Homepage

Project description

GridSearchReductor

⚠️ Disclaimer: This project was almost purely vibecoded with assistance from aider.chat. While it includes comprehensive pytest tests, the author doesn't have the mathematical expertise to verify that the Latin Hypercube Sampling implementation is mathematically sound. Use at your own discretion for production workloads.

A Python package for optimizing hyperparameter search using Latin Hypercube Sampling principles.

Inspired by NightHawkInLight's video on Taguchi arrays.

Do fewer experiments than grid search, but do the right ones using Latin Hypercube Sampling!

Why use GridSearchReductor?

This library is designed to work seamlessly with scikit-learn's ParameterGrid, providing a drop-in replacement that can significantly reduce your hyperparameter search space.

When tuning machine learning models, traditional grid search can require an exponentially large number of experiments. GridSearchReductor helps reduce the number of experiments needed while still effectively exploring the parameter space.

Instead of testing every possible combination of parameters (which can be computationally expensive), this package uses Latin Hypercube Sampling principles to:

Reduce the number of experiments needed
Maintain excellent coverage of the parameter space through stratified sampling
Ensure each parameter dimension is sampled uniformly
Provide better space-filling properties than random sampling
Generate deterministic results by default - the same parameter grid will always produce the same reduced combinations

Getting started

Installation

From PyPI:
- Via uv: uv pip install GridSearchReductor
- Via pip: pip install GridSearchReductor
From GitHub:
- Clone this repo then pip install .

Basic Usage

from sklearn.model_selection import ParameterGrid
from GridSearchReductor import GridSearchReductor

# Default uses 20% of the full grid size
grid_converter = GridSearchReductor()

# Or specify a custom reduction factor (must be between 0 and 1)
grid_converter = GridSearchReductor(reduction_factor=0.1)  # Use 10% of full grid

sample_grid = {
    'kernel': ['linear', 'rbf', 'poly'],
    'C': [0.1, 1, 10],
    'gamma': ['scale', 'auto'],
    'verbose': [True],  # also handles length 1 lists for fixed params
}

full_grid = ParameterGrid(sample_grid)

reduced_grid = grid_converter.fit_transform(sample_grid)
# Alternative way:
# reduced_grid = grid_converter.fit_transform(full_grid)

# Use the reduced grid in your experiments
for params in reduced_grid:
    # Your training/evaluation code here
    print(params)

The reduced experiments list will be significantly smaller than the full grid while maintaining good parameter space coverage through Latin Hypercube Sampling.

The full experiments list would have been 18 combinations (3×3×2×1), but the reduced grid provides effective coverage with fewer experiments! By default, GridSearchReductor uses 20% of the full grid size, so this example would generate approximately 3-4 experiments instead of 18.

Advanced Usage

Reproducible Results

GridSearchReductor is deterministic by default (using random_state=42). The same parameter grid will always produce the same reduced combinations.

# Default behavior - deterministic results
grid_converter = GridSearchReductor()
reduced_grid = grid_converter.fit_transform(sample_grid)

# Use a different random_state if needed
grid_converter = GridSearchReductor(random_state=123)
reduced_grid = grid_converter.fit_transform(sample_grid)

# Use global random state (non-deterministic)
grid_converter = GridSearchReductor(random_state=None)
reduced_grid = grid_converter.fit_transform(sample_grid)

Controlling Reduction Factor

The reduction_factor parameter controls what fraction of the full parameter grid to sample:

# Use 10% of the full grid (more aggressive reduction)
grid_converter = GridSearchReductor(reduction_factor=0.1)

# Use 30% of the full grid (less aggressive reduction)
grid_converter = GridSearchReductor(reduction_factor=0.3)

# Default is 20% of the full grid
grid_converter = GridSearchReductor()  # Same as reduction_factor=0.2

Important notes about reduction_factor:

Must be between 0 and 1 (exclusive)
The actual number of samples will be at least 2 * number_of_variable_parameters to ensure reasonable coverage
The reduction must result in fewer samples than the full grid, otherwise a ValueError is raised
Smaller values mean fewer experiments but potentially less thorough parameter space exploration

Verbose Logging

# Enable verbose logging to see the sampling process
grid_converter = GridSearchReductor(verbose=True)
reduced_grid = grid_converter.fit_transform(sample_grid)

How it works

The converter takes a parameter grid (similar to scikit-learn's ParameterGrid) and:

Separates fixed parameters (single values) from variable parameters
Determines the number of levels for each variable parameter
Calculates the target number of samples based on the reduction_factor (default 20% of full grid)
Generates Latin Hypercube Samples in normalized [0,1] space
Maps these samples to discrete parameter indices
Creates a reduced set ensuring uniform coverage across all parameter dimensions
Removes duplicate combinations and ensures the result is smaller than the full grid

Latin Hypercube Sampling Benefits

Latin Hypercube Sampling (LHS) provides superior space-filling properties compared to random sampling:

Stratified sampling: Each parameter dimension is divided into equally probable intervals
Uniform coverage: Exactly one sample per interval ensures no clustering
Better convergence: More efficient exploration of the parameter space
Reproducible: When using a fixed random_state

This approach is particularly useful when:

You have limited computational resources
You need comprehensive parameter space exploration with fewer experiments
You want better coverage than random search
You need reproducible hyperparameter optimization results

Dependencies

numpy
scikit-learn
joblib

This project was almost purely vibecoded with assistance from aider.chat.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.2

Dec 15, 2025

1.0.1

Sep 15, 2025

This version

1.0.0

Sep 15, 2025

0.3.2

Sep 15, 2025

0.3.1

Sep 15, 2025

0.3.0

Sep 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gridsearchreductor-1.0.0.tar.gz (26.9 kB view details)

Uploaded Sep 15, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gridsearchreductor-1.0.0-py3-none-any.whl (26.4 kB view details)

Uploaded Sep 15, 2025 Python 3

File details

Details for the file gridsearchreductor-1.0.0.tar.gz.

File metadata

Download URL: gridsearchreductor-1.0.0.tar.gz
Upload date: Sep 15, 2025
Size: 26.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for gridsearchreductor-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`d9b1ce53e555f5e16f483dd8b04bef4128e6a45a651bcc3080340be517faafc5`
MD5	`54cd0073bd1a231388cad63f15b82008`
BLAKE2b-256	`19173d33a56a0e1eb0fe4b98f5733aea240d901e43412fe15502ae18902ad201`

See more details on using hashes here.

File details

Details for the file gridsearchreductor-1.0.0-py3-none-any.whl.

File metadata

Download URL: gridsearchreductor-1.0.0-py3-none-any.whl
Upload date: Sep 15, 2025
Size: 26.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for gridsearchreductor-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`25a118ad74250e91743045569d1272ab92a33074b8bee1f8d25be01ef4349ed1`
MD5	`7bd84773a9aff1393a8f9a9621784a4c`
BLAKE2b-256	`cb9a5503762c7b6891ecbe445781806907a7926a50d220c2f12105a11128512b`

See more details on using hashes here.

GridSearchReductor 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GridSearchReductor

Why use GridSearchReductor?

Getting started

Installation

Basic Usage

Advanced Usage

Reproducible Results

Controlling Reduction Factor

Verbose Logging

How it works

Latin Hypercube Sampling Benefits

Dependencies

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes