A workflow for hyperparamter optimization using the ATLAS grid resources

These details have not been verified by PyPI

Project links

Homepage

Project description

Hyperparameter Optimization on the Grid

This package provides a framework for performing hyperparameter optimization (HPO) using the ATLAS grid resources.

Basic Workflow
Getting the code
Setup and Installation
- Using the default conda environment
- Inside a custom virtual environment
Quick Start
Managing Configuration Files
Adapting the training script for hyperparameter optimization
- Training model as an objective function
- Training model as a trainable class
Running Hyperparameter Optimization Jobs
Monitoring Job Status
Visualizing Hyperparamter Optimization Results
Command line options
Important Notes
How the Hyperparameter Optimization is done
- Hyperparameter Optimization Algorithms
- Hyperparameter Optimization Schedulers

Basic Workflow

The execution of the above hyperparameter optimization workflow can be done entirely using the hpogrid tool provided by this repository. The sample usage and instruction for using the hpogrid tool is given in the next few sections.

The workflow can be divided into the following steps:

Step 1: Prepare the configuration files for a hyperparameter optimization task which will submitted to the ATLAS grid site. A total of four configuration files are required. They are:

HPO Configuration: Configurations that define how the hyperparameter optimization is performed. This may include the algorithm of hyperparameter optimization, the scheduling method for choosing the next hyperparameter points, the number of hyperparameter points to be evaluated and so on.
Search Space Configuration: Configurations that define the hyperparameter search space. This includes the sampling method for a particular hyperparameter (such as uniform or normal sampling or logirthmic based uniform sampling) and its range of allowed values.
Model Configuration: Configurations that contains information of the training model that is called by the hyperparameter optimization alogrithm. This should include
- The name of the training script which contains the class/function defining the training model
- The name of the class/function that defines the training model
- The parameters that should be passed to the training model. For details please refer to the section (Adaptation of Training Script)
Grid Configuration: Configurations that define settings for grid job submission. This may include the container inside which the scripts are run, the name of input and output datasets and the name of the grid site where the hyperparameter optimization jobs are run.

Step 2: Upload the input dataset via rucio which will be retrieved by the grid site when the hyperparameter optimization task is executed.

Step 3: Adapt the training script(s) to conform with the format required by the hyperparameter optimization library (Ray Tune).

Step 4: Submit the hyperparamter optimization task and monitor its progress.

Step 5: Retrieve the hyperparamter optimization results after completion. The results can be output into various formats supported by the hpogrid tool for visualization.

Getting the code

To get the code, use the following command:

git clone ssh://git@gitlab.cern.ch:7999/aml/hyperparameter-optimization/alkaid-qt/hpogrid/hpogrid.git

Setup and Installation

Using the default conda environment

To setup just use the command (from the base path of the project):

source setupenv.sh

This will setup the default conda environment ml-base which contains all necessary machine-learning packages and the latest version of ROOT.

Inside a custom virtual environment

Activate your custom environment then use the command (from the base path of the project):

source setupenv.sh no-conda

Then install the hpogrid package:

pip install hpogrid

Quick start

For details of how to adapt a training script to the HPO library, please refer to this section
For details of how to create, update, display a configuration file, please refer to this section
Make sure to source setupenv.sh before running the examples below

Example 1: Optimizing a simple objective function

Given a simple objective function with the evaluation metric defined by loss = (height - 14)**2 - abs(width - 3) with hyperparameters -100 < height < 100 and 0 < width < 20. The goal is to minimize the metric loss. The training script with the implementation of the above objective function can be found in example/scripts/simple_objective.py.

To run this example, just do:

# Create Configuration Files
hpogrid model_config create simple_objective_model --script simple_objective.py --model simple_objective
hpogrid search_space create simple_objective_space '{"height":{"method":"uniform","dimension":{"low":-100,"high":100}}, "width":{"method":"uniform","dimension":{"low":0,"high":20}}}'
hpogrid hpo_config create random_search_min_loss --algorithm random --metric loss --mode min --trials 200
hpogrid grid_config create manc_standard --site ANALY_MANC_GPU_TEST

# Create a Project with the Configuration Files
hpogrid project create simple_objective --scripts_path ${HPOGRID_BASE_PATH}/example/scripts/simple_objective.py --model_config simple_objective_model --search_space simple_objective_space --hpo_config random_search_min_loss --grid_config manc_standard

# Submit Grid Jobs corresponding to the Project
hpogrid run simple_objective --n_jobs 3

Example 2: Optimizing a simple trainable class

Given a simple trainble class with the evaluation metric defined by loss = alpha*tanh(1/beta) with hyperpraamters -10 < alpha < 10 and beta ∈ [-1,0,1,2,3,4]. The goal is to minimize the metric loss. The training script with the above trainable class can be found in example/scripts/simple_trainable.py.

To run this example, just do:

# Create Configuration Files
hpogrid model_config create simple_trainable_model --script simple_trainable.py --model MyTrainableClass
hpogrid search_space create simple_trainable_space '{"alpha":{"method":"uniform","dimension":{"low":-10,"high":10}}, "beta": {"method":"categorical","dimension":{"categories":[-1,0,1,2,3,4,5]}}}'
hpogrid hpo_config create bayesian_min_loss --algorithm bayesian --metric loss --mode min --trials 200
# If you have already created the grid config manc_standard, you may skip the following line
hpogrid grid_config create manc_standard --site ANALY_MANC_GPU_TEST

# Create a Project with the Configuration Files
hpogrid project create simple_trainable --scripts_path ${HPOGRID_BASE_PATH}/example/scripts/simple_trainable.py --model_config simple_trainable_model --search_space simple_trainable_space --hpo_config bayesian_min_loss --grid_config manc_standard

# Submit Grid Jobs corresponding to the Project
hpogrid run simple_trainable --n_jobs 3

Example 3: Optimizing a convolutional neural network for the MNIST dataset

The training script with the implementation of a convolutional neural network for the MNIST dataset (for classifying digits) can be found in exmple/scripts/mnist.py.

To run this example, just do:

# Create Configuration Files
hpogrid model_config create mnist_cnn --script mnist.py --model MNISTTrainable
hpogrid search_space create mnist_cnn_space '{"hidden": {"method": "categorical", "dimension": {"categories": [32, 64, 128]}},"batchsize": {"method": "categorical", "dimension": {"categories": [32, 64, 128, 256, 512]}}, "lr": {"method": "loguniform", "dimension": {"low": 4.5e-05, "high": 1.83e-2}}, "beta": {"method": "uniform", "dimension": {"low": 0.5, "high": 1.0}}}'
hpogrid hpo_config create hyperopt_min_loss --algorithm hyperopt --mode min --trials 200

# If you have already created the grid config manc_standard, you may skip the following line
hpogrid grid_config create manc_standard --site ANALY_MANC_GPU_TEST

# Create a Project with the Configuration Files
hpogrid project create mnist_cnn --scripts_path ${HPOGRID_BASE_PATH}/example/scripts/mnist.py --model_config mnist_cnn --search_space mnist_cnn_space --hpo_config hyperopt_min_loss --grid_config manc_standard

# Submit Grid Jobs corresponding to the Project
hpogrid run simple_trainable --n_jobs 3

Managing Configuration Files

In general, the command for managing configuraton file takes the form:

hpogrid <config_type> <action> <config_name> [<options>]

The <config_type> argument specifies the type of configuration to be handled. The avaliable types are

hpo_config : Configuration for hyperparamter optimization
grid_config : Configuration for grid job submission
model_config : Configuration for the machine learning model (which the hyperparameters are to be optimized)
search_space : Configuration for the hyperparameter search space

The <action> argument specifies the action to be performed. The available actions are

create : Create a new configuration
recreate : Recreate an existing configuration (the old configuration will be overwritten)
update : Update an existing configuration (the old configuration except those to be updated will be kept)
remove : Remove an existing configuration
list : List the name of existing configurations (the <config_name> argument is omitted)
show : Display the content of an existing configuration

The <config_name> argument specifies name given to a configuration file.

The [<options>] arguments specify the configuration settings for the corresponding configuration type. The available options are explained below.

The configuration files are stored in the directory config/

HPO Configuration

These are configurations that define how the hyperparameter optimization is performed.

The command for managing hpo configuration is:

hpogrid hpo_config <action> <config_name> [<options>]

For creation, modification of the configuration file, the following options are available

Option	Description	Default	Choices
`algortihm`	Algorithm for hyperparameter optimization	'random'	'hyperopt', 'skopt', 'bohb', 'ax', 'tune', 'random', 'bayesian'
`metric`	Evaluation metric to be optimized	'accuracy'	-
`mode`	Optimization mode (either 'min' or 'max')	'max'	'max', 'min'
`scheduler`	Trial scheduling method for hyperparameter optimization	'asynchyperband'	'asynchyperband', 'bohbhyperband', 'pbt'
`trials`	Number of trials (search points)	100	-
`log_dir`	Logging directory	"./log"	-
`verbose`	Check to enable verbosity	-	-
`stop`	Stopping criteria	'{"training_iteration": 1}'	-
`scheduler_param`	extra parameters for the trial scheduler	'{"max_concurrent": 4}'	-
`algorithm_param`	extra parameters for hyperparameter optimization algorithm	{}	-

For details about the hyperparamter optimization algorithm, please refer to here
For how scheduler works in hyperparameter optimization, please refer to here
Here the key training_iteration for the option stop refers to how many times the training function is called before a trial is finished.
Here the key max_concurrent for the option scheduler_param refers to the maximum number of hyperparameter points to be run concurrently (if parallelization is allowed)

Example of creating an hpo configuration:

hpogrid hpo_config create my_example_config --algorithm random --metric loss --mode min --trials 100

Example of modifying an existing hpo configuration (old settings will be kept):

hpogrid hpo_config update my_example_config --algorithm bayesian

Example of recreating an existing hpo configuration (old settings will not be kept):

hpogrid hpo_config recreate --algorithm bayesian --metric loss --mode min --trials 100

To list all hpo configurations:

hpogrid hpo_config list

To show the content of an existing hpo configuration:

hpogrid hpo_config show my_example_config

Grid Configuration

These are configurations that define settings for grid job submission.

The command for managing grid configuration is:

hpogrid grid_config <action> <config_name> [<options>]

For creation, modification of the configuration file, the following options are available

Option	Description	Default
`site`	Grid site where the jobs are submitted	ANALY_MANC_GPU_TEST
`container`	Docker or singularity container which the jobs are run	/cvmfs/unpacked.cern.ch/gitlab-registry.cern.ch/aml/hyperparameter-optimization/alkaid-qt/hpogrid:latest
`retry`	Check to enable retrying faild jobs	-
`inDS`	Name of input dataset	-
`outDS`	Name of output dataset	user.${{RUCIO_ACCOUNT}}.hpogrid.{HPO_PROJECT_NAME}.out.$(date +%Y%m%d%H%M%S)

Example of creating a grid configuration:

hpogrid grid_config create my_example_config --site ANALY_MANC_GPU_TEST --inDS user.${RUCIO_ACCOUNT}.mydataset01

Example of modifying an existing grid configuration (old settings will be kept):

hpogrid grid_config update my_example_config --container docker://gitlab-registry.cern.ch/aml/hyperparameter-optimization/alkaid-qt/hpogrid:latest

Example of recreating an existing grid configuration (old settings will not be kept):

hpogrid grid_config recreate my_example_config --site ANALY_BNL_GPU_ARC --inDS user.${RUCIO_ACCOUNT}.mydataset02

To list all grid configurations:

hpogrid grid_config list

To show the content of an existing grid configuration:

hpogrid grid_config show my_example_config

Model Configuration

Configurations that contains information of the training model that is called by the hyperparameter optimization alogrithm.

The command for managing model configuration is:

hpogrid model_config <action> <config_name> [<options>]

For creation, modification of the configuration file, the following options are available

Option	Description
`script`	Name of the training script where the function or class that defines the training model will be called to perform the training
`model`	Name of the function or class that defines the training model
`param`	Extra parameters to be passed to the training model

Example of creating a model configuration:

hpogrid model_config create my_example_config --script train.py --model advanced_rnn --param '{"input_path": "mypath/signl_point/", "epochs": 100}'

Example of modifying an existing model configuration (old settings will be kept):

hpogrid model_config update --model revised_rnn

Example of recreating an existing model configuration (old settings will not be kept):

hpogrid model_config recreate my_example_config --script train.py --model revised_rnn --param '{"input_path": "mypath/bkg_point/", "epochs": 200}'

To list all model configurations:

hpogrid model_config list

To show the content of an existing model configuration:

hpogrid model_config show my_example_config

Search Space Configuration

These are configurations that defines hyperparameter search space.

The command for managing search space configuration is:

hpogrid search_space <action> <config_name> [<options>]

For creation, modification of the configuration file, the command takes the form:

hpogrid search_space <action> <config_name> <search_space_definition>

The format for defining a search space in command line is through a json decodable string:

'{"NAME_OF_HYPERPARAMETER":{"method":"SAMPLING_METHOD","dimension":{"DIMENSION":"VALUE"}},
"NAME_OF_HYPERPARAMETER":{"method":"SAMPLING_METHOD","dimension":{"DIMENSION":"VALUE"}}, ...}'

Supported sampling methods for a hyperparameter:

Method	Description	Dimension
`categorical`	Returns one of the values in `categories`, which should be a list. If `grid_search`is set to 1, each value must be sampled once.	`categories`, `grid_search`
`uniform`	Returns a value uniformly between `low` and `high`	`low`, `high`
`uniformint`	Returns an integer value uniformly between `low` and `high`	`low`, `high`
`quniform`	Returns a value like round(uniform(`low`, `high`) / `q`) * `q`	`low`, `high`, `q`
`loguniform`	Returns a value drawn according to exp(uniform(`low`, `high`)) so that the logarithm of the return value is uniformly distributed.	`low`, `high`, `base`
`qloguniform`	Returns a value like round(exp(uniform(`low`, `high`)) / `q`) * `q`	`low`, `high`, `base`, `q`
`normal`	Returns a real value that's normally-distributed with mean `mu` and standard deviation `sigma`.	`mu`, `sigma`
`qnormal`	Returns a value like round(normal(`mu`, `sigma`) / `q`) * `q`	`mu`, `sigma`, `q`
`lognormal`	Returns a value drawn according to exp(normal(`mu`, `sigma`)) so that the logarithm of the return value is normally distributed.	`mu`, `sigma`, `base`
`qlognormal`	Returns a value like round(exp(normal(`mu`, `sigma`)) / `q`) * `q`	`mu`, `sigma`, `base`, `q`

Example of creating a search space configuration:

hpogrid search_space create my_search_space '{ "lr":{"method":"loguniform","dimension":{"low":1e-5,"high":1e-2, "base":10}},\
"batchsize":{"method":"categorical","dimension":{"categories":[32,64,128,256,512,1024]}},\
"num_layers":{"method":"uniformint","dimension":{"low":3,"high":10}},\
"momentum":{"method":"uniform","dimension":{"low":0.5,"high":1.0}} }'

Example of modifying a search space configuration (old settings will be kept):

 hpogrid search_space update my_search_space '{"new_hp":{"method":"lognormal","dimension":{"mu":1,"sigma":1}}}'

Example of recreating a search space configuration (old settings will not be kept):

hpogrid search_space recreate my_search_space '{ "lr":{"method":"categorical", "dimension":{"categories": [1e-4,1e-3,1e-2,1e-1}},\
"batchsize":{"method":"categorical","dimension":{"categories":[32,64,128,256,512,1024]}}}'

To list all search space configurations:

hpogrid search_space list

To show the content of an existing search space configuration:

hpogrid search_space show my_example_config

Adapting the training script for hyperparameter optimization

The hpogrid workflow uses the Ray Tune library for carrying out hyperparameter optimization. There are two ways to adapt your training script to fit the format accepted by the library.

Method 1: Training model as an objective function

Suppose you have a training script named train.py which contains an objective function called my_objective. Then for each trial of the hyperparameter optimization, the Ray Tune library will generate a set of hyperparameters which will then be fed into this objective function. The objective function should then return a value for the evaluation metric which will allow the optimization algorithm to determine the next hyperparameter point.

Example training script:

def my_objective(config, reporter):
    height = config["height"]
    width = config["width"]
    loss = (height - 14)**2 - abs(width - 3)
    reporter(loss=loss)

The objective function should contain the config argument. It is a dictionary where the values of all hyperparameters and other model parameters (i.e. any parameters you want to put in) are stored. Simply use config['parameter_name'] to reference the value.
The objective function should contain the repoorter argument. It is a function that reports the value of the evaluation metric (loss in this case) at the end of the training.

You can also put whatever you want inside the training script:

kMyConstant = 1

def calculate_loss(height, width):
    loss = (height - 14)**2 - abs(width - 3)
    loss *= kMyConstant
    return loss

def my_objective(config, reporter):
    height = config["height"]
    width = config["width"]
    loss = calculate_loss(height, width)
    reporter(loss=loss)

For the above two examples, you should create your model configuration like this:

hpogrid model_config create test_model --script train.py --model my_objective

Method 2: Training model as a trainable class

Suppose you have a training script named train.py which contains a class called my_trainable. Then for each trial of the hyperparameter optimization, the Ray Tune library will do the following:

Generate a set of hyperparameters which will then be fed into the class object.
Initialize the class object by calling the _setup function inside the class.
For each training_iteration (as defined in the hpo configuration), the _train function will be called which should return a dictionary containing information about the training result (e.g. value of the evaluation metric) for that training iteration.
(this step depends on the scheduler used as defined in the hpo configuration) If after some training iterations the value of the evaluation does not look promising, the model will be saved via the _save function and the training will be temporarily or permanently stopped and a new set of hyperparameters will run. If the algorithm thinks the old set of hyperparameters may worth continue training, then the _restore function will be called to continue the previously stopped training.
The trial is said to be finished if either
- The stopping criteria is reached, e.g. training_iteration reached a certain value, or
- The training is stopped by the scheduler (due to low performance of the set of hyperparameters)

Example training script:

import numpy as np

class MyTrainableClass(ray.tune.Trainable):

    def _setup(self, config):
        self.timestep = 0

    def _train(self):
        self.timestep += 1
        alpha = self.config.get("alpha", 1) # or alpha = config['alpha']
        beta = self.config.get("beta", 1) # or beta = config["beta"]
        loss = self.calculate_loss(alpha, beta)
        return {"loss": loss, 'timestep': self.timestep}}

    def _save(self, checkpoint_dir):
        path = os.path.join(checkpoint_dir, "checkpoint")
        with open(path, "w") as f:
            f.write(json.dumps({"timestep": self.timestep}))
        return path

    def _restore(self, checkpoint_path):
        with open(checkpoint_path) as f:
            self.timestep = json.loads(f.read())["timestep"]

    def calculate_loss(self, alpha, beta):
        loss = np.tanh(float(self.timestep) / alpha)
        loss *= beta
        return loss

The trainable class must inherit from the ray.Tune.Trainable class for the Ray Tune library to recognize the class structure.
The functions _save and _restore are optional if you only train for one training iteration (i.e. _train is called only once)

If you do not need to make use of the scheduler to speed up your training by terminating low performant trainings, or your training cannot be split into successive training iterations. You can leave out the _save and _restore functions. For example:

import numpy as np

class MyTrainableClass(ray.tune.Trainable):

    def _train(self):
        alpha = self.config.get("alpha", 1) # or alpha = config['alpha']
        beta = self.config.get("beta", 1) # or beta = config["beta"]
        loss = self.calculate_loss(alpha, beta)
        return {"loss": loss}

    def calculate_loss(self, alpha, beta):
        loss = np.tanh(1/alpha)
        loss *= beta
        return loss

Running Hyperparameter Optimization Jobs

Step 1: Create a custom project with the configuration files:

hpogrid project create <project_name> [--options]

Option	Description
`scripts_path`	the path to where the training scripts (or the directory containing the training scripts) are located
`hpo_config`	the hpo configuration to use for this project
`grid_config`	the grid configuration to use for this project
`model_config`	the model configuration to use for this project
`search_space`	the search space configuration to use for this project

Step 2: Run the project:

To run locally:

hpogrid local_run <project_name>

To run on the grid:

hpogrid run <project_name> [--options]

Option	Description	Default
`n_jobs`	the number of grid jobs to be submitted (useful for random search, i.e. to run a single search point per job)	1
`site`	the site to where the jobs are submitted (this will override the site setting in the grid configuration	-

Monitoring Job Status

To get the status of recent grid jobs:

hpogrid tasks show [--options]

Option	Description	Default
`username`	filter tasks by username	1
`limit`	the maximum number of tasks to query	1000
`days`	filter tasks within the recent `N` days	30
`taskname`	filter tasks by taskname (accept wildcards)	-
`jeditaskid`	only show the task with the specified jeditaskid	-
`metadata`	print out the metadata of a task	False
`sync`	force no caching on the PanDA server	False
`range`	filter tasks by jeditaskid range	-
`output`	output result with the filename if specified	-
`outcol`	data columns to be saved in output	'jeditaskid', 'status', 'taskname', 'computingsite', 'metastruct'

Visualizing Hyperparamter Optimization Results

To get the hpo result for a specific project:

hpogrid report <project_name> [--options]

Option	Description	Default	Choices
`limit`	the maximum number of tasks to query	1000	-
`days`	filter tasks within the recent `N` days	30	-
`taskname`	filter tasks by taskname (accept wildcards)	-	-
`range`	filter tasks by jeditaskid range	-	-
`extra`	extra data columns to be displayed and saved	-	'site', 'task_time_s', 'time_s', 'taskid'
`outname`	output file name (excluding extension)	hpo_result	-
`to_json`	output result to a json file	False	-
`to_html`	output result to an html file	False	-
`to_csv`	output result to a csv file	False	-
`to_pcp`	output result as a parallel coordinate plot	False	-

Command Line Options

To kill a job by jeditaskid

hpogrid tasks kill <jeditaskid>

To retry a job by jeditaskid

hpogrid tasks retry <jeditaskid>

To see a list of available GPU sites

hpogrid sites

Important Notes

Working Directory of Container

By default, the working directory (where the training scripts are located) of the container is /hpogrid/project/

Location of Input Dataset

By default, the location of input dataset inside the container is /hpogrid/project/ (same as working directory)

Digest of Grid Site Errors

To be updated

How the Hyperparameter Optimization is done

Hyperparameter Optimization Algorithms

To be updated

Hyperparameter Optimization Schedulers

To be updated

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.3.3

Jan 16, 2023

1.3.2

Nov 18, 2022

1.3.1

Jul 21, 2022

1.3.0

Apr 19, 2022

1.2.9

Apr 9, 2022

1.2.8.1

Mar 26, 2022

1.2.8

Mar 26, 2022

1.2.7

Mar 25, 2022

1.2.6

Mar 25, 2022

1.2.5

Mar 22, 2022

1.2.4

Mar 21, 2022

1.2.3

Mar 21, 2022

1.2.2

Mar 20, 2022

1.2.1

Mar 20, 2022

1.2.0

Mar 20, 2022

1.1.4

Jun 29, 2021

1.1.3

Jun 26, 2021

1.1.2 yanked

Jun 26, 2021

Reason this release was yanked:

buggy

1.1.1 yanked

Jun 26, 2021

Reason this release was yanked:

buggy

1.1.0 yanked

Jun 26, 2021

Reason this release was yanked:

buggy

1.0.9

Mar 3, 2021

1.0.8

Mar 3, 2021

1.0.7

Feb 22, 2021

1.0.6

Feb 4, 2021

1.0.5

Jan 29, 2021

1.0.4

Jan 29, 2021

1.0.3

Jan 29, 2021

1.0.2

Jan 29, 2021

1.0.1

Jan 29, 2021

1.0.0

Jan 29, 2021

0.8.4.2

Jan 27, 2021

0.8.4.1

Jan 26, 2021

0.8.4.0

Jan 26, 2021

0.8.3.9

Jan 26, 2021

0.8.3.8

Jan 26, 2021

0.8.3.7

Jan 22, 2021

0.8.3.6

Jan 22, 2021

0.8.3.4

Jan 22, 2021

0.8.3.3

Jan 22, 2021

0.8.3.2

Jan 22, 2021

0.8.3.1

Jan 22, 2021

0.8.3.0

Dec 29, 2020

0.8.2.9

Dec 29, 2020

0.8.2.8

Dec 29, 2020

0.8.2.7

Dec 29, 2020

0.8.2.6

Dec 29, 2020

0.8.2.4

Nov 19, 2020

0.8.2.3

Nov 18, 2020

0.8.2.2

Nov 17, 2020

0.8.2.1

Oct 31, 2020

0.8.2

Oct 29, 2020

0.8.0

Oct 19, 2020

0.7.9

Oct 12, 2020

0.7.8

Oct 11, 2020

0.7.7

Oct 8, 2020

0.7.6

Sep 24, 2020

0.7.4

Sep 20, 2020

0.7.3

Sep 17, 2020

0.7.2

Sep 16, 2020

0.7.1

Sep 16, 2020

0.7.0

Sep 16, 2020

0.6.9

Sep 16, 2020

0.6.8

Sep 16, 2020

0.6.7

Sep 15, 2020

0.6.6

Sep 14, 2020

0.6.5

Sep 11, 2020

0.6.4

Sep 9, 2020

0.6.3

Sep 8, 2020

0.6.2

Sep 8, 2020

0.6.1

Sep 8, 2020

0.6.0

Sep 8, 2020

0.5.0.1

Sep 5, 2020

0.5.0

Sep 5, 2020

0.4.9

Sep 4, 2020

0.4.8

Sep 4, 2020

0.4.7

Sep 3, 2020

0.4.6

Sep 3, 2020

0.4.5

Sep 3, 2020

0.4.4

Sep 3, 2020

0.4.3

Sep 2, 2020

0.4.2

Sep 2, 2020

0.4.0.1

Jul 23, 2020

0.4.0

Sep 2, 2020

0.3.8

Jul 20, 2020

0.3.4

Jul 14, 2020

0.3.3

Jul 13, 2020

0.3.0

Jul 11, 2020

0.2.9

Jul 7, 2020

0.2.8

Jul 6, 2020

0.2.7

Jul 6, 2020

0.2.6

Jul 5, 2020

0.2.5

Jul 5, 2020

0.2.4

Jul 5, 2020

0.2.3

Jul 5, 2020

0.2.2

Jul 4, 2020

0.2.1

Jul 4, 2020

This version

0.2.0

Jul 3, 2020

0.1.9

Jun 27, 2020

0.1.8

Jun 9, 2020

0.1.7

Jun 9, 2020

0.1.6

Jun 8, 2020

0.1.4

Jun 8, 2020

0.1.2

Jun 6, 2020

0.1.1

Jun 6, 2020

0.1.0

Jun 6, 2020

0.0.5

Jun 3, 2020

0.0.4

Jun 2, 2020

0.0.3

Jun 2, 2020

0.0.2

Jun 1, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hpogrid-0.2.0.tar.gz (60.5 kB view hashes)

Uploaded Jul 3, 2020 Source

Built Distribution

hpogrid-0.2.0-py3-none-any.whl (72.3 kB view hashes)

Uploaded Jul 3, 2020 Python 3

Hashes for hpogrid-0.2.0.tar.gz

Hashes for hpogrid-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`aa423a1cf71d05fa799f9fd278ef1ceb2621aadf166873ef603c7cec96a71e98`
MD5	`fa961e74d986126e8689bf9f6f7bcbdd`
BLAKE2b-256	`99d916234ebb19d52ed82ce9a81464df20a457cc7832c0295522ec1bf5960baf`

Hashes for hpogrid-0.2.0-py3-none-any.whl

Hashes for hpogrid-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b897c36d14a4ca049c9ce2aacaabe20f0c1264d3019ce7c5abc651e6c0d32799`
MD5	`6aa24a1e55d42abf73c6ed56b57d000f`
BLAKE2b-256	`5f8371c51f049797ceec019c1756d34cab625239f1fef0a8ed92f2465d8448b8`

hpogrid 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Hyperparameter Optimization on the Grid

Table of Contents

Basic Workflow

Getting the code

Setup and Installation

Using the default conda environment

Inside a custom virtual environment

Quick start

Example 1: Optimizing a simple objective function

Example 2: Optimizing a simple trainable class

Example 3: Optimizing a convolutional neural network for the MNIST dataset

Managing Configuration Files

HPO Configuration

Example of creating an hpo configuration:

Example of modifying an existing hpo configuration (old settings will be kept):

Example of recreating an existing hpo configuration (old settings will not be kept):

To list all hpo configurations:

To show the content of an existing hpo configuration:

Grid Configuration

Example of creating a grid configuration:

Example of modifying an existing grid configuration (old settings will be kept):

Example of recreating an existing grid configuration (old settings will not be kept):

To list all grid configurations:

To show the content of an existing grid configuration:

Model Configuration

Example of creating a model configuration:

Example of modifying an existing model configuration (old settings will be kept):

Example of recreating an existing model configuration (old settings will not be kept):

To list all model configurations:

To show the content of an existing model configuration:

Search Space Configuration

Example of creating a search space configuration:

Example of modifying a search space configuration (old settings will be kept):

Example of recreating a search space configuration (old settings will not be kept):

To list all search space configurations:

To show the content of an existing search space configuration:

Adapting the training script for hyperparameter optimization

Method 1: Training model as an objective function

Method 2: Training model as a trainable class

Running Hyperparameter Optimization Jobs

Monitoring Job Status

Visualizing Hyperparamter Optimization Results

Command Line Options

Important Notes

Working Directory of Container

Location of Input Dataset

Digest of Grid Site Errors

How the Hyperparameter Optimization is done

Hyperparameter Optimization Algorithms

Hyperparameter Optimization Schedulers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution