MetaGen: A framework for metaheuristic development and hyperparameter optimization in machine and deep learning.

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

MetaGen: A framework for metaheuristic development and hyperparameter optimization in machine and deep learning

Machine and deep learning have transformed the field of computing in recent decades, delivering impressive accuracy results when processing large datasets. However, a common challenge arises when implementing these algorithms: setting the hyperparameters. Many algorithms are sensitive to these adjustments, and the quality of the results heavily depends on the choices made. Metaheuristics are a widely used strategy for finding optimal hyperparameter settings for a specific algorithm.

MetaGen is a comprehensive and user-friendly metaheuristic development framework that provides tools to define and solve hyperparameter optimization in machine and deep learning.

Instalation

MetaGen only requires python (>=3.10) as the library has been built completely on Python.

For the installation, the easiest way is to use pip:

pip install pymetagen-datalabupo

API reference

The official documentation is available in: https://pymetagen.readthedocs.io.

Development

New contributors from all experience levels are welcomed. To contribute, you can open an issue or sending a pull request.

For testing the repository you just need to execute the following command:

pytest test

In this document:

MetaGen Features
MetaGen vs. other proposals
How to implement a metaheuristic with MetaGen
How to perform a hyperparameter optimization with MetaGen
Resources

MetaGen features

Metaheuristic development framework.

It provides convenient solution management for the developer and isolates them from the intricacies of the search space.

Deployment platform for metaheuristic execution.

It offers an effortless problem definition process that doesn't require advanced programming skills.

Standard interface between metaheuristic developers and users.

Both developers and users can "speak the same language" through the MetaGen package.

Specialized tools for deep learning hyperparameter tuning.

It provides optimization capabilities not just at the layer level but also at the architecture level.

Dynamic deep learning architecture adjustment.

Users are not restricted to optimizing a fixed architecture (e.g., a fixed number of layers); they can also adjust specific hyperparameters per layer in a variable architecture.

Python type hints annotations.

Implemented using Python type hints annotations, it provides developers and users with an efficient environment for coding, debugging, and maintenance.

Full compatibility with Python machine and deep learning packages.

Thanks to its standard and flexible interface, it is compatible with all Python packages such as scikit-learn, keras, and tensorflow.

RS, GA and CVOA metaheuristic implementations.

It offers tow popular metaheuristic implementations, Random search and Genetic Algorithm optimization. Furthermore, it includes the implementation of CVOA, a metaheuristic inspired by the SARS-CoV-2 propagation model, the virus responsible for COVID-19. CVOA was initially used to optimize long-short-term memory (LSTM) networks for electricity demand forecasting, but its positive results have inspired its use for other machine and deep learning models.

MetaGen vs. other proposals

	MetaGen	kerastuner	optuna	chocolate	hyperopt and hyperas	sherpa	btb	talos	test-tube	advisor	ray-tune
A new framework for developing custom metaheuristics with a simplified coding process	Yes	No	Yes	No	No	Yes	No	No	No	No	No
Representation of problems and solutions implementation, allowing metaheuristic developers to provide a standard and intuitive interface	Yes	No	No	No	No	No	No	No	No	No	No
Specific mechanisms for optimizing deep learning architectures and hyperparameters	Yes	Yes	No	No	No	No	No	No	No	No	No
Dynamic changes to a deep learning model’s architecture allow the number of layers and hyperparameters to adjust during metaheuristic executions	Yes	No	No	Yes	No	No	No	No	No	No	No
User-friendly and accessible to the general public with limited programming skills	Yes	No	No	No	No	No	No	No	No	No	Yes
Python-type hints implementation to help third-party developers and ease debugging	Yes	No	No	No	No	No	No	No	No	No	No
Mechanisms to improve metaheuristics execution across distributed systems	No	Yes	Yes	No	Yes	Yes	No	Yes	Yes	No	Yes
The package includes visualization tools to analyze the performance of metaheuristics	No	Yes	Yes	No	No	No	No	No	Yes	Yes	No

How to implement a metaheuristic with MetaGen

The random search algorithm generates a search space with a specific number of potential solutions, then alters these potential solutions a specified number of times. The potential solution with the lowest fitness function value after each iteration is considered the global solution of the random search.

The Domain and Solution classes are required, so they are imported from the metagen.framework package.

Finally, the Callable and List classes are imported for typing management, and the deepcopy method from the standard copy package is used to preserve a consistent copy of the global solution for each iteration

from copy import deepcopy
from typing import Callable, List
from metagen.framework import Domain, Solution

Prototype

The RandomSearch metaheuristic function takes as input a problem definition, which is composed of a Domain object and a fitness function that takes a Solution object as input and returns a float.

The method also has two arguments to control the metaheuristic, the search space size (default set to 30) and the number of iterations (default set to 20). The search space size controls the number of potential solutions generated, and the number of iterations determine the number of times the search space will be altered.

The result of an RandomSearch run will be a Solution object that assigns the variables of the Domain object in a way that optimizes the function.

In order to encapsulate all these characteristics, a RandomSearch class is defined.

class RandomSearch:

    def __init__(self, domain: Domain, fitness: Callable[[Solution], float], search_space_size: int = 30, iterations: int = 20) -> None:

        self.domain = domain
        self.fitness = fitness
        self.search_space_size = search_space_size
        self.iterations = iterations

Search space building

The first step of the method involves constructing the search space, which consists of search_space_size potential solutions or Solution objects.

Each new potential solution is randomly generated using the Solution's init method, which creates a Solution object from a Domain.

The newly created Solution is then evaluated using the evaluate method passing the fitness function, and all the potential solutions are stored in a List of Solution. A copy of the Solution with the minimum function value is also kept.

def run(self) -> Solution:

    search_space: List[Solution] = list()

    for _ in range(0, search_space_size):
        initial_solution:Solution = Solution()
        initial_solution.evaluate(self.fitness)
        search_space.append(initial_solution)

    global_solution: Solution = deepcopy(min(search_space))

Altering the search space

The final step involves modifying the potential solutions in the search space over iteration iterations. Each Solution is modified using the mutate method, which modifies the variable values of a Solution while taking into account the Domain.

The method randomly selects a variable in the Solution to modify and changes its value randomly while also respecting the Domain. If the modified Solution is better than the current global Solution, the latter is updated with a copy of the former.

Finally, the global_solution is returned.

    for _ in range(0, iterations):
            for ps in search_space:
                ps.mutate()
                ps.evaluate(self.fitness)
                if ps < solution:
                    global_solution = deepcopy(ps)
    return global_solution

How to perform a hyperparameter optimization with MetaGen

In a typical machine learning regression problem. The goal is to find the hyperparameters that build the best model for a training set. To do this, the Domain object is constructed by defining a variable for each parameter to optimize.

Defining the Domain.

In this case, three variables are defined: a $REAL$ variable called alpha, with values in the range of $[0.0001, 0.001]$, is defined using the define_real method, and an $INTEGER$ variable called iterations, with values in the range of $[5, 200]$, is defined using the define_integer method. Additionally, a $CATEGORICAL$ variable is defined using the define_categorical method, with the name loss and a list of unique values including squared_error, huber, and epsilon_insensitive.

from metagen.framework import Domain
regression_domain = Domain()
regression_domain.define_real("alpha", 0.0001, 0.001)
regression_domain.define_integer("iterations", 5, 200)
regression_domain.define_categorical("loss", ["squared_error", "huber", "epsilon_insensitive"])

Implementing the fitness function

The fitness function must then construct a regression model using the training dataset and the hyperparameters of the potential solution. In this case, the sklearn package is used for the machine learning operations.

A synthetic training dataset with $1000$ instances and $4$ features is generated using the make_regression method from the sklearn.datasets package, and it is loaded into two variables, the inputs X and the expected outputs y.

from sklearn.datasets import make_regression
X, y = make_regression(n_samples=1000, n_features=4)

The function regression_fitness is defined with a Solution object as an input parameter. The values of loss, iterations, and the hyperparameter alpha are obtained through the bracket Python operator.

A regression model using stochastic gradient descent is constructed using the SGDRegressor class from the sklearn.linear_model package and the obtained values. Cross-validation training is performed using the cross_val_score function from the sklearn.model_selection package by passing the configured model and the training dataset (X and y). The cross-validation process is set to return the negative value of the mean absolute percentage error (mape), which is specified in the scoring argument.

To find the solution with the least error (i.e., the smallest mape), the resulting mape value must be multiplied by $-1$.

from metagen.framework import Solution
from sklearn.linear_model import SGDRegressor
from sklearn.model_selection import cross_val_score

def regression_fitness(solution: Solution):
    loss = solution["loss"] # In this case, we get the builtin by getting the value property.
    iterations = solution["iterations"]
    alpha = solution["alpha"] 
    model = SGDRegressor(loss=loss, alpha=alpha, max_iter=iterations)
    mape = cross_val_score(model, X, y, scoring="neg_mean_absolute_percentage_error").mean()*-1
    return mape

To conclude, the regression_domain and regression_fitness elements are passed to the RandomSearch metaheuristic, obtaining a hyperparameter solution for this problem by calling the run method.

regression_solution: Solution = RandomSearch(regression_domain, regression_fitness).run()

Finally, the regression_solution is printed.

print(regression_solution)

Resources

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.1

Mar 21, 2025

0.2.0

Feb 18, 2025

0.1.3

Jun 17, 2024

This version

0.1.2

Dec 5, 2023

0.1.1

Sep 21, 2023

0.1.0

Sep 18, 2023

0.0.3

Jul 10, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymetagen-datalabupo-0.1.2.tar.gz (53.7 kB view details)

Uploaded Dec 5, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pymetagen_datalabupo-0.1.2-py3-none-any.whl (74.5 kB view details)

Uploaded Jun 17, 2024 Python 3

File details

Details for the file pymetagen-datalabupo-0.1.2.tar.gz.

File metadata

Download URL: pymetagen-datalabupo-0.1.2.tar.gz
Upload date: Dec 5, 2023
Size: 53.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for pymetagen-datalabupo-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`b0b1547bee30541a6d3e7969dac07ede1e71e02e8fcf9af8e0175464637e9e59`
MD5	`9ab8777808cbd07718423565089e6459`
BLAKE2b-256	`56245906fffc8f0daaff7dee068f55e18138cb0e559329f02c13573fbb73a583`

See more details on using hashes here.

File details

Details for the file pymetagen_datalabupo-0.1.2-py3-none-any.whl.

File metadata

Download URL: pymetagen_datalabupo-0.1.2-py3-none-any.whl
Upload date: Jun 17, 2024
Size: 74.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for pymetagen_datalabupo-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`34076a063c3cff51fffbd425d1d6ee38982626a221cc7b48544d79c93dfb7668`
MD5	`abb8e7348f6c34704f2efe2eec0e19e4`
BLAKE2b-256	`c0a780e2ef0d7e3c1d17960f5fc22a39d399cd9fb4d52bf6fa05c6897197eb9e`

See more details on using hashes here.

pymetagen-datalabupo 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MetaGen: A framework for metaheuristic development and hyperparameter optimization in machine and deep learning

Instalation

API reference

Development

In this document:

MetaGen features

MetaGen vs. other proposals

How to implement a metaheuristic with MetaGen

Prototype

Search space building

Altering the search space

See also

How to perform a hyperparameter optimization with MetaGen

Defining the Domain.

Implementing the fitness function

See also

Resources

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes