Multi-start and hyper-parameter optimization in pure Julia
Description
Hyperopt.jl is a Julia library that implements a number of hyperparameter optimization algorithms which can be used to optimize the starting point of the optimization. NonconvexHyperopt.jl allows the use of the algorithms in Hyperopt.jl as meta-algorithms using the HyperoptAlg struct.
Quick start
Given a model model and an initial solution x0, the following can be used to optimize the model using Hyperopt.
using Nonconvex
Nonconvex.@load Hyperopt
alg = HyperoptAlg(IpoptAlg())
options = HyperoptOptions(sub_options = IpoptOptions(), sampler = GPSampler())
result = optimize(model, alg, x0, options = options)Hyperopt is an optional dependency of Nonconvex so you need to import it in order to use it. HyperoptAlg can wrap any other algorithm in Nonconvex, e.g. IpoptAlg(). When the algorithm is a HyperoptAlg, the options keyword argument must of type HyperoptOptions. For more on the options available see below.
Construct an instance
To construct an instance of the Hyperopt + Ipopt algorithm, use:
alg = HyperoptAlg(IpoptAlg())HyperoptAlg can wrap any other algorithm in Nonconvex, e.g. NLoptAlg(:LD_MMA) or AugLag().
Options
The options keyword argument to the optimize function shown above must be an instance of the HyperoptOptions struct when the algorihm is a HyperoptAlg. To specify options, use keyword arguments in the constructor of HyperoptOptions. The sampler keyword argument determines the sampling algorithm used to propose new starting points in the multi-start procedure. The sub_options keyword argument can be used to pass in the options for the sub-optimizer. There are 2 different ways to pass the sub-options depending on the sampler type.
The sampler argument can be of type:
- RandomSampler
- LHSampler
- CLHSampler
- GPSampler
- Hyperband
When optimizing the starting point, the upper and lower bounds on the initial solution must be finite, or finite bounds must be passed in to the options constructor. All the options that can be passed to the HyperoptOptions constructor are listed below:
NonconvexMultistart.HyperoptOptions — TypeHyperoptOptions: options performing starting point optimization using Hyperopt.jl- sub_options: options for the sub-optimizer.
- lb: Lower bound of starting point, if don't specify it, the default value will be- nothing, then will end up be replaced by the lower bound of optimization problem.
- ub: Upper bound of starting point, same as above.
- searchspace_size::Integer: How many potential starting points we generate.
- iters::Integer: Among all generated potential starting points, how many of them will be evaluated.
- sampler::Hyperopt.Sampler: An instance of 'Hyperopt.Sampler', which decides search algorithm.
- ctol: infeasibility tolerance for accepting a solution as feasible
- keep_all: if true, all the solutions of the sub-problems will be saved
Sampler choice
RandomSampler, LHSampler, CLHSampler and GPSampler
All the sampler constructors are functions defined in Nonconvex wrapping the Hyperopt alternatives to define defaults. For GPSampler, Hyperopt.Min is always used by default in Nonconvex so you should not pass this argument. All the other arguments that can be passed to the sampler constructor can be found in the Hyperopt documentation. Example:
options = HyperoptOptions(sub_options = IpoptOptions(), sampler = GPSampler())Hyperband
The Hyperband algorithm in Hyperopt requires a different way to pass in the sub-options. The Hyperband algorithm tries to optimize the allocation of resources. The sub_options argument must be a function with input as the "resources" and output as the sub-solver options. The Hyperband constructor accepts 3 arguments:
- The maximum resources R
- ηwhich roughly determines the proportion of trials discarded between each round of successive halving
- innerwhich specifies an inner sampler of type- RandomSampler,- LHSampleror- CLHSampler.
Example:
options = HyperoptOptions(
    sub_options = max_iter -> IpoptOptions(max_iter = max_iter), 
    sampler = Hyperband(R=100, η=3, inner=RandomSampler()),
)