NAME
PDL::Opt::ParticleSwarm - Particle Swarm Optimization (object oriented)
SYNOPSIS
use PDL::Opt::ParticleSwarm;
my $pso = PDL::Opt::ParticleSwarm->new (
-fitFunc => \&calcFit,
-dimensions => 1,
);
my $bestFit = $pso->optimize();
my $bestPos = $pso->getBestPos();
print "Fit $bestFit at $bestPos\n";
sub calcFit {
my $vec = shift;
my $x = $vec->slice('(0)');
# The parabola (x+3)^2 - 5 has a minima at x=-3:
return (($x+3)**2 - 5);
}
Description
The Particle Swarm Optimization technique uses communication of the current best position found between a number of particles moving over a hyper surface as a technique for locating the best location on the surface (where 'best' is the minimum of some fitness function). For a Wikipedia discussion of PSO see http://en.wikipedia.org/wiki/Particle_swarm_optimization.
This pure Perl module is an implementation of the Particle Swarm Optimization technique for finding minima of hyper surfaces using the PDL module for accelerated vector calculations. It presents an object oriented interface that facilitates easy configuration of the optimization parameters and (in principle) allows the creation of derived classes to reimplement all aspects of the optimization engine (a future version will describe the replaceable engine components).
This implementation allows communication of a local best point between a selected number of neighbours.
This module and its documentation is based on AI::ParticleSwarmOptimization by Peter Jaquiery to support PDL objects and additional features such as search space scaling and initial guess options.
Methods
PDL::Opt::ParticleSwarm provides the following public methods. The parameter lists shown for the methods denote optional parameters by showing them in [].
- new(%parameters)
-
Create an optimization object. The following parameters may be used:
- -initialGuess: a vector of initial "best" values to start with.
-
This must be a PDL object. As long as it is correctly broadcasts, this can take any PDL representation.
- -searchSize: a scalar multiple to control the search distance from
-initialGuess
-
For example, if your
initialGuess
is "5",posMin
/posMax
range is 0-10, andsearchSize
is 0.5 then it will initialize the particles to search in the region between 2.5 and 7.5.If undefined, then search the entire space between
posMin
andposMax
Default value: undef
- -dimensions: positive number, semi-required
-
The number of dimensions of the hypersurface being searched. This can be omitted if
-intitialGuess
is provided. - -exitFit: number, optional
-
If provided, -exitFit allows early termination of optimize if the fitness value becomes equal or less than -exitFit.
- -fitFunc: required
-
-fitFunc is a reference to the fitness function used by the search. If extra parameters need to be passed to the fitness function an array ref may be used with the code ref as the first array element and parameters to be passed into the fitness function as following elements. User provided parameters are passed as the first parameters to the fitness function when it is called:
my $pso = PDL::Opt::ParticleSwarm->new ( fitFunc => [\&calcFit, $context], dimensions => 3, ); ... sub calcFit { my ($context, $pdl_vec_to_optimize) = @_; ... do something with $pdl_vec_to_optimize return $fitness; }
In addition to any user provided parameters the list of values representing the current particle position in the hyperspace is passed in. There is one value per hyperspace dimension.
- -logFunc: log function callback, optional
-
This function is called after each iteration with the current
bestPos
andbestFit
values as follows:$self->{logFunc}->($self->getBestPos(), $self->getBestFit());
- -inertia: positive or zero number, optional
-
Determines what proportion of the previous velocity is carried forward to the next iteration. This can be a PDL object, so it should work in any dimension so long as it works in the broadcast sense.
Defaults to 0.9
See also -meWeight and -themWeight.
- -iterations: number, optional
-
Number of optimization iterations to perform. Defaults to 1000.
- -meWeight: number, optional
-
Coefficient determining the influence of the current local best position on the next iterations velocity. This can be a PDL object, so it should work in any dimension so long as it works in the broadcast sense.
Defaults to 0.5.
See also -inertia and -themWeight.
- -numNeighbors: positive number, optional
-
Number of local particles considered to be part of the neighbourhood of the current particle.
Defaults to the square root of the total number of particles.
Background: "The basic version of the algorithm uses the global topology as the swarm communication structure [when (numNeighbors=numParticles-1)]. This topology allows all particles to communicate with all the other particles, thus the whole swarm share the same best position from a single particle. However, this approach might lead the swarm to be trapped into a local minimum." [ https://en.wikipedia.org/wiki/Particle_swarm_optimization ]
Thus, as
numNeighbors
approachesnumParticles-1
, there is a greater risk of getting stuck in a local minima.The neighbor selection algorithm in PDL::Opt::ParticleSwarm uses
numNeighbors
particles following the Nth particle index in the particle piddle being evaluated for its neighbors. The term "neighbor" is perhaps a misnomer in the sense that it is not a vector distance, but an index distance.Future work could include adding other neighbor selection algorithms (ie, random, vector distance, ring, others).
- -numParticles: positive number, optional
-
Number of particles in the swarm. Defaults to 10 times the number of dimensions.
- -posMax: number, optional
-
Maximum coordinate value for any dimension in the hyper space. This can be a PDL object, so it should work in any dimension so long as it works in the broadcast sense. For example, different dimensions could have different posMax values by passing a vector of length
dimension
.Defaults to 100.
- -posMin: number, optional
-
Minimum coordinate value for any dimension in the hyper space. This can be a PDL object, so it should work in any dimension so long as it works in the broadcast sense. For example, different dimensions could have different posMax values by passing a vector of length
dimension
.Defaults to --posMax (if -posMax is negative -posMin should be set more negative).
- -randStartVelocity: boolean, optional
-
Set true to initialize particles with a random velocity. Otherwise particle velocity is set to 0 on initalization.
A range based on 1/100th of --posMax - -posMin is used for the initial speed in each dimension of the velocity vector if a random start velocity is used.
- -stallSpeed: positive number, optional
-
Speed below which a particle is considered to be stalled and is repositioned to a new random location with a new initial speed. This can be a PDL object, so it should work in any dimension so long as it works in the broadcast sense.
By default -stallSpeed is undefined but particles with a speed of 1e-9 will be repositioned.
- -stallSearchScale: positive number, optional
-
If a particle stalls, then jump to a random location that exists +/-
searchSize%
of the current location, but increasesearchSize%
for that particle bystallSearchScale
after each stall:$searchSize = $searchSize * ($stallSearchScale ** $numStalls)
Default: 1 (does not change
searchSize
).A recommended value is that which will slightly increase
searchSize
over time. For example, ifstallSearchScale = 1.1
andsearchSize = 0.5
thensearchSize
will change for that particle as follows:Stalls searchSize ------ ---------- 0 0.5 # initial configured value 1 0.55 2 0.605 3 0.6655 5 0.8053 7 0.9744
In this example,
searchSize
will be capped at a value of 1.0 after the particle stalls 8 times.See also: -searchSize
- -themWeight: number, optional
-
Coefficient determining the influence of the neighbourhod best position on the next iterations velocity. This can be a PDL object, so it should work in any dimension so long as it works in the broadcast sense.
Defaults to 0.5.
See also -inertia and -meWeight.
- -exitPlateau: boolean, optional
-
Set true to have the optimization check for plateaus (regions where the fit hasn't improved much for a while) during the search. The optimization ends when a suitable plateau is detected following the burn in period.
Defaults to undefined (option disabled).
- -exitPlateauDP: number, optional
-
Specify the number of decimal places to compare between the current fitness function value and the mean of the previous -exitPlateauWindow values.
Defaults to 10.
- -exitPlateauWindow: number, optional
-
Specify the size of the window used to calculate the mean for comparison to the current output of the fitness function. Correlates to the minimum size of a plateau needed to end the optimization.
Defaults to 10% of the number of iterations (-iterations).
- -exitPlateauBurnin: number, optional
-
Determines how many iterations to run before checking for plateaus.
Defaults to 50% of the number of iterations (-iterations).
- -verbose: flags, optional
-
If set to a non-zero value -verbose determines the level of diagnostic print reporting that is generated during optimization.
The following constants may be bitwise ored together to set logging options:
kLogBetter
prints particle details when its fit becomes bebtter than its previous best.
kLogStall
prints particle details when its velocity reaches 0 or falls below the stall threshold.
kLogIter
Shows the current iteration number.
- setParams(%parameters)
-
Set or change optimization parameters. See -new above for a description of the parameters that may be supplied.
- init()
-
Reinitialize the optimization. init() will be called during the first call to optimize() if it hasn't already been called.
- optimize()
-
Runs the minimization optimization. Returns the fit value of the best fit found. The best possible fit is negative infinity.
optimize() may be called repeatedly to continue the fitting process. The fit processing on each subsequent call will continue from where the last call left off.
Use
getBestPos()
andgetBestFit()
after optimization to get the optimized result. - getBestPos()
-
Return the best position vector that has been found so far, as determined by
fitFunc
. Use this after callingoptimize()
. - getBestFit()
-
Return the fit value that has been found so far, as returned by
fitFunc
Use this after callingoptimize()
. - getIterationCount()
-
Return the number of iterations performed. This may be useful when the -exitFit criteria has been met or where multiple calls to optimize have been made.
- getStallCount()
-
Returns a PDL of stall counts per particle.
Hint: to get a total stall count, call `$pso->getStallCount()->sum()` .
SEE ALSO
- PDL::Opt::ParticleSwarm::Simple - Use names for Particle Swarm-optimized values
- PDL::Opt::Simplex - A PDL implementation of the Simplex optimization algorithm
- PDL::Opt::Simplex::Simple - Use names for Simplex-optimized values
- http://en.wikipedia.org/wiki/Particle_swarm_optimization
- AI::ParticleSwarmOptimization
- AI::PSO
ACKNOWLEDGEMENTS
This PDL implementation is based on AI::ParticleSwarmOptimization by Peter Jaquiery (GRANDPA), which in turn was based on the AI::PSO originally created by Kyle Schlansker (KYLESCH).
AUTHORS
- Copyright (C) 2023 by Eric Wheeler (EWHEELER)
- Copyright (C) 2011 by Peter Jaquiery (GRANDPA) as AI::ParticleSwarmOptimization
- Copyright (C) 2006 by W. Kyle Schlansker (KYLESCH) as AI::PSO
All rights reserved.
LICENSE
This module is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This module is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this module. If not, see <http://www.gnu.org/licenses/>.