https://en.wikipedia.org/wiki/Mathematical_optimization
https://en.wikipedia.org/wiki/Optimization_%28disambiguation%29
http://www.benfrederickson.com/numerical-optimization/
Good learning resource: book + lectures.
See intro slides. Often an n-dimensional real number space.
Constrained optimization refers to optimization where the optimization space is some subspace of an "underlying space". This is a bit arbitrary, but the underlying space is most often assumed to be and the constraints then define a subset of through inequalities and equalities.
All of these are mostly refering to continuous optimization
A mathematical optimization problem, or optimization problem.
One can also have equality constraints, apart from inequalities.
Andrew Ng video – General optimization problem – Dual problem, uses Max–min inequality. Conditions for the primal and dual problems being equivalent (includes Karush–Kuhn–Tucker conditions)
Has applications in Support vector machines. The reasons we use the dual problem is that it has many useful properties, I think it is convex?
Used in Operations research. Can be used to solve (either exactly or approximately) some discrete optimization problems. See Linear programming relaxation
Objective and constraint functins are non-linear. Some special cases:
Often for global nonlinear optimization, we need to use brute force methods, with Computational complexity exponential in the dimension of th optimzation space (space of optimization variables). For approximate but faster solutions, we can use local optimizal optimization, or heuristics.
http://mat.gsia.cmu.edu/classes/QUANT/NOTES/chap4/node6.html
Good explanation of inequality constraints using an extension of Lagrange multipliers. Think of planes in 3D as constraint surfaces for intuition on equation! Delta f should be perpendicular to hypersurface defined by constraints.
Can formulate discrete optimization as an Energy minimization problem on a set of random variables which can take values in a discrete set. This can be formulated as a Graphical model. Energy-based model, Ising model..
Find local optima, as a method to approximately solve hard nonlinear optimization problems. Main methods:
THE EFFECT OF GRADIENT NOISE ON THE ENERGY LANDSCAPE OF DEEP NETWORKS
EXPLORATIONS ON HIGH DIMENSIONAL LANDSCAPES
Gradient-based Hyperparameter Optimization through Reversible Learning
See links here
https://en.wikipedia.org/wiki/Inferential_programming
Many applications in Science, Engineering, Statistics...