Optimisation
Multidimensional Taylor Series
The scalar case of the taylor series is an expansion of the function about the point :
This can be generalised to a matrix case. Let be a scalar function of a column vector . The taylor series expansion of about the point is:
This result is a scalar. Consider the first three terms:
is a row vector with it's gradient evaluated at the point :
is the matrix of second derivatives, called the Hessian matrix, evaluated at point
- The Hessian matrix is generally symmetric
- Matrix of mixed partial derivatives
Taylor series can be used to approximate multidimensional functions:
- Let be a scalar function of an vector ,
- Expand about a point , assuming displacements about
- The first term is a linear form
- Second term a quadratic form
- Higher order terms are ignored
Multidimensional Optimisation
Optimisation tasks involve finding such that is at an extremum (max/min).
Consider a continuous function , expanded about the point , with a vector as the displacement from :
The point is an extremum if the gradient vector when . The homogenous nonlinear equation therefore defines an extremum .
If , then:
Therefore:
This is the important result that defines the extremum of a function
To determine the nature of the extremum, the sign of the must be determined. By the spectral resolution, this is determined by the eigenvalues of . It is said that the sign definiteness of is determined by
Let be a quadratic form m, where is a symmetric matrix. The eigenvalues of are . The definiteness is determined by all of the eigenvalues:
Definiteness of | Nature of Point | |
---|---|---|
Positive Definite | Minimum | |
Positive Semidefinite | Probable Valley | |
and | Indefinite | Saddle Point |
Negative Semidefinite | Probable Ridge | |
Negative Definite | Maximum |
Extrema of a Multivariate Quadratic
For a quadratic , the extremum is at the point where :
A maximum/minimum exists at the point if the matrix is positive/negative definite.
Example 1
Find the extremum (and it's nature) of the quadratic function:
Put into matrix form:
The extremum of this quadratic form exists at the point where:
To determine the nature, find the eigenvalues of :
The eigenvalues lie either side of zero, which makes indefinite, and the extremum is therefore a saddle point.
Example 2
Find the stationary points (and their nature) for the function:
The stationary points lie where :
The solutions are therefore:
Two solutions:
To determine the nature of the extremum, we need the hessian matrix:
The eigenvalues at each point will give the nature. For :
The hessian matrix is positive semidefinite, so the point is probably a valley, but further analysis is required to determine the nature of the point.
For :
The hessian matrix is indefinite, so the point is a saddle point.