Low-rank approximation

In mathematics, low-rank approximation is a minimization problem, in which the cost function measures the fit between a given matrix and an approximating matrix, subject to a constraint that the approximating matrix has reduced rank. The problem is used for mathematical modeling and data compression. The rank constraint is related to a constraint on the complexity of a model that fits the data. In applications, often there are other constraints on the approximating matrix apart from the rank constraint, e.g., non-negativity and Hankel structure.
Low-rank approximation is closely related to:

Given

structure specification,
vector of structure parameters,
norm, and
desired rank,
Applications
Linear system identification, in which case the approximating matrix is Hankel structured.
Machine learning, in which case the approximating matrix is nonlinearly structured.
Recommender systems, in which cases the data matrix has missing values and the approximation is categorical.
Distance matrix completion, in which case there is a positive definiteness constraint.
Natural language processing, in which case the approximation is nonnegative.
Computer algebra, in which case the approximation is Sylvester structured.
Basic low-rank approximation problem

The unstructured problem with fit measured by the Frobenius norm, i.e.,
has analytic solution in terms of the singular value decomposition of the data matrix. The result is referred to as the matrix approximation lemma or Eckart–Young–Mirsky theorem. Let
be the singular value decomposition of and partition,, and as follows:
where is, is, and is. Then the rank- matrix, obtained from the truncated singular value decomposition
is such that
The minimizer is unique if and only if.

Proof of Eckart–Young–Mirsky theorem (for [spectral norm])

Let be a real matrix with. Suppose that
is the singular value decomposition of. Recall that and are orthogonal matrices, and is an diagonal matrix with entries such that.
We claim that the best rank approximation to in the spectral norm, denoted by, is given by
where and denote the th column of and, respectively.
First, note that we have
Therefore, we need to show that if where and have columns then.
Since has columns, then there must be a linear combination of the first columns of, i.e.,
such that. Without loss of generality, we can scale so that or . Therefore,
The result follows by taking the square root of both sides of the above inequality.

Proof of Eckart–Young–Mirsky theorem (for [Frobenius norm])

Let be a real matrix with. Suppose that
is the singular value decomposition of.
We claim that the best rank approximation to in the Frobenius norm, denoted by, is given by
where and denote the th column of and, respectively.
First, note that we have
Therefore, we need to show that if where and have columns then
By the triangle inequality with the spectral norm, if then. Suppose and respectively denote the rank approximation to and by SVD method described above. Then, for any
Since, when and we conclude that for
Therefore,
as required.

Weighted low-rank approximation problems

The Frobenius norm weights uniformly all elements of the approximation error . Prior knowledge about distribution of the errors can be taken into account by considering the weighted low-rank approximation problem
where vectorizes the matrix column wise and is a given positive definite weight matrix.
The general weighted low-rank approximation problem does not admit an analytic solution in terms of the singular value decomposition and is solved by local optimization methods, which provide no guarantee that a globally optimal solution is found.
Inspired by Netflix prize application, weighted low-rank approximation problem also can be formulated in this way : for a non-negative matrix and a matrix we want to minimize over matrices,, of rank at most.

Entry-wise $L_p$ low-rank approximation problems

Let. For, the fastest algorithm runs in time,. One of the important ideas been used is called Oblivious Subspace Embedding, it is first proposed by Sarlos.
For, it is known that this entry-wise L1 norm is more robust than the Frobenius norm in the presence of outliers and is indicated in models where Gaussian assumptions on the noise may not apply. It is natural to seek to minimize . For and, there are some algorithms with provable guarantees,.

Distance low-rank approximation problem

Let and be two point sets in an arbitrary metric space. Let represent the matrix where. Such distances matrices are commonly computed in software packages and have applications to learning image manifolds, handwriting recognition, and multi-dimensional unfolding. In an attempt to reduce their description size, one can study low rank approximation of such matrices.

Distributed/Streaming low-rank approximation problem

The low-rank approximation problems in the distributed and streaming setting has been consider in.

Image and kernel representations of the rank constraints

Using the equivalences
and
the weighted low-rank approximation problem becomes equivalent to the parameter optimization problems
and
where is the identity matrix of size.

Alternating projections algorithm

The image representation of the rank constraint suggests a parameter optimization method in which the cost function is minimized alternatively over one of the variables with the other one fixed. Although simultaneous minimization over both and is a difficult biconvex optimization problem, minimization over one of the variables alone is a linear least squares problem and can be solved globally and efficiently.
The resulting optimization algorithm is globally convergent with a linear convergence rate to a locally optimal solution of the weighted low-rank approximation problem. Starting value for the parameter should be given. The iteration is stopped when a user defined convergence condition is satisfied.
Matlab implementation of the alternating projections algorithm for weighted low-rank approximation:

function = wlra_ap
= size; r = size; f = inf;
for i = 2:maxiter
% minimization over L
bp = kron;
vl = \ bp' * w * d;
l = reshape;
% minimization over P
bl = kron;
vp = \ bl' * w * d;
p = reshape;
% check exit condition
dh = p * l; dd = d - dh;
f = dd' * w * dd;
if abs - f) < tol, break, end
end

Variable projections algorithm

The alternating projections algorithm exploits the fact that the low rank approximation problem, parameterized in the image form, is bilinear in the variables or. The bilinear nature of the problem is effectively used in an alternative approach, called variable projections.
Consider again the weighted low rank approximation problem, parameterized in the image form. Minimization with respect to the variable leads to the closed form expression of the approximation error as a function of
The original problem is therefore equivalent to the nonlinear least squares problem of minimizing with respect to. For this purpose standard optimization methods, e.g. the Levenberg-Marquardt algorithm can be used.
Matlab implementation of the variable projections algorithm for weighted low-rank approximation:

function = wlra_varpro
prob = optimset; prob.solver = 'lsqnonlin';
prob.options = optimset;
prob.x0 = p; prob.objective = @ cost_fun;
= lsqnonlin;
= cost_fun;
dh = p * reshape, size);
function = cost_fun
bp = kron;
vl = \ bp' * w * d;
f = d' * w * ;

The variable projections approach can be applied also to low rank approximation problems parameterized in the kernel form. The method is effective when the number of eliminated variables is much larger than the number of optimization variables left at the stage of the nonlinear least squares minimization. Such problems occur in system identification, parameterized in the kernel form, where the eliminated variables are the approximating trajectory and the remaining variables are the model parameters. In the context of linear time-invariant systems, the elimination step is equivalent to Kalman smoothing.

A Variant: convex-restricted low rank approximation

Usually, we want our new solution not only to be of low rank, but also satisfy other convex constraints due to application requirements. Our interested problem would be as follows,
This problem has many real=world applications, including to recover a good solution from an inexact relaxation. If additional constraint is linear, like we require all elements to be nonnegative, the problem is called structured low rank approximation. The more general form is named convex-restricted low rank approximation.
This problem is helpful in solving many problems. However, it is challenging due to the combination of the convex and nonconvex constraints. Different techniques were developed based on different realizations of. However, the Alternating Direction Method of Multipliers can be applied to solve the nonconvex problem with convex objective function, rank constraints and other convex constraints, and is thus suitable to solve our above problem. Moreover, unlike the general nonconvex problems, ADMM will guarantee to converge a feasible solution as long as its dual variable converges in the iterations

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...