Farkas' lemma

From Wikimization

Jump to: navigation, search

Farkas' lemma is a result used in the proof of the Karush-Kuhn-Tucker (KKT) theorem from nonlinear programming.

It states that if \,A\, is a matrix and \,b a vector, then exactly one of the following two systems has a solution:

  • A^Ty\succeq0 for some y\, such that b^Ty<0~~

or in the alternative

  • Ax=b\, for some x\succeq0

where the notation x\succeq0 means that all components of the vector x are nonnegative.

The lemma was originally proved by Farkas in 1902. The above formulation is due to Albert W. Tucker in the 1950s.

It is an example of a theorem of the alternative; a theorem stating that of two systems, one or the other has a solution, but not both.

Contents

Proof

(Dattorro) Define a convex cone

  • \mathcal{K}=\{y~|~A^Ty\succeq0\}\quad

whose dual cone is

  • \quad\mathcal{K}^*=\{A_{}x~|~x\succeq0\}

From the definition of dual cone \,\mathcal{K}^*\!=\{b~|~b^{\rm T}y\!\geq\!0~~\forall~y\!\in_{}\!\mathcal{K}\} we get

y\in\mathcal{K}~\Leftrightarrow~b^Ty\geq0~~\forall~b\in\mathcal{K}^*

rather,

A^Ty\succeq0~\Leftrightarrow~b^Ty\geq0~~\forall~b\in\{A_{}x~|~x\succeq0\}

Given some {\displaystyle b} vector and y\!\in\!\mathcal{K}~, then {\displaystyle b^Ty\!<\!0} can only mean b\notin\mathcal{K}^*.

An alternative system is therefore simply b\in\mathcal{K}^* and so the stated result follows.

Geometrical Interpretation

Farkas' lemma simply states that either vector \,b belongs to convex cone \mathcal{K}^* or it does not.

When b\notin\mathcal{K}^*, then there is a vector \,y\!\in\!\mathcal{K} normal to a hyperplane separating point \,b from cone \mathcal{K}^*.

References

  • Gyula Farkas, Über die Theorie der Einfachen Ungleichungen, Journal für die Reine und Angewandte Mathematik, volume 124, pages 1–27, 1902.

http://gdz.sub.uni-goettingen.de/no_cache/dms/load/img/?IDDOC=261361


Extended Farkas' lemma

For any closed convex cone \mathcal J in the Hilbert space (\mathbb H,\langle\cdot,\cdot\rangle) (\,in particular we can have \mathbb H=\mathbb R^n), denote by \mathcal J^\circ the polar cone of \mathcal J.

Let \mathcal K be an arbitrary closed convex cone in \mathbb H.

Then, the extended Farkas' lemma asserts that \mathcal K^{\circ\circ}=\mathcal K.

Hence, denoting \mathcal L=\mathcal K^\circ, it follows that \mathcal L^\circ=\mathcal K.

Therefore, the cones \mathcal K and \mathcal L are called mutually polar pair of cones.

notes

For definition of convex cone see in finite dimension see Convex cones, Wikimization.

For definition of polar cone see Moreau's theorem.

Proof of extended Farkas' lemma

(Sándor Zoltán Németh) Let z\in\mathbb H be arbitrary. Then, by Moreau's theorem we have


z=P_{\mathcal K}z+P_{\mathcal K^\circ}z

and


z=P_{\mathcal K^\circ}z+P_{\mathcal K^{\circ\circ}}z.

Therefore,

P_{\mathcal K}z=P_{\mathcal K^{\circ\circ}}z=z-P_{\mathcal K^\circ}z.

In particular, for any z\in K we have z=P_{\mathcal K}z=P_{\mathcal K^{\circ\circ}}z\in\mathcal K^{\circ\circ}. Hence, \mathcal K \subset \mathcal K^{\circ\circ}. Similarly, for any z\in K^{\circ\circ} we have z=P_{\mathcal K^{\circ\circ}}z= P_{\mathcal K}z\in\mathcal K. Hence, \mathcal K^{\circ\circ}\subset\mathcal K. Therefore, \mathcal K^{\circ\circ}=\mathcal K.

Personal tools