3.6 Fenchel duality

Signal theory Page 1 / 1

Introduces the concepts of an epigraph and a conjugate function necessary to set up Fenchel dual problems.

Convexity and conjugate functions

We begin by reviewing two notions of convexity: for sets and for functions.

Definition 1 A subset $C \subseteq X$ is called convex if $z = α x + (1 - α) y \in C$ for every $x, y \in C$ and $α \in [0, 1]$ ; $z$ is called a convex combination of $x$ and $y$ .

Definition 2 Let $C$ be a convex set. A functional $f : C \to R$ is convex on $C$ if $f (α x + (1 - α) y) \leq α f (x) + (1 - α) f (y)$ for all $x, y \in C$ and $α \in [0, 1]$ . If strict inequality holds the functional is set to be strictly convex . A functional $g$ is called concave (strictly concave) if $- g$ is convex (strictly convex).

We will denote the region above the function $f$ defined over a convex set $C$ as $[f, C]$ , sometimes called an epigraph , as illustrated in [link] .

Definition 3 Let $f$ be a convex functional on a convex set $C$ . The conjugate set $C^{*}$ is defined as

C^{*} = {x^{*} \in X : sup_{x \in C} [⟨ x, x^{*} ⟩ - f (x)] < \infty},

and the conjugate functional $f^{*} : C^{*} \to R$ is defined as

f^{*} (x^{*}) = sup_{x \in C} [⟨ x, x^{*} ⟩ - f (x)] .

There is a geometric intuition behind the definition of the conjugate functionals. Consider the illustration below where the horizontal axis represents the space $X$ and the vertical axis represents the scalar field. A hyperplane in this space contains all points $(x, r) \in X \times R$ for which $r = ⟨ x, x^{*} ⟩ - k$ for some value of $k \in R$ ; the vector $x^{*}$ determines the orientation of the hyperplane and the value $k$ determines the shift from the origin (i.e., the intersect in the axis $R$ ). The value of the functional $f^{*} (x^{*})$ corresponds to the supremum value of $k$ for which the hyperplane intersects $[f, C]$ , and is finite only for $x^{*} \in C^{*}$ ; this is illustrated in [link] .

The conjugate function of a convex epigraph.

Note that $C^{*}$ is convex and $f^{*}$ is convex. This definition is easily extended to concave functionals.

Definition 4 Let $g$ be a concave functional on a convex set $D$ . The conjugate set $D^{*}$ is defined as

D^{*} = {x^{*} \in X : inf_{x \in D} [⟨ x, x^{*} ⟩ - g (x)] > - \infty},

and the conjugate functional $g^{*} : D^{*} \to R$ is defined as

g^{*} (x^{*}) = inf_{x \in D} [⟨ x, x^{*} ⟩ - f (x)] .

Note that $D^{*}$ is convex and $g^{*}$ is concave.

Fenchel duality

The following theorem will allow us to convert an optimization problem with a convex objective function into a dual problem with a concave objective function.

Theorem 1 (Fenchel) Assume that $f$ and $g$ are convex and concave functions, respectively, on convex sets $C$ and $D$ in a normed space $X$ . Assume that $C \cap D$ contains points in the relative interior of $C$ and $D$ and that either $[f, C]$ or $[g, D]$ has a nonempty interior. Suppose further that

μ = inf_{x \in C \cap D} {f (x) - g (x)}

is finite. Then,

μ = inf_{x \in C \cap D} {f (x) - g (x)} = max_{x^{*} \in C^{*} \cap D^{*}} {g^{*} (x^{*}) - f^{*} (x^{*})},

where the maximum is achieved by some $x_{0}^{*} \in C^{*} \cap D^{*}$ .

In this theorem, $g (x)$ is usually set to zero. From a geometrical point of view, the theorem states that there are two ways to interpret the minimum distance between the two epigraphs $[f, C]$ and $[g, D]$ shown below: one in terms of the original functions $f, g$ and one in terms of the duals $f^{*}, g^{*}$ : we look for the two tangent hyperplanes for $f$ and $g$ that are maximally separated from one another, as illustrated in [link] .

Illustration of the Fenchel dual problem on a conjugate function.

If the infimum on the left is achieved by $x_{0}$ , then

\begin{matrix} max_{x \in C} [⟨ x, x_{0}^{*} ⟩ - f (x)] = ⟨ x_{0}, x_{0}^{*} ⟩ - f (x_{0}), \\ max_{x \in D} [⟨ x, x_{0}^{*} ⟩ - g (x)] = ⟨ x_{0}, x_{0}^{*} ⟩ - g (x_{0}) . \end{matrix}

Example 1 (Allocation) Assume that we have a capital $x_{0}$ available for investment with $n$ different funds. There is a predicted gain $g_{i} (x_{i})$ to having stock worth $x_{i}$ at fund $i$ , where the functions $g_{i}$ are concave. We aim to find the optimal allocation of the capital $x = (x_{1}, ..., x_{n})$ that maximizes the total gain $g (x) = \sum_{i = 1}^{n} g_{i} (x_{i})$ .

To appeal to duality, we have the concave function $g (x)$ and must define a convex function, e.g., $f (x) = 0$ . The constraint set can be written as the intersection $C \cap D$ with

\begin{matrix} C & = {x : \sum_{i = 1}^{n} x_{i} = x_{0}} = {x : ⟨ 1, x ⟩ = x_{0}}, \\ D & = {x : x_{i} \geq 0, i = 1, ..., n} . \end{matrix}

Therefore, we can write our optimization problem as

min_{x \in C \cap D} {f (x) - g (x)} .

We consider the conjugate sets. First, we have

\begin{matrix} C^{*} & = {x^{*} \in X : sup_{x \in C} [⟨ x, x^{*} ⟩ - f (x)] < \infty}, \\ = {x^{*} \in X : sup_{x \in C} [⟨ x, x^{*} ⟩] < \infty} . \end{matrix}

We want to define $C^{*}$ more explicitly. Let $x^{*} \in C^{*}$ be written as $x^{*} = λ 1 + w$ , where $w ⊥ 1$ . Then,

⟨ x^{*}, w ⟩ = λ ⟨ 1, w ⟩ + ⟨ w, w ⟩ = {∥ w ∥}^{2},

which can be arbitrarily large. Now let $x = \frac{x_{0}}{n} 1 + λ w$ ; it is easy to check that $x \in C$ for all $λ \in R$ . If $w \neq 0$ then $⟨ x, x^{*} ⟩ = λ x_{0} + λ {∥ w ∥}^{2}$ , which again can be arbitrarily large. Since $x^{*} \in C^{*}$ must hold that ${sup}_{x \in C} ⟨ x, x^{*} ⟩ < \infty$ , we must have that $w = 0$ and so $x^{*} \in span ({1})$ . Since $x^{*} \in C^{*}$ was arbitrary, then $C^{*} \subseteq span ({1})$ .

It is also easy to see that $span ({1}) \subseteq C^{*}$ , therefore implying that $C^{*} = span ({1})$ .

For $D$ , we have a conjugate set

\begin{matrix} D^{*} & = {y : inf_{x \in D} [⟨ x, y ⟩ - g (x)] > - \infty}, \\ = {y : inf_{x \in D} [⟨ x, y ⟩ - g (x)] > - \infty}, \end{matrix}

since $g (x) \leq x_{0}$ . Now $D \subseteq D^{*}$ since all vectors in $D$ have nonnegative entries. Fix $y \in D^{*}$ ; if $y \notin D$ then there is some negative entry $y_{i} < 0$ among $i = 1, ..., n$ . For such $i$ let $x_{λ} = λ e_{i} \in D$ for some $λ > 0$ ; then we get $⟨ x_{λ}, y ⟩ = λ y_{i}$ which can be arbitrarily close to $\infty$ (i.e., as $λ \to \infty$ we have $⟨ x_{λ}, y ⟩ \to - \infty$ . Thus $y \notin D^{*}$ , a contradiction. Therefore, if $y \in D^{*}$ then $y \in D$ and so $D^{*} \subseteq D$ . We have therefore shown that $D^{*} = D$ .

The conjugate functionals can be written as

f^{*} (x^{*}) = sup_{x \in C} ⟨ x, x^{*} ⟩ = λ x_{0},

since each $x^{*} \in C^{*}$ can be written as $x^{*} = λ 1$ . Therefore, $f^{*}$ can be written as a function of a single variable. Similarly,

g^{*} (x^{*}) = inf_{x \in D} (⟨ x, x^{*} ⟩ - g (x)) = inf_{x \in D} (\sum_{i = 1}^{n} x_{i} x_{i}^{*} - g_{i} (x_{i})) = \sum_{i = 1}^{n} g_{i}^{*} (x_{i}^{*}),

where we write

g_{i}^{*} (x_{i}^{*}) = inf_{x_{i} > 0} (x_{i} x_{i}^{*} - g_{i} (x_{i})) .

For $x^{*} \in C^{*} \cap D^{*}$ we can write $x_{i} = λ > 0$ for all $i = 1, ..., n$ and so

\begin{matrix} g_{i}^{*} (x_{i}^{*}) = g_{i}^{*} (λ) = inf_{x_{i} > 0} (λ x_{i} - g_{i} (x_{i})) . \end{matrix}

Therefore, the original problem can be reformulated as the following single-variable problem:

λ^{*} = min_{λ > 0} [λ x_{0} - \sum_{i = 1}^{n} g_{i}^{*} (λ)] .

This is due to $x^{*} = λ 1 \in C^{*} \cap D^{*}$ if and only if $λ = 0$ . Once $λ^{*}$ is found, we can find each $x_{i}$ as the minimizer in $g_{i}^{*} (λ^{*})$ , cf. [link] .

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Signal theory. OpenStax CNX. Oct 18, 2013 Download for free at http://legacy.cnx.org/content/col11542/1.3

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Signal theory' conversation and receive update notifications?

Ask

©flickr: Iqbal	Liver Cancer By Darlene Paliswat Start Test
	1 Microeconomics 01 What Is Economics? By OpenStax Start Flashcards
	Anthropology Life Cycle By Richley Crapo Start Assignment
	Introduction to sociology 2e By OpenStax Read Online Course
	Renaissance Baroque Arts By Marion Cabalfin Start Quiz
	Interventionism Mixed Economy By Robert Murphy Start Test
©flickr: Abraham	3 Biology 3 By Sarah Warren Start Test
	OOP with Java - Quiz 1 By Vongkol HENG Start Quiz
	Principles of Marketing By Dionne Mahaffey Start Quiz
	5 Microeconomics 05 Elasticity By OpenStax Start Flashcards