Consider . Its convex conjugate is the function given by
[5 points] Compute when .
[5 points] Compute when for .
[5 points] Prove that for all ,
[15 points] Assume that is a convex function. Show that has -Lipschitz gradients if and only if is -strongly convex.
Given a probability distribution on , its covariance matrix is defined by
where is a random vector sampled from .
Recall that a probability density on is an integrable function such that
And the associated probability measure is defined on measurable subsets by
Recall also the Minkowski sum of two sets :
[5 points] For a function and a distribution , we define the pushforward measure by
For a given measure , define an affine function such that is isotropic. Recall that an affine function is of the form for .
(You may assume that the covariance matrix is invertible.)
[EXTRA CREDIT, 10 points] Recall that is log-concave if the function is concave. Say that a distribution is log-concave if it holds that for all measurable subsets ,
The Brunn-Minkowski inequality asserts that the volume measure on is log-concave.
Show that if is a log-concave density, then the associated probability measure is log-concave in the sense of .
[10 points] (You may assume that part (B) is true for this part!)
Suppose that the distribution is given by the density
where is a closed, bounded, convex set.
Show that if is an affine function, then is log-concave in the sense of .
Suppose that is convex and $\mathcal{C}^2$. Assume also that for some ,
In class we showed that if we define the gradient desent steps by
then we have the rate of convergence
where . We also showed that is decreasing as increases. In this problem, you will try to obtain an exponential rate of decrease for .
[10 points] Show that
[15 points] Using the mirror-descent analysis for inspiration, prove that
Partial credit will be given for proving the slightly weaker bound with a decrease of at every iteration.