Publications

Home   |   Publications   |   tcsmath   [old]
Home   |   Publications   |   tcsmath   [old]

[entropy]

[ abs | bib ]   Multiscale entropic regularization for MTS on general metric spaces, with F. Ebrahimnejad (ITCS'22)

We present an \(O((\log n)^2)\)-competitive algorithm for metrical task systems (MTS) on any \(n\)-point metric space that is also \(1\)-competitive for service costs. This matches the competitive ratio achieved by Bubeck, Cohen, Lee, and Lee (2019) and the refined competitive ratios obtained by Coester and Lee (2019). Those algorithms work by first randomly embedding the metric space into an ultrametric and then solving MTS there. In contrast, our algorithm is cast as regularized gradient descent where the regularizer is a multiscale metric entropy defined directly on the metric space. This answers an open question of Bubeck (Highlights of Algorithms, 2019).

@inproceedings{EL22,
     AUTHOR = {Farzam Ebrahimnejad and James R. Lee},
  BIBSOURCE = {dblp computer science bibliography, https://dblp.org},
     BIBURL = {https://dblp.org/rec/conf/innovations/EbrahimnejadL22.bib},
  BOOKTITLE = {13th Innovations in Theoretical Computer Science Conference, {ITCS}
                  2022, January 31 - February 3, 2022, Berkeley, CA,
                  {USA}},
        DOI = {10.4230/LIPIcs.ITCS.2022.60},
     EDITOR = {Mark Braverman},
      PAGES = {60:1--60:21},
  PUBLISHER = {Schloss Dagstuhl - Leibniz-Zentrum f{\"{u}}r Informatik},
     SERIES = {LIPIcs},
  TIMESTAMP = {Wed, 26 Jan 2022 14:53:11 +0100},
      TITLE = {Multiscale Entropic Regularization for {MTS} on General Metric
                  Spaces},
        URL = {https://doi.org/10.4230/LIPIcs.ITCS.2022.60},
     VOLUME = {215},
       YEAR = {2022}
}
[ abs | bib ]   Pure entropic regularization for metrical task systems, with C. Coester (COLT'19; Theory Comput.)

There is a randomized online algorithm for metrical task systems (MTS) that is \(1\)-competitive for service costs and \(O(\log n)\)-competitive for movement costs. In general, these refined guarantees are optimal up to the implicit constant. While an \(O(\log n)\)-competitive algorithm for MTS on HST metrics was developed in BCLL19, that approach could only establish an \(O((\log n)^2)\)-competitive ratio when the service costs are required to be \(O(1)\)-competitive. Our algorithm is an instantiation of online mirror descent with the regularizer derived from a multiscale conditional entropy.

In fact, our algorithm satisfies a set of even more refined guarantees; we are able to exploit this property to combine it with known random embedding theorems and obtain, for any \(n\)-point metric space, a randomized algorithm that is \(1\)-competitive for service costs and \(O((\log n)^2)\)-competitive for movement costs.

@article{CLMTS22,
     AUTHOR = {Coester, Christian and Lee, James R.},
        DOI = {10.4086/toc.2022.v018a023},
   FJOURNAL = {Theory of Computing. An Open Access Journal},
    JOURNAL = {Theory Comput.},
      PAGES = {1--24},
      TITLE = {Pure entropic regularization for metrical task systems},
     VOLUME = {18},
       YEAR = {2022}
}
[ abs | bib ]   Metrical task systems on trees via mirror descent and unfair gluing, with S. Bubeck, M. B. Cohen, and Y. T. Lee (SODA'19; SICOMP)

We consider metrical task systems on tree metrics, and present an $O(\mathrm{depth} \times \log n)$-competitive randomized algorithm based on the mirror descent framework. For the special case of HSTs, we refine the standard approach based on combining unfair metrical task systems, yielding an $O(\log n)$-competitive algorithm. This is tight up to the implicit constant factor. Using known random embeddings of finite metric spaces into HSTs, this yields an $O((\log n)^2)$-competitive algorithm for general $n$-point spaces.

@article{BCLL21,
     AUTHOR = {Bubeck, S\'{e}bastien and Cohen, Michael B. and Lee, James R. and
                  Lee, Yin Tat},
   FJOURNAL = {SIAM Journal on Computing},
       ISSN = {0097-5397},
    JOURNAL = {SIAM J. Comput.},
     NUMBER = {3},
      PAGES = {909--923},
      TITLE = {Metrical task systems on trees via mirror descent and unfair
                  gluing},
     VOLUME = {50},
       YEAR = {2021}
}
[ abs | bib ]   Fusible HSTs and the randomized k-server conjecture (FOCS'18)

We give a poly(log k)-competitive randomized algorithm for the k-server problem on general metric spaces. The best previous result independent of the underlying metric space is the 2k-1 competitive ratio established for the deterministic work function algorithm by Koutsoupias and Papadimitriou (1995).

Since deterministic algorithms can do no better than k on any metric space with at least k+1 points, this establishes that for every metric space on which the problem is non-trivial, randomized algorithms give an exponential improvement over deterministic algorithms. The approach employs our previous algorithm for the fractional k-server problem on HSTs.

[This paper has been withdrawn]
@incollection{LeeFOCS18,
     AUTHOR = {Lee, James R.},
  BOOKTITLE = {59th {A}nnual {IEEE} {S}ymposium on {F}oundations of {C}omputer
                  {S}cience---{FOCS} 2018},
      PAGES = {438--449},
  PUBLISHER = {IEEE Computer Soc., Los Alamitos, CA},
      TITLE = {Fusible {HST}s and the randomized {$k$}-server conjecture},
       YEAR = {2018}
}
[ abs | bib ]   k-server via multiscale entropic regularization, with S. Bubeck, M. B. Cohen, Y. T. Lee, and A. Madry (STOC'18)

We present an \(O((\log k)^2)\)-competitive randomized algorithm for the k-server problem on hierarchically separated trees (HSTs). This is the first o(k)-competitive randomized algorithm for which the competitive ratio is independent of the size of the underlying HST. Our algorithm is designed in the framework of online mirror descent where the mirror map is a multiscale entropy.

When combined with Bartal’s static HST embedding reduction, this leads to an \(O((\log k)^2 \log n)\)-competitive algorithm on any n-point metric space. We give a new dynamic HST embedding that yields an \(O((\log k)^3 \log \Delta)\)-competitive algorithm on any metric space where the ratio of the largest to smallest non-zero distance is at most \(\Delta\).

@inproceedings{BCLLM18,
  author    = {S{\'{e}}bastien Bubeck and
               Michael B. Cohen and
               Yin Tat Lee and
               James R. Lee and
               Aleksander Madry},
  title     = {k-server via multiscale entropic regularization},
  booktitle = {Proceedings of the 50th Annual {ACM} {SIGACT} Symposium on Theory
               of Computing, {STOC} 2018, Los Angeles, CA, USA, June 25-29, 2018},
  pages     = {3--16},
  year      = {2018},
}
[ abs | bib ]   Transport-entropy inequalities and curvature for discrete-space Markov chains, with R. Eldan and J. Lehec (Matousek, 2017)

We show that if the random walk on a graph has positive coarse Ricci curvature in the sense of Ollivier, then the stationary measure satisfies a \(W^1\) transport-entropy inequality. Peres and Tetali have conjectured a stronger consequence, that a modified log-Sobolev inequality (MLSI) should hold, in analogy with the setting of Markov diffusions. We discuss how our entropy interpolation approach suggests a natural attack on the MLSI conjecture.

@incollection {ELL17,
    AUTHOR = {Eldan, Ronen and Lee, James R. and Lehec, Joseph},
     TITLE = {Transport-entropy inequalities and curvature in discrete-space
              {M}arkov chains},
 BOOKTITLE = {A journey through discrete mathematics},
     PAGES = {391--406},
 PUBLISHER = {Springer},
      YEAR = {2017},
}
[ abs | bib ]   Covering the large spectrum and generalized Riesz products (SIDMA, 2016)

Entropy-regularized gradient descent is used to reprove some results from additive combinatorics on the structure of the large Fourier spectrum of dense subsets of the integers.

@article {LeeSpectrum17,
    AUTHOR = {Lee, James R.},
     TITLE = {Covering the large spectrum and generalized {R}iesz products},
   JOURNAL = {SIAM J. Discrete Math.},
  FJOURNAL = {SIAM Journal on Discrete Mathematics},
    VOLUME = {31},
      YEAR = {2017},
    NUMBER = {1},
     PAGES = {562--572},
}
[ abs | bib ]   Lower bounds on the size of semidefinite programming relaxations, with P. Raghavendra and D. Steurer (STOC'15)

[credit: Bernd Sturmfels]

We introduce a method for proving lower bounds on the efficacy of semidefinite programming (SDP) relaxations for combinatorial problems. In particular, we show that the cut, TSP, and stable set polytopes on n-vertex graphs are not the linear image of the feasible region of any SDP (i.e., any spectrahedron) of dimension less than \(2^{n^c}\), for some constant \(c > 0\). This result yields the first super-polynomial lower bounds on the semidefinite extension complexity of any explicit family of polytopes.

Our results follow from a general technique for proving lower bounds on the positive semidefinite rank of a matrix. To this end, we establish a close connection between arbitrary SDPs and those arising from the sum-of-squares SDP hierarchy. For approximating maximum constraint satisfaction problems, we prove that SDPs of polynomial-size are equivalent in power to those arising from degree-O(1) sum-of-squares relaxations. This result implies, for instance, that no family of polynomial-size SDP relaxations can achieve better than a 7/8-approximation for MAX-3-SAT.

[ PPT slides ]

Best paper award, STOC 2015

@inproceedings{LRS15,
     AUTHOR = {James R. Lee and Prasad Raghavendra and David Steurer},
  BOOKTITLE = {Proceedings of the Forty-Seventh Annual {ACM} on Symposium on Theory
                  of Computing, {STOC} 2015, Portland, OR, USA, June
                  14-17, 2015},
       NOTE = {Full version at \verb|https://arxiv.org/abs/1411.6317|},
      PAGES = {567--576},
      TITLE = {Lower Bounds on the Size of Semidefinite Programming Relaxations},
       YEAR = {2015}
}
[ abs | bib ]   Regularization under diffusion and anti-concentration of the information content, with R. Eldan (FOCS'15; Duke)

We show that under the Ornstein-Uhlenbeck semigroup (i.e., the natural diffusion process) on n-dimensional Gaussian space, any nonnegative, measurable function exhibits a uniform tail bound better than that implied by Markov’s inequality and conservation of mass. This confirms positively the Gaussian limiting case of Talagrand’s convolution conjecture (1989).

Video: Talagrand’s convolution conjecture and geometry via coupling (IAS)

@article{ElDuke18,
     AUTHOR = {Eldan, Ronen and Lee, James R.},
   FJOURNAL = {Duke Mathematical Journal},
    JOURNAL = {Duke Math. J.},
     NUMBER = {5},
      PAGES = {969--993},
      TITLE = {Regularization under diffusion and anticoncentration of the
                  information content},
     VOLUME = {167},
       YEAR = {2018}
}
[ abs | bib ]   Approximate constraint satisfaction requires large LP relaxations, with S. O. Chan, P. Raghavendra, and D. Steurer (FOCS'13; JACM)

We prove super-polynomial lower bounds on the size of linear programming relaxations for approximation versions of constraint satisfaction problems. We show that for these problems, polynomial-sized linear programs are exactly as powerful as programs arising from a constant number of rounds of the Sherali-Adams hierarchy. In particular, any polynomial-sized linear program for Max Cut has an integrality gap of 1/2 and any such linear program for Max 3-Sat has an integrality gap of 7/8.

Video: Linear programming and constraint satisfaction (Simons Institute)

@article {CLRS16,
    AUTHOR = {Chan, Siu On and Lee, James R. and Raghavendra, Prasad and
              Steurer, David},
     TITLE = {Approximate constraint satisfaction requires large {LP}
              relaxations},
   JOURNAL = {J. ACM},
  FJOURNAL = {Journal of the ACM},
    VOLUME = {63},
      YEAR = {2016},
    NUMBER = {4},
     PAGES = {Art. 34, 22},
}
[ abs | bib ]   Adversarial hypothesis testing and a quantum Stein's Lemma for restricted measurements, with F. G. S. L. Brandao, A. W. Harrow, and Y. Peres (IEEE TIT; ITCS'14)

Recall the classical hypothesis testing setting with two convex sets of probability distributions $P$ and $Q$. One receives either $n$ i.i.d. samples from a distribution $p \in P$ or from a distribution $q \in Q$ and wants to decide from which set the points were sampled. It is known that the optimal exponential rate at which errors decrease can be achieved by a simple maximum-likelihood ratio test which does not depend on $p$ or $q$, but only on the sets $P$ and $Q$. We consider an adaptive generalization of this model where the choice of $p \in P$ and $q \in Q$ can change in each sample in some way that depends arbitrarily on the previous samples. In other words, in the $k$th round, an adversary, having observed all the previous samples in rounds $1,…,k-1$, chooses $p_k \in P$ and $q_k \in Q$, with the goal of confusing the hypothesis test. We prove that even in this case, the optimal exponential error rate can be achieved by a simple maximum-likelihood test that depends only on $P$ and $Q$.

We then show that the adversarial model has applications in hypothesis testing for quantum states using restricted measurements. For example, it can be used to study the problem of distinguishing entangled states from the set of all separable states using only measurements that can be implemented with local operations and classical communication (LOCC). The basic idea is that in our setup, the deleterious effects of entanglement can be simulated by an adaptive classical adversary. We prove a quantum Stein’s Lemma in this setting: In many circumstances, the optimal hypothesis testing rate is equal to an appropriate notion of quantum relative entropy between two states. In particular, our arguments yield an alternate proof of Li and Winter’s recent strengthening of strong subadditivity for quantum relative entropy.

@article{BHLP20,
  author    = {F. G. S. L. {Brandão} and A. W. {Harrow} and J. R. {Lee} and Y. {Peres}},
  journal   = {IEEE Transactions on Information Theory}, 
  title     = {Adversarial Hypothesis Testing and a Quantum Stein’s Lemma for Restricted Measurements}, 
  year      = {2020},
  volume    = {66},
  number    = {8},
  pages     = {5037-5054},
}
@inproceedings{BHLP14,
  author    = {Fernando G. S. L. Brand{\~{a}}o and
               Aram Wettroth Harrow and
               James R. Lee and
               Yuval Peres},
  title     = {Adversarial hypothesis testing and a quantum stein's lemma for restricted
               measurements},
  booktitle = {Innovations in Theoretical Computer Science, ITCS'14, Princeton, NJ,
               USA, January 12-14, 2014},
  pages     = {183--194},
  year      = {2014},
}