Publications

Home   |   Publications   |   tcsmath   [old]
Home   |   Publications   |   tcsmath   [old]

[algorithms]

[ abs | ]   Sparsifying generalized linear models, with A. Jambulapati, Y. Liu, and A. Sidford (to appear, STOC'24)

We consider the sparsification of sums \(F : \mathbb{R}^n \to \mathbb{R}_+\) where \(F(x) = f_1(\langle a_1,x\rangle) + \cdots + f_m(\langle a_m,x\rangle)\) for vectors $a_1,\ldots,a_m \in \mathbb{R}^n$ and functions $f_1,\ldots,f_m : \mathbb{R} \to \mathbb{R}_+$. We show that $(1+\varepsilon)$-approximate sparsifiers of $F$ with support size $\frac{n}{\varepsilon^2} (\log \frac{n}{\varepsilon})^{O(1)}$ exist whenever the functions $f_1,\ldots,f_m$ are symmetric, monotone, and satisfy natural growth bounds. Additionally, we give efficient algorithms to compute such a sparsifier assuming each $f_i$ can be evaluated efficiently.

Our results generalize the classic case of $\ell_p$ sparsification, where $f_i(z) = |z|^p$, for $p \in (0, 2]$, and give the first near-linear size sparsifiers in the well-studied setting of the Huber loss function and its generalizations, e.g., $f_i(z) = \min{|z|^p, |z|^2}$ for $0 < p \leq 2$. Our sparsification algorithm can be applied to give near-optimal reductions for optimizing a variety of generalized linear models including $\ell_p$ regression for $p \in (1, 2]$ to high accuracy, via solving $(\log n)^{O(1)}$ sparse regression instances with $m \le n(\log n)^{O(1)}$, plus runtime proportional to the number of nonzero entries in the vectors $a_1, \dots, a_m$.

[ abs | bib ]   Sparsifying sums of norms, with A. Jambulapati, Y. Liu, and A. Sidford (FOCS'23)

For any norms $N_1,\ldots,N_m$ on $\mathbb{R}^n$ and $N(x) := N_1(x)+\cdots+N_m(x)$, we show there is a sparsified norm $\tilde{N}(x) = w_1 N_1(x) + \cdots + w_m N_m(x)$ such that $|N(x) - \tilde{N}(x)| \leq \epsilon N(x)$ for all $x \in \mathbb{R}^n$, where $w_1,\ldots,w_m$ are non-negative weights, of which only $O(\epsilon^{-2} n \log(n/\epsilon) (\log n)^{2.5} )$ are non-zero.

Additionally, we show that such weights can be found with high probability in time $O(m (\log n)^{O(1)} + \mathrm{poly}(n)) T$, where $T$ is the time required to evaluate a norm $N_i(x)$, assuming that $N(x)$ is $\mathrm{poly}(n)$-equivalent to the Euclidean norm. This immediately yields analogous statements for sparsifying sums of symmetric submodular functions. More generally, we show how to sparsify sums of $p$th powers of norms when the sum is $p$-uniformly smooth.

@inproceedings{JLLS23,
     AUTHOR = {Arun Jambulapati and James R. Lee and Yang P. Liu and Aaron
                  Sidford},
  BIBSOURCE = {dblp computer science bibliography, https://dblp.org},
     BIBURL = {https://dblp.org/rec/conf/focs/JambulapatiLLS23.bib},
  BOOKTITLE = {64th {IEEE} Annual Symposium on Foundations of Computer Science,
                  {FOCS} 2023, Santa Cruz, CA, USA, November 6-9,
                  2023},
        DOI = {10.1109/FOCS57990.2023.00119},
      PAGES = {1953--1962},
  PUBLISHER = {{IEEE}},
  TIMESTAMP = {Tue, 02 Jan 2024 15:09:54 +0100},
      TITLE = {Sparsifying Sums of Norms},
        URL = {https://doi.org/10.1109/FOCS57990.2023.00119},
       YEAR = {2023}
}
[ abs | bib ]   Spectral hypergraph sparsification via chaining (STOC'23)

In a hypergraph on $n$ vertices where $D$ is the maximum size of a hyperedge, there is a weighted hypergraph spectral $\varepsilon$-sparsifier with at most $O(\varepsilon^{-2} \log(D) \cdot n \log n)$ hyperedges. This improves over the bound of Kapralov, Krauthgamer, Tardos and Yoshida (2021) who achieve $O(\varepsilon^{-4} n (\log n)^3)$, as well as the bound $O(\varepsilon^{-2} D^3 n \log n)$ obtained by Bansal, Svensson, and Trevisan (2019).

@inproceedings{Lee23,
     AUTHOR = {James R. Lee},
  BOOKTITLE = {Proceedings of the 55th Annual {ACM} Symposium on Theory of
                  Computing, {STOC} 2023, Orlando, FL, USA, June
                  20-23, 2023},
        DOI = {10.1145/3564246.3585165},
     EDITOR = {Barna Saha and Rocco A. Servedio},
      PAGES = {207--218},
  PUBLISHER = {{ACM}},
      TITLE = {Spectral Hypergraph Sparsification via Chaining},
        URL = {https://doi.org/10.1145/3564246.3585165},
       YEAR = {2023}
}
[ abs | bib ]   Multiscale entropic regularization for MTS on general metric spaces, with F. Ebrahimnejad (ITCS'22)

We present an \(O((\log n)^2)\)-competitive algorithm for metrical task systems (MTS) on any \(n\)-point metric space that is also \(1\)-competitive for service costs. This matches the competitive ratio achieved by Bubeck, Cohen, Lee, and Lee (2019) and the refined competitive ratios obtained by Coester and Lee (2019). Those algorithms work by first randomly embedding the metric space into an ultrametric and then solving MTS there. In contrast, our algorithm is cast as regularized gradient descent where the regularizer is a multiscale metric entropy defined directly on the metric space. This answers an open question of Bubeck (Highlights of Algorithms, 2019).

@inproceedings{EL22,
     AUTHOR = {Farzam Ebrahimnejad and James R. Lee},
  BIBSOURCE = {dblp computer science bibliography, https://dblp.org},
     BIBURL = {https://dblp.org/rec/conf/innovations/EbrahimnejadL22.bib},
  BOOKTITLE = {13th Innovations in Theoretical Computer Science Conference, {ITCS}
                  2022, January 31 - February 3, 2022, Berkeley, CA,
                  {USA}},
        DOI = {10.4230/LIPIcs.ITCS.2022.60},
     EDITOR = {Mark Braverman},
      PAGES = {60:1--60:21},
  PUBLISHER = {Schloss Dagstuhl - Leibniz-Zentrum f{\"{u}}r Informatik},
     SERIES = {LIPIcs},
  TIMESTAMP = {Wed, 26 Jan 2022 14:53:11 +0100},
      TITLE = {Multiscale Entropic Regularization for {MTS} on General Metric
                  Spaces},
        URL = {https://doi.org/10.4230/LIPIcs.ITCS.2022.60},
     VOLUME = {215},
       YEAR = {2022}
}
[ abs | bib ]   Non-existence of annular separators in geometric graphs, with F. Ebrahimnejad (Discrete Comp. Geom., 2023)

Benjamini and Papasoglou (2011) showed that planar graphs with uniform polynomial volume growth admit $1$-dimensional annular separators: The vertices at graph distance $R$ from any vertex can be separated from those at distance $2R$ by removing at most $O(R)$ vertices. They asked whether geometric $d$-dimensional graphs with uniform polynomial volume growth similarly admit $(d − 1)$-dimensional annular separators when $d > 2$. We show that this fails in a strong sense: For any $d > 3$ and every $s > 1$, there is a collection of interior-disjoint spheres in $\mathbb{R}^d$ whose tangency graph $G$ has uniform polynomial growth, but such that all annular separators in $G$ have cardinality at least $R^s$.

@article{EL23DCG,
     AUTHOR = {Ebrahimnejad, Farzam and Lee, James R.},
        DOI = {10.1007/s00454-023-00519-8},
   FJOURNAL = {Discrete \& Computational Geometry. An International Journal of
                  Mathematics and Computer Science},
       ISSN = {0179-5376,1432-0444},
    JOURNAL = {Discrete Comput. Geom.},
     NUMBER = {2},
      PAGES = {627--645},
      TITLE = {Non-{E}xistence of {A}nnular {S}eparators in {G}eometric {G}raphs},
        URL = {https://doi.org/10.1007/s00454-023-00519-8},
     VOLUME = {71},
       YEAR = {2024}
}
[ abs | bib ]   Pure entropic regularization for metrical task systems, with C. Coester (COLT'19; Theory Comput.)

There is a randomized online algorithm for metrical task systems (MTS) that is \(1\)-competitive for service costs and \(O(\log n)\)-competitive for movement costs. In general, these refined guarantees are optimal up to the implicit constant. While an \(O(\log n)\)-competitive algorithm for MTS on HST metrics was developed in BCLL19, that approach could only establish an \(O((\log n)^2)\)-competitive ratio when the service costs are required to be \(O(1)\)-competitive. Our algorithm is an instantiation of online mirror descent with the regularizer derived from a multiscale conditional entropy.

In fact, our algorithm satisfies a set of even more refined guarantees; we are able to exploit this property to combine it with known random embedding theorems and obtain, for any \(n\)-point metric space, a randomized algorithm that is \(1\)-competitive for service costs and \(O((\log n)^2)\)-competitive for movement costs.

@article{CLMTS22,
     AUTHOR = {Coester, Christian and Lee, James R.},
        DOI = {10.4086/toc.2022.v018a023},
   FJOURNAL = {Theory of Computing. An Open Access Journal},
    JOURNAL = {Theory Comput.},
      PAGES = {1--24},
      TITLE = {Pure entropic regularization for metrical task systems},
     VOLUME = {18},
       YEAR = {2022}
}
[ abs | bib ]   Metrical task systems on trees via mirror descent and unfair gluing, with S. Bubeck, M. B. Cohen, and Y. T. Lee (SODA'19; SICOMP)

We consider metrical task systems on tree metrics, and present an $O(\mathrm{depth} \times \log n)$-competitive randomized algorithm based on the mirror descent framework. For the special case of HSTs, we refine the standard approach based on combining unfair metrical task systems, yielding an $O(\log n)$-competitive algorithm. This is tight up to the implicit constant factor. Using known random embeddings of finite metric spaces into HSTs, this yields an $O((\log n)^2)$-competitive algorithm for general $n$-point spaces.

@article{BCLL21,
     AUTHOR = {Bubeck, S\'{e}bastien and Cohen, Michael B. and Lee, James R. and
                  Lee, Yin Tat},
   FJOURNAL = {SIAM Journal on Computing},
       ISSN = {0097-5397},
    JOURNAL = {SIAM J. Comput.},
     NUMBER = {3},
      PAGES = {909--923},
      TITLE = {Metrical task systems on trees via mirror descent and unfair
                  gluing},
     VOLUME = {50},
       YEAR = {2021}
}
[ abs | bib ]   Flow-cut gaps and face covers in planar graphs, with R. Krauthgamer and H. Rika (SODA'19)

We establish new and improved bounds on the flow-cut gap in planar flow networks where the terminals lie on a small number of faces. The classical Okamura-Seymour theorem (1981) shows that when all terminals lie on a single face, the flow-cut gap is exactly 1. We show that if the terminals lie on $\gamma$ faces, the flow-cut gap is $O(\log \gamma)$. This is proved by embedding the terminal metric into a distribution over trees with distortion $O(\log \gamma)$, which is tight up to the implied constant factor.

@inproceedings{KLR19,
     AUTHOR = {Krauthgamer, Robert and Lee, James R. and Rika, Havana},
  BOOKTITLE = {Proceedings of the {T}hirtieth {A}nnual {ACM}-{SIAM} {S}ymposium on
                  {D}iscrete {A}lgorithms},
      PAGES = {525--534},
  PUBLISHER = {SIAM, Philadelphia, PA},
      TITLE = {Flow-cut gaps and face covers in planar graphs},
       YEAR = {2019}
}
[ abs | bib ]   Fusible HSTs and the randomized k-server conjecture (FOCS'18)

We give a poly(log k)-competitive randomized algorithm for the k-server problem on general metric spaces. The best previous result independent of the underlying metric space is the 2k-1 competitive ratio established for the deterministic work function algorithm by Koutsoupias and Papadimitriou (1995).

Since deterministic algorithms can do no better than k on any metric space with at least k+1 points, this establishes that for every metric space on which the problem is non-trivial, randomized algorithms give an exponential improvement over deterministic algorithms. The approach employs our previous algorithm for the fractional k-server problem on HSTs.

[This paper has been withdrawn]
@incollection{LeeFOCS18,
     AUTHOR = {Lee, James R.},
  BOOKTITLE = {59th {A}nnual {IEEE} {S}ymposium on {F}oundations of {C}omputer
                  {S}cience---{FOCS} 2018},
      PAGES = {438--449},
  PUBLISHER = {IEEE Computer Soc., Los Alamitos, CA},
      TITLE = {Fusible {HST}s and the randomized {$k$}-server conjecture},
       YEAR = {2018}
}
[ abs | bib ]   k-server via multiscale entropic regularization, with S. Bubeck, M. B. Cohen, Y. T. Lee, and A. Madry (STOC'18)

We present an \(O((\log k)^2)\)-competitive randomized algorithm for the k-server problem on hierarchically separated trees (HSTs). This is the first o(k)-competitive randomized algorithm for which the competitive ratio is independent of the size of the underlying HST. Our algorithm is designed in the framework of online mirror descent where the mirror map is a multiscale entropy.

When combined with Bartal’s static HST embedding reduction, this leads to an \(O((\log k)^2 \log n)\)-competitive algorithm on any n-point metric space. We give a new dynamic HST embedding that yields an \(O((\log k)^3 \log \Delta)\)-competitive algorithm on any metric space where the ratio of the largest to smallest non-zero distance is at most \(\Delta\).

@inproceedings{BCLLM18,
  author    = {S{\'{e}}bastien Bubeck and
               Michael B. Cohen and
               Yin Tat Lee and
               James R. Lee and
               Aleksander Madry},
  title     = {k-server via multiscale entropic regularization},
  booktitle = {Proceedings of the 50th Annual {ACM} {SIGACT} Symposium on Theory
               of Computing, {STOC} 2018, Los Angeles, CA, USA, June 25-29, 2018},
  pages     = {3--16},
  year      = {2018},
}
[ abs | bib ]   Separators in region intersection graphs (ITCS'17)

A region intersection graph over a base graph \(G_0\) is a graph \(G\) whose vertices correspond to connected subsets of \(G_0\) with an edge between two vertices if the corresponding regions intersect. We show that if \(G_0\) excludes the complete graph \(K_h\) as a minor, then every region intersection graph over \(G_0\) with m edges has a balanced separator with \(O(\sqrt{m})\) nodes. If \(G\) has uniformly bounded vertex degrees, we show the separator is found by spectral partitioning.

A string graph is the intersection graph of continuous arcs in the plane. The preceding result implies that every string graph with m edges has a balanced separator of size \(O(\sqrt{m})\). This bound is optimal, as it generalizes the planar separator theorem. It confirms a conjecture of Fox and Pach (2010) and improves over the \(O(\sqrt{m} \log m)\) bound of Matousek (2013).

@inproceedings{LeeStrings17,
    author    = {James R. Lee},
    title     = {Separators in Region Intersection Graphs},
    booktitle = {8th Innovations in Theoretical Computer Science Conference, {ITCS}
    2017, January 9-11, 2017, Berkeley, CA, {USA}},
    pages     = {1:1--1:8},
    year      = {2017},
}
[ abs | bib ]   Lower bounds on the size of semidefinite programming relaxations, with P. Raghavendra and D. Steurer (STOC'15)

[credit: Bernd Sturmfels]

We introduce a method for proving lower bounds on the efficacy of semidefinite programming (SDP) relaxations for combinatorial problems. In particular, we show that the cut, TSP, and stable set polytopes on n-vertex graphs are not the linear image of the feasible region of any SDP (i.e., any spectrahedron) of dimension less than \(2^{n^c}\), for some constant \(c > 0\). This result yields the first super-polynomial lower bounds on the semidefinite extension complexity of any explicit family of polytopes.

Our results follow from a general technique for proving lower bounds on the positive semidefinite rank of a matrix. To this end, we establish a close connection between arbitrary SDPs and those arising from the sum-of-squares SDP hierarchy. For approximating maximum constraint satisfaction problems, we prove that SDPs of polynomial-size are equivalent in power to those arising from degree-O(1) sum-of-squares relaxations. This result implies, for instance, that no family of polynomial-size SDP relaxations can achieve better than a 7/8-approximation for MAX-3-SAT.

[ PPT slides ]

Best paper award, STOC 2015

@inproceedings{LRS15,
     AUTHOR = {James R. Lee and Prasad Raghavendra and David Steurer},
  BOOKTITLE = {Proceedings of the Forty-Seventh Annual {ACM} on Symposium on Theory
                  of Computing, {STOC} 2015, Portland, OR, USA, June
                  14-17, 2015},
       NOTE = {Full version at \verb|https://arxiv.org/abs/1411.6317|},
      PAGES = {567--576},
      TITLE = {Lower Bounds on the Size of Semidefinite Programming Relaxations},
       YEAR = {2015}
}
[ abs | bib ]   On the power of symmetric LP and SDP relaxations, with P. Raghavendra, D. Steurer, and N. Tan (CCC'14)

We study the computational power of general symmetric relaxations for combinatorial optimization problems, both in the linear programming (LP) and semidefinite programming (SDP) case. We show new connections to explicit LP and SDP relaxations, like those obtained from standard hierarchies.

@inproceedings{DBLP:conf/coco/LeeRST14,
  author    = {James R. Lee and
               Prasad Raghavendra and
               David Steurer and
               Ning Tan},
  title     = {On the Power of Symmetric {LP} and {SDP} Relaxations},
  booktitle = {{IEEE} 29th Conference on Computational Complexity, {CCC} 2014, Vancouver,
               BC, Canada, June 11-13, 2014},
  pages     = {13--21},
  year      = {2014},
}
[ abs | bib ]   Approximate constraint satisfaction requires large LP relaxations, with S. O. Chan, P. Raghavendra, and D. Steurer (FOCS'13; JACM)

We prove super-polynomial lower bounds on the size of linear programming relaxations for approximation versions of constraint satisfaction problems. We show that for these problems, polynomial-sized linear programs are exactly as powerful as programs arising from a constant number of rounds of the Sherali-Adams hierarchy. In particular, any polynomial-sized linear program for Max Cut has an integrality gap of 1/2 and any such linear program for Max 3-Sat has an integrality gap of 7/8.

Video: Linear programming and constraint satisfaction (Simons Institute)

@article {CLRS16,
    AUTHOR = {Chan, Siu On and Lee, James R. and Raghavendra, Prasad and
              Steurer, David},
     TITLE = {Approximate constraint satisfaction requires large {LP}
              relaxations},
   JOURNAL = {J. ACM},
  FJOURNAL = {Journal of the ACM},
    VOLUME = {63},
      YEAR = {2016},
    NUMBER = {4},
     PAGES = {Art. 34, 22},
}
[ abs | bib ]   A node-capacitated Okamura-Seymour theorem, with M. Mendel and M. Moharrami (STOC'13; Math. Prog. Ser. A.)

The classical Okamura-Seymour theorem states that for an edge-capacitated, multi-commodity flow instance in which all terminals lie on a single face of a planar graph, there exists a feasible concurrent flow if and only if the cut conditions are satisfied. Simple examples show that a similar theorem is impossible in the node-capacitated setting. Nevertheless, we prove that an approximate flow/cut theorem does hold: For some universal c > 0, if the node cut conditions are satisfied, then one can simultaneously route a c-fraction of all the demands.

This answers an open question of Chekuri and Kawarabayashi. More generally, we show that this holds in the setting of multi-commodity polymatroid networks introduced by Chekuri, et. al. Our approach employs a new type of random metric embedding in order to round the convex programs corresponding to these more general flow problems.

@article {LMM15,
    AUTHOR = {Lee, James R. and Mendel, Manor and Moharrami, Mohammad},
     TITLE = {A node-capacitated {O}kamura-{S}eymour theorem},
   JOURNAL = {Math. Program.},
  FJOURNAL = {Mathematical Programming},
    VOLUME = {153},
      YEAR = {2015},
    NUMBER = {2, Ser. A},
     PAGES = {381--415},
}
[ abs | bib ]   Genus and the geometry of the cut graph, with A. Sidiropoulos (SODA'10)

We study the quantitative geometry of graphs in terms of their genus, using the structure of certain cut graphs. These are subgraphs whose removal leaves a planar graph.

@inproceedings{DBLP:conf/soda/LeeS10,
  author    = {James R. Lee and
               Anastasios Sidiropoulos},
  title     = {Genus and the Geometry of the Cut Graph},
  booktitle = {Proceedings of the Twenty-First Annual {ACM-SIAM} Symposium on Discrete
               Algorithms, {SODA} 2010, Austin, Texas, USA, January 17-19, 2010},
  pages     = {193--201},
  year      = {2010},
}
[ abs | bib ]   Randomly removing $g$ handles at once, with G. Borradaile and A. Sidiropoulos (SoCG'09; CGTA)

Indyk and Sidiropoulos (2007) proved that any orientable graph of genus $g$ can be probabilistically embedded into a graph of genus $g−1$ with constant distortion. Viewing a graph of genus $g$ as embedded on the surface of a sphere with $g$ handles attached, Indyk and Sidiropoulos’ method gives an embedding into a distribution over planar graphs with distortion $O(g)$ by iteratively removing the handles. By removing all g handles at once, we present a probabilistic embedding with distortion $O(g^2)$ for both orientable and non-orientable graphs. Our result is obtained by showing that the nimum-cut graph of Erickson and Har Peled (2004) has low dilation, and then randomly cutting this graph out of the surface using the Peeling Lemma of Lee and Sidiropoulos (2009).

@article{BLS10,
     AUTHOR = {Borradaile, Glencora and Lee, James R. and Sidiropoulos,
                  Anastasios},
   FJOURNAL = {Computational Geometry. Theory and Applications},
    JOURNAL = {Comput. Geom.},
     NUMBER = {8},
      PAGES = {655--662},
      TITLE = {Randomly removing {$g$} handles at once},
     VOLUME = {43},
       YEAR = {2010}
}
[ abs | bib ]   Vertex cuts, random walks, and dimension reduction in series-parallel graphs, with B. Brinkman and A. Karagiozova (STOC'07)

We consider questions about vertex cuts in graphs, random walks in metric spaces, and dimension reduction in $\ell_1$ and $\ell_2$; these topics are intimately connected because they can each be reduced to the existence of various families of real-valued Lipschitz maps on certain metric spaces. We view these issues through the lens of shortest-path metrics on series-parallel graphs, and we discuss the implications for a variety of well-known open problems.

@inproceedings{DBLP:conf/stoc/BrinkmanKL07,
     AUTHOR = {Bo Brinkman and Adriana Karagiozova and James R. Lee},
  BOOKTITLE = {Proceedings of the 39th Annual {ACM} Symposium on Theory of
                  Computing, San Diego, California, USA, June 11-13,
                  2007},
        DOI = {10.1145/1250790.1250882},
      PAGES = {621--630},
      TITLE = {Vertex cuts, random walks, and dimension reduction in series-
                  parallel graphs},
       YEAR = {2007}
}
[ abs | bib ]   \(L_p\) metrics on the Heisenberg group and the Goemans-Linial conjecture, with A. Naor (FOCS'06)

[credit: Patrick Massot]

We prove that the function \(d : \mathbb{R}^3 \times \mathbb{R}^3 \to [0,\infty)\) given by

\begin{align*} d(&(x,y,z),(t,u,v)) \\ &= \left(\left[\left((t-x)^2+(u-y)^2\right)^2 + (v-z+2xu-2yt)^2\right]^{1/2} + (t-x)^2 + (u-y)^2\right)^{1/2} \end{align*}

is a metric on \(\mathbb{R}^3\) such that \((\mathbb{R}^3, \sqrt{d})\) is isometric to a subset of Hilbert space, yet \((\mathbb{R}^3,d)\) does not admit a bi-Lipschitz embedding into \(L_1\). This yields a new simple counter example to the Goemans-Linial conjecture on the integrality gap of the SDP relaxation of the Sparsest Cut problem.

Our methods involve Fourier analytic techniques, combined with a breakthrough of Cheeger and Kleiner, along with classical results of Pansu on the differentiability of Lispchitz functions on the Heisenberg group.

@inproceedings{LN06,
  author    = {James R. Lee and
               Assaf Naor},
  title     = {Lp metrics on the Heisenberg group and the Goemans-Linial conjecture},
  booktitle = {47th Annual {IEEE} Symposium on Foundations of Computer Science {(FOCS}
               2006), 21-24 October 2006, Berkeley, California, USA, Proceedings},
  pages     = {99--108},
  year      = {2006},
}
[ abs | bib ]   Volume distortion for subsets of Euclidean spaces (SoCG'06, DCG)

We present an $O(\sqrt{\log n} \log \log n)$ upper bound on the $(n-1)$-dimensional volume distortion of $n$-point subsets of a Euclidean space, nearly resolving the main open questions of [Dunagan-Vempala 2001] and [Krauthgamer-Linial-Magen 2004]. The best previous bound was $O(\log n)$ and our bound is nearly tight, as even the $2$-dimensional volume distortion of an $n$-vertex path is $\Omega(\sqrt{\log n})$.

@article{LeeVolume09,
     AUTHOR = {Lee, James R.},
   FJOURNAL = {Discrete \& Computational Geometry. An International Journal of
                  Mathematics and Computer Science},
    JOURNAL = {Discrete Comput. Geom.},
     NUMBER = {4},
      PAGES = {590--615},
      TITLE = {Volume distortion for subsets of {E}uclidean spaces},
     VOLUME = {41},
       YEAR = {2009}
}
[ abs | bib ]   Algorithms on negatively curved spaces, with R. Krauthgamer (FOCS'06)

We explore approximation algorithms on negatively curved metric spaces in the sense of Gromov. For instance, we show that the Traveling Salesman Problem admits a PTAS when the set of points lies in hyperbolic $d$-space.

@inproceedings{KL06,
  author    = {Robert Krauthgamer and
               James R. Lee},
  title     = {Algorithms on negatively curved spaces},
  booktitle = {47th Annual {IEEE} Symposium on Foundations of Computer Science {(FOCS}
               2006), 21-24 October 2006, Berkeley, California, USA, Proceedings},
  pages     = {119--132},
  year      = {2006},
}
[ abs | bib ]   An improved approximation ratio for the minimum linear arrangement problem, with U. Feige (IPL 2007)

We observe that combining the techniques behind the Sparsest Cut SDP analysis of Arora, Rao, and Vazirani, with the rounding algorithm of Rao and Richa yields an $O(\sqrt{\log n} \log \log n)$-approximation for the minimum linear arrangement problem, improving over the previous bound of $O(\log n)$.

@article {FL07,
    AUTHOR = {Feige, Uriel and Lee, James R.},
     TITLE = {An improved approximation ratio for the minimum linear
              arrangement problem},
   JOURNAL = {Inform. Process. Lett.},
  FJOURNAL = {Information Processing Letters},
    VOLUME = {101},
      YEAR = {2007},
    NUMBER = {1},
     PAGES = {26--29},
}
[ abs | bib ]   Euclidean distortion and the Sparsest Cut, with S. Arora and A. Naor (STOC'05; JAMS)

We prove that every \(n\)-point metric space of negative type (and, in particular, every \(n\)-point subset of \(L_1\)) embeds into a Euclidean space with distortion \(O(\sqrt{\log n} \log \log n)\), a result which is tight up to the iterated logarithm factor. As a consequence, we obtain the best known polynomial-time approximation algorithm for the Sparsest Cut problem with general demands. If the demand is supported on a set of size \(k\), we achieve an approximation ratio of \(O(\sqrt{\log k} \log \log k)\).

@article {LN08,
    AUTHOR = {Arora, Sanjeev and Lee, James R. and Naor, Assaf},
     TITLE = {Euclidean distortion and the sparsest cut},
   JOURNAL = {J. Amer. Math. Soc.},
  FJOURNAL = {Journal of the American Mathematical Society},
    VOLUME = {21},
      YEAR = {2008},
    NUMBER = {1},
     PAGES = {1--21},
}
[ abs | bib ]   Improved approximation algorithms for minimum-weight vertex separators, with M.-T. Hajiaghayi and U. Feige (STOC'05; SICOMP)

We develop the algorithmic theory of vertex separators, and its relation to embeddings of certain metric spaces. As a consequence, we obtain an \(O(\sqrt{\log n})\)-approximation for min-ratio vertex cuts in general graphs based on a new SDP relaxation and a tight analysis of its integrality gap.

Our results allow yield algorithms for constructing approximately optimal tree decompositions of graphs. If a graph has treewidth \(k\), we find a tree decomposition of width at most \(O(k \sqrt{\log k})\). If, additionally, the input graph excludes a fixed graph as a minor, we produce a tree decomposition of width \(O(k)\).

Related: A similar method gives improved bounds for the minimum linear arrangement problem.

@article {FHL08,
    AUTHOR = {Feige, Uriel and Hajiaghayi, Mohammadtaghi and Lee, James R.},
     TITLE = {Improved approximation algorithms for minimum weight vertex
              separators},
   JOURNAL = {SIAM J. Comput.},
  FJOURNAL = {SIAM Journal on Computing},
    VOLUME = {38},
      YEAR = {2008},
    NUMBER = {2},
     PAGES = {629--657},
}
[ abs | bib ]   Distance scales, embeddings, and metrics of negative type (SODA'05)

[Note: This is the (unpublished) full version of a paper appearing in the SODA 2005 conference under the name Distance scales, embeddings, and efficient relaxations of the cut cone.]

We introduce a new number of new techniques for the construction of low-distortion embeddings of a finite metric space. These include a generic Gluing Lemma which avoids the overhead typically incurred from the naı̈ve concatenation of maps for different scales of a space. We also give a significantly improved and quantitatively optimal version of the main structural theorem of Arora, Rao, and Vazirani on separated sets in metrics of negative type. The latter result offers a simple hyperplane rounding algorithm for the computation of an \(O(\sqrt{\log n})\)-approximation to the Sparsest Cut problem with uniform demands, and has a number of other applications to embeddings and approximation algorithms.

For a more elegant and simpler proof of the “Big Core” Theorem, one can consult these notes of Rothvoss.

@inproceedings{DBLP:conf/soda/Lee05,
     AUTHOR = {James R. Lee},
  BOOKTITLE = {Proceedings of the Sixteenth Annual {ACM-SIAM} Symposium on Discrete
                  Algorithms, {SODA} 2005, Vancouver, British
                  Columbia, Canada, January 23-25, 2005},
      PAGES = {92--101},
      TITLE = {On distance scales, embeddings, and efficient relaxations of the cut
                  cone},
       YEAR = {2005}
}
[ abs | bib ]   The black-box complexity of nearest-neighbor search, with R. Krauthgamer (ICALP'04; TCS)

This paper establishes that a metric space $X$ supports approximate nearest-neighbor queries against any $n$-point subset with at most $\mathrm{poly}(\log n)$ query time if and only if one has certain quantitative estimates on the doubling constant of $n$-point subsets.

@article {KL05,
    AUTHOR = {Krauthgamer, Robert and Lee, James R.},
     TITLE = {The black-box complexity of nearest-neighbor search},
   JOURNAL = {Theoret. Comput. Sci.},
  FJOURNAL = {Theoretical Computer Science},
    VOLUME = {348},
      YEAR = {2005},
    NUMBER = {2-3},
     PAGES = {262--276},
}
[ abs | bib ]   Navigating nets: Simple algorithms for proximity search, with R. Krauthgamer (SODA'04)

We present simple deterministic data structures for maintaining a set of points in a metric space under insertions and deletions, with the ability to make fast nearest-neighbor queries. The running-time of the operations scales with the doubling constant of the underlying metric space.

@inproceedings {KL04soda,
    AUTHOR = {Krauthgamer, Robert and Lee, James R.},
     TITLE = {Navigating nets: simple algorithms for proximity search},
 BOOKTITLE = {Proceedings of the {F}ifteenth {A}nnual {ACM}-{SIAM}
              {S}ymposium on {D}iscrete {A}lgorithms},
     PAGES = {798--807},
 PUBLISHER = {ACM, New York},
      YEAR = {2004},
   MRCLASS = {68P05},
  MRNUMBER = {2290969},
}
[ abs | bib ]   Bounded geometries, fractals, and low-distortion embeddings, with A. Gupta and R. Krauthgamer (FOCS'03)

We study the quantitative ditsortion required to embed finite, uniformly doubling metric spaces into normed spaces.

@inproceedings {GKL03,
  author    = {Anupam Gupta and
               Robert Krauthgamer and
               James R. Lee},
  title     = {Bounded Geometries, Fractals, and Low-Distortion Embeddings},
  booktitle = {44th Symposium on Foundations of Computer Science {(FOCS} 2003), 11-14
               October 2003, Cambridge, MA, USA, Proceedings},
  pages     = {534--543},
  year      = {2003},
  doi       = {10.1109/SFCS.2003.1238226},
}
[ abs | bib ]   Hardness of approximation for vertex-connectivity network design problems, with G. Kortsarz and R. Krauthgamer (APPROX'02, SICOMP)

We establish lower bounds on the polynomial-time approximability of the vertex-connectivity variant of the Survivable Network Design Problem (SNDP) under the assumption that \(\mathsf{NP}\) does not have quasipolynomial-time algorithms. Some related hardness of approximation results are established, e.g., for the edge and vertex-connectivity augmention problems.

@article {KKL04,
    AUTHOR = {Kortsarz, Guy and Krauthgamer, Robert and Lee, James R.},
     TITLE = {Hardness of approximation for vertex-connectivity network
              design problems},
   JOURNAL = {SIAM J. Comput.},
  FJOURNAL = {SIAM Journal on Computing},
    VOLUME = {33},
      YEAR = {2004},
    NUMBER = {3},
     PAGES = {704--720},
}