Difference between revisions of "Open Problems:7"

From Open Problems in Sublinear Algorithms
Jump to: navigation, search
(Created page with "{{DISPLAYTITLE:Problem 7: Estimating Earth-Mover Distance}} {{Infobox |label1 = Proposed by |data1 = Piotr Indyk |label2 = Source |data2 = [[Workshops:Kanpur_2006|Kanpur 2006]...")
 
 
(3 intermediate revisions by one other user not shown)
Line 1: Line 1:
{{DISPLAYTITLE:Problem 7: Estimating Earth-Mover Distance}}
+
{{Header
{{Infobox
+
|source=kanpur06
|label1 = Proposed by
+
|who=Piotr Indyk
|data1 = Piotr Indyk
 
|label2 = Source
 
|data2 = [[Workshops:Kanpur_2006|Kanpur 2006]]
 
|label3 = Short link
 
|data3 = http://sublinear.info/7
 
 
}}
 
}}
 
Consider a stream of red points $R$ and blue points $B$ from a 2-dimensional
 
Consider a stream of red points $R$ and blue points $B$ from a 2-dimensional
Line 28: Line 23:
 
distortion {{cite|NaorS-06}}.
 
distortion {{cite|NaorS-06}}.
 
So one would need to do something else to get $O(1)$-approximation.
 
So one would need to do something else to get $O(1)$-approximation.
 +
 +
==Update==
 +
 +
It was shown that the decision version of the problem (distinguish the cases, when the EMD distance is at most $1$ vs. at least $D$ for $D = O(1)$) does not allow sketches of the constant size {{cite|AndoniKR-14}}.

Latest revision as of 21:26, 11 November 2014

Suggested by Piotr Indyk
Source Kanpur 2006
Short link https://sublinear.info/7

Consider a stream of red points $R$ and blue points $B$ from a 2-dimensional grid $[\Delta]^2$, in an arbitrary order. We assume $|R|=|B|=n$. The Earth-Mover Distance (EMD) between $R$ and $B$ is the value of the min-cost matching between $R$ and $B$, i.e., \[ \mbox{EMD}(R,B)= \min_{\pi: R \to B} \sum_{p \in R} \|p-\pi(p)\|, \] where $\pi$ is a one-to-one mapping, and $\|\cdot\|$ is (say) the $L_1$ norm.

What are the space vs. approximation tradeoffs achievable by streaming algorithms for this problem? In particular, is there an $O(1)$-approximation algorithm using $(\log n+\log \Delta)^{O(1)}$ space?

It is known that that there is an $O(\log \Delta)$-approximation algorithm using that much space [Indyk-04]. That algorithm proceeds essentially by embedding EMD into $L_1$ [Charikar-02,IndykT-03]. However, any such embedding must incur at least $\Omega( \sqrt{\log \Delta})$ distortion [NaorS-06]. So one would need to do something else to get $O(1)$-approximation.

Update[edit]

It was shown that the decision version of the problem (distinguish the cases, when the EMD distance is at most $1$ vs. at least $D$ for $D = O(1)$) does not allow sketches of the constant size [AndoniKR-14].