Almost Optimal Bounds for Sublinear-Time Sampling of $k$-Cliques: Sampling Cliques is Harder Than Counting
December 07, 2020 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Talya Eden, Dana Ron, Will Rosenbaum
arXiv ID
2012.04090
Category
cs.DS: Data Structures & Algorithms
Citations
10
Venue
arXiv.org
Last Checked
4 months ago
Abstract
In this work, we consider the problem of sampling a $k$-clique in a graph from an almost uniform distribution in sublinear time in the general graph query model. Specifically the algorithm should output each $k$-clique with probability $(1\pm Ξ΅)/n_k$, where $n_k$ denotes the number of $k$-cliques in the graph and $Ξ΅$ is a given approximation parameter. We prove that the query complexity of this problem is \[ Ξ^*\left(\max\left\{ \left(\frac{(nΞ±)^{k/2}}{ n_k}\right)^{\frac{1}{k-1}} ,\; \min\left\{nΞ±,\frac{nΞ±^{k-1}}{n_k} \right\}\right\}\right). \] where $n$ is the number of vertices in the graph, $Ξ±$ is its arboricity, and $Ξ^*$ suppresses the dependence on $(\log n/Ξ΅)^{O(k)}$. Interestingly, this establishes a separation between approximate counting and approximate uniform sampling in the sublinear regime. For example, if $k=3$, $Ξ±= O(1)$, and $n_3$ (the number of triangles) is $Ξ(n)$, then we get a lower bound of $Ξ©(n^{1/4})$ (for constant $Ξ΅$), while under these conditions, a $(1\pm Ξ΅)$-approximation of $n_3$ can be obtained by performing $\textrm{poly}(\log(n/Ξ΅))$ queries (Eden, Ron and Seshadhri, SODA20). Our lower bound follows from a construction of a family of graphs with arboricity $Ξ±$ such that in each graph there are $n_k$ cliques (of size $k$), where one of these cliques is "hidden" and hence hard to sample. Our upper bound is based on defining a special auxiliary graph $H_k$, such that sampling edges almost uniformly in $H_k$ translates to sampling $k$-cliques almost uniformly in the original graph $G$. We then build on a known edge-sampling algorithm (Eden, Ron and Rosenbaum, ICALP19) to sample edges in $H_k$, where the challenge is simulate queries to $H_k$ while being given access only to $G$.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Data Structures & Algorithms
π
π
The Cartographer
R.I.P.
π»
Ghosted
Route Planning in Transportation Networks
R.I.P.
π»
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
π»
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
π»
Ghosted
Graph Isomorphism in Quasipolynomial Time
π
π
The Cartographer
Simulation optimization: A review of algorithms and applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted