Making mean-estimation more efficient using an MCMC trace variance approach: DynaMITE

November 22, 2020 Β· Declared Dead Β· πŸ› arXiv.org

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Cyrus Cousins, Shahrzad Haddadan, Eli Upfal arXiv ID 2011.11129 Category cs.DS: Data Structures & Algorithms Citations 8 Venue arXiv.org Last Checked 4 months ago
Abstract
We introduce a novel statistical measure for MCMC-mean estimation, the inter-trace variance ${\rm trv}^{(Ο„_{rel})}({\cal M},f)$, which depends on a Markov chain ${\cal M}$ and a function $f:S\to [a,b]$. The inter-trace variance can be efficiently estimated from observed data and leads to a more efficient MCMC-mean estimator. Prior MCMC mean-estimators receive, as input, upper-bounds on $Ο„_{mix}$ or $Ο„_{rel}$, and often also the stationary variance, and their performance is highly dependent to the sharpness of these bounds. In contrast, we introduce DynaMITE, which dynamically adjusts the sample size, it is less sensitive to the looseness of input upper-bounds on $Ο„_{rel}$, and requires no bound on $v_Ο€$. Receiving only an upper-bound ${\cal T}_{rel}$ on $Ο„_{rel}$, DynaMITE estimates the mean of $f$ in $\tilde{\cal{O}}\bigl(\smash{\frac{{\cal T}_{rel} R}{\varepsilon}}+\frac{Ο„_{rel}\cdot {\rm trv}^{(Ο„{rel})}}{\varepsilon^{2}}\bigr)$ steps, without a priori bounds on the stationary variance $v_Ο€$ or the inter-trace variance ${\rm trv}^{(Ο„rel)}$. Thus we depend minimally on the tightness of ${\cal T}_{mix}$, as the complexity is dominated by $Ο„_{rel}\rm{trv}^{(Ο„{rel})}$ as $\varepsilon \to 0$. Note that bounding $Ο„_{\rm rel}$ is known to be prohibitively difficult, however, DynaMITE is able to reduce its principal dependence on ${\cal T}_{rel}$ to $Ο„_{rel}$, simply by exploiting properties of the inter-trace variance. To compare our method to known variance-aware bounds, we show ${\rm trv}^{(Ο„{rel})}({\cal M},f) \leq v_Ο€$. Furthermore, we show when $f$'s image is distributed (semi)symmetrically on ${\cal M}$'s traces, we have ${\rm trv}^{({Ο„{rel}})}({\cal M},f)=o(v_Ο€(f))$, thus DynaMITE outperforms prior methods in these cases.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Data Structures & Algorithms

Died the same way β€” πŸ‘» Ghosted