Families of costs with zero and nonnegative MTW tensor in optimal transport and the c-divergences
January 01, 2024 Β· Declared Dead Β· π Information Geometry
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Du Nguyen
arXiv ID
2401.00953
Category
math.AP
Cross-listed
cs.IT,
cs.LG,
stat.ML
Citations
0
Venue
Information Geometry
Last Checked
1 month ago
Abstract
We study the information geometry of $\bcc$-divergences from families of costs of the form $\mathsf{c}(x, \barx) =\mathsf{u}(x^{\mathfrak{t}}\barx)$ through the optimal transport point of view. Here, $\mathsf{u}$ is a scalar function with inverse $\mathsf{s}$, $x^{\ft}\barx$ is a nondegenerate bilinear pairing of vectors $x, \barx$ belonging to an open subset of $\mathbb{R}^n$. We compute explicitly the MTW tensor (or cross curvature) for the optimal transport problem on $\mathbb{R}^n$ with this cost. The condition that the MTW-tensor vanishes on null vectors under the Kim-McCann metric is a fourth-order nonlinear ODE, which could be reduced to a linear ODE of the form $\mathsf{s}^{(2)} - S\mathsf{s}^{(1)} + P\mathsf{s} = 0$ with constant coefficients $P$ and $S$. The resulting inverse functions include {\it Lambert} and {\it generalized inverse hyperbolic\slash trigonometric} functions. The square Euclidean metric and $\log$-type costs are equivalent to instances of these solutions. The optimal map may be written explicitly in terms of the potential function. For cost functions of a similar form on a hyperboloid model of the hyperbolic space and unit sphere, we also express this tensor in terms of algebraic expressions in derivatives of $\mathsf{s}$ using the Gauss-Codazzi equation, obtaining new families of strictly regular costs for these manifolds, including new families of {\it power function costs}. We express the divergence geometry of the $\mathsf{c}$-divergence in terms of the Kim-McCann metric, including a $\mathsf{c}$-Crouzeix identity and a formula for the primal connection. We analyze the $\sinh$-type hyperbolic cost, providing examples of $\mathsf{c}$-convex functions, which are used to construct a new \emph{local form} of the $Ξ±$-divergences on probability simplices. We apply the optimal maps to sample the multivariate $t$-distribution.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β math.AP
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Consistency of Lipschitz learning with infinite unlabeled data and finite labeled data
R.I.P.
π»
Ghosted
Properly-weighted graph Laplacian for semi-supervised learning
R.I.P.
π»
Ghosted
Quantum optimal transport is cheaper
R.I.P.
π»
Ghosted
Graph clustering, variational image segmentation methods and Hough transform scale detection for object measurement in images
R.I.P.
π»
Ghosted
The limit shape of convex hull peeling
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted