Noisy, sparse, nonlinear: Navigating the Bermuda Triangle of physical inference with deep filtering
November 19, 2019 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Carl Poelking, Yehia Amar, Alexei Lapkin, Lucy Colwell
arXiv ID
1912.04345
Category
physics.chem-ph
Cross-listed
cs.LG,
stat.ML
Citations
1
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Capturing the microscopic interactions that determine molecular reactivity poses a challenge across the physical sciences. Even a basic understanding of the underlying reaction mechanisms can substantially accelerate materials and compound design, including the development of new catalysts or drugs. Given the difficulties routinely faced by both experimental and theoretical investigations that aim to improve our mechanistic understanding of a reaction, recent advances have focused on data-driven routes to derive structure-property relationships directly from high-throughput screens. However, even these high-quality, high-volume data are noisy, sparse and biased -- placing them in a regime where machine-learning is extremely challenging. Here we show that a statistical approach based on deep filtering of nonlinear feature networks results in physicochemical models that are more robust, transparent and generalize better than standard machine-learning architectures. Using diligent descriptor design and data post-processing, we exemplify the approach using both literature and fresh data on asymmetric catalytic hydrogenation, Palladium-catalyzed cross-coupling reactions, and drug-drug synergy. We illustrate how the sparse models uncovered by the filtering help us formulate physicochemical reaction ``pharmacophores'', investigate experimental bias and derive strategies for mechanism detection and classification.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.chem-ph
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Machine learning for molecular simulation
R.I.P.
π
404 Not Found
TorchMD: A deep learning framework for molecular simulations
R.I.P.
π»
Ghosted
Coarse-Graining Auto-Encoders for Molecular Dynamics
R.I.P.
π»
Ghosted
Sampling molecular conformations and dynamics in a multi-user virtual reality framework
R.I.P.
π»
Ghosted
A Self-Attention Ansatz for Ab-initio Quantum Chemistry
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted