Unsupervised Learning of Molecular Embeddings for Enhanced Clustering and Emergent Properties for Chemical Compounds
October 25, 2023 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Jaiveer Gill, Ratul Chakraborty, Reetham Gubba, Amy Liu, Shrey Jain, Chirag Iyer, Obaid Khwaja, Saurav Kumar
arXiv ID
2310.18367
Category
physics.chem-ph
Cross-listed
cs.AI,
cs.CV,
cs.LG
Citations
0
Venue
arXiv.org
Last Checked
3 months ago
Abstract
The detailed analysis of molecular structures and properties holds great potential for drug development discovery through machine learning. Developing an emergent property in the model to understand molecules would broaden the horizons for development with a new computational tool. We introduce various methods to detect and cluster chemical compounds based on their SMILES data. Our first method, analyzing the graphical structures of chemical compounds using embedding data, employs vector search to meet our threshold value. The results yielded pronounced, concentrated clusters, and the method produced favorable results in querying and understanding the compounds. We also used natural language description embeddings stored in a vector database with GPT3.5, which outperforms the base model. Thus, we introduce a similarity search and clustering algorithm to aid in searching for and interacting with molecules, enhancing efficiency in chemical exploration and enabling future development of emergent properties in molecular property prediction models.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.chem-ph
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Machine learning for molecular simulation
R.I.P.
π
404 Not Found
TorchMD: A deep learning framework for molecular simulations
R.I.P.
π»
Ghosted
Coarse-Graining Auto-Encoders for Molecular Dynamics
R.I.P.
π»
Ghosted
Sampling molecular conformations and dynamics in a multi-user virtual reality framework
R.I.P.
π»
Ghosted
A Self-Attention Ansatz for Ab-initio Quantum Chemistry
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted