R.I.P.
๐ป
Ghosted
PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design
November 30, 2023 ยท Declared Dead ยท ๐ arXiv.org
Repo contents: README.md
Authors
Chuanrui Wang, Bozitao Zhong, Zuobai Zhang, Narendra Chaudhary, Sanchit Misra, Jian Tang
arXiv ID
2312.00080
Category
q-bio.QM
Cross-listed
cs.LG
Citations
8
Venue
arXiv.org
Repository
https://github.com/WANG-CR/PDB-Struct
โญ 6
Last Checked
1 month ago
Abstract
Structure-based protein design has attracted increasing interest, with numerous methods being introduced in recent years. However, a universally accepted method for evaluation has not been established, since the wet-lab validation can be overly time-consuming for the development of new algorithms, and the $\textit{in silico}$ validation with recovery and perplexity metrics is efficient but may not precisely reflect true foldability. To address this gap, we introduce two novel metrics: refoldability-based metric, which leverages high-accuracy protein structure prediction models as a proxy for wet lab experiments, and stability-based metric, which assesses whether models can assign high likelihoods to experimentally stable proteins. We curate datasets from high-quality CATH protein data, high-throughput $\textit{de novo}$ designed proteins, and mega-scale experimental mutagenesis experiments, and in doing so, present the $\textbf{PDB-Struct}$ benchmark that evaluates both recent and previously uncompared protein design methods. Experimental results indicate that ByProt, ProteinMPNN, and ESM-IF perform exceptionally well on our benchmark, while ESM-Design and AF-Design fall short on the refoldability metric. We also show that while some methods exhibit high sequence recovery, they do not perform as well on our new benchmark. Our proposed benchmark paves the way for a fair and comprehensive evaluation of protein design methods in the future. Code is available at https://github.com/WANG-CR/PDB-Struct.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ q-bio.QM
R.I.P.
๐ป
Ghosted
GuacaMol: Benchmarking Models for De Novo Molecular Design
R.I.P.
๐ป
Ghosted
DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences
R.I.P.
๐ป
Ghosted
ProtVec: A Continuous Distributed Representation of Biological Sequences
R.I.P.
๐ป
Ghosted
A Perspective on Deep Imaging
R.I.P.
๐
404 Not Found
Deep learning in bioinformatics: introduction, application, and perspective in big data era
Died the same way โ ๐ Death by README
R.I.P.
๐
Death by README
Momentum Contrast for Unsupervised Visual Representation Learning
R.I.P.
๐
Death by README
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
R.I.P.
๐
Death by README
Revisiting Graph based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach
R.I.P.
๐
Death by README