RAFP-Pred: Robust Prediction of Antifreeze Proteins using Localized Analysis of n-Peptide Compositions
September 25, 2018 Β· Declared Dead Β· π IEEE/ACM Transactions on Computational Biology & Bioinformatics
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun
arXiv ID
1809.09620
Category
q-bio.BM
Cross-listed
cs.LG,
stat.AP
Citations
31
Venue
IEEE/ACM Transactions on Computational Biology & Bioinformatics
Last Checked
1 month ago
Abstract
In extreme cold weather, living organisms produce Antifreeze Proteins (AFPs) to counter the otherwise lethal intracellular formation of ice. Structures and sequences of various AFPs exhibit a high degree of heterogeneity, consequently the prediction of the AFPs is considered to be a challenging task. In this research, we propose to handle this arduous manifold learning task using the notion of localized processing. In particular an AFP sequence is segmented into two sub-segments each of which is analyzed for amino acid and di-peptide compositions. We propose to use only the most significant features using the concept of information gain (IG) followed by a random forest classification approach. The proposed RAFP-Pred achieved an excellent performance on a number of standard datasets. We report a high Youden's index (sensitivity+specificity-1) value of 0.75 on the standard independent test data set outperforming the AFP-PseAAC, AFP\_PSSM, AFP-Pred and iAFP by a margin of 0.05, 0.06, 0.14 and 0.68 respectively. The verification rate on the UniProKB dataset is found to be 83.19\% which is substantially superior to the 57.18\% reported for the iAFP method.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.BM
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Protein secondary structure prediction using deep convolutional neural fields
R.I.P.
π»
Ghosted
Protein structure generation via folding diffusion
R.I.P.
π
404 Not Found
LinearFold: linear-time approximate RNA folding by 5'-to-3' dynamic programming and beam search
R.I.P.
π»
Ghosted
What is a meaningful representation of protein sequences?
R.I.P.
π»
Ghosted
Protein Secondary Structure Prediction Using Cascaded Convolutional and Recurrent Neural Networks
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted