TY - JOUR
T1 - An effective machine learning-based model for the prediction of protein–protein interaction sites in health systems
AU - Tahir, Muhammad
AU - Khan, Fazlullah
AU - Hayat, Maqsood
AU - Alshehri, Mohammad Dahman
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
PY - 2024/1
Y1 - 2024/1
N2 - Protein is a vital biomolecule that accomplishes distinct biological activities by interacting with other proteins in complex biological systems. The protein–protein interaction (PPI) sites hot spot characterization holds preliminary importance in drug discovery as well as in the comprehension of the cellular signaling phenomenon. Looking at the significance of PPIs, an intelligent prediction system based on the notion of fuzzy logic “PPIs-FuzzyKNN” is developed for PPI sites identification. Here, protein sequences are transformed into an equal length of numerical descriptors by using physicochemical properties of amino acids and a position-specific scoring matrix. Here, we have utilized conventional machine learning algorithms as well as fuzzy k-nearest neighbors. The results of the model are assessed via a tenfold cross-validation test. The proposed model PPIs-FuzzyKNN obtained 91.20, 92.65, and 93.50% of accuracy on the three different datasets, namely Dtestset72, PDBtestset164, and Dset186, respectively. The results exhibited that the outcomes of the proposed model are outstanding and persistent in all datasets, so far, compared to the literature. Consequently, it will not only play a leading role in the accurate identification of PPI sites but also becomes a rudimentary tool for the research community.
AB - Protein is a vital biomolecule that accomplishes distinct biological activities by interacting with other proteins in complex biological systems. The protein–protein interaction (PPI) sites hot spot characterization holds preliminary importance in drug discovery as well as in the comprehension of the cellular signaling phenomenon. Looking at the significance of PPIs, an intelligent prediction system based on the notion of fuzzy logic “PPIs-FuzzyKNN” is developed for PPI sites identification. Here, protein sequences are transformed into an equal length of numerical descriptors by using physicochemical properties of amino acids and a position-specific scoring matrix. Here, we have utilized conventional machine learning algorithms as well as fuzzy k-nearest neighbors. The results of the model are assessed via a tenfold cross-validation test. The proposed model PPIs-FuzzyKNN obtained 91.20, 92.65, and 93.50% of accuracy on the three different datasets, namely Dtestset72, PDBtestset164, and Dset186, respectively. The results exhibited that the outcomes of the proposed model are outstanding and persistent in all datasets, so far, compared to the literature. Consequently, it will not only play a leading role in the accurate identification of PPI sites but also becomes a rudimentary tool for the research community.
KW - Fuzzy KNN
KW - PPIs
KW - PSSM
KW - Physicochemical properties
UR - http://www.scopus.com/inward/record.url?scp=85124830729&partnerID=8YFLogxK
U2 - 10.1007/s00521-022-07024-8
DO - 10.1007/s00521-022-07024-8
M3 - Article
AN - SCOPUS:85124830729
SN - 0941-0643
VL - 36
SP - 65
EP - 75
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 1
ER -