Predicting Primary Sequence-Based Protein-Protein Interactions Using a Mercer Series Representation of Nonlinear Support Vector Machine

Omid Chatrabgoun, Alireza Daneshkhah, Mohsen Esmaeilbeigi, Nader Sohrabi Safa, Ali H. Alenezi, Arafatur Rahman

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)
75 Downloads (Pure)

Abstract

The prediction of protein-protein interactions (PPIs) is essential to understand the cellular processes from a medical perspective. Among the various machine learning techniques, kernel-based Support Vector Machine (SVM) has been commonly employed to discriminate between interacting and non-interacting protein pairs. The main drawback of employing the kernel-based SVM to datasets with many features, such as the primary sequence-based protein-protein dataset, is the significant increase in computational time of training stage. This increase in computational time is mainly due to the presence of the kernel in solving the quadratic optimisation problem (QOP) involved in nonlinear SVM. In order to fix this issue, we propose a novel and efficient computational algorithm by approximating the kernel-based SVM using a low-rank truncated Mercer series as well as desired. As a result, the QOP for the approximated kernel-based SVM will be very tractable in the sense that there is a significant reduction in computational time of training and validating stages. We illustrate the novelty of the proposed method by predicting the PPIs of "S. Cerevisiae” where the protein features extracted using the multiscale local descriptor (MLD), and then we compare the predictive performance of the proposed low-rank approximation with the existing methods. Finally, the new method results in significant reduction in computational time for predicting PPIs with almost as accuracy as kernel-based SVM
Original languageEnglish
Pages (from-to)124345 - 124354
Number of pages10
JournalIEEE Access
Volume10
DOIs
Publication statusPublished - 1 Dec 2022

Bibliographical note

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited..

Funder

The authors extend their appreciation to the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia for funding this research work through the project number IF-2020-NBU-412.

Keywords

  • Kernel-based SVM
  • protein-protein interactions
  • quadratic optimisation problem
  • Mercer series

Fingerprint

Dive into the research topics of 'Predicting Primary Sequence-Based Protein-Protein Interactions Using a Mercer Series Representation of Nonlinear Support Vector Machine'. Together they form a unique fingerprint.

Cite this