Limitations of Protein Structure Prediction Algorithms in Therapeutic Protein Development

Sarfaraz K. Niazi, Zamara Mariam, Rehan Z. Paracha

Research output: Contribution to journalArticlepeer-review

46 Downloads (Pure)

Abstract

The three-dimensional protein structure is pivotal in comprehending biological phenomena. It directly governs protein function and hence aids in drug discovery. The development of protein prediction algorithms, such as AlphaFold2, ESMFold, and trRosetta, has given much hope in expediting protein-based therapeutic discovery. Though no study has reported a conclusive application of these algorithms, the efforts continue with much optimism. We intended to test the application of these algorithms in rank-ordering therapeutic proteins for their instability during the pre-translational modification stages, as may be predicted according to the confidence of the structure predicted by these algorithms. The selected molecules were based on a harmonized category of licensed therapeutic proteins; out of the 204 licensed products, 188 that were not conjugated were chosen for analysis, resulting in a lack of correlation between the confidence scores and structural or protein properties. It is crucial to note here that the predictive accuracy of these algorithms is contingent upon the presence of the known structure of the protein in the accessible database. Consequently, our conclusion emphasizes that these algorithms primarily replicate information derived from existing structures. While our findings caution against relying on these algorithms for drug discovery purposes, we acknowledge the need for a nuanced interpretation. Considering their limitations and recognizing that their utility may be constrained to scenarios where known structures are available is important. Hence, caution is advised when applying these algorithms to characterize various attributes of therapeutic proteins without the support of adequate structural information. It is worth noting that the two main algorithms, AlfphaFold2 and ESMFold, also showed a 72% correlation in their scores, pointing to similar limitations. While much progress has been made in computational sciences, the Levinthal paradox remains unsolved.
Original languageEnglish
Pages (from-to)98-112
Number of pages15
JournalBioMedInformatics
Volume4
Issue number1
Early online date8 Jan 2024
DOIs
Publication statusPublished - Mar 2024

Bibliographical note

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Keywords

  • AlphaFold2
  • ESMFold
  • Levinthal paradox
  • biosimilars
  • protein structure prediction

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Health Informatics
  • Health Professions (miscellaneous)
  • Medicine (miscellaneous)

Fingerprint

Dive into the research topics of 'Limitations of Protein Structure Prediction Algorithms in Therapeutic Protein Development'. Together they form a unique fingerprint.

Cite this