A non-invasive machine learning mechanism for early disease recognition on Twitter: The case of anemia

Samer Sarsam, Hosam Al-Samarraie, Ahmed Ibrahim Alzahrani, Abdul Samad Shibghatullah

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)
60 Downloads (Pure)


Social media sites, such as Twitter, provide the means for users to share their stories, feelings, and health conditions during the disease course. Anemia, the most common type of blood disorder, is recognized as a major public health problem all over the world. Yet very few studies have explored the potential of recognizing anemia from online posts. This study proposed a novel mechanism for recognizing anemia based on the associations between disease symptoms and patients’ emotions posted on the Twitter platform. We used k-means and Latent Dirichlet Allocation (LDA) algorithms to group similar tweets and to identify hidden disease topics. Both disease emotions and symptoms were mapped using the Apriori algorithm. The proposed approach was evaluated using a number of classifiers. A higher prediction accuracy of 98.96 % was achieved using Sequential Minimal Optimization (SMO). The results revealed that fear and sadness emotions are dominant among anemic patients. The proposed mechanism is the first of its kind to diagnose anemia using textual information posted on social media sites. It can advance the development of intelligent health monitoring systems and clinical decision-support systems.
Original languageEnglish
Article number102428
Number of pages12
JournalArtificial Intelligence in Medicine
Early online date19 Oct 2022
Publication statusPublished - Dec 2022

Bibliographical note

This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).


This work was funded by the Researchers Supporting Project number [RSP-2021/157], King Saud University, Riyadh, Saudi Arabia.


  • Health monitoring systems
  • Anemia recognition
  • Lexicon-based approach
  • Twitter
  • Machine learning


Dive into the research topics of 'A non-invasive machine learning mechanism for early disease recognition on Twitter: The case of anemia'. Together they form a unique fingerprint.

Cite this