Generative AI-Powered Synthetic Data for Enhancing Predictive Analytics in Blood Donation Supply Management: A Comparative Study of Machine Learning Models

Koh Chee Hong, Thong Chee Ling, Shayla Islam, Raenu AL Kolandaisamy , Abdul Samad Shibghatullah, Nazirul Nazrin Shahrol Nidzam , Samer Sarsam, Halimah Badioze Zaman

Research output: Contribution to journalArticlepeer-review

9 Downloads (Pure)

Abstract

Maintaining a sufficient and timely blood supply is an urgent and critical challenge in public health, where even minor miscalculations can lead to life-threatening shortages. This study evaluates the performance of machine learning models to improve blood donation forecasting. Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) generated synthetic datasets that mirror real-world donation patterns to address data scarcity and variability issues. Leveraging transactional data from the Blood Bank Information System (BBISv2), a blood tracking system used by 22 main blood collection sites under the Ministry of Health (MoH) in Malaysia, 50 synthetic datasets were created and validated to ensure consistency with real data. The synthetic data showed minimal deviations from real data across key metrics, including mean (differences under 10%), variance (1 to 2 units), and skewness and kurtosis (0.03 or less). Among the models, the Random Forest algorithm demonstrated the highest performance, achieving an accuracy of 98.7%, a precision of 0.91, and an Area Under the Receiver Operating Characteristic (AUC-ROC) score of 0.92, making it the most reliable for predicting blood donation rates. Linear Regression also performed well, with an accuracy of 98.6%, while Neural Networks and Support Vector Machines (SVM) showed lower performance. This research provides a valuable tool for optimizing blood donation strategies, particularly in scenarios where real data is limited. Integrating validated synthetic data offers a novel approach for enhancing resource management in healthcare, ensuring reliable blood supply during high-demand periods.
Original languageEnglish
Pages (from-to)9-19
Number of pages11
JournalInternational Journal on Advanced Science, Engineering and Information Technology
Volume15
Issue number1
Early online date8 Feb 2025
DOIs
Publication statusPublished - 28 Feb 2025

Bibliographical note

International Journal on Advanced Science, Engineering and Information Technology (IJASEIT) publishes fully open access journals, which means that all articles are available on the internet to all users immediately upon publication. Non-commercial use and distribution in any medium is permitted, provided the author and the journal are properly credited. https://ijaseit.insightsociety.org/index.php/ijaseit/authorguidelines

Publisher Copyright:
© (2025), (Insight Society). All rights reserved.

Funding

This study benefited from the valuable contributions of several individuals. Special thanks are due to Abdul Samad Bin Shibghatullah from UCSI University Kuala Lumpur for his technical expertise in data analysis and methodology, and to Chloe Thong Chee Ling for her insights on data modeling. Nazirul Nazrin Bin Shahrol Nidzam from Universiti Tenaga Nasional provided critical feedback on data validation, while Samer Muthana Sarsam from Coventry University offered valuable perspectives on strategic direction. Appreciation is also extended to Shayla Islam and Raenu AL Kolandaisamy for their guidance and manuscript review. Additionally, Halimah Badioze Zaman, Chair Holder of the Tan Sri Leo Moggie Distinguished Chair in Energy Informatics at UNITEN, is acknowledged for her advice and support throughout the research process. This study was conducted without external funding from public, commercial, or not-for-profit sectors. The findings, interpretations, and conclusions expressed are solely those of the authors and do not necessarily reflect the views of their affiliated institutions

FundersFunder number
UCSI University
Universiti Tenaga Nasional
Coventry University

    Keywords

    • Blood donation forecasting
    • predictive analytics and visualization
    • generative AI
    • random forest
    • public health

    Fingerprint

    Dive into the research topics of 'Generative AI-Powered Synthetic Data for Enhancing Predictive Analytics in Blood Donation Supply Management: A Comparative Study of Machine Learning Models'. Together they form a unique fingerprint.

    Cite this