Machine Learning for Enhanced Underwriting: Predicting Premiums in Health Insurance

Nada Abdelshafy, Mohamed Abdelshafy, Mustansar Ali Ghazanfar, Rahime Belen Saglam

Research output: Chapter in Book/Report/Conference proceedingConference proceedingpeer-review

Abstract

In the evolving field of Insurance, Machine Learning (ML) has emerged as a transformative tool for enhancing the accuracy and efficiency of insurance underwriting, particularly in health insurance, a sector experiencing increased demand post-pandemic. This research develops an ML model aimed at improving premium prediction within this domain. Utilizing a health insurance database from Kaggle, we conducted a comparative analysis of several ML models - XGBoost, Random Forest, Neural Network Regression, and a baseline Linear Regression - to establish performance benchmarks. Our methodology included rigorous data analysis and application of ML model optimization techniques such as feature engineering and selection, hyperparameter tuning through Grid Search, Random Search, and Bayesian Optimization, as well as overfitting prevention strategies like Pruning and Early Stopping. The XGBoost model demonstrated superior performance, achieving a Mean Absolute Error (MAE) of 2582.3, Root Mean Square Error (RMSE) of 4625.0, and an R2 value of 86.59%. This research not only advances the application of ML in predictive premium pricing but also provides a structured approach for future studies in the insurance industry's underwriting processes.

Original languageEnglish
Title of host publication2024 4th International Conference on Robotics, Automation and Artificial Intelligence (RAAI)
PublisherIEEE
Pages330-339
Number of pages10
ISBN (Electronic)9798331520038, 9798331520021
ISBN (Print)9798331520045
DOIs
Publication statusE-pub ahead of print - 9 Apr 2025
Event4th International Conference on Robotics, Automation and Artificial Intelligence (RAAI) - Singapore, Singapore
Duration: 19 Dec 202421 Dec 2024

Publication series

Name2024 4th International Conference on Robotics, Automation and Artificial Intelligence (RAAI)
PublisherIEEE

Conference

Conference4th International Conference on Robotics, Automation and Artificial Intelligence (RAAI)
Country/TerritorySingapore
CitySingapore
Period19/12/2421/12/24

Keywords

  • Feature Selection and Engineering
  • Health Insurance
  • Hyperparameter Tuning
  • InsurTech
  • Insurance Underwriting
  • Machine Learning
  • Neural Network
  • Premium Prediction
  • Random Forest
  • Regression
  • XGBoost

ASJC Scopus subject areas

  • Computer Science Applications
  • Human-Computer Interaction
  • Mechanical Engineering
  • Control and Optimization
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Machine Learning for Enhanced Underwriting: Predicting Premiums in Health Insurance'. Together they form a unique fingerprint.

Cite this