Quality-Guaranteed and Cost-Effective Population Health Profiling: A Deep Active Learning Approach

Long Chen, Jiangtao Wang, Piyushimita (Vonu) Thakuriah

Research output: Contribution to journalArticlepeer-review

25 Downloads (Pure)


Reliability and cost are two primary consideration for profiling population-scale prevalence ( PPP ) of multiple None Communicable Diseases ( NCDs ). In this paper, we exploit intra-disease and inter-disease correlation in different traditionally-sensed-areas ( TS-A ) to reduce the required number of the profiling task allocated without compromising the data reliability. Specifically, we propose a novel approach called Compressive Population Health TS-A Selection ( CPH-TS ), which blends the state-of-the-art profile inference, data augmentation and active learning in a unified deep learning framework. It can actively select a minimum number of TS-A regions for profiling task allocation in each profiling cycle, while deducting of the missing data of the unprofiled regions with a probabilistic guarantee of reliability. We evaluate our approach on real-world prevalence datasets of London, which shows the effectiveness of CPH-TS . In general, CPH-TS assigned 11.1-27.3% fewer tasks than baselines, assigning tasks to only 34.7% of the sub-regions while the profiling error below 5% for 95% of the cycles.
Original languageEnglish
Article number22
Pages (from-to)1-19
Number of pages19
JournalACM Transactions on Computing for Healthcare
Issue number4
Early online date25 Aug 2023
Publication statusPublished - 13 Oct 2023

Bibliographical note

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honoured. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org
Copyright © and Moral Rights are retained by the author(s) and/ or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This item cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder(s). The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders.

This document is the author’s post-print version, incorporating any revisions agreed during the peer-review process. Some differences between the published version and this version may remain and you are advised to consult the published version if you wish to cite from it.


This work was supported by EPSRC New Investigator Award under Grant No EP/V043544/1.


  • Profiling of Prevalence
  • Spatio-temporal Correlations
  • Computer Science Applications
  • Generative Adversarial Network,
  • Convolutional Neural Networks (CNN)


Dive into the research topics of 'Quality-Guaranteed and Cost-Effective Population Health Profiling: A Deep Active Learning Approach'. Together they form a unique fingerprint.

Cite this