Abstract
Reliability and cost are two primary considerations for profiling population-scale prevalence ( PPP ) of multiple non-communicable diseases ( NCDs ). In this paper, we exploit intra-disease and inter-disease correlation in different traditionally-sensed-areas ( TS-A ) to reduce the number of profiling tasks required without compromising data reliability. Specifically, we propose a novel approach called Compressive Population Health TS-A Selection ( CPH-TS ), which blends the state-of-the-art profile inference, data augmentation and active learning in a unified deep learning framework. It can actively select the minimum number of TS-A regions for profiling task allocation in each profiling cycle, while deducing the missing data on the unprofiled regions with a probabilistic guarantee of reliability. We evaluate our approach on real-world prevalence datasets of London, which shows the effectiveness of CPH-TS . In general, CPH-TS assigned 11.1-27.3% fewer tasks than baselines, assigning tasks to only 34.7% of the sub-regions while the profiling error was below 5% for 95% of the cycles.
Original language | English |
---|---|
Article number | 22 |
Pages (from-to) | 1-19 |
Number of pages | 19 |
Journal | ACM Transactions on Computing for Healthcare |
Volume | 4 |
Issue number | 4 |
Early online date | 25 Aug 2023 |
DOIs | |
Publication status | Published - 13 Oct 2023 |
Bibliographical note
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honoured. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]Copyright © and Moral Rights are retained by the author(s) and/ or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This item cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder(s). The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders.
This document is the author’s post-print version, incorporating any revisions agreed during the peer-review process. Some differences between the published version and this version may remain and you are advised to consult the published version if you wish to cite from it.
Funder
This work was supported by EPSRC New Investigator Award under Grant No EP/V043544/1.Keywords
- Profiling of Prevalence
- Spatio-temporal Correlations
- Computer Science Applications
- Generative Adversarial Network,
- Convolutional Neural Networks (CNN)