Abstract
Background
Despite the well-documented benefits of physical activity, insufficient activity remains highly prevalent among adolescents with obesity. This study is the first to apply interpretable machine learning methods to identify the barriers, facilitators, and U-shaped determinants of physical activity in this population.
Methods
We analyzed data from 1,041 adolescents with obesity from the China Education Panel Survey. A range of personal, family, and school-level variables were incorporated to construct six machine learning models for predicting physical activity attainment. The Shapley Additive Explanations (SHAP) method, a game-theoretic approach for explainable artificial intelligence, was used to identify key predictive factors and quantify their relative contributions.
Results
The Random Forest model demonstrated the best performance, achieving an accuracy of 85.30% and an AUC of 0.720 on the test set. SHAP analysis revealed several key factors associated with physical activity. Positive facilitators included parents with an education level beyond high school (≥ 3.27 for mothers, ≥ 3.51 for fathers), higher school rankings (≥ 3.77), adequate school sports facilities (≥ 1.44), and a personal interest in sports. Negative barriers included excessive screen time (≥ 5.51 h) and school location in central urban areas (≥ 4.23). Notably, U-shaped relationships were identified for academic workload, sleep problems, and self-perceived appearance. Specifically, moderate levels of these factors were associated with lower physical activity, whereas both low and high extremes promoted activity.
Conclusion
This study demonstrates that physical activity among adolescents with obesity is shaped by a complex interplay of individual, family, and school-level factors, with parental education emerging as the strongest predictor. The identification of specific risk thresholds (e.g., screen time ≥ 5.51 h) and U-shaped relationships offers precise, actionable targets for intervention. These findings underscore the need for multi-level strategies: families should prioritize fostering cultural capital, schools should ensure facility accessibility beyond regular hours, and policymakers must address environmental constraints in urban settings. To facilitate the translation of these insights into practice, a web-based tool has been deployed to help physical education teachers identify at-risk students and design targeted interventions.
Despite the well-documented benefits of physical activity, insufficient activity remains highly prevalent among adolescents with obesity. This study is the first to apply interpretable machine learning methods to identify the barriers, facilitators, and U-shaped determinants of physical activity in this population.
Methods
We analyzed data from 1,041 adolescents with obesity from the China Education Panel Survey. A range of personal, family, and school-level variables were incorporated to construct six machine learning models for predicting physical activity attainment. The Shapley Additive Explanations (SHAP) method, a game-theoretic approach for explainable artificial intelligence, was used to identify key predictive factors and quantify their relative contributions.
Results
The Random Forest model demonstrated the best performance, achieving an accuracy of 85.30% and an AUC of 0.720 on the test set. SHAP analysis revealed several key factors associated with physical activity. Positive facilitators included parents with an education level beyond high school (≥ 3.27 for mothers, ≥ 3.51 for fathers), higher school rankings (≥ 3.77), adequate school sports facilities (≥ 1.44), and a personal interest in sports. Negative barriers included excessive screen time (≥ 5.51 h) and school location in central urban areas (≥ 4.23). Notably, U-shaped relationships were identified for academic workload, sleep problems, and self-perceived appearance. Specifically, moderate levels of these factors were associated with lower physical activity, whereas both low and high extremes promoted activity.
Conclusion
This study demonstrates that physical activity among adolescents with obesity is shaped by a complex interplay of individual, family, and school-level factors, with parental education emerging as the strongest predictor. The identification of specific risk thresholds (e.g., screen time ≥ 5.51 h) and U-shaped relationships offers precise, actionable targets for intervention. These findings underscore the need for multi-level strategies: families should prioritize fostering cultural capital, schools should ensure facility accessibility beyond regular hours, and policymakers must address environmental constraints in urban settings. To facilitate the translation of these insights into practice, a web-based tool has been deployed to help physical education teachers identify at-risk students and design targeted interventions.
| Original language | English |
|---|---|
| Pages (from-to) | (In-Press) |
| Journal | BMC Public Health |
| Volume | 26 |
| Issue number | 1 |
| Early online date | 1 Apr 2026 |
| DOIs | |
| Publication status | E-pub ahead of print - 1 Apr 2026 |
Bibliographical note
This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in thisarticle are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the
article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain
permission directly from the copyright holder.
Funding
This study was supported by the Applied Basic Research Project of Changzhou (CJ20252030)
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Adolescent obesity
- Machine learning
- Weight management
- Academic workload
- Physical activity
Fingerprint
Dive into the research topics of 'Why they do not move: An explainable machine learning analysis of physical activity barriers in obese adolescents and tool translation'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS