Abstract
Heat pump control under dynamic environmental conditions and time-varying electricity pricing poses significant challenges for energy efficiency and cost optimisation. Conventional methods such as PID controllers cannot adapt effectively to such non-stationary conditions, while model-free reinforcement learning approaches require extensive training and struggle with long-horizon constraints. This work investigates planning-based approaches, focusing on Monte Carlo Tree Search (MCTS) and its learning-augmented successor MuZero, as promising alternatives for adaptive and efficient heat pump control. A simplified immersion hot water heater environment is developed as a proof-of-concept testbed, capturing the core challenge of reaching a target temperature at a specified time while minimising energy use. Experimental evaluation shows that MCTS achieved target tracking with a 7% reduction in energy consumption compared to a bang-bang baseline, without requiring any training. In contrast, PPO, used as a baseline learning method, achieved higher efficiency (up to 19% savings) but only after extensive training and hyperparameter tuning. This work establishes planning-based control as a viable research direction for scalable, adaptive, and cost-effective energy management in built environments.
| Original language | English |
|---|---|
| Title of host publication | BUILDSYS '25: Proceedings of the 12th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation |
| Publisher | ACM |
| Pages | 342-343 |
| Number of pages | 2 |
| ISBN (Electronic) | 979-8-4007-1945-5 |
| DOIs | |
| Publication status | Published - 19 Nov 2025 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Fingerprint
Dive into the research topics of 'PhD Forum Abstract: Hybrid MCTS for Heat Pump Control'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS