PhD Forum Abstract: Hybrid MCTS for Heat Pump Control

Research output: Chapter in Book/Report/Conference proceedingConference proceedingpeer-review

Abstract

Heat pump control under dynamic environmental conditions and time-varying electricity pricing poses significant challenges for energy efficiency and cost optimisation. Conventional methods such as PID controllers cannot adapt effectively to such non-stationary conditions, while model-free reinforcement learning approaches require extensive training and struggle with long-horizon constraints. This work investigates planning-based approaches, focusing on Monte Carlo Tree Search (MCTS) and its learning-augmented successor MuZero, as promising alternatives for adaptive and efficient heat pump control. A simplified immersion hot water heater environment is developed as a proof-of-concept testbed, capturing the core challenge of reaching a target temperature at a specified time while minimising energy use. Experimental evaluation shows that MCTS achieved target tracking with a 7% reduction in energy consumption compared to a bang-bang baseline, without requiring any training. In contrast, PPO, used as a baseline learning method, achieved higher efficiency (up to 19% savings) but only after extensive training and hyperparameter tuning. This work establishes planning-based control as a viable research direction for scalable, adaptive, and cost-effective energy management in built environments.
Original languageEnglish
Title of host publicationBUILDSYS '25: Proceedings of the 12th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation
PublisherACM
Pages342-343
Number of pages2
ISBN (Electronic)979-8-4007-1945-5
DOIs
Publication statusPublished - 19 Nov 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Fingerprint

Dive into the research topics of 'PhD Forum Abstract: Hybrid MCTS for Heat Pump Control'. Together they form a unique fingerprint.

Cite this