End-to-end deep reinforcement learning control for HVAC systems in office buildings

Xuyang Zhong; Zhiang Zhang; Ruijun Zhang; Chenlu Zhang

doi:10.3390/designs6030052

End-to-end deep reinforcement learning control for HVAC systems in office buildings

Xuyang Zhong, Zhiang Zhang, Ruijun Zhang, Chenlu Zhang

Research output: Journal Publication › Article › peer-review

16 Citations (Scopus)

Abstract

The heating, ventilation, and air conditioning (HVAC) system is a major energy consumer in office buildings, and its operation is critical for indoor thermal comfort. While previous studies have indicated that reinforcement learning control can improve HVAC energy efficiency, they did not provide enough information about end-to-end control (i.e., from raw observations to ready-to-implement control signals) for centralized HVAC systems in multizone buildings due to the limitations of reinforcement learning methods or the test buildings being single zones with independent HVAC systems. This study developed a model-free end-to-end dynamic HVAC control method based on a recently proposed deep reinforcement learning framework to control the centralized HVAC system of a multizone office building. By using the deep neural network, the proposed control method could directly take measurable parameters, including weather and indoor environment conditions, as inputs and control indoor temperature setpoints at a supervisory level. In some test cases, the proposed control method could successfully learn a dynamic control policy to reduce HVAC energy consumption by 12.8% compared with the baseline case using conventional control methods, without compromising thermal comfort. However, an over-fitting problem was noted, indicating that future work should first focus on the generalization of deep reinforcement learning.

Original language	English
Article number	52
Number of pages	21
Journal	Designs
Volume	6
Issue number	3
DOIs	https://doi.org/10.3390/designs6030052
Publication status	Published - 4 Jun 2022

Keywords

HVAC control
deep reinforcement learning
thermal comfort
energy efficiency
A3C

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.3390/designs6030052Licence: CC BY

Cite this

@article{3358db14e2cd42f783ad20ba8f250222,

title = "End-to-end deep reinforcement learning control for HVAC systems in office buildings",

abstract = "The heating, ventilation, and air conditioning (HVAC) system is a major energy consumer in office buildings, and its operation is critical for indoor thermal comfort. While previous studies have indicated that reinforcement learning control can improve HVAC energy efficiency, they did not provide enough information about end-to-end control (i.e., from raw observations to ready-to-implement control signals) for centralized HVAC systems in multizone buildings due to the limitations of reinforcement learning methods or the test buildings being single zones with independent HVAC systems. This study developed a model-free end-to-end dynamic HVAC control method based on a recently proposed deep reinforcement learning framework to control the centralized HVAC system of a multizone office building. By using the deep neural network, the proposed control method could directly take measurable parameters, including weather and indoor environment conditions, as inputs and control indoor temperature setpoints at a supervisory level. In some test cases, the proposed control method could successfully learn a dynamic control policy to reduce HVAC energy consumption by 12.8\% compared with the baseline case using conventional control methods, without compromising thermal comfort. However, an over-fitting problem was noted, indicating that future work should first focus on the generalization of deep reinforcement learning.",

keywords = "HVAC control, deep reinforcement learning, thermal comfort, energy efficiency, A3C",

author = "Xuyang Zhong and Zhiang Zhang and Ruijun Zhang and Chenlu Zhang",

year = "2022",

month = jun,

day = "4",

doi = "10.3390/designs6030052",

language = "English",

volume = "6",

journal = "Designs",

issn = "2411-9660",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "3",

}

TY - JOUR

T1 - End-to-end deep reinforcement learning control for HVAC systems in office buildings

AU - Zhong, Xuyang

AU - Zhang, Zhiang

AU - Zhang, Ruijun

AU - Zhang, Chenlu

PY - 2022/6/4

Y1 - 2022/6/4

N2 - The heating, ventilation, and air conditioning (HVAC) system is a major energy consumer in office buildings, and its operation is critical for indoor thermal comfort. While previous studies have indicated that reinforcement learning control can improve HVAC energy efficiency, they did not provide enough information about end-to-end control (i.e., from raw observations to ready-to-implement control signals) for centralized HVAC systems in multizone buildings due to the limitations of reinforcement learning methods or the test buildings being single zones with independent HVAC systems. This study developed a model-free end-to-end dynamic HVAC control method based on a recently proposed deep reinforcement learning framework to control the centralized HVAC system of a multizone office building. By using the deep neural network, the proposed control method could directly take measurable parameters, including weather and indoor environment conditions, as inputs and control indoor temperature setpoints at a supervisory level. In some test cases, the proposed control method could successfully learn a dynamic control policy to reduce HVAC energy consumption by 12.8% compared with the baseline case using conventional control methods, without compromising thermal comfort. However, an over-fitting problem was noted, indicating that future work should first focus on the generalization of deep reinforcement learning.

AB - The heating, ventilation, and air conditioning (HVAC) system is a major energy consumer in office buildings, and its operation is critical for indoor thermal comfort. While previous studies have indicated that reinforcement learning control can improve HVAC energy efficiency, they did not provide enough information about end-to-end control (i.e., from raw observations to ready-to-implement control signals) for centralized HVAC systems in multizone buildings due to the limitations of reinforcement learning methods or the test buildings being single zones with independent HVAC systems. This study developed a model-free end-to-end dynamic HVAC control method based on a recently proposed deep reinforcement learning framework to control the centralized HVAC system of a multizone office building. By using the deep neural network, the proposed control method could directly take measurable parameters, including weather and indoor environment conditions, as inputs and control indoor temperature setpoints at a supervisory level. In some test cases, the proposed control method could successfully learn a dynamic control policy to reduce HVAC energy consumption by 12.8% compared with the baseline case using conventional control methods, without compromising thermal comfort. However, an over-fitting problem was noted, indicating that future work should first focus on the generalization of deep reinforcement learning.

KW - HVAC control

KW - deep reinforcement learning

KW - thermal comfort

KW - energy efficiency

KW - A3C

U2 - 10.3390/designs6030052

DO - 10.3390/designs6030052

M3 - Article

SN - 2411-9660

VL - 6

JO - Designs

JF - Designs

IS - 3

M1 - 52

ER -

End-to-end deep reinforcement learning control for HVAC systems in office buildings

Abstract

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this