A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings

Yue Lei; Sicheng Zhan; Eikichi Ono; Yuzhen Peng; Zhiang Zhang; Takamasa Hasama; Adrian Chong

doi:10.1016/j.apenergy.2022.119742

A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings

Yue Lei, Sicheng Zhan, Eikichi Ono, Yuzhen Peng, Zhiang Zhang, Takamasa Hasama, Adrian Chong

Department of Architecture and Built Environment

Research output: Journal Publication › Article › peer-review

74 Citations (Scopus)

Abstract

Reinforcement learning (RL) has been shown to have the potential for optimal control of heating, ventilation, and air conditioning (HVAC) systems. Although research on RL-based building control has received extensive attention in recent years, there is limited real-world implementation to evaluate its performance while keeping occupants in the loop. Additionally, many HVAC systems consist of multiple subsystems, but conventional RL algorithms face significant challenges when dealing with high-dimensional action spaces. This study proposes a practical deep reinforcement learning (DRL) based multivariate occupant-centric control framework that considers personalized thermal comfort and occupant presence. Specifically, Branching Dueling Q-network (BDQ) is leveraged as the learning agent to efficiently solve the multi-dimensional control task, and a tabular-based personal comfort modeling method is applied that is naturally integrated into human-in-the-loop operations. The BDQ agent is pre-trained in a virtual environment, followed by online deployment in a real office space for 5-dimensional action control. Based on the actual deployment and real-time comfort votes, our results showed a 14% reduction in cooling energy and an 11% improvement in total thermal acceptability.

Original language	English
Article number	119742
Number of pages	18
Journal	Applied Energy
Volume	324
DOIs	https://doi.org/10.1016/j.apenergy.2022.119742
Publication status	Published - 15 Oct 2022

Keywords

Occupant-centric control
Deep learning
Reinforcement learning
Thermal comfort
Energy efficiency

ASJC Scopus subject areas

Building and Construction
Mechanical Engineering
General Energy
Management, Monitoring, Policy and Law

Access to Document

10.1016/j.apenergy.2022.119742

https://linkinghub.elsevier.com/retrieve/pii/S0306261922010297

Cite this

@article{11ddb0fcfafb41adb69f1928b3881863,

title = "A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings",

abstract = "Reinforcement learning (RL) has been shown to have the potential for optimal control of heating, ventilation, and air conditioning (HVAC) systems. Although research on RL-based building control has received extensive attention in recent years, there is limited real-world implementation to evaluate its performance while keeping occupants in the loop. Additionally, many HVAC systems consist of multiple subsystems, but conventional RL algorithms face significant challenges when dealing with high-dimensional action spaces. This study proposes a practical deep reinforcement learning (DRL) based multivariate occupant-centric control framework that considers personalized thermal comfort and occupant presence. Specifically, Branching Dueling Q-network (BDQ) is leveraged as the learning agent to efficiently solve the multi-dimensional control task, and a tabular-based personal comfort modeling method is applied that is naturally integrated into human-in-the-loop operations. The BDQ agent is pre-trained in a virtual environment, followed by online deployment in a real office space for 5-dimensional action control. Based on the actual deployment and real-time comfort votes, our results showed a 14% reduction in cooling energy and an 11% improvement in total thermal acceptability.",

keywords = "Occupant-centric control, Deep learning, Reinforcement learning, Thermal comfort, Energy efficiency",

author = "Yue Lei and Sicheng Zhan and Eikichi Ono and Yuzhen Peng and Zhiang Zhang and Takamasa Hasama and Adrian Chong",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Ltd",

year = "2022",

month = oct,

day = "15",

doi = "10.1016/j.apenergy.2022.119742",

language = "English",

volume = "324",

journal = "Applied Energy",

issn = "0306-2619",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings

AU - Lei, Yue

AU - Zhan, Sicheng

AU - Ono, Eikichi

AU - Peng, Yuzhen

AU - Zhang, Zhiang

AU - Hasama, Takamasa

AU - Chong, Adrian

PY - 2022/10/15

Y1 - 2022/10/15

N2 - Reinforcement learning (RL) has been shown to have the potential for optimal control of heating, ventilation, and air conditioning (HVAC) systems. Although research on RL-based building control has received extensive attention in recent years, there is limited real-world implementation to evaluate its performance while keeping occupants in the loop. Additionally, many HVAC systems consist of multiple subsystems, but conventional RL algorithms face significant challenges when dealing with high-dimensional action spaces. This study proposes a practical deep reinforcement learning (DRL) based multivariate occupant-centric control framework that considers personalized thermal comfort and occupant presence. Specifically, Branching Dueling Q-network (BDQ) is leveraged as the learning agent to efficiently solve the multi-dimensional control task, and a tabular-based personal comfort modeling method is applied that is naturally integrated into human-in-the-loop operations. The BDQ agent is pre-trained in a virtual environment, followed by online deployment in a real office space for 5-dimensional action control. Based on the actual deployment and real-time comfort votes, our results showed a 14% reduction in cooling energy and an 11% improvement in total thermal acceptability.

AB - Reinforcement learning (RL) has been shown to have the potential for optimal control of heating, ventilation, and air conditioning (HVAC) systems. Although research on RL-based building control has received extensive attention in recent years, there is limited real-world implementation to evaluate its performance while keeping occupants in the loop. Additionally, many HVAC systems consist of multiple subsystems, but conventional RL algorithms face significant challenges when dealing with high-dimensional action spaces. This study proposes a practical deep reinforcement learning (DRL) based multivariate occupant-centric control framework that considers personalized thermal comfort and occupant presence. Specifically, Branching Dueling Q-network (BDQ) is leveraged as the learning agent to efficiently solve the multi-dimensional control task, and a tabular-based personal comfort modeling method is applied that is naturally integrated into human-in-the-loop operations. The BDQ agent is pre-trained in a virtual environment, followed by online deployment in a real office space for 5-dimensional action control. Based on the actual deployment and real-time comfort votes, our results showed a 14% reduction in cooling energy and an 11% improvement in total thermal acceptability.

KW - Occupant-centric control

KW - Deep learning

KW - Reinforcement learning

KW - Thermal comfort

KW - Energy efficiency

UR - http://www.scopus.com/inward/record.url?scp=85135701106&partnerID=8YFLogxK

U2 - 10.1016/j.apenergy.2022.119742

DO - 10.1016/j.apenergy.2022.119742

M3 - Article

SN - 0306-2619

VL - 324

JO - Applied Energy

JF - Applied Energy

M1 - 119742

ER -

A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this