Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning

Kevlyn Kadamala, Des Chambers, Enda Barrett

    Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

    1 Citation (Scopus)

    Abstract

    This paper explores the application of imitation learning (IL) and reinforcement learning (RL) in HVAC control. IL learns to perform tasks by imitating a demonstrator, utilising a dataset of demonstrations. However, the performance of IL is highly dependent on the quality of the expert demonstration data. On the other hand, RL can adapt control policies based on different objectives, but for larger problems, it can be sample inefficient, requiring significant time and resources for training. To overcome the limitations of both RL and IL, we propose a combined methodology where IL is used for pre-training and RL for fine-tuning. We introduce a fine-tuning methodology to HVAC control inspired by a robot navigation task. Using the 5-Zone residential building environment provided by Sinergym, we collect state-action pairs from interactions with the environment using a rule-based policy to create a dataset of expert demonstrations. Our experiments show that this combined methodology improves the efficiency and performance of the RL agent by 1% to 11.35% compared to existing literature. This study contributes to the ongoing discourse on how imitation learning can enhance the performance of reinforcement learning in building control systems.

    Original languageEnglish
    Title of host publicationMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track - European Conference, ECML PKDD 2024, Proceedings
    EditorsAlbert Bifet, Tomas Krilavičius, Ioanna Miliou, Slawomir Nowaczyk
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages256-270
    Number of pages15
    ISBN (Print)9783031703775
    DOIs
    Publication statusPublished - 2024
    EventEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2024 - Vilnius, Lithuania
    Duration: 9 Sep 202413 Sep 2024

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume14949 LNAI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    ConferenceEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2024
    Country/TerritoryLithuania
    CityVilnius
    Period9/09/2413/09/24

    Keywords

    • Continuous HVAC control
    • Imitation learning
    • Reinforcement learning

    Fingerprint

    Dive into the research topics of 'Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning'. Together they form a unique fingerprint.

    Cite this