Abstract
In this paper, we present our work on deep reinforcement learning (DRL) based intelligent control of Heating, Ventilation, and Air Conditioning (HVAC) with the goal of reducing carbon emission. We performed this task using 1) Marginal Operating Emission Rates (MOER), where the objective was to shift the demand to the low emission period of the day and 2) Time-Of-Use (TOU) demand-response price where the objective was to shift the demand to low price period of the day. This was achieved by learning an optimal pre-cooing strategy. We found the carbon emission reduction in the range of ≈ 6%-16% depending on the opportunity presented by the MOER signal. Similarly, we observed the carbon emission reduction in the range of ≈23%-29% during the peak price period when TOU price was used. The results clearly demonstrated the applicability of our approach in reducing the carbon footprint of the building.