9th IFIP WG 12.6 and 1st IFIP WG 12.11 International Workshop on Artificial Intelligence for Knowledge Management, Energy, and Sustainability (AI4KMES) held at 30th International Joint Conference on Artificial Intelligence (IJCAI), Montreal, Canada, 19 - 26 August 2021, vol.637, pp.169-183
A proximal policy optimization reinforcement learning system is proposed to handle the energy dispatch management of a sample microgrid. The microgrid in question has 3 participants of different classifications, signifying their relative importance and how sensitive they are to energy shortages. The energy within the microgrid is generated by these participants, which are individually equipped with a solar panel and a wind turbine for energy generation, and an energy storage system to store this energy. The environmental conditions, i.e. temperature, wind velocity and irradiation figures of Istanbul are considered to obtain accurate energy generation figures. The microgrid is designed to be grid connected in order to compensate for the uncertainties caused by the weather changes, hence service of the utility is accessed when energy produced & stored cannot respond to the demand. Information security of the participants is respected and to that end, direct energy generation, consumption and storage figures are not supplied to the agent, instead only supply and demand figures are transferred. The agent, using this information, after a period of training, optimizes the system for a reward scheme that rewards energy exports and punishes energy deficits and imports. The results verify the feasibility of proximal policy optimization in managing microgrid energy dispatch.