Abstract: This paper explores the use of Temporal Difference (TD) learning algorithm to optimize the Automatic Generation Control (AGC) of a multi-area thermal power system. The main challenge ...