TY - GEN
T1 - The Application of Reinforcement Learning to the FlipIt Security Game
AU - Yang, Xue
AU - Howley, Enda
AU - Schukat, Michael
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Advanced Persistent Threat is a new, sophisticated type of targeted attack which is continuous, long-living, and stealthy. It has proven to be difficult to detect and defend against in the cloud-based environment by traditional methods, calling for more advanced security technologies. FlipIt is a two-player security game where an attacker and defender compete to control a sensitive resource in advanced scenarios such as APTs. Its robustness against APT attacks is outstanding. We model the FlipIt game as a Markov Decision Process and apply reinforcement learning to the framework. The goal is to find an optimal adaptive strategy for a player to compete against any unknown opponent in a FlipIt game with incomplete information. This means the best result for a player is to maximize the ownership of the resource with minimum cost. We perform experiments on single-Agent and multi-Agent scenarios, respectively. We further extend the model to involve noisy information and consider the openness of the game. Our experimental analysis proves that in a two-player FlipIt game, an adaptive player can automatically learn and find an optimal strategy using only the last move information of the opponent, who moves with a non-Adaptive strategy (i.e. a periodic strategy with random noise). The parameters related to the random noise we considered affect the average benefit for each player. In addition, we consider the openness of the game in which new participants are introduced individually at random time steps with a certain probability. In this case, the model is generalized from two-player to n-player, and the convergence of the optimal strategy learned by each player is confirmed. Moreover, we demonstrate that varying the probability of adding an additional player does not affect the convergence but changes the average benefits for players.
AB - Advanced Persistent Threat is a new, sophisticated type of targeted attack which is continuous, long-living, and stealthy. It has proven to be difficult to detect and defend against in the cloud-based environment by traditional methods, calling for more advanced security technologies. FlipIt is a two-player security game where an attacker and defender compete to control a sensitive resource in advanced scenarios such as APTs. Its robustness against APT attacks is outstanding. We model the FlipIt game as a Markov Decision Process and apply reinforcement learning to the framework. The goal is to find an optimal adaptive strategy for a player to compete against any unknown opponent in a FlipIt game with incomplete information. This means the best result for a player is to maximize the ownership of the resource with minimum cost. We perform experiments on single-Agent and multi-Agent scenarios, respectively. We further extend the model to involve noisy information and consider the openness of the game. Our experimental analysis proves that in a two-player FlipIt game, an adaptive player can automatically learn and find an optimal strategy using only the last move information of the opponent, who moves with a non-Adaptive strategy (i.e. a periodic strategy with random noise). The parameters related to the random noise we considered affect the average benefit for each player. In addition, we consider the openness of the game in which new participants are introduced individually at random time steps with a certain probability. In this case, the model is generalized from two-player to n-player, and the convergence of the optimal strategy learned by each player is confirmed. Moreover, we demonstrate that varying the probability of adding an additional player does not affect the convergence but changes the average benefits for players.
KW - Adaptive Strategy
KW - Advanced Persistent Threats
KW - FlipIt
KW - Game Openness
KW - Random Noise
KW - Reinforcement Learning
KW - Security Games
UR - https://www.scopus.com/pages/publications/85148608338
U2 - 10.1109/Cyber-RCI55324.2022.10032686
DO - 10.1109/Cyber-RCI55324.2022.10032686
M3 - Conference Publication
T3 - 2022 Cyber Research Conference - Ireland, Cyber-RCI 2022
BT - 2022 Cyber Research Conference - Ireland, Cyber-RCI 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 Cyber Research Conference - Ireland, Cyber-RCI 2022
Y2 - 25 April 2022
ER -