<i>SUB-PLAY:</i>
            Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

Oubo Ma; Yuwen Pu; Linkang Du; Yang Dai; Ruo Wang; Xiaolei Liu; Yingcai Wu; Shouling Ji

doi:10.1145/3658644.3670293

What is it about?

Recent advancements in multi-agent reinforcement learning (MARL) have opened up vast application prospects, such as swarm control of drones, collaborative manipulation by robotic arms, and multitarget encirclement. However, potential security threats during the MARL deployment need more attention and thorough investigation.

Photo by Kasia Derenda on Unsplash

Why is it important?

Recent research reveals that attackers can rapidly exploit the victim’s vulnerabilities, generating adversarial policies that result in the failure of specific tasks. For instance, reducing the winning rate of a superhuman-level Go AI to around 20%. Existing studies predominantly focus on two-player competitive environments, assuming attackers possess complete global state observation.

Perspectives

In this study, we unveil, for the first time, the capability of attackers to generate adversarial policies even when restricted to partial observations of the victims in multi-agent competitive environments. Specifically, we propose a novel black-box attack (SUB-PLAY) that incorporates the concept of constructing multiple subgames to mitigate the impact of partial observability and suggests sharing transitions among subpolicies to improve attackers’ exploitative ability. Extensive evaluations demonstrate the effectiveness of SUBPLAY under three typical partial observability limitations. Visualization results indicate that adversarial policies induce significantly different activations of the victims’ policy networks. Furthermore, we evaluate three potential defenses aimed at exploring ways to mitigate security threats posed by adversarial policies, providing constructive recommendations for deploying MARL in competitive environments.
Oubo Ma
Zhejiang University

This page is a summary of: SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems, December 2024, ACM (Association for Computing Machinery),
DOI: 10.1145/3658644.3670293.
You can read the full text:

Read

Resources

Open Access version
arvix version
it's the long version of our paper

Contributors

The following have contributed to this page

Oubo Ma
Zhejiang University

Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

What is it about?

Why is it important?

Perspectives

Resources

arvix version

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Resources

arvix version

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management