Multi-Agent Intention Progression with Reward Machines

Michael Dann; Yuan Yao; Natasha Alechina; Brian Logan; John Thangarajah

doi:10.24963/ijcai.2022/31

Multi-Agent Intention Progression with Reward Machines

Michael Dann, Yuan Yao, Natasha Alechina, Brian Logan, John Thangarajah

School of Computer Science

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

14 Citations (Scopus)

Abstract

Recent work in multi-agent intention scheduling has shown that enabling agents to predict the actions of other agents when choosing their own actions may be beneficial. However existing approaches to 'intention-aware' scheduling assume that the programs of other agents are known, or are “similar” to that of the agent making the prediction. While this assumption is reasonable in some circumstances, it is less plausible when the agents are not co-designed. In this paper, we present a new approach to multi-agent intention scheduling in which agents predict the actions of other agents based on a high-level specification of the tasks performed by an agent in the form of a reward machine (RM) rather than on its (assumed) program. We show how a reward machine can be used to generate tree and rollout policies for an MCTS-based scheduler. We evaluate our approach in a range of multi-agent environments, and show that RM-based scheduling out-performs previous intention-aware scheduling approaches in settings where agents are not co-designed.

Original language	English
Title of host publication	Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022
Editors	Luc De Raedt, Luc De Raedt
Publisher	International Joint Conferences on Artificial Intelligence
Pages	215-222
Number of pages	8
ISBN (Electronic)	9781956792003
DOIs	https://doi.org/10.24963/ijcai.2022/31
Publication status	Published - 2022
Event	31st International Joint Conference on Artificial Intelligence, IJCAI 2022 - Vienna, Austria Duration: 23 Jul 2022 → 29 Jul 2022

Publication series

Name	IJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)	1045-0823

Conference

Conference	31st International Joint Conference on Artificial Intelligence, IJCAI 2022
Country/Territory	Austria
City	Vienna
Period	23/07/22 → 29/07/22

ASJC Scopus subject areas

Artificial Intelligence

Access to Document

10.24963/ijcai.2022/31

Cite this

Dann, M., Yao, Y., Alechina, N., Logan, B., & Thangarajah, J. (2022). Multi-Agent Intention Progression with Reward Machines. In L. De Raedt, & L. De Raedt (Eds.), Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 (pp. 215-222). (IJCAI International Joint Conference on Artificial Intelligence). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2022/31

Dann, Michael ; Yao, Yuan ; Alechina, Natasha et al. / Multi-Agent Intention Progression with Reward Machines. Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. editor / Luc De Raedt ; Luc De Raedt. International Joint Conferences on Artificial Intelligence, 2022. pp. 215-222 (IJCAI International Joint Conference on Artificial Intelligence).

@inproceedings{44ab9d6803d24679a65aa37482ed3399,

title = "Multi-Agent Intention Progression with Reward Machines",

abstract = "Recent work in multi-agent intention scheduling has shown that enabling agents to predict the actions of other agents when choosing their own actions may be beneficial. However existing approaches to 'intention-aware' scheduling assume that the programs of other agents are known, or are “similar” to that of the agent making the prediction. While this assumption is reasonable in some circumstances, it is less plausible when the agents are not co-designed. In this paper, we present a new approach to multi-agent intention scheduling in which agents predict the actions of other agents based on a high-level specification of the tasks performed by an agent in the form of a reward machine (RM) rather than on its (assumed) program. We show how a reward machine can be used to generate tree and rollout policies for an MCTS-based scheduler. We evaluate our approach in a range of multi-agent environments, and show that RM-based scheduling out-performs previous intention-aware scheduling approaches in settings where agents are not co-designed.",

author = "Michael Dann and Yuan Yao and Natasha Alechina and Brian Logan and John Thangarajah",

note = "Publisher Copyright: {\textcopyright} 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.; 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 ; Conference date: 23-07-2022 Through 29-07-2022",

year = "2022",

doi = "10.24963/ijcai.2022/31",

language = "English",

series = "IJCAI International Joint Conference on Artificial Intelligence",

publisher = "International Joint Conferences on Artificial Intelligence",

pages = "215--222",

editor = "{De Raedt}, Luc and {De Raedt}, Luc",

booktitle = "Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022",

}

Dann, M, Yao, Y, Alechina, N, Logan, B & Thangarajah, J 2022, Multi-Agent Intention Progression with Reward Machines. in L De Raedt & L De Raedt (eds), Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. IJCAI International Joint Conference on Artificial Intelligence, International Joint Conferences on Artificial Intelligence, pp. 215-222, 31st International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23/07/22. https://doi.org/10.24963/ijcai.2022/31

Multi-Agent Intention Progression with Reward Machines. / Dann, Michael; Yao, Yuan; Alechina, Natasha et al.
Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. ed. / Luc De Raedt; Luc De Raedt. International Joint Conferences on Artificial Intelligence, 2022. p. 215-222 (IJCAI International Joint Conference on Artificial Intelligence).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-Agent Intention Progression with Reward Machines

AU - Dann, Michael

AU - Yao, Yuan

AU - Alechina, Natasha

AU - Logan, Brian

AU - Thangarajah, John

PY - 2022

Y1 - 2022

N2 - Recent work in multi-agent intention scheduling has shown that enabling agents to predict the actions of other agents when choosing their own actions may be beneficial. However existing approaches to 'intention-aware' scheduling assume that the programs of other agents are known, or are “similar” to that of the agent making the prediction. While this assumption is reasonable in some circumstances, it is less plausible when the agents are not co-designed. In this paper, we present a new approach to multi-agent intention scheduling in which agents predict the actions of other agents based on a high-level specification of the tasks performed by an agent in the form of a reward machine (RM) rather than on its (assumed) program. We show how a reward machine can be used to generate tree and rollout policies for an MCTS-based scheduler. We evaluate our approach in a range of multi-agent environments, and show that RM-based scheduling out-performs previous intention-aware scheduling approaches in settings where agents are not co-designed.

AB - Recent work in multi-agent intention scheduling has shown that enabling agents to predict the actions of other agents when choosing their own actions may be beneficial. However existing approaches to 'intention-aware' scheduling assume that the programs of other agents are known, or are “similar” to that of the agent making the prediction. While this assumption is reasonable in some circumstances, it is less plausible when the agents are not co-designed. In this paper, we present a new approach to multi-agent intention scheduling in which agents predict the actions of other agents based on a high-level specification of the tasks performed by an agent in the form of a reward machine (RM) rather than on its (assumed) program. We show how a reward machine can be used to generate tree and rollout policies for an MCTS-based scheduler. We evaluate our approach in a range of multi-agent environments, and show that RM-based scheduling out-performs previous intention-aware scheduling approaches in settings where agents are not co-designed.

UR - http://www.scopus.com/inward/record.url?scp=85137905354&partnerID=8YFLogxK

U2 - 10.24963/ijcai.2022/31

DO - 10.24963/ijcai.2022/31

M3 - Conference contribution

AN - SCOPUS:85137905354

T3 - IJCAI International Joint Conference on Artificial Intelligence

SP - 215

EP - 222

BT - Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022

A2 - De Raedt, Luc

PB - International Joint Conferences on Artificial Intelligence

T2 - 31st International Joint Conference on Artificial Intelligence, IJCAI 2022

Y2 - 23 July 2022 through 29 July 2022

ER -

Dann M, Yao Y, Alechina N, Logan B, Thangarajah J. Multi-Agent Intention Progression with Reward Machines. In De Raedt L, De Raedt L, editors, Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. International Joint Conferences on Artificial Intelligence. 2022. p. 215-222. (IJCAI International Joint Conference on Artificial Intelligence). doi: 10.24963/ijcai.2022/31

Multi-Agent Intention Progression with Reward Machines

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this