邓小铁教授:The complexity of computing Markov perfect equilibrium in general-sum stochastic games
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars
Speaker:
邓小铁教授,北京大学
Inviter:
Title:
The complexity of computing Markov perfect equilibrium in general-sum stochastic games
Language:
Chinese
Time & Venue:
2023.05.11 10:00 N204
Abstract:
Similar to the role of Markov decision processes in reinforcement learning, Markov games (also known as stochastic games) form the basis for the study of multi-agent reinforcement learning and sequence-agent interaction. We introduce an approximate Markov perfect equilibrium as a computational problem for solving finite-state stochastic games under infinite time discounting, and prove its PPAD completeness. This solution concept preserves the Markovian-perfect property, opening the possibility to extend successful multi-agent reinforcement learning algorithms to multi-agent dynamic games, thus extending the range of PPAD complete classes.