论文

Research

Large Sequence Models for Sequential Decision-Making: A Survey.

Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang

arXiv PDF

Research

Learning Robust Communication by Adversarial Training in Networked System Control.

Runji Lin, Haifeng Zhang

CCSICC 2023

Research

Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future.

Yan Song, He Jiang, Haifeng Zhang, Zheng Tian, Weinan Zhang, Jun Wang

AAMAS 2024 PDF

Research

Offline Hierarchical Reinforcement Learning: Enable Large-scale Training in HRL.

Yuqiao Wu, Haifeng Zhang, Jun Wang

CCSICC 2023

Research

TaxAI: A Dynamic Economic Simulator and Benchmark for Multi-Agent Reinforcement Learning.

Qirui Mi, Siyu Xia, Yan Song, Haifeng Zhang, Shenghao Zhu, Jun Wang

AAMAS 2024 PDF

Research

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models.

Xue Yan, Yan Song, Xinyu Cui, Haifeng Zhang, Jun Wang

arXiv PDF

Research

An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination.

Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du

NeurIPS 2023 PDF

Research

A Generative Model for Game Theory with Flow Equilibrium.

Zhiyu Zhao, Renyuan Xu, Haifeng Zhang, Jun Wang, Yaodong Yang

Open Review PDF

Research

Imitation Learning for Mean Field Games with Correlated Equilibria | Research Square.

Zhiyu Zhao, Renyuan Xu, Haifeng Zhang, Jun Wang, Yaodong Yang

Research Square PDF

Research

An Empirical Study on Google Research Football Multi-agent Scenarios.

Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang

Machine Intelligence Research 2022 PDF

Research

Offline Multi-agent Decision Transformer.

Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang, Bo Xu

Machine Intelligence Research 2022 PDF

Research

Contextual Transformer for Offline Meta Reinforcement Learning.

Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang

NeurIPS2022 workshop PDF

Research

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.

Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang

NeurIPS 2022 PDF

Research

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.

Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang

NeurIPS 2022 PDF

Research

GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning.

Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu

AAMAS 2022 PDF

Research

Learning to Identify Top Elo Ratings as A Dueling Bandits Problem.

Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen

AAAI 2022 PDF

Research

A Review: Machine Learning for Combinatorial Optimization Problems in Energy Areas.

Xinyi Yang, Ziyi Wang, Hengxi Zhang, Nan Ma, Ning Yang, Hualin Liu, Haifeng Zhang, Lei Yang

Algorithms 2022 PDF

Research

A Game-theoretic Approach for Improving Generalization Ability of TSP Solvers.

Chenguang Wang, Yaodong Yang, Oliver Slumbers, Congying Han, Tiande Guo, Haifeng Zhang, Jun Wang

arXiv 2021 PDF

Research

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning.

Liheng Chen, Hongyi Guo, Haifeng Zhang, Fei Fang, Yaoming Zhu, Ming Zhou, Qing Wang, Weinan Zhang, Yong Yu

DAI 2021 PDF

Research

Settling the Variance of Multi-Agent Policy Gradients.

Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang

NeurIPS 2021 PDF

Research

Joint caching and transmission in the mobile edge network: A multi-agent learning approach.

Qirui Mi, Ning Yang, Haifeng Zhang, Haijun Zhang, Jun Wang

Globecom 2021 PDF

Research

Learning in Nonzero-Sum Stochastic Games with Potentials.

David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang

ICML 2021 PDF

Research

Estimating α-Rank from A Few Entries with Low Rank Matrix Completion.

Yali Du, Xue Yan, Xu Chen, Haifeng Zhang, Jun Wang

ICML 2021 PDF

Research

Learning Correlated Communication Topology in Multi-Agent Reinforcement Learning.

Yali Du, Bo Liu, Vincent Moens, Ziqi Liu, Zhicheng Ren, Jun Wang, Xu Chen, Haifeng Zhang

AAMAS 2021 PDF

项目

Research

“及第”多智能体开源开放平台

“及第”复现几十种主流(多智能体)强化学习基线算法,并接入几十种主流博弈环境,为新算法的研究提供方便的在线实验平台。

网址

Research

RLChina强化学习社区

RLChina强化学习社区讨论强化学习、博弈论、多智能体系统相关的学术、产业问题。

网址