论文

Research

ARIES: Stimulating Self-Refinement of Large Language Models by Iterative Preference Optimization.

Yongcheng Zeng, Xinyu Cui, Xuanfa Jin, Guoqing Liu, Zexu Sun, Quan He, Dong Li, Ning Yang, Jianye Hao, Haifeng Zhang, Jun Wang

arXiv PDF

Research

Ask More, Know Better: Reinforced-learned Prompt Questions for Decision Making with Large Language Models.

Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang, David Mguni, Jun Wang

arXiv PDF

Research

Mean Field Correlated Imitation Learning.

Zhiyu Zhao, Qirui Mi, Ning Yang, Xue Yan, Haifeng Zhang, Jun Wang, Yaodong Yang

AAMAS 2025 PDF

Research

Efficient Reinforcement Learning with Large Language Model Priors.

Xue Yan, Yan Song, Xidong Feng, Mengyue Yang, Haifeng Zhang, Haitham Bou Ammar, Jun Wang

ICLR 2025 PDF

Research

Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning.

Xinyu Cui, Boai Sun, Yi Zhu, Ning Yang, Haifeng Zhang, Weicheng Cui, Dixia Fan, Jun Wang

Physics of Fluids 2024 PDF

Research

Adaptive Command: Real-Time Policy Adjustment via Language Models in StarCraft II.

Weiyu Ma, Dongyu Xu, Shu Lin, Haifeng Zhang, Jun Wang

DAI 2024

Research

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf.

Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang

NeurIPS 2024 PDF

Research

Large language models play starcraft II: Benchmarks and a chain of summarization approach.

Weiyu Ma, Qirui Mi, Yongcheng Zeng, Xue Yan, Runji Lin, Yuqiao Wu, Jun Wang, Haifeng Zhang

NeurIPS 2024 PDF

Research

AI-Olympics: Exploring the Generalization of Agents through Open Competitions.

Chen Wang, Yan Song, Shuai Wu, Sa Wu, Ruizhi Zhang, Shu Lin, Haifeng Zhang

IJCAI 2024 PDF

Research

Token-level Direct Preference Optimization.

Yongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang

ICML 2024 PDF

Research

Large Sequence Models for Sequential Decision-Making: A Survey.

Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang

arXiv PDF

Research

TaxAI: A Dynamic Economic Simulator and Benchmark for Multi-Agent Reinforcement Learning.

Qirui Mi, Siyu Xia, Yan Song, Haifeng Zhang, Shenghao Zhu, Jun Wang

AAMAS 2024 PDF

Research

Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future.

Yan Song, He Jiang, Haifeng Zhang, Zheng Tian, Weinan Zhang, Jun Wang

AAMAS 2024 PDF

Research

Variational Stochastic Games.

Zhiyu Zhao, Haifeng Zhang

DAI 2024

Research

Offline Hierarchical Reinforcement Learning: Enable Large-scale Training in HRL.

Yuqiao Wu, Haifeng Zhang, Jun Wang

CCSICC 2023

Research

Learning Robust Communication by Adversarial Training in Networked System Control.

Runji Lin, Haifeng Zhang

CCSICC 2023

Research

An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination.

Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du

NeurIPS 2023 PDF

Research

A Generative Model for Game Theory with Flow Equilibrium.

Zhiyu Zhao, Renyuan Xu, Haifeng Zhang, Jun Wang, Yaodong Yang

Open Review PDF

Research

Imitation Learning for Mean Field Games with Correlated Equilibria | Research Square.

Zhiyu Zhao, Renyuan Xu, Haifeng Zhang, Jun Wang, Yaodong Yang

Research Square PDF

Research

An Empirical Study on Google Research Football Multi-agent Scenarios.

Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang

Machine Intelligence Research 2022 PDF

Research

Offline Multi-agent Decision Transformer.

Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang, Bo Xu

Machine Intelligence Research 2022 PDF

Research

Contextual Transformer for Offline Meta Reinforcement Learning.

Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang

NeurIPS2022 workshop PDF

Research

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.

Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang

NeurIPS 2022 PDF

Research

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.

Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang

NeurIPS 2022 PDF

Research

GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning.

Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu

AAMAS 2022 PDF

Research

Learning to Identify Top Elo Ratings as A Dueling Bandits Problem.

Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen

AAAI 2022 PDF

Research

A Review: Machine Learning for Combinatorial Optimization Problems in Energy Areas.

Xinyi Yang, Ziyi Wang, Hengxi Zhang, Nan Ma, Ning Yang, Hualin Liu, Haifeng Zhang, Lei Yang

Algorithms 2022 PDF

Research

A Game-theoretic Approach for Improving Generalization Ability of TSP Solvers.

Chenguang Wang, Yaodong Yang, Oliver Slumbers, Congying Han, Tiande Guo, Haifeng Zhang, Jun Wang

arXiv 2021 PDF

Research

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning.

Liheng Chen, Hongyi Guo, Haifeng Zhang, Fei Fang, Yaoming Zhu, Ming Zhou, Qing Wang, Weinan Zhang, Yong Yu

DAI 2021 PDF

Research

Settling the Variance of Multi-Agent Policy Gradients.

Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang

NeurIPS 2021 PDF

Research

Joint caching and transmission in the mobile edge network: A multi-agent learning approach.

Qirui Mi, Ning Yang, Haifeng Zhang, Haijun Zhang, Jun Wang

Globecom 2021 PDF

Research

Learning in Nonzero-Sum Stochastic Games with Potentials.

David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang

ICML 2021 PDF

Research

Estimating α-Rank from A Few Entries with Low Rank Matrix Completion.

Yali Du, Xue Yan, Xu Chen, Haifeng Zhang, Jun Wang

ICML 2021 PDF

Research

Learning Correlated Communication Topology in Multi-Agent Reinforcement Learning.

Yali Du, Bo Liu, Vincent Moens, Ziqi Liu, Zhicheng Ren, Jun Wang, Xu Chen, Haifeng Zhang

AAMAS 2021 PDF

项目

Research

“及第”多智能体开源开放平台

“及第”复现几十种主流(多智能体)强化学习基线算法,并接入几十种主流博弈环境,为新算法的研究提供方便的在线实验平台。

网址

Research

RLChina强化学习社区

RLChina强化学习社区讨论强化学习、博弈论、多智能体系统相关的学术、产业问题。

网址