default search action

combined dblp search
author search
venue search
publication search

ask others

Hangyu Mao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/tmc/YinXMG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/YinXMG25
Jiangjin Yin, Xin Xie, Hangyu Mao, Song Guo:
Efficient Missing Key Tag Identification in Large-Scale RFID Systems: An Iterative Verification and Selection Method. IEEE Trans. Mob. Comput. 24(3): 2253-2269 (2025)
[j8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/KongMZ0R0C00T25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/KongMZ0R0C00T25
Yilun Kong, Hangyu Mao, Qi Zhao, Bin Zhang, Jingqing Ruan, Li Shen, Yongzhe Chang, Xueqian Wang, Rui Zhao, Dacheng Tao:
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning. Trans. Mach. Learn. Res. 2025 (2025)
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WenLZYML25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WenLZYML25
Yongyan Wen, Siyuan Li, Rongchang Zuo, Lei Yuan, Hangyu Mao, Peng Liu:
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks. AAAI 2025: 21491-21500
[c34]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Cheng000DMZWH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Cheng000DMZWH25
Rong Cheng, Jinyi Liu, Yan Zheng, Fei Ni, Jiazhen Du, Hangyu Mao, Fuzheng Zhang, Bo Wang, Jianye Hao:
DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering. ACL (1) 2025: 31877-31899
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/dasfaa/LiWZYDHZYLMZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dasfaa/LiWZYDHZYLMZ25
Zhishuai Li, Xiang Wang, Jingjing Zhao, Sun Yang, Guoqing Du, Xiaoru Hu, Bin Zhang, Yuxiao Ye, Ziyue Li, Hangyu Mao, Rui Zhao:
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-Consistency. DASFAA (2) 2025: 193-208
[c32]
- view
- export record
  dblp key:
  - conf/icml/0005H0MSM00Y25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0005H0MSM00Y25
Zhiwei Xu, Kun Hu, Xin Xin, Weiliang Meng, Yiwei Shi, Hangyu Mao, Bin Zhang, Dapeng Li, Jiangjin Yin:
Reidentify: Context-Aware Identity Generation for Contextual Multi-Agent Reinforcement Learning. ICML 2025
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/www/ChenYZ000HMZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/ChenYZ000HMZ25
Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao, Hangyu Mao, Fuzheng Zhang:
SheetAgent: Towards a Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models. WWW 2025: 158-177
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-15944
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-15944
Jinyi Liu, Yan Zheng, Rong Cheng, Qiyu Wu, Wei Guo, Fei Ni, Hebin Liang, Yifu Yuan, Hangyu Mao, Fuzheng Zhang, Jianye Hao:
From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models. CoRR abs/2503.15944 (2025)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-18243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-18243
Rong Cheng, Jinyi Liu, Yan Zheng, Fei Ni, Jiazhen Du, Hangyu Mao, Fuzheng Zhang, Bo Wang, Jianye Hao:
DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering. CoRR abs/2504.18243 (2025)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-21433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-21433
Zhicong Li, Hangyu Mao, Jiangjin Yin, Mingzhe Xing, Zhiwei Xu, Yuanxing Zhang, Yang Xiao:
NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence. CoRR abs/2504.21433 (2025)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-16410
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-16410
Guanting Dong, Yifei Chen, Xiaoxi Li, Jiajie Jin, Hongjin Qian, Yutao Zhu, Hangyu Mao, Guorui Zhou, Zhicheng Dou, Ji-Rong Wen:
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning. CoRR abs/2505.16410 (2025)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-19849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-19849
Guanting Dong, Hangyu Mao, Kai Ma, Licheng Bao, Yifei Chen, Zhongyuan Wang, Zhongxia Chen, Jiazhen Du, Huiyang Wang, Fuzheng Zhang, Guorui Zhou, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou:
Agentic Reinforced Policy Optimization. CoRR abs/2507.19849 (2025)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-14545
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-14545
Guanting Dong, Licheng Bao, Zhongyuan Wang, Kangzhi Zhao, Xiaoxi Li, Jiajie Jin, Jinghan Yang, Hangyu Mao, Fuzheng Zhang, Kun Gai, Guorui Zhou, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou:
Agentic Entropy-Balanced Policy Optimization. CoRR abs/2510.14545 (2025)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-10365
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-10365
Hangyu Mao, Guangting Dong, Zhicheng Dou:
GPG: Generalized Policy Gradient Theorem for Transformer-based Policies. CoRR abs/2512.10365 (2025)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-24138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-24138
Haoran He, Yuxiao Ye, Jie Liu, Jiajun Liang, Zhiyong Wang, Ziyang Yuan, Xintao Wang, Hangyu Mao, Pengfei Wan, Ling Pan:
GARDO: Reinforcing Diffusion Models without Reward Hacking. CoRR abs/2512.24138 (2025)
2024
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/tcyb/DongMYZLHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcyb/DongMYZLHG24
Shaokang Dong, Hangyu Mao, Shangdong Yang, Shengyu Zhu, Wenbin Li, Jianye Hao, Yang Gao:
WToE: Learning When to Explore in Multiagent Reinforcement Learning. IEEE Trans. Cybern. 54(8): 4789-4801 (2024)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tits/JiangLLBMKZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tits/JiangLLBMKZ24
Haoyuan Jiang, Ziyue Li, Zhishuai Li, Lei Bai, Hangyu Mao, Wolfgang Ketter, Rui Zhao:
A General Scenario-Agnostic Reinforcement Learning for Traffic Signal Control. IEEE Trans. Intell. Transp. Syst. 25(9): 11330-11344 (2024)
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/LuRJ0MZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LuRJ0MZ24
Jiaming Lu, Jingqing Ruan, Haoyuan Jiang, Ziyue Li, Hangyu Mao, Rui Zhao:
DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge. AAMAS 2024: 1283-1291
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/MaoZ00CCZXZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/MaoZ00CCZXZY24
Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin:
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024: 1363-1371
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/dexa/YangSLLMLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dexa/YangSLLMLZ24
Sun Yang, Qiong Su, Zhishuai Li, Ziyue Li, Hangyu Mao, Chenxi Liu, Rui Zhao:
SQL-to-Schema Enhances Schema Linking in Text-to-SQL. DEXA (1) 2024: 139-145
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KongRC0BSQHM0ZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KongRC0BSQHM0ZZ24
Yilun Kong, Jingqing Ruan, Yihong Chen, Bin Zhang, Tianpeng Bao, Shiwei Shi, Du Qing, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao, Xueqian Wang:
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Industry Systems. EMNLP (Industry Track) 2024: 371-385
[c26]
- view
- export record
  dblp key:
  - conf/icml/0052M0000F24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0052M0000F24
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan:
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach. ICML 2024: 59559-59575
[c25]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/ChenMM0Z0YC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ChenMM0Z0YC24
Yiqun Chen, Hangyu Mao, Jiaxin Mao, Shiguang Wu, Tianle Zhang, Bin Zhang, Wei Yang, Hongxing Chang:
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning. IJCAI 2024: 31-39
[c24]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/Jiang00XRLM024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Jiang00XRLM024
Haoyuan Jiang, Ziyue Li, Hua Wei, Xuantang Xiong, Jingqing Ruan, Jiaming Lu, Hangyu Mao, Rui Zhao:
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner. IJCAI 2024: 94-102
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/Ruan0WJLXM024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/Ruan0WJLXM024
Jingqing Ruan, Ziyue Li, Hua Wei, Haoyuan Jiang, Jiaming Lu, Xuantang Xiong, Hangyu Mao, Rui Zhao:
CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control. KDD 2024: 2500-2511
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/secon/YinMZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/secon/YinMZX24
Jiangjin Yin, Hangyu Mao, Rongbo Zhu, Shiwei Xu:
Parallel Missing Tag Identification for Anonymous Multiple Users RFID Systems. SECON 2024: 1-9
[c21]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/strl/Jiang0L0MK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/strl/Jiang0L0MK024
Haoyuan Jiang, Ziyue Li, Zhishuai Li, Lei Bai, Hangyu Mao, Wolfgang Ketter, Rui Zhao:
GESA: A GEneral Scenario-Agnostic Reinforcement Learning for Traffic Signal Control. STRL@IJCAI 2024
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-02951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-02951
Bin Zhang, Yuxiao Ye, Guoqing Du, Xiaoru Hu, Zhishuai Li, Sun Yang, Chi Harold Liu, Rui Zhao, Ziyue Li, Hangyu Mao:
Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation. CoRR abs/2403.02951 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-09732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-09732
Zhishuai Li, Xiang Wang, Jingjing Zhao, Sun Yang, Guoqing Du, Xiaoru Hu, Bin Zhang, Yuxiao Ye, Ziyue Li, Rui Zhao, Hangyu Mao:
PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency. CoRR abs/2403.09732 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-12090
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-12090
Haoyuan Jiang, Ziyue Li, Hua Wei, Xuantang Xiong, Jingqing Ruan, Jiaming Lu, Hangyu Mao, Rui Zhao:
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner. CoRR abs/2404.12090 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-09593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-09593
Sun Yang, Qiong Su, Zhishuai Li, Ziyue Li, Hangyu Mao, Chenxi Liu, Rui Zhao:
SQL-to-Schema Enhances Schema Linking in Text-to-SQL. CoRR abs/2405.09593 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17152
Jingqing Ruan, Ziyue Li, Hua Wei, Haoyuan Jiang, Jiaming Lu, Xuantang Xiong, Hangyu Mao, Rui Zhao:
CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control. CoRR abs/2405.17152 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10811
Haoyuan Jiang, Xuantang Xiong, Ziyue Li, Hangyu Mao, Guanghu Sui, Jingqing Ruan, Yuheng Cheng, Hua Wei, Wolfgang Ketter, Rui Zhao:
GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents. CoRR abs/2407.10811 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-09501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-09501
Zhiwei Xu, Hangyu Mao, Nianmin Zhang, Xin Xin, Pengjie Ren, Dapeng Li, Bin Zhang, Guoliang Fan, Zhumin Chen, Changwei Wang, Jiangjin Yin:
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2408.09501 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-10504
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-10504
Yilun Kong, Hangyu Mao, Qi Zhao, Bin Zhang, Jingqing Ruan, Li Shen, Yongzhe Chang, Xueqian Wang, Rui Zhao, Dacheng Tao:
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning. CoRR abs/2408.10504 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01739
Xingrui Gu, Guanren Qiao, Chuyi Jiang, Tianqing Xia, Hangyu Mao:
Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning. CoRR abs/2410.01739 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-12173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-12173
Yongyan Wen, Siyuan Li, Rongchang Zuo, Lei Yuan, Hangyu Mao, Peng Liu:
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks. CoRR abs/2411.12173 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13154
Zhicong Li, Jiahao Wang, Zhishu Jiang, Hangyu Mao, Zhongxia Chen, Jiazhen Du, Yuanxing Zhang, Fuzheng Zhang, Di Zhang, Yong Liu:
DMQR-RAG: Diverse Multi-Query Rewriting for RAG. CoRR abs/2411.13154 (2024)
2023
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/candie/HeMLX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/candie/HeMLX23
Bin He, Hangyu Mao, Tengyu Li, Jing-Long Xiao:
A closed-loop digital twin modeling method integrated with carbon footprint analysis. Comput. Ind. Eng. 182: 109389 (2023)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/jcise/HeM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jcise/HeM23
Bin He, Hangyu Mao:
Digital Twin-Driven Product Sustainable Design for Low Carbon Footprint. J. Comput. Inf. Sci. Eng. 23(6) (2023)
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ecai/RuanH0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/RuanH0M23
Jingqing Ruan, Xiaotian Hao, Dong Li, Hangyu Mao:
Learning to Collaborate by Grouping: A Consensus-Oriented Strategy for Multi-Agent Reinforcement Learning. ECAI 2023: 2010-2017
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HaoHMW00ZW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HaoHMW00ZW23
Jianye Hao, Xiaotian Hao, Hangyu Mao, Weixun Wang, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang:
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks. ICLR 2023
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icse/YanCMJHLTCLXGLWSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icse/YanCMJHLTCLXGLWSC23
Ming Yan, Junjie Chen, Hangyu Mao, Jiajun Jiang, Jianye Hao, Xingjian Li, Zhao Tian, Zhichao Chen, Dong Li, Zhangkong Xian, Yanwei Guo, Wulong Liu, Bin Wang, Yuefeng Sun, Yongshun Cui:
Achieving Last-Mile Functional Coverage in Testing Chip Design Software Implementations. ICSE-SEIP 2023: 343-354
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/XingMYPZXL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/XingMYPZXL23
Mingzhe Xing, Hangyu Mao, Shenglin Yin, Lichen Pan, Zhengchao Zhang, Zhen Xiao, Jieyi Long:
A Dual-Agent Scheduler for Distributed Deep Learning Jobs on Public Cloud via Reinforcement Learning. KDD 2023: 2776-2788
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-07856
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-07856
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan:
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems. CoRR abs/2305.07856 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-15530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-15530
Jingqing Ruan, Xiaotian Hao, Dong Li, Hangyu Mao:
Learning to Collaborate by Grouping: a Consensus-oriented Strategy for Multi-agent Reinforcement Learning. CoRR abs/2307.15530 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-03427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-03427
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao:
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents. CoRR abs/2308.03427 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18752
Guanghu Sui, Zhishuai Li, Ziyue Li, Sun Yang, Jingqing Ruan, Hangyu Mao, Rui Zhao:
Reboost Large Language Model-based Text-to-SQL, Text-to-Python, and Text-to-Function - with Real Applications in Traffic Domain. CoRR abs/2310.18752 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-11315
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-11315
Yilun Kong, Jingqing Ruan, Yihong Chen, Bin Zhang, Tianpeng Bao, Shiwei Shi, Guoqing Du, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao:
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems. CoRR abs/2311.11315 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-13884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-13884
Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan:
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach. CoRR abs/2311.13884 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14532
Jiaming Lu, Jingqing Ruan, Haoyuan Jiang, Ziyue Li, Hangyu Mao, Rui Zhao:
DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge. CoRR abs/2312.14532 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15863
Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin:
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. CoRR abs/2312.15863 (2023)
2022
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/ZhangLMY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/ZhangLMY22
Xianjie Zhang, Yu Liu, Hangyu Mao, Chao Yu:
Common belief multi-agent reinforcement learning based on variational recurrent models. Neurocomputing 513: 341-350 (2022)
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/TangMHCGLYML0TW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/TangMHCGLYML0TW22
Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang:
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator. AAAI 2022: 8441-8449
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/HuangLSZLWMHWD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/HuangLSZLWMHWD22
Wenhan Huang, Kai Li, Kun Shao, Tianze Zhou, Jun Luo, Dongge Wang, Hangyu Mao, Jianye Hao, Jun Wang, Xiaotie Deng:
Multiagent Q-learning with Sub-Team Coordination. AAMAS 2022: 1630-1632
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icmre/LiuHZM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmre/LiuHZM22
Long Liu, Bin He, Dong Zhang, Hangyu Mao:
Deep Belief Network-based Prediction for Gear Noise. ICMRE 2022: 50-54
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/XingMX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/XingMX22
Mingzhe Xing, Hangyu Mao, Zhen Xiao:
Fast and Fine-grained Autoscaler for Streaming Jobs with Reinforcement Learning. IJCAI 2022: 564-570
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/miccai/LiCMDLHDH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/miccai/LiCMDLHDH22
Jinpeng Li, Guangyong Chen, Hangyu Mao, Danruo Deng, Dong Li, Jianye Hao, Qi Dou, Pheng-Ann Heng:
Flat-Aware Cross-Stage Distilled Framework for Imbalanced Medical Image Classification. MICCAI (3) 2022: 217-226
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/middleware/PanQXMYLX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/middleware/PanQXMYLX22
Lichen Pan, Jun Qian, Wei Xia, Hangyu Mao, Jun Yao, Pengze Li, Zhen Xiao:
Optimizing communication in deep reinforcement learning with XingTian. Middleware 2022: 255-268
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HuangLSZT0WMH0D22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuangLSZT0WMH0D22
Wenhan Huang, Kai Li, Kun Shao, Tianze Zhou, Matthew E. Taylor, Jun Luo, Dongge Wang, Hangyu Mao, Jianye Hao, Jun Wang, Xiaotie Deng:
Multiagent Q-learning with Sub-Team Coordination. NeurIPS 2022
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-05285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-05285
Xiaotian Hao, Weixun Wang, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao:
API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks. CoRR abs/2203.05285 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-08872
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-08872
Yiqun Chen, Hangyu Mao, Tianle Zhang, Shiguang Wu, Bin Zhang, Jianye Hao, Dong Li, Bin Wang, Hongxing Chang:
PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning. CoRR abs/2210.08872 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-14538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-14538
Hangyu Mao, Rui Zhao, Hao Chen, Jianye Hao, Yiqun Chen, Dong Li, Junge Zhang, Zhen Xiao:
Transformer in Transformer as Backbone for Deep Reinforcement Learning. CoRR abs/2212.14538 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/ZhangLXHMC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/ZhangLXHMC21
Xianjie Zhang, Yu Liu, Xiujuan Xu, Qiong Huang, Hangyu Mao, Anil Carie:
Structural relational inference actor-critic for multi-agent reinforcement learning. Neurocomputing 459: 383-394 (2021)
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/MaoWHMLWHLT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/MaoWHMLWHLT21
Hangyu Mao, Chao Wang, Xiaotian Hao, Yihuan Mao, Yiming Lu, Chengjie Wu, Jianye Hao, Dong Li, Pingzhong Tang:
SEIHAI: A Sample-Efficient Hierarchical AI for the MineRL Competition. DAI 2021: 38-51
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangWTHMMLLCHFZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangWTHMMLLCHFZ21
Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang:
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning. NeurIPS 2021: 17037-17048
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00517
Tianze Zhou, Fubiao Zhang, Kun Shao, Kai Li, Wenhan Huang, Jun Luo, Weixun Wang, Yaodong Yang, Hangyu Mao, Bin Wang, Dong Li, Wulong Liu, Jianye Hao:
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment. CoRR abs/2106.00517 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03748
William Hebgen Guss, Stephanie Milani, Nicholay Topin, Brandon Houghton, Sharada P. Mohanty, Andrew Melnik, Augustin Harter, Benoit Buschmaas, Bjarne Jaster, Christoph Berganski, Dennis Heitkamp, Marko Henning, Helge J. Ritter, Chengjie Wu, Xiaotian Hao, Yiming Lu, Hangyu Mao, Yihuan Mao, Chao Wang, Michal Opanowicz, Anssi Kanervisto, Yanick Schraner, Christian Scheller, Xiren Zhou, Lu Liu, Daichi Nishio, Toi Tsuneda, Karolis Ramanauskas, Gabija Juceviciute:
Towards robust and domain agnostic reinforcement learning competitions. CoRR abs/2106.03748 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08857
Hangyu Mao, Chao Wang, Xiaotian Hao, Yihuan Mao, Yiming Lu, Chengjie Wu, Jianye Hao, Dong Li, Pingzhong Tang:
SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition. CoRR abs/2111.08857 (2021)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/aamas/MaoZXGN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/MaoZXGN20
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong, Yan Ni:
Learning multi-agent communication with double attentional deep reinforcement learning. Auton. Agents Multi Agent Syst. 34(1): 32 (2020)
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MaoZXGN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MaoZXGN20
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong, Yan Ni:
Learning Agent Communication under Limited Bandwidth by Message Pruning. AAAI 2020: 5142-5149
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MaoLHLLZWX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MaoLHLLZWX20
Hangyu Mao, Wulong Liu, Jianye Hao, Jun Luo, Dong Li, Zhengchao Zhang, Jun Wang, Zhen Xiao:
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning. AAAI 2020: 7219-7226
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GussMTHMMHBJBHH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GussMTHMMHBJBHH20
William Hebgen Guss, Stephanie Milani, Nicholay Topin, Brandon Houghton, Sharada P. Mohanty, Andrew Melnik, Augustin Harter, Benoit Buschmaas, Bjarne Jaster, Christoph Berganski, Dennis Heitkamp, Marko Henning, Helge J. Ritter, Chengjie Wu, Xiaotian Hao, Yiming Lu, Hangyu Mao, Yihuan Mao, Chao Wang, Michal Opanowicz, Anssi Kanervisto, Yanick Schraner, Christian Scheller, Xiren Zhou, Lu Liu, Daichi Nishio, Toi Tsuneda, Karolis Ramanauskas, Gabija Juceviciute:
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020. NeurIPS (Competition and Demos) 2020: 233-252
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-03433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-03433
Hangyu Mao, Zhibo Gong, Zhen Xiao:
Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing. CoRR abs/2003.03433 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/MaoZXG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/MaoZXG19
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong:
Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG. AAMAS 2019: 1108-1116
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-05561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-05561
Hangyu Mao, Zhibo Gong, Zhengchao Zhang, Zhen Xiao, Yan Ni:
Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing. CoRR abs/1903.05561 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-01160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-01160
Hangyu Mao, Wulong Liu, Jianye Hao, Jun Luo, Dong Li, Zhengchao Zhang, Jun Wang, Zhen Xiao:
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning. CoRR abs/1912.01160 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-05304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-05304
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong, Yan Ni:
Learning Agent Communication under Limited Bandwidth by Message Pruning. CoRR abs/1912.05304 (2019)
2018
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/pakdd/MaoXWWX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pakdd/MaoXWWX18
Hangyu Mao, Yang Xiao, Yuan Wang, Jiakang Wang, Zhen Xiao:
Topic-Specific Retweet Count Ranking for Weibo. PAKDD (3) 2018: 625-637
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07029
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong:
Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG. CoRR abs/1811.07029 (2018)
2017
[c2]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/WangMX17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WangMX17
Yuan Wang, Hangyu Mao, Zhen Xiao:
Identifying Influential Users' Professions via the Microblogs They Forward. SocInf@IJCAI 2017: 33-44
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MaoGNLWKMSX17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MaoGNLWKMSX17
Hangyu Mao, Zhibo Gong, Yan Ni, Xiangyu Liu, Quanbin Wang, Weichen Ke, Chao Ma, Yiping Song, Zhen Xiao:
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning. CoRR abs/1706.03235 (2017)
2016
[c1]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/XiaoWMX16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/XiaoWMX16
Yang Xiao, Yuan Wang, Hangyu Mao, Zhen Xiao:
Predicting Restaurant Consumption Level through Social Media Footprints. COLING 2016: 3328-3338

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.