default search action

combined dblp search
author search
venue search
publication search

ask others

Yang Yu 0001

> Home > Persons

Person information

affiliation (PhD 2011): Nanjing University, State Key Laboratory for Novel Software Technology, China
affiliation: Pazhou Lab, Guangzhou, China

Other persons with the same name

see FAQ

Yang Yu — disambiguation page
Yang Yu 0002 — University of Technology Sydney, Faculty of Engineering and Information Technology, NSW, Australia (and 1 more)
Yang Yu 0003 — North China Electric Power University, State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, Baoding, China
Yang Yu 0004 — Rochester Institute of Technology, Saunders College of Business, Rochester, NY, USA (and 1 more)
Yang Yu 0005 — Jiangsu University of Technology, School of Electric Information Engineering, Changzhou, China
Yang Yu 0006 — National University of Defense Technology, College of Electrical Science and Engineering, National Key Laboratory of Science and Technology on ATR, Changsha, China
Yang Yu 0007 — National University of Defense Technology, College of Computer, Changsha, China
Yang Yu 0008 — Tsinghua University, Department of Computer Science and Technology, Beijing, China (and 1 more)
Yang Yu 0009 — Motorola Labs, Schaumburg, IL, USA (and 1 more)
Yang Yu 0010 — Rutgers University, Department of Computer Science, Piscataway, NJ, USA

Yang Yu 0011 — Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China (and 1 more)
Yang Yu 0012 — University of Sheffield, UK
Yang Yu 0013 — Nanjing University of Posts and Telecommunications, College of Automation / College of Artificial Intelligence, China (and 1 more)
Yang Yu 0014 — National University of Defense Technology, College of Intelligence Science and Technology, Changsha, China (and 1 more)
Yang Yu 0015 — Harbin Institute of Technology, Department of Automatic Test and Control, Harbin, China
Yang Yu 0016 — Northeastern University, College of Information Science and Engineering, Shenyang, China
Yang Yu 0017 — Changchun University of Technology, School of Mechatronic Engineering, Changchun, China
Yang Yu 0018 — Harbin Jiancheng Group Company, Harbin, China
Yang Yu 0019 — Shanghai Jiao Tong University, School of Mechanical Engineering, State Key Laboratory of Mechanical System and Vibration, Shanghai, China
Yang Yu 0020 — Tongji University, State Key Laboratory of Marine Geology, Shanghai, China
Yang Yu 0021 — University of Technology Sydney, School of Civil and Environmental Engineering, Sydney, Australia
Yang Yu 0022 — Hebei University of Technology, School of Computer Science and Engineering, Tianjin, China
Yang Yu 0023 — Wuhan University, School of Urban Design, Department of Urban Planning, Wuhan, China
Yang Yu 0024 — Tongji University, Department of Control Science and Engineering, Shanghai, China
Yang Yu 0025 — Rutgers University, Department of Mathematics, Piscataway, NJ, USA
Yang Yu 0026 — China Agricultural University, College of Engineering, Beijing, China
Yang Yu 0027 — Sun Yat-sen University, School of Data and Computer Science, Guangzhou, China
Yang Yu 0028 — Hong Kong University of Science and Technology, Department of Electronic and Computer Engineering, Robotics and Multi-Perception Laborotary, Hong Kong
Yang Yu 0029 — Google, Mountain View, CA, USA (and 3 more)
Yang Yu 0030 — Tianjin University, College of Intelligence and Computing, China
Yang Yu 0031 — Southwest Forestry University, School of Machinery and Transportation, Kunming, China (and 1 more)
Yang Yu 0032 — National University of Defense Technology, Center of Material Science, College of Liberal Arts and Sciences, College of Advanced Interdisciplinary Studies, College of Sciences, Changsha, China
Yang Yu 0033 — Guizhou Medical University, School of Biology and Engineering, Guiyang, China (and 1 more)
Yang Yu 0034 — Nanjing University of Posts and Telecommunications, College of Communication & Information Engineering, China
Yang Yu 0035 — Royal Institute of Technology, Stockholm, Sweden
Yang Yu 0036 — University of Duisburg-Essen, Germany
Yang Yu 0037 — Auckland University of Technology, Institute of Biomedical Technologies, New Zealand
Yang Yu 0038 — University of Science and Technology of China, State Key Laboratory of Cognitive Intelligence, Hefei, China
Yang Yu 0039 — Beijing Jiaotong University, Institute of Information Science, Beijing Key Laboratory of Advanced Information Science and Network Technology, Beijing, China
Yang Yu 0040 — Northwestern Polytechnical University, School of Marine Science and Technology, Xi'an, China
Yang Yu 0041 — Shanghai Jiao Tong University, Department of Electronic Engineering, Network Coding and Transmission Laboratory, Shanghai, China (and 1 more)
Yang Yu 0042 — Kookmin University, Department of Computer Science, Seoul, South Korea
Yang Yu 0043 — Qingdao University, School of Automation, Shandong Key Laboratory of Industrial Control Technology, Qingdao, China
Yang Yu 0044 — Zhengzhou University of Light Industry, Software Engineering College, Zhengzhou, China
Yang Yu 0045 — Chinese Academy of Sciences, Shanghai Institute of Technical Physics, Key Laboratory of Infrared System Detecting and Imaging Technology, Shanghai, China
Yang Yu 0046 — Japan Advanced Institute of Science and Technology (JAIST), School of Knowledge Science, Nomi, Japan
Yang Yu 0047 — Hubei Three Gorges Polytechnic, Electronic Information School, Yichang, China
Yang Yu 0048 — Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China
Yang Yu 0049 — Lanzhou Jiaotong University, School of Traffic and Transportation, Lanzhou, China
Yang Yu 0050 — Shanghai Jiao Tong University, Antai College of Economics and Management, Shanghai, China
Yang Yu 0051 — Tianjin University, Tianjin Key Laboratory of Port and Ocean Engineering, State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin, China
Yang Yu 0052 — Purdue University, Department of Statistics, West Lafayette, IN, USA
Yang Yu 0053 — University of North Carolina at Chapel Hill, Department of Statistics and Operations Research, Chapel Hill, NC, USA
Yang Yu 0054 — Halliburton Ltd, Singapore (and 1 more)
Yang Yu 0055 — Taylor Hobson Ltd. AMETEK Ultra Precision Technologies, Leicester, UK (and 1 more)
Yang Yu 0056 — University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China (and 1 more)
Yang Yu 0057 — Qilu University of Technology, Shandong Computer Science Center, Shandong Provincial Key Laboratory of Computer Networks, Jinan, China
Yang Yu 0058 — Beijing Jiaotong University, Institute of Data Science and Intelligent Decision Support, Beijing, China (and 2 more)
Yang Yu 0059 — Liaoning Institute of Science and Engineering, School of Management Engineering, Jinzhou, China
Yang Yu 0060 — Wuhan University of Technology, School of Information Engineering, Wuhan, China
Yang Yu 0061 — Nanjing University of Posts and Telecommunications, Institute of Signal Processing Transmission, Nanjing, China
Yang Yu 0062 — University of California Davis, Department of Land Air and Water Resources, Davis, CA, USA (and 1 more)
Yang Yu 0063 — Pennsylvania State University, Department of Architectural Engineering, University Park, PA, USA
Yang Yu 0064 — Beihang University, School of Aeronautic Science and Engineering, Beijing, China
Yang Yu 0065 — Shanghai Conservatory of Music, Shanghai Key Laboratory for Music Acoustic, Shanghai, China
Yang Yu 0066 — Semiconductor Manufacturing International Corporation, R&D Department, Shanghai, China
Yang Yu 0067 — Tianjin University of Science and Technology, College of Artificial Intelligence, Tianjin, China
Yang Yu 0068 — Chinese Academy of Sciences, National Space Science Center, Key Laboratory of Microwave Remote Sensing, Beijing, China (and 3 more)
Yang Yu 0069 — Northwestern Polytechnical University, School of Astronautics, National Key Laboratory of Aerospace Flight Dynamics, Xi'an, China
Yang Yu 0070 — Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong
Yang Yu 0071 — Weifang People's Hospital, Department of Stomatology, Weifang, China
Yang Yu 0072 — Shandong University of Technology, School of Transportation and Vehicle Engineering, Zibo, China
Yang Yu 0073 — Wuhan University, School of Cyber Science and Engineering, Key Laboratory of Aerospace Information Security and Trusted Computing, Wuhan, China
Yang Yu 0074 — Victoria University of Wellington, School of Marketing and International Business, Wellington, New Zealand
Yang Yu 0075 — Victoria University of Wellington, School of Engineering and Computer Science, Wellington, New Zealand
Yang Yu 0076 — Jilin Communications Polytechnic, Department of Physical Education, Changchun, China
Yang Yu 0077 — Shenyang Aerospace University, School of Automation, Shenyang, China
Yang Yu 0078 — Northwestern University, Department of Statistics, Evanston, IL, USA
Yang Yu 0079 — Agency for Science, Technology and Research (A*STAR), Machine Intellection Department, Institute for Infocomm Research (I2R), Singapore (and 1 more)

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j57]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/GuanXFCZYQY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/GuanXFCZYQY25
Cong Guan, Ke Xue, Chunpeng Fan, Feng Chen, Lichao Zhang, Lei Yuan, Chao Qian, Yang Yu:
Open and real-world human-AI coordination by heterogeneous training with communication. Frontiers Comput. Sci. 19(4): 194314 (2025)
[j56]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/ZhuTCZY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/ZhuTCZY25
Zhengmao Zhu, Hong-Long Tian, Xionghui Chen, Kun Zhang, Yang Yu:
Offline model-based reinforcement learning with causal structured world models. Frontiers Comput. Sci. 19(4): 194347 (2025)
[j55]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/GuanJLZYY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/GuanJLZYY25
Cong Guan, Tao Jiang, Yi-Chen Li, Zongzhang Zhang, Lei Yuan, Yang Yu:
Constraining an Unconstrained Multi-agent Policy with offline data. Neural Networks 186: 107253 (2025)
[j54]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/LiCZZYY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/LiCZZYY25
Yi-Chen Li, Ningjing Chao, Zongzhang Zhang, Fuxiang Zhang, Lei Yuan, Yang Yu:
Generalizable Multi-Modal Adversarial Imitation Learning for Non-Stationary Dynamics. IEEE Trans. Pattern Anal. Mach. Intell. 47(7): 5600-5612 (2025)
[j53]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/0042CQG0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/0042CQG0Z025
Feng Chen, Xinwei Chen, Rong-Jun Qin, Cong Guan, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Multi-Agent Cooperation Learning through Teammate Lookahead. Trans. Mach. Learn. Res. 2025 (2025)
[j52]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/PangFWXTYJXCHY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/PangFWXTYJXCHY25
Jing-Cheng Pang, Heng-Bo Fan, Pengyuan Wang, Jiahao Xiao, Nan Tang, Si-Hang Yang, Chengxing Jia, Ming-Kun Xie, Xiang Chen, Sheng-Jun Huang, Yang Yu:
Interactive Large Language Models for Reliable Answering under Incomplete Context. Trans. Mach. Learn. Res. 2025 (2025)
[j51]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuanLZZGY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuanLZZGY25
Lei Yuan, Lihe Li, Ziqian Zhang, Fuxiang Zhang, Cong Guan, Yang Yu:
Multiagent Continual Coordination via Progressive Task Contextualization. IEEE Trans. Neural Networks Learn. Syst. 36(4): 6326-6340 (2025)
[j50]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/GuanCYZY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/GuanCYZY25
Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 36(5): 9044-9056 (2025)
[j49]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/DingJZGCYY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/DingJZGCYY25
Hao Ding, Chengxing Jia, Zongzhang Zhang, Cong Guan, Feng Chen, Lei Yuan, Yang Yu:
Learning to Coordinate With Different Teammates via Team Probing. IEEE Trans. Neural Networks Learn. Syst. 36(9): 15807-15821 (2025)
[j48]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/PangXJLY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/PangXJLY25
Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu, Yang Yu:
Reinforcement Learning With Sparse-Executing Action via Sparsity Regularization. IEEE Trans. Neural Networks Learn. Syst. 36(9): 16072-16084 (2025)
[c151]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001BLZG025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001BLZG025
Lei Yuan, Yuqi Bian, Lihe Li, Ziqian Zhang, Cong Guan, Yang Yu:
Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching. ICLR 2025
[c150]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001Z00JZ0A25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001Z00JZ0A25
Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu, Bo An:
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation. ICLR 2025
[c149]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LinXSZ0JYZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LinXSZ0JYZ025
Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu:
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning. ICLR 2025
[c148]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Liu00025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Liu00025
Xu-Hui Liu, Yali Du, Jun Wang, Yang Yu:
On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models. ICLR 2025
[c147]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuL0JWZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuL0JWZ025
Tian-Shuo Liu, Xu-Hui Liu, Ruifeng Chen, Lixuan Jin, Pengyuan Wang, Zhilong Zhang, Yang Yu:
Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning. ICLR 2025
[c146]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PangTLTCZ0S025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PangTLTCZ0S025
Jing-Cheng Pang, Nan Tang, Kaiyuan Li, Yuting Tang, Xin-Qiang Cai, Zhen-Yu Zhang, Gang Niu, Masashi Sugiyama, Yang Yu:
Learning View-invariant World Models for Visual Robotic Manipulation. ICLR 2025
[c145]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/QianZSLWALZ0Y25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/QianZSLWALZ0Y25
Hong Qian, Yiyi Zhu, Xiang Shu, Shuo Liu, Yaolin Wen, Xin An, Huakang Lu, Aimin Zhou, Ke Tang, Yang Yu:
SOO-Bench: Benchmarks for Evaluating the Stability of Offline Black-Box Optimization. ICLR 2025
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-04778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-04778
Chen-Xiao Gao, Chenyang Wu, Mingjun Cao, Chenjun Xiao, Yang Yu, Zongzhang Zhang:
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning. CoRR abs/2502.04778 (2025)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-04793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-04793
Wenjie Qiu, Yi-Chen Li, Xuqin Zhang, Tianyi Zhang, Yihang Zhang, Zongzhang Zhang, Yang Yu:
Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference. CoRR abs/2503.04793 (2025)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-19267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-19267
Songyi Gao, Zuolin Tu, Rong-Jun Qin, Yi-Hao Sun, Xiong-Hui Chen, Yang Yu:
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios. CoRR abs/2503.19267 (2025)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-21383
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-21383
Chengxing Jia, Ziniu Li, Pengyuan Wang, Yi-Chen Li, Zhenyu Hou, Yuxiao Dong, Yang Yu:
Controlling Large Language Model with Latent Actions. CoRR abs/2503.21383 (2025)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-10010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-10010
Jing-Cheng Pang, Kaiyuan Li, Yidi Wang, Si-Hang Yang, Shengyi Jiang, Yang Yu:
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts. CoRR abs/2505.10010 (2025)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13425
Zhi-Hao Tan, Zi-Chen Zhao, Hao-Yu Shi, Xin-Yu Zhang, Peng Tan, Yang Yu, Zhi-Hua Zhou:
Learnware of Language Models: Specialized Small Language Models Can Do Big. CoRR abs/2505.13425 (2025)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-23235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-23235
Yi-Chen Li, Tian Xu, Yang Yu, Xuqin Zhang, Xiong-Hui Chen, Zhongxiang Ling, Ningjing Chao, Lei Yuan, Zhi-Hua Zhou:
Generalist Reward Models: Found Inside Large Language Models. CoRR abs/2506.23235 (2025)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22402
Nan Tang, Jing-Cheng Pang, Guanlin Li, Chao Qian, Yang Yu:
ReLAM: Learning Anticipation Model for Rewarding Visual Robotic Manipulation. CoRR abs/2509.22402 (2025)
2024
[j47]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/YuanJLCZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/YuanJLCZY24
Lei Yuan, Tao Jiang, Lihe Li, Feng Chen, Zongzhang Zhang, Yang Yu:
Robust cooperative multi-agent reinforcement learning via multi-view message certification. Sci. China Inf. Sci. 67(1) (2024)
[j46]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/LuoXLCZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/LuoXLCZY24
Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A survey on model-based reinforcement learning. Sci. China Inf. Sci. 67(2) (2024)
[j45]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/QinCWYWKZZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/QinCWYWKZZY24
Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Yipeng Kang, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-agent policy transfer via task relationship modeling. Sci. China Inf. Sci. 67(8) (2024)
[j44]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/JiaZXPZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/JiaZXPZY24
Chengxing Jia, Fuxiang Zhang, Tian Xu, Jing-Cheng Pang, Zongzhang Zhang, Yang Yu:
Model gradient: unified model and policy learning in model-based reinforcement learning. Frontiers Comput. Sci. 18(4): 184339 (2024)
[j43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/fcsc/YuanCZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/YuanCZY24
Lei Yuan, Feng Chen, Zongzhang Zhang, Yang Yu:
Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation. Frontiers Comput. Sci. 18(6) (2024)
[j42]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/LiuSYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/LiuSYL24
Ruo-Ze Liu, Yanjie Shen, Yang Yu, Tong Lu:
Revisiting of AlphaStar. IEEE Trans. Games 16(2): 317-330 (2024)
[j41]
- view
  authority control:
- export record
  dblp key:
  - journals/tdsc/ZhangLLAYYZLLK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tdsc/ZhangLLAYYZLLK24
Zijian Zhang, Xin Lu, Meng Li, Jincheng An, Yang Yu, Hao Yin, Liehuang Zhu, Yong Liu, Jiamou Liu, Bakh Khoussainov:
A Blockchain-Based Privacy-Preserving Scheme for Sealed-Bid Auction. IEEE Trans. Dependable Secur. Comput. 21(5): 4668-4683 (2024)
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/tii/YangWYZU24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tii/YangWYZU24
Ming Yang, Yiming Wang, Yang Yu, Mingliang Zhou, Leong Hou U:
MixLight: Mixed-Agent Cooperative Reinforcement Learning for Traffic Light Control. IEEE Trans. Ind. Informatics 20(2): 2653-2661 (2024)
[j39]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/Guan00FZZZZ00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Guan00FZZZZ00024
Cong Guan, Feng Chen, Ke Xue, Chunpeng Fan, Lichao Zhang, Ziqian Zhang, Pengyao Zhao, Zongzhang Zhang, Chao Qian, Lei Yuan, Yang Yu:
One by One, Continual Coordinating with Humans via Hyper-Teammate Identification. Trans. Mach. Learn. Res. 2024 (2024)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/tois/ZhuQHDYYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tois/ZhuQHDYYZ24
Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. ACM Trans. Inf. Syst. 42(4): 90:1-90:32 (2024)
[c144]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Chen0LDZYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Chen0LDZYZ24
Chao Chen, Jiacheng Xu, Weijian Liao, Hao Ding, Zongzhang Zhang, Yang Yu, Rui Zhao:
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning. AAAI 2024: 11240-11248
[c143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GaoWCKZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GaoWCKZ024
Chenxiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu:
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning. AAAI 2024: 12127-12135
[c142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LinWZSYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LinWZSYY24
Haoxin Lin, Hongqiu Wu, Jiaji Zhang, Yihao Sun, Junyin Ye, Yang Yu:
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward. AAAI 2024: 13808-13816
[c141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhouGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhouGZ024
Renzhe Zhou, Chenxiao Gao, Zongzhang Zhang, Yang Yu:
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations. AAAI 2024: 17132-17140
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/ChenWM0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/ChenWM0Z024
Chao Chen, Dawei Wang, Feng Mao, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Deep Anomaly Detection via Active Anomaly Search. AAMAS 2024: 308-316
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0003LLJX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0003LLJX024
Ruifeng Chen, Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Feng Xu, Yang Yu:
Foresight Distribution Adjustment for Off-policy Reinforcement Learning. AAMAS 2024: 317-325
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/GuanXZLLY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/GuanXZLLY024
Cong Guan, Ruiqi Xue, Ziqian Zhang, Lihe Li, Yi-Chen Li, Lei Yuan, Yang Yu:
Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation. AAMAS 2024: 743-751
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/JiaZ0GLYZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/JiaZ0GLYZ024
Chengxing Jia, Fuxiang Zhang, Yi-Chen Li, Chenxiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu:
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation. AAMAS 2024: 944-953
[c136]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JiaGYZCXYZZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JiaGYZCXYZZ024
Chengxing Jia, Chenxiao Gao, Hao Yin, Fuxiang Zhang, Xiong-Hui Chen, Tian Xu, Lei Yuan, Zongzhang Zhang, Zhi-Hua Zhou, Yang Yu:
Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning. ICLR 2024
[c135]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiX024
Ziniu Li, Tian Xu, Yang Yu:
When is RL better than DPO in RLHF? A Representation and Optimization Perspective. Tiny Papers @ ICLR 2024
[c134]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LuoXC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LuoXC024
Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu:
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning. ICLR 2024
[c133]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PangWLC0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PangWLC0Z024
Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Language Model Self-improvement by Reinforcement Learning Contemplation. ICLR 2024
[c132]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangSYLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangSYLZ024
Zhilong Zhang, Yihao Sun, Junyin Ye, Tian-Shuo Liu, Jiaji Zhang, Yang Yu:
Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation. ICLR 2024
[c131]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0003CSXL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0003CSXL024
Ruifeng Chen, Xiong-Hui Chen, Yihao Sun, Siyuan Xiao, Minhui Li, Yang Yu:
Policy-conditioned Environment Models are More Generalizable. ICML 2024
[c130]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0003JHLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0003JHLL024
Ruifeng Chen, Chengxing Jia, Zefang Huang, Tian-Shuo Liu, Xu-Hui Liu, Yang Yu:
Offline Transition Modeling via Contrastive Energy Learning. ICML 2024
[c129]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/CaoLYXZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/CaoLYXZ024
Xingchen Cao, Fan-Ming Luo, Junyin Ye, Tian Xu, Zhilong Zhang, Yang Yu:
Limited Preference Aided Imitation Learning from Imperfect Demonstrations. ICML 2024
[c128]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenYZ0LSXYY0H024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenYZ0LSXYY0H024
Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, XuHui Liu, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Yang Yu, Anqi Huang, Kai Xu, Zongzhang Zhang:
Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration. ICML 2024
[c127]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiXZL00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiXZL00L24
Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo:
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models. ICML 2024
[c126]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuLJ0ZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuLJ0ZC024
Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Ruifeng Chen, Zhilong Zhang, Xinwei Chen, Yang Yu:
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning. ICML 2024
[c125]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Zhang000JZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Zhang000JZ024
Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu:
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics. ICML 2024
[c124]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/LiCZW0G0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LiCZW0G0024
Lihe Li, Ruotong Chen, Ziqian Zhang, Zhichao Wu, Yi-Chen Li, Cong Guan, Yang Yu, Lei Yuan:
Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal. IJCAI 2024: 4434-4442
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/TanLBTZLXZYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/TanLBTZLXZYZ24
Zhi-Hao Tan, Jian-Dong Liu, Xiao-Dong Bi, Peng Tan, Qin-Cheng Zheng, Hai-Tian Liu, Yi Xie, Xiao-Chuan Zou, Yang Yu, Zhi-Hua Zhou:
Beimingwu: A Learnware Dock System. KDD 2024: 5773-5782
[c122]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenW0JFYW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenW0JFYW24
Xiong-Hui Chen, Ziyan Wang, Yali Du, Shengyi Jiang, Meng Fang, Yang Yu, Jun Wang:
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting. NeurIPS 2024
[c121]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Jiang0LGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Jiang0LGZ024
Tao Jiang, Lei Yuan, Lihe Li, Cong Guan, Zongzhang Zhang, Yang Yu:
Multi-Agent Domain Calibration with a Handful of Offline Data. NeurIPS 2024
[c120]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LuoTH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuoTH024
Fan-Ming Luo, Zuolin Tu, Zefang Huang, Yang Yu:
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate. NeurIPS 2024
[c119]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PangYLZCT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PangYLZCT024
Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu:
KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts. NeurIPS 2024
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/pkdd/XueZLCLYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pkdd/XueZLCLYY24
Ruiqi Xue, Ziqian Zhang, Lihe Li, Feng Chen, Yi-Chen Li, Yang Yu, Lei Yuan:
Dynamics Adaptive Safe Reinforcement Learning with a Misspecified Simulator. ECML/PKDD (7) 2024: 74-91
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14427
Zhi-Hao Tan, Jian-Dong Liu, Xiao-Dong Bi, Peng Tan, Qin-Cheng Zheng, Hai-Tian Liu, Yi Xie, Xiao-Chuan Zou, Yang Yu, Zhi-Hua Zhou:
Beimingwu: A Learnware Dock System. CoRR abs/2401.14427 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03719
Jing-Cheng Pang, Heng-Bo Fan, Pengyuan Wang, Jiahao Xiao, Nan Tang, Si-Hang Yang, Chengxing Jia, Sheng-Jun Huang, Yang Yu:
Empowering Language Models with Active Inquiry for Deeper Understanding. CoRR abs/2402.03719 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-11317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-11317
Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu:
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics. CoRR abs/2402.11317 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-07261
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-07261
Chengxing Jia, Fuxiang Zhang, Yi-Chen Li, Chenxiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu:
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation. CoRR abs/2403.07261 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09248
Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu:
Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts. CoRR abs/2404.09248 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-15384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-15384
Fan-Ming Luo, Zuolin Tu, Zefang Huang, Yang Yu:
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate. CoRR abs/2405.15384 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17031
Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu:
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning. CoRR abs/2405.17031 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17039
Chengxing Jia, Pengyuan Wang, Ziniu Li, Yi-Chen Li, Zhilong Zhang, Nan Tang, Yang Yu:
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation. CoRR abs/2405.17039 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03856
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03856
Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu:
Q-Adapter: Training Your LLM Adapter as a Residual Q-Function. CoRR abs/2407.03856 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03964
Fuxiang Zhang, Junyou Li, Yi-Chen Li, Zongzhang Zhang, Yang Yu, Deheng Ye:
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models. CoRR abs/2407.03964 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04451
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04451
Chen-Xiao Gao, Shengjun Fang, Chenjun Xiao, Yang Yu, Zongzhang Zhang:
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning. CoRR abs/2407.04451 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12448
Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Ruifeng Chen, Zhilong Zhang, Xinwei Chen, Yang Yu:
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning. CoRR abs/2407.12448 (2024)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-05619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-05619
Zhilong Zhang, Ruifeng Chen, Junyin Ye, Yihao Sun, Pengyuan Wang, Jingcheng Pang, Kaiyuan Li, Tianshuo Liu, Haoxin Lin, Yang Yu, Zhi-Hua Zhou:
WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making. CoRR abs/2411.05619 (2024)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-10809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-10809
Feng Chen, Fuguang Han, Cong Guan, Lei Yuan, Zhilong Zhang, Yang Yu, Zongzhang Zhang:
Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay. CoRR abs/2411.10809 (2024)
2023
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/YangZYYLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/YangZYYLG23
Hua Yang, Minghao Zhao, Lei Yuan, Yang Yu, Zhenhua Li, Ming Gu:
Memory-efficient Transformer-based network model for Traveling Salesman Problem. Neural Networks 161: 589-597 (2023)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ChenLYLQSY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ChenLYLQSY23
Xiong-Hui Chen, Fan-Ming Luo, Yang Yu, Qingyang Li, Zhiwei Qin, Wenjie Shang, Jieping Ye:
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15260-15274 (2023)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/tkde/HuzhangPGLSZLDZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tkde/HuzhangPGLSZLDZ23
Guangda Huzhang, Zhen-Jia Pang, Yongqing Gao, Yawen Liu, Weijie Shen, Wen-Ji Zhou, Qianying Lin, Qing Da, Anxiang Zeng, Han Yu, Yang Yu, Zhi-Hua Zhou:
AliExpress Learning-to-Rank: Maximizing Online Model Performance Without Going Online. IEEE Trans. Knowl. Data Eng. 35(2): 1214-1226 (2023)
[j34]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/WangYJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/WangYJ23
Han Wang, Yang Yu, Yuan Jiang:
Fully Decentralized Multiagent Communication via Causal Inference. IEEE Trans. Neural Networks Learn. Syst. 34(12): 10193-10202 (2023)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/tog/ZhaoPYX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tog/ZhaoPYX23
Hang Zhao, Zherong Pan, Yang Yu, Kai Xu:
Learning Physically Realizable Skills for Online Packing of General 3D Shapes. ACM Trans. Graph. 42(5): 165:1-165:21 (2023)
[c117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/00010WYYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/00010WYYZ23
Yang Yu, Qi Liu, Likang Wu, Runlong Yu, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. AAAI 2023: 4854-4863
[c116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiaoZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiaoZ023
Weijian Liao, Zongzhang Zhang, Yang Yu:
Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning. AAAI 2023: 8746-8754
[c115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YuanZ0YCGL0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YuanZ0YCGL0023
Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers. AAAI 2023: 11753-11762
[c114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenWMZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenWMZ023
Chao Chen, Dawei Wang, Feng Mao, Zongzhang Zhang, Yang Yu:
Deep Anomaly Detection and Search via Reinforcement Learning (Student Abstract). AAAI 2023: 16180-16181
[c113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiSZMZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiSZMZ023
Yi-Chen Li, Wen-Jie Shen, Boyu Zhang, Feng Mao, Zongzhang Zhang, Yang Yu:
Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract). AAAI 2023: 16258-16259
[c112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangYMZ0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangYMZ0L23
Aoran Wang, Hongyang Yang, Feng Mao, Zongzhang Zhang, Yang Yu, Xiaoyang Liu:
Anti-drifting Feature Selection via Deep Reinforcement Learning (Student Abstract). AAAI 2023: 16356-16357
[c111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhouZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhouZ023
Renzhe Zhou, Zongzhang Zhang, Yang Yu:
Model-Based Offline Weighted Policy Optimization (Student Abstract). AAAI 2023: 16392-16393
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/ZhangCY0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/ZhangCY0Z23
Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. AAMAS 2023: 476-484
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/LiuXZLJ0Z023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LiuXZLJ0Z023
Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. AAMAS 2023: 1276-1284
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/YuanLZCZG0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/YuanLZCZG0Z23
Lei Yuan, Lihe Li, Ziqian Zhang, Feng Chen, Tianyi Zhang, Cong Guan, Yang Yu, Zhi-Hua Zhou:
Learning to Coordinate with Anyone. DAI 2023: 4:1-4:9
[c107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ecai/LinSZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/LinSZY23
Haoxin Lin, Yihao Sun, Jiaji Zhang, Yang Yu:
Model-Based Reinforcement Learning with Multi-Step Plan Value Estimation. ECAI 2023: 1481-1488
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ecai/LuQWLZZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/LuQWLZZ023
Huakang Lu, Hong Qian, Yupeng Wu, Ziqi Liu, Ya-Lin Zhang, Aimin Zhou, Yang Yu:
Degradation-Resistant Offline Optimization via Accumulative Risk Control. ECAI 2023: 1609-1616
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icde/ChenH0LQSYM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icde/ChenH0LQSYM23
Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. ICDE 2023: 3389-3402
[c104]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangJLY0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangJLY0Z23
Fuxiang Zhang, Chengxing Jia, Yi-Chen Li, Lei Yuan, Yang Yu, Zongzhang Zhang:
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data. ICLR 2023
[c103]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RanLZZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RanLZZ023
Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu:
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning. ICML 2023: 28701-28717
[c102]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SunZJLYY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SunZJLYY23
Yihao Sun, Jiaji Zhang, Chengxing Jia, Haoxin Lin, Junyin Ye, Yang Yu:
Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning. ICML 2023: 33177-33194
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/PangYCY0MGYH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/PangYCY0MGYH23
Jing-Cheng Pang, Si-Hang Yang, Xiong-Hui Chen, Xinyu Yang, Yang Yu, Mas Ma, Ziqi Guo, Howard Yang, Bill Huang:
Object-Oriented Option Framework for Robotics Manipulation in Clutter. IROS 2023: 1230-1237
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/0003CZYZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/0003CZYZ023
Jiacheng Xu, Chao Chen, Fuxiang Zhang, Lei Yuan, Zongzhang Zhang, Yang Yu:
Internal Logical Induction for Pixel-Symbolic Reinforcement Learning. KDD 2023: 2825-2837
[c99]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Chen0ZYCWWQWDH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Chen0ZYCWWQWDH23
Xiong-Hui Chen, Yang Yu, Zhengmao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Rong-Jun Qin, Hongqiu Wu, Ruijin Ding, Fangsheng Huang:
Adversarial Counterfactual Environment Model Learning. NeurIPS 2023
[c98]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiXQ0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiXQ0L23
Ziniu Li, Tian Xu, Zeyu Qin, Yang Yu, Zhi-Quan Luo:
Imitation Learning from Imperfection: Theoretical Justifications and Algorithms. NeurIPS 2023
[c97]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuHZTG0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuHZTG0023
Yuren Liu, Biwei Huang, Zhengmao Zhu, Hong-Long Tian, Mingming Gong, Yang Yu, Kun Zhang:
Learning World Models with Identifiable Factorization. NeurIPS 2023
[c96]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PangYYC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PangYYC023
Jing-Cheng Pang, Xinyu Yang, Si-Hang Yang, Xiong-Hui Chen, Yang Yu:
Natural Language Instruction-following with Task-related Language Development and Translation. NeurIPS 2023
[c95]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/XuL0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/XuL0L23
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. UAI 2023: 2367-2378
[c94]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/ZhangYL0JG0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ZhangYL0JG0023
Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. UAI 2023: 2465-2476
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-02083
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-02083
Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. CoRR abs/2301.02083 (2023)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11687
Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Theoretical Analysis of Offline Imitation With Supplementary Dataset. CoRR abs/2301.11687 (2023)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-09368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-09368
Jing-Cheng Pang, Xinyu Yang, Si-Hang Yang, Yang Yu:
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation. CoRR abs/2302.09368 (2023)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-09605
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-09605
Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning. CoRR abs/2302.09605 (2023)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-02073
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-02073
Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. CoRR abs/2303.02073 (2023)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05458
Zheng-Mao Zhu, Yu-Ren Liu, Hong-Long Tian, Yang Yu, Kun Zhang:
Beware of Instantaneous Dependence in Reinforcement Learning. CoRR abs/2303.05458 (2023)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04832
Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. CoRR abs/2305.04832 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05116
Lei Yuan, Feng Chen, Zongzhang Zhang, Yang Yu:
Communication-Robust Multi-Agent Learning by Adaptable Auxiliary Multi-Agent Adversary Generation. CoRR abs/2305.05116 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05909
Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers. CoRR abs/2305.05909 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05911
Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. CoRR abs/2305.05911 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13936
Lei Yuan, Tao Jiang, Lihe Li, Feng Chen, Zongzhang Zhang, Yang Yu:
Robust Multi-agent Communication via Multi-view Message Certification. CoRR abs/2305.13936 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13937
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13937
Lei Yuan, Lihe Li, Ziqian Zhang, Fuxiang Zhang, Cong Guan, Yang Yu:
Multi-agent Continual Coordination via Progressive Task Contextualization. CoRR abs/2305.13937 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14483
Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Language Model Self-improvement by Reinforcement Learning Contemplation. CoRR abs/2305.14483 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06561
Yu-Ren Liu, Biwei Huang, Zheng-Mao Zhu, Hong-Long Tian, Mingming Gong, Yang Yu, Kun Zhang:
Learning World Models with Identifiable Factorization. CoRR abs/2306.06561 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06563
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. CoRR abs/2306.06563 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06569
Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu:
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning. CoRR abs/2306.06569 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05915
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05915
Chenxiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu:
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning. CoRR abs/2309.05915 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12633
Lei Yuan, Lihe Li, Ziqian Zhang, Feng Chen, Tianyi Zhang, Cong Guan, Yang Yu, Zhi-Hua Zhou:
Learning to Coordinate with Anyone. CoRR abs/2309.12633 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05422
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05422
Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu:
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning. CoRR abs/2310.05422 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05712
Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, Yang Yu:
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments. CoRR abs/2310.05712 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10505
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10505
Ziniu Li, Tian Xu, Yushun Zhang, Yang Yu, Ruoyu Sun, Zhi-Quan Luo:
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models. CoRR abs/2310.10505 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-00416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-00416
Cong Guan, Lichao Zhang, Chunpeng Fan, Yichen Li, Feng Chen, Lihe Li, Yunjia Tian, Lei Yuan, Yang Yu:
Efficient Human-AI Coordination via Preparatory Language-based Convention. CoRR abs/2311.00416 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01058
Lei Yuan, Ziqian Zhang, Lihe Li, Cong Guan, Yang Yu:
A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment. CoRR abs/2312.01058 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10584
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10584
Ziniu Li, Tian Xu, Yang Yu:
Policy Optimization in RLHF: The Impact of Out-of-preference Data. CoRR abs/2312.10584 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10642
Haoxin Lin, Hongqiu Wu, Jiaji Zhang, Yihao Sun, Junyin Ye, Yang Yu:
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward. CoRR abs/2312.10642 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15909
Renzhe Zhou, Chenxiao Gao, Zongzhang Zhang, Yang Yu:
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations. CoRR abs/2312.15909 (2023)
2022
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/LiuHQQY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/LiuHQQY22
Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Chao Qian, Yang Yu:
ZOOpt: a toolbox for derivative-free optimization. Sci. China Inf. Sci. 65(10) (2022)
[j31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jair/LiuPMWYL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/LiuPMWYL22
Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. J. Artif. Intell. Res. 75: 213-260 (2022)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/ZhangLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/ZhangLY22
Yi-Feng Zhang, Fan-Ming Luo, Yang Yu:
Improve generated adversarial imitation learning with reward variance regularization. Mach. Learn. 111(3): 977-995 (2022)
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/HuLLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/HuLLY22
Yi-Qi Hu, Xu-Hui Liu, Shu-Qiao Li, Yang Yu:
Cascaded Algorithm Selection With Extreme-Region UCB Bandit. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6782-6794 (2022)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/XuLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/XuLY22
Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments for Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6968-6980 (2022)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/LiuGJYPXWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/LiuGJYPXWL22
Ruo-Ze Liu, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zhen-Jia Pang, Zitai Xiao, Yuzhou Wu, Tong Lu:
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning. IEEE Trans. Games 14(2): 294-307 (2022)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/JinXWZZTY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/JinXWZZTY22
Xin Jin, Yanping Xie, Xiu-Shen Wei, Borui Zhao, Yongshun Zhang, Xiaoyang Tan, Yang Yu:
A Lightweight Encoder-Decoder Path for Deep Residual Networks. IEEE Trans. Neural Networks Learn. Syst. 33(2): 866-878 (2022)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/vc/YuNLX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vc/YuNLX22
Yang Yu, Chengjie Niu, Jun Li, Kai Xu:
Multi-view 2D-3D alignment with hybrid bundle adjustment for visual metrology. Vis. Comput. 38(4): 1483-1494 (2022)
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LuoJ0ZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LuoJ0ZZ22
Fan-Ming Luo, Shengyi Jiang, Yang Yu, Zongzhang Zhang, Yi-Feng Zhang:
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy. AAAI 2022: 7637-7646
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhuJL0Z22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhuJL0Z22
Zheng-Mao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang:
Invariant Action Effect Model for Reinforcement Learning. AAAI 2022: 9260-9268
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YuanWZWZ0Z22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YuanWZWZ0Z22
Lei Yuan, Jianhao Wang, Fuxiang Zhang, Chenghe Wang, Zongzhang Zhang, Yang Yu, Chongjie Zhang:
Multi-Agent Incentive Communication via Decentralized Teammate Modeling. AAAI 2022: 9466-9474
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/cscloud/YuJYGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cscloud/YuJYGZ22
Yang Yu, Rui Jin, Hao Yin, Keke Gai, Zijian Zhang:
A Searchable Re-encryption-based Scheme for Massive Data Transactions. CSCloud/EdgeCom 2022: 135-140
[c89]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/00010DY0Z22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/00010DY0Z22
Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. ICLR 2022
[c88]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiZWYZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiZWYZ22
Siyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang:
Active Hierarchical Exploration with Stable Subgoal Representation Learning. ICLR 2022
[c87]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhaoY022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhaoY022
Hang Zhao, Yang Yu, Kai Xu:
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees. ICLR 2022
[c86]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QianLSZ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QianLSZ022
Hong Qian, Xu-Hui Liu, Chen-Xi Su, Aimin Zhou, Yang Yu:
The Teaching Dimension of Regularized Kernel Learners. ICML 2022: 17984-18002
[c85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/XueYZ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/XueYZ022
Di Xue, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Multi-Agent Communication via Shapley Message Value. IJCAI 2022: 578-584
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YuanWWZCGZZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YuanWWZCGZZY22
Lei Yuan, Chenghe Wang, Jianhao Wang, Fuxiang Zhang, Feng Chen, Cong Guan, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Concentrative Coordination with Decentralized Task Representation. IJCAI 2022: 599-605
[c83]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001XYL0Z022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001XYL0Z022
Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. NeurIPS 2022
[c82]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GuanCYWYZ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuanCYWYZ022
Cong Guan, Feng Chen, Lei Yuan, Chenghe Wang, Hao Yin, Zongzhang Zhang, Yang Yu:
Efficient Multi-agent Communication via Self-supervised Information Aggregation. NeurIPS 2022
[c81]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/QinZGCL0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QinZGCL0022
Rongjun Qin, Xingyuan Zhang, Songyi Gao, Xiong-Hui Chen, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. NeurIPS 2022
[c80]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WuLZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WuLZY22
Chenyang Wu, Tianci Li, Zongzhang Zhang, Yang Yu:
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning. NeurIPS 2022
[e4]
- view
  authority control:
- export record
  dblp key:
  - conf/pakdd/2022-1
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pakdd/2022-1
João Gama, Tianrui Li, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13280, Springer 2022, ISBN 978-3-031-05932-2 [contents]
[e3]
- view
  authority control:
- export record
  dblp key:
  - conf/pakdd/2022-2
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pakdd/2022-2
João Gama, Tianrui Li, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part II. Lecture Notes in Computer Science 13281, Springer 2022, ISBN 978-3-031-05935-3 [contents]
[e2]
- view
  authority control:
- export record
  dblp key:
  - conf/pakdd/2022-3
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pakdd/2022-3
João Gama, Tianrui Li, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part III. Lecture Notes in Computer Science 13282, Springer 2022, ISBN 978-3-031-05980-3 [contents]
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02468
Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Rethinking ValueDice: Does It Really Improve Performance? CoRR abs/2202.02468 (2022)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04482
Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Policy Transfer via Task Relationship Modeling. CoRR abs/2203.04482 (2022)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11489
Ziniu Li, Tian Xu, Yang Yu:
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle. CoRR abs/2203.11489 (2022)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00238
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00238
Fan-Ming Luo, Xingchen Cao, Yang Yu:
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble. CoRR abs/2206.00238 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01474
Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu:
Offline Reinforcement Learning with Causal Structured World Models. CoRR abs/2206.01474 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02000
Xue-Kun Jin, Xu-Hui Liu, Shengyi Jiang, Yang Yu:
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning. CoRR abs/2206.02000 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04890
Xiong-Hui Chen, Yang Yu, Zheng-Mao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Hongqiu Wu, Rong-Jun Qin, Ruijin Ding, Fangsheng Huang:
Adversarial Counterfactual Environment Model Learning. CoRR abs/2206.04890 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09328
Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A Survey on Model-based Reinforcement Learning. CoRR abs/2206.09328 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01899
Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis. CoRR abs/2208.01899 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-04957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-04957
Ke Xue, Yutong Wang, Lei Yuan, Cong Guan, Chao Qian, Yang Yu:
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution. CoRR abs/2208.04957 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-09452
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-09452
Rong-Jun Qin, Fan-Ming Luo, Hong Qian, Yang Yu:
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games. CoRR abs/2208.09452 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-05530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-05530
Haoxin Lin, Yihao Sun, Jiaji Zhang, Yang Yu:
Model-based Reinforcement Learning with Multi-step Plan Value Estimation. CoRR abs/2209.05530 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11553
Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. CoRR abs/2209.11553 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05662
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05662
Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. CoRR abs/2210.05662 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06835
Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. CoRR abs/2210.06835 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-02094
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-02094
Hang Zhao, Zherong Pan, Yang Yu, Kai Xu:
Learning Physically Realizable Skills for Online Packing of General 3D Shapes. CoRR abs/2212.02094 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05399
Yang Yu, Qi Liu, Likang Wu, Runlong Yu, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. CoRR abs/2212.05399 (2022)
2021
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/aim/ZengYDZYZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aim/ZengYDZYZM21
Anxiang Zeng, Han Yu, Qing Da, Yusen Zhan, Yang Yu, Jingren Zhou, Chunyan Miao:
Improving Search Engine Efficiency through Contextual Factor Selection. AI Mag. 42(2): 50-58 (2021)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/algorithmica/QianBYTY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/algorithmica/QianBYTY21
Chao Qian, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of Noisy Evolutionary Optimization When Sampling Fails. Algorithmica 83(4): 940-975 (2021)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/Bian00T21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/Bian00T21
Chao Bian, Chao Qian, Yang Yu, Ke Tang:
On the robustness of median sampling in noisy evolutionary optimization. Sci. China Inf. Sci. 64(5) (2021)
[j21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/fac/BuLXQHYCL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fac/BuLXQHYCL21
Lei Bu, Yongjuan Liang, Zhunyi Xie, Hong Qian, Yi-Qi Hu, Yang Yu, Xin Chen, Xuandong Li:
Machine learning steered symbolic execution framework for complex software code. Formal Aspects Comput. 33(3): 301-323 (2021)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/ShangLQ0MY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/ShangLQ0MY21
Wenjie Shang, Qingyang Li, Zhiwei (Tony) Qin, Yang Yu, Yiping Meng, Jieping Ye:
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Mach. Learn. 110(9): 2603-2640 (2021)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/EscalanteYTPQYH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/EscalanteYTPQYH21
Hugo Jair Escalante, Quanming Yao, Wei-Wei Tu, Nelishia Pillay, Rong Qu, Yang Yu, Neil Houlsby:
Guest Editorial: Automated Machine Learning. IEEE Trans. Pattern Anal. Mach. Intell. 43(9): 2887-2890 (2021)
[c79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WuKYKZ00L21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WuKYKZ00L21
Chenyang Wu, Rui Kong, Guoyu Yang, Xianghan Kong, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu:
LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract). AAAI 2021: 15927-15928
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuJYZYLLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuJYZYLLL21
Feng Xu, Shengyi Jiang, Hao Yin, Zongzhang Zhang, Yang Yu, Ming Li, Dong Li, Wulong Liu:
Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract). AAAI 2021: 15937-15938
[c77]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangRLYZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangRLYZ21
Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. ICLR 2021
[c76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/0001Q0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/0001Q0021
Chao Bian, Chao Qian, Frank Neumann, Yang Yu:
Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints. IJCAI 2021: 2191-2197
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ShenYHGHY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ShenYHGHY21
Weijie Shen, Lei Yuan, Junfu Huang, Songyi Gao, Yuyang Huang, Yang Yu:
Sequential and Dynamic constraint Contrastive Learning for Reinforcement Learning. IJCNN 2021: 1-9
[c74]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenYLLQSY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenYLLQSY21
Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei (Tony) Qin, Wenjie Shang, Jieping Ye:
Offline Model-based Adaptable Policy Learning. NeurIPS 2021: 8432-8443
[c73]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenJXZY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenJXZY21
Xiong-Hui Chen, Shengyi Jiang, Feng Xu, Zongzhang Zhang, Yang Yu:
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning. NeurIPS 2021: 12520-12532
[c72]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuXPJXY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuXPJXY21
Xu-Hui Liu, Zhenghai Xue, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning. NeurIPS 2021: 17604-17615
[c71]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WuYZYLLH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WuYZYLLH21
Chenyang Wu, Guoyu Yang, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu, Jianye Hao:
Adaptive Online Packing-guided Search for POMDPs. NeurIPS 2021: 28419-28430
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-00714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-00714
Rongjun Qin, Songyi Gao, Xingyuan Zhang, Zhen Xu, Shengkai Huang, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. CoRR abs/2102.00714 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-05710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-05710
Hong Qian, Yang Yu:
Derivative-Free Reinforcement Learning: A Review. CoRR abs/2102.05710 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06890
Ruo-Ze Liu, Wenhai Wang, Yanjie Shen, Zhiqi Li, Yang Yu, Tong Lu:
An Introduction of mini-AlphaStar. CoRR abs/2104.06890 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-07253
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-07253
Zhenghai Xue, Xu-Hui Liu, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay. CoRR abs/2105.07253 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-08666
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-08666
Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu, Yang Yu:
Sparsity Prior Regularized Q-learning for Sparse Action Tasks. CoRR abs/2105.08666 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02886
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02886
Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. CoRR abs/2106.02886 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10424
Tian Xu, Ziniu Li, Yang Yu:
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions. CoRR abs/2106.10424 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-07693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-07693
Yongqing Gao, Guangda Huzhang, Weijie Shen, Yawen Liu, Wen-Ji Zhou, Qing Da, Dan Shen, Yang Yu:
Imitate TheWorld: A Search Engine Simulation Platform. CoRR abs/2107.07693 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-06898
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-06898
Zhao-Hua Li, Yang Yu, Yingfeng Chen, Ke Chen, Zhipeng Hu, Changjie Fan:
Neural-to-Tree Policy Distillation with Policy Improvement Criterion. CoRR abs/2108.06898 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-12508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-12508
Jiahan Cao, Lei Yuan, Jianhao Wang, Shaowei Zhang, Chongjie Zhang, Yang Yu, De-Chuan Zhan:
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates. CoRR abs/2109.12508 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-13964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-13964
Qixin Zhang, Wenbing Ye, Zaiyi Chen, Haoyuan Hu, Enhong Chen, Yang Yu:
Online Allocation with Two-sided Resource Constraints. CoRR abs/2112.13964 (2021)
2020
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/cgf/NiuYBLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cgf/NiuYBLX20
Chengjie Niu, Yang Yu, Zhenwei Bian, Jun Li, Kai Xu:
Weakly Supervised Part-wise 3D Shape Reconstruction from Single-View RGB Images. Comput. Graph. Forum 39(7): 447-457 (2020)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/mlc/HuY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mlc/HuY20
Yi-Qi Hu, Yang Yu:
A technical view on neural architecture search. Int. J. Mach. Learn. Cybern. 11(4): 795-811 (2020)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/tcs/BianQTY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcs/BianQTY20
Chao Bian, Chao Qian, Ke Tang, Yang Yu:
Running time analysis of the (1+1)-EA for robust linear optimization. Theor. Comput. Sci. 843: 57-72 (2020)
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BianF0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BianF0020
Chao Bian, Chao Feng, Chao Qian, Yang Yu:
An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints. AAAI 2020: 3267-3274
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/0016CLSGF020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/0016CLSGF020
Meng Wang, Yingfeng Chen, Tangjie Lv, Yan Song, Kai Guan, Changjie Fan, Yang Yu:
Reinforcement Learning with Action-Specific Focuses in Video Games. CoG 2020: 9-16
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ecai/HuLYYL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/HuLYYL20
Yi-Qi Hu, Zelin Liu, Hua Yang, Yang Yu, Yunfeng Liu:
Derivative-Free Optimization with Adaptive Experience for Efficient Hyper-Parameter Tuning. ECAI 2020: 1207-1214
[c67]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JiangP020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiangP020
Shengyi Jiang, Jing-Cheng Pang, Yang Yu:
Offline Imitation Learning with a Misspecified Simulator. NeurIPS 2020
[c66]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XuLY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuLY20
Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. NeurIPS 2020
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/2020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/2020
Matthew E. Taylor, Yang Yu, Edith Elkind, Yang Gao:
Distributed Artificial Intelligence - Second International Conference, DAI 2020, Nanjing, China, October 24-27, 2020, Proceedings. Lecture Notes in Computer Science 12547, Springer 2020, ISBN 978-3-030-64095-8 [contents]
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-02080
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-02080
Wen-Ji Zhou, Yang Yu:
Temporal-adaptive Hierarchical Reinforcement Learning. CoRR abs/2002.02080 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-00497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-00497
Chao Wang, Ruo-Ze Liu, Han-Jia Ye, Yang Yu:
Novelty-Prepared Few-Shot Classification. CoRR abs/2003.00497 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-11941
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-11941
Guangda Huzhang, Zhen-Jia Pang, Yongqing Gao, Wen-Ji Zhou, Qing Da, Anxiang Zeng, Yang Yu:
Validation Set Evaluation can be Wrong: An Evaluator-Generator Approach for Maximizing Online Performance of Ranking in E-commerce. CoRR abs/2003.11941 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01062
Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. CoRR abs/2008.01062 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11876
Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. CoRR abs/2010.11876 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[b1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/ZhouYQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/ZhouYQ19
Zhi-Hua Zhou, Yang Yu, Chao Qian:
Evolutionary Learning: Advances in Theories and Algorithms. Springer 2019, ISBN 978-981-13-5955-2, pp. 3-293
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/QianYTYZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/QianYTYZ19
Chao Qian, Yang Yu, Ke Tang, Xin Yao, Zhi-Hua Zhou:
Maximizing submodular or monotone approximately submodular functions by multi-objective evolutionary algorithms. Artif. Intell. 275: 279-294 (2019)
[c65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Hu0TYCD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Hu0TYCD19
Yi-Qi Hu, Yang Yu, Wei-Wei Tu, Qiang Yang, Yuqiang Chen, Wenyuan Dai:
Multi-Fidelity Automatic Hyper-Parameter Tuning via Transfer Series Expansion. AAAI 2019: 3846-3853
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PangLMZYL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PangLMZYL19
Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang, Yang Yu, Tong Lu:
On Reinforcement Learning for Full-Length Game of StarCraft. AAAI 2019: 4691-4698
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Shi0DCZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Shi0DCZ19
Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, Anxiang Zeng:
Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning. AAAI 2019: 4902-4909
[c62]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/ChenY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/ChenY19
Xiong-Hui Chen, Yang Yu:
Reinforcement Learning with Derivative-Free Exploration. AAMAS 2019: 1880-1882
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/LiuHQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/LiuHQ019
Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu:
Asynchronous classification-based optimization. DAI 2019: 9:1-9:8
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/GaoSLZY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/GaoSLZY19
Songyi Gao, Weijie Shen, Zelin Liu, An Zhu, Yang Yu:
Only Image Cosine Embedding for Few-Shot Learning. ICONIP (2) 2019: 83-94
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Hu0L19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Hu0L19
Yi-Qi Hu, Yang Yu, Jun-Da Liao:
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit. IJCAI 2019: 2528-2534
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Zhou0CGLFZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Zhou0CGLFZ19
Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou:
Reinforcement Learning Experience Reuse with Policy Residual Representation. IJCAI 2019: 4447-4453
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/ShangYLQMY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/ShangYLQMY19
Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei (Tony) Qin, Yiping Meng, Jieping Ye:
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation. KDD 2019: 566-576
[c56]
- view
- export record
  dblp key:
  - conf/nips/DaiX0Z19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DaiX0Z19
Wang-Zhou Dai, Qiu-Ling Xu, Yang Yu, Zhi-Hua Zhou:
Bridging Machine Learning and Logical Reasoning by Abductive Learning. NeurIPS 2019: 2811-2822
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00715
Ruo-Ze Liu, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zitai Xiao, Yuzhou Wu, Zhen-Jia Pang, Tong Lu:
Efficient Reinforcement Learning with a Mind-Game for Full-Length StarCraft II. CoRR abs/1903.00715 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13703
Yi-Qi Hu, Yang Yu, Jun-Da Liao:
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit. CoRR abs/1905.13703 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13719
Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou:
Reinforcement Learning Experience Reuse with Policy Residual Representation. CoRR abs/1905.13719 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-06584
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-06584
Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei (Tony) Qin, Yiping Meng, Jieping Ye:
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation. CoRR abs/1907.06584 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-10772
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-10772
Jorge G. Madrid, Hugo Jair Escalante, Eduardo F. Morales, Wei-Wei Tu, Yang Yu, Lisheng Sun-Hosoya, Isabelle Guyon, Michèle Sebag:
Towards AutoML in the presence of Drift: first results. CoRR abs/1907.10772 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-13100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-13100
Chao Bian, Chao Qian, Yang Yu:
On the Robustness of Median Sampling in Noisy Evolutionary Optimization. CoRR abs/1907.13100 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-07027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-07027
Tian Xu, Ziniu Li, Yang Yu:
On Value Discrepancy of Imitation Learning. CoRR abs/1911.07027 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-11928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-11928
Rong-Jun Qin, Jing-Cheng Pang, Yang Yu:
Improving Fictitious Play Reinforcement Learning with Expanding Models. CoRR abs/1911.11928 (2019)
2018
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/ec/QianYZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ec/QianYZ18
Chao Qian, Yang Yu, Zhi-Hua Zhou:
Analyzing Evolutionary Optimization in Noisy Environments. Evol. Comput. 26(1) (2018)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ec/QianYTJYZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ec/QianYTJYZ18
Chao Qian, Yang Yu, Ke Tang, Yaochu Jin, Xin Yao, Zhi-Hua Zhou:
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments. Evol. Comput. 26(2) (2018)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuCDZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuCDZ18
Yang Yu, Shi-Yong Chen, Qing Da, Zhi-Hua Zhou:
Reusable Reinforcement Learning via Shallow Trails. IEEE Trans. Neural Networks Learn. Syst. 29(6): 2204-2215 (2018)
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangQY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangQY18
Hong Wang, Hong Qian, Yang Yu:
Noisy Derivative-Free Optimization With Value Suppression. AAAI 2018: 1447-1454
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/gecco/QianB0TY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gecco/QianB0TY18
Chao Qian, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of noisy evolutionary optimization when sampling fails. GECCO 2018: 1507-1514
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/ieeesam/PuYYL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ieeesam/PuYYL18
Wenqiang Pu, Yang Yu, Shuhua Yu, Zhi-Quan Luo:
An Alternating Minimization Approach to Optimizing Subarray Configuration for a Large Phased Array. SAM 2018: 361-365
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Qian0T18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Qian0T18
Chao Qian, Yang Yu, Ke Tang:
Approximation Guarantees of Stochastic Greedy Algorithms for Subset Selection. IJCAI 2018: 1478-1484
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/HuYZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HuYZ18
Yi-Qi Hu, Yang Yu, Zhi-Hua Zhou:
Experienced Optimization with Reusable Directional Model for Hyper-Parameter Search. IJCAI 2018: 2276-2282
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YuZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YuZ18
Yang Yu, Wen-Ji Zhou:
Mixture of GANs for Clustering. IJCAI 2018: 3047-3053
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhangYZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhangYZ18
Chao Zhang, Yang Yu, Zhi-Hua Zhou:
Learning Environmental Calibration Actions for Policy Self-Evolution. IJCAI 2018: 3061-3067
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Yu18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Yu18
Yang Yu:
Towards Sample Efficient Reinforcement Learning. IJCAI 2018: 5739-5743
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/HuDZ0X18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/HuDZ0X18
Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, Yinghui Xu:
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application. KDD 2018: 368-377
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/Chen0DTHT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/Chen0DTHT18
Shi-Yong Chen, Yang Yu, Qing Da, Jun Tan, Hai-Kuan Huang, Hai-Hong Tang:
Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation. KDD 2018: 1187-1196
[c45]
- view
- export record
  dblp key:
  - conf/nips/Feng0Z18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Feng0Z18
Ji Feng, Yang Yu, Zhi-Hua Zhou:
Multi-Layered Gradient Boosting Decision Trees. NeurIPS 2018: 3555-3565
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1801-00329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-00329
Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu, Chao Qian:
ZOOpt/ZOOjl: Toolbox for Derivative-Free Optimization. CoRR abs/1801.00329 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-01173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-01173
Wang-Zhou Dai, Qiu-Ling Xu, Yang Yu, Zhi-Hua Zhou:
Tunneling Neural Perception and Logic Reasoning through Abductive Learning. CoRR abs/1802.01173 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-00693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-00693
Yusen Zhan, Qing Da, Fei Xiao, Anxiang Zeng, Yang Yu:
Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection. CoRR abs/1803.00693 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-00710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-00710
Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, Yinghui Xu:
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application. CoRR abs/1803.00710 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-10000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-10000
Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, Anxiang Zeng:
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning. CoRR abs/1805.10000 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-00007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-00007
Ji Feng, Yang Yu, Zhi-Hua Zhou:
Multi-Layered Gradient Boosting Decision Trees. CoRR abs/1806.00007 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-09095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-09095
Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang, Yang Yu, Tong Lu:
On Reinforcement Learning for Full-length Game of StarCraft. CoRR abs/1809.09095 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-05045
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-05045
Chao Qian, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of Noisy Evolutionary Optimization When Sampling Fails. CoRR abs/1810.05045 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-13306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-13306
Quanming Yao, Mengshuo Wang, Hugo Jair Escalante, Isabelle Guyon, Yi-Qi Hu, Yufeng Li, Wei-Wei Tu, Qiang Yang, Yang Yu:
Taking Human out of Learning Applications: A Survey on Automated Machine Learning. CoRR abs/1810.13306 (2018)
2017
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/QianY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/QianY17
Hong Qian, Yang Yu:
Solving High-Dimensional Multi-Objective Optimization Problems with Low Effective Dimensions. AAAI 2017: 875-881
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HuQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HuQY17
Yi-Qi Hu, Hong Qian, Yang Yu:
Sequential Classification-Based Optimization for Direct Policy Search. AAAI 2017: 2029-2035
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/cec/ShiQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cec/ShiQY17
Jing-Cheng Shi, Chao Qian, Yang Yu:
Evolutionary multi-objective optimization made faster by sequential decomposition. CEC 2017: 2488-2493
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/QianSYTZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QianSYTZ17
Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou:
Optimizing Ratio of Monotone Set Functions. IJCAI 2017: 2606-2612
[c40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/QianSYT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QianSYT17
Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang:
On Subset Selection with General Cost Constraints. IJCAI 2017: 2613-2619
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YangYZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YangYZ17
Jing-Wen Yang, Yang Yu, Xiao-Peng Zhang:
Life-Stage Modeling by Customer-Manifold Embedding. IJCAI 2017: 3259-3265
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YuQLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YuQLG17
Yang Yu, Wei-Yang Qu, Nan Li, Zimin Guo:
Open Category Classification by Adversarial Sample Generation. IJCAI 2017: 3357-3363
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhouYZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhouYZ17
Wen-Ji Zhou, Yang Yu, Min-Ling Zhang:
Binary Linear Compression for Multi-label Classification. IJCAI 2017: 3546-3552
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhangSHNWDCY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhangSHNWDCY17
Jianbing Zhang, Yixin Sun, Shujian Huang, Cam-Tu Nguyen, Xiaoliang Wang, Xinyu Dai, Jiajun Chen, Yang Yu:
AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles. IJCAI 2017: 4221-4227
[c35]
- view
- export record
  dblp key:
  - conf/nips/QianS0TZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QianS0TZ17
Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou:
Subset Selection under Noise. NIPS 2017: 3560-3570
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YuQLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YuQLG17
Yang Yu, Wei-Yang Qu, Nan Li, Zimin Guo:
Open-Category Classification by Adversarial Sample Generation. CoRR abs/1705.08722 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-07214
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-07214
Chao Qian, Yang Yu, Ke Tang, Xin Yao, Zhi-Hua Zhou:
Maximizing Non-monotone/Non-submodular Functions by Multi-objective Evolutionary Algorithms. CoRR abs/1711.07214 (2017)
2016
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/QianY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/QianY16
Hong Qian, Yang Yu:
Scaling Simultaneous Optimistic Optimization for High-Dimensional Non-Convex Functions with Low Effective Dimensions. AAAI 2016: 2000-2006
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YuQH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YuQH16
Yang Yu, Hong Qian, Yi-Qi Hu:
Derivative-Free Optimization via Classification. AAAI 2016: 2286-2292
[c32]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/YuHDQ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YuHDQ16
Yang Yu, Peng-Fei Hou, Qing Da, Yu Qian:
Boosting Nonparametric Policies. AAMAS 2016: 477-484
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/bic-ta/HuY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bic-ta/HuY16
Yi-Qi Hu, Yang Yu:
A Multi-task Learning Approach by Combining Derivative-Free and Gradient Methods. BIC-TA (1) 2016: 456-465
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/cec/QianY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cec/QianY16
Hong Qian, Yang Yu:
On sampling-and-classification optimization in discrete domains. CEC 2016: 4374-4381
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/ideal/QianYZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ideal/QianYZ16
Chao Qian, Yang Yu, Zhi-Hua Zhou:
A Lower Bound Analysis of Population-Based Evolutionary Algorithms for Pseudo-Boolean Functions. IDEAL 2016: 457-467
[c28]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/QianSYTZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QianSYTZ16
Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou:
Parallel Pareto Optimization for Subset Selection. IJCAI 2016: 1939-1945
[c27]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/QianHY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QianHY16
Hong Qian, Yi-Qi Hu, Yang Yu:
Derivative-Free Optimization of High-Dimensional Non-Convex Functions by Sequential Random Embeddings. IJCAI 2016: 1946-1952
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/kbse/LiLQHBYCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kbse/LiLQHBYCL16
Xin Li, Yongjuan Liang, Hong Qian, Yi-Qi Hu, Lei Bu, Yang Yu, Xin Chen, Xuandong Li:
Symbolic execution of complex program driven by machine learning based constraint solving. ASE 2016: 554-559
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/pricai/WangY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pricai/WangY16
Han Wang, Yang Yu:
Exploring Multi-action Relationship in Reinforcement Learning. PRICAI 2016: 574-587
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/QianYZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/QianYZ16
Chao Qian, Yang Yu, Zhi-Hua Zhou:
A Lower Bound Analysis of Population-based Evolutionary Algorithms for Pseudo-Boolean Functions. CoRR abs/1606.03326 (2016)
2015
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/chinaf/QianYZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/chinaf/QianYZ15
Chao Qian, Yang Yu, Zhi-Hua Zhou:
Variable solution structure can be helpful in evolutionary optimization. Sci. China Inf. Sci. 58(11): 1-17 (2015)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/soco/SunJZY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/soco/SunJZY15
Chaoli Sun, Yaochu Jin, Jianchao Zeng, Yang Yu:
A two-layer surrogate-assisted particle swarm optimization algorithm. Soft Comput. 19(6): 1461-1475 (2015)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/tec/YuQZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tec/YuQZ15
Yang Yu, Chao Qian, Zhi-Hua Zhou:
Switch Analysis for Running Time Analysis of Evolutionary Algorithms. IEEE Trans. Evol. Comput. 19(6): 777-792 (2015)
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/QianYZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/QianYZ15
Chao Qian, Yang Yu, Zhi-Hua Zhou:
Pareto Ensemble Pruning. AAAI 2015: 2935-2941
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/cec/YuQ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cec/YuQ15
Yang Yu, Chao Qian:
Running time analysis: Convergence-based analysis reduces to switch analysis. CEC 2015: 2603-2610
[c22]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/QianYZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QianYZ15
Chao Qian, Yang Yu, Zhi-Hua Zhou:
On Constrained Boolean Pareto Optimization. IJCAI 2015: 389-395
[c21]
- view
- export record
  dblp key:
  - conf/nips/QianYZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QianYZ15
Chao Qian, Yang Yu, Zhi-Hua Zhou:
Subset Selection by Pareto Optimization. NIPS 2015: 1774-1782
2014
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DaYZ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DaYZ14
Qing Da, Yang Yu, Zhi-Hua Zhou:
Learning with Augmented Class by Exploiting Unlabeled Data. AAAI 2014: 1760-1766
[c19]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/DaYZ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/DaYZ14
Qing Da, Yang Yu, Zhi-Hua Zhou:
Napping for functional representation of policy. AAMAS 2014: 189-196
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/cec/YuQ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cec/YuQ14
Yang Yu, Hong Qian:
The sampling-and-learning framework: A statistical view of evolutionary algorithms. IEEE Congress on Evolutionary Computation 2014: 149-158
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/ppsn/QianYJZ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppsn/QianYJZ14
Chao Qian, Yang Yu, Yaochu Jin, Zhi-Hua Zhou:
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments. PPSN 2014: 302-311
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YuQ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YuQ14
Yang Yu, Hong Qian:
The Sampling-and-Learning Framework: A Statistical View of Evolutionary Algorithms. CoRR abs/1401.6333 (2014)
2013
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/QianYZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/QianYZ13
Chao Qian, Yang Yu, Zhi-Hua Zhou:
An analysis on recombination in multi-objective evolutionary optimization. Artif. Intell. 204: 99-119 (2013)
[c16]
- view
- export record
  dblp key:
  - conf/ijcai/YuYZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YuYZ13
Yang Yu, Xin Yao, Zhi-Hua Zhou:
On the Approximation Ability of Evolutionary Optimization with Application to Minimum Set Cover: Extended Abstract. IJCAI 2013: 3190-3194
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/psl/DaYZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/psl/DaYZ13
Qing Da, Yang Yu, Zhi-Hua Zhou:
Self-Practice Imitation Learning from Weak Policy. PSL 2013: 9-20
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/QianYZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/QianYZ13
Chao Qian, Yang Yu, Zhi-Hua Zhou:
Analyzing Evolutionary Optimization in Noisy Environments. CoRR abs/1311.4987 (2013)
2012
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/YuYZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/YuYZ12
Yang Yu, Xin Yao, Zhi-Hua Zhou:
On the approximation ability of evolutionary optimization with application to minimum set cover. Artif. Intell. 180-181: 20-33 (2012)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/HuangYZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/HuangYZ12
Sheng-Jun Huang, Yang Yu, Zhi-Hua Zhou:
Multi-label hypothesis reuse. KDD 2012: 525-533
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/pkdd/LiYZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pkdd/LiYZ12
Nan Li, Yang Yu, Zhi-Hua Zhou:
Diversity Regularized Ensemble Pruning. ECML/PKDD (1) 2012: 330-345
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/ppsn/QianYZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppsn/QianYZ12
Chao Qian, Yang Yu, Zhi-Hua Zhou:
On Algorithm-Dependent Boundary Case Identification for Problem Classes. PPSN (1) 2012: 62-71
2011
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/gecco/QianYZ11a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gecco/QianYZ11a
Chao Qian, Yang Yu, Zhi-Hua Zhou:
Collisions are helpful for computing unique input-output sequences. GECCO (Companion) 2011: 265-266
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/gecco/QianYZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gecco/QianYZ11
Chao Qian, Yang Yu, Zhi-Hua Zhou:
An analysis on recombination in multi-objective evolutionary optimization. GECCO 2011: 2051-2058
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/DaiYZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/DaiYZ11
Wang-Zhou Dai, Yang Yu, Zhi-Hua Zhou:
Lifted-Rollout for Approximate Policy Iteration of Markov Decision Process. ICDM Workshops 2011: 689-696
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcai/YuLZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YuLZ11
Yang Yu, Yufeng Li, Zhi-Hua Zhou:
Diversity Regularized Machine. IJCAI 2011: 1603-1608
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1111-0907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1111-0907
Yang Yu, Chao Qian, Zhi-Hua Zhou:
Towards Analyzing Crossover Operators in Evolutionary Search via General Markov Chain Switching Theorem. CoRR abs/1111.0907 (2011)
2010
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/kais/YuZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kais/YuZ10
Yang Yu, Zhi-Hua Zhou:
A framework for modeling positive class expansion with single snapshot. Knowl. Inf. Syst. 25(2): 211-227 (2010)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/ppsn/YuQZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppsn/YuQZ10
Yang Yu, Chao Qian, Zhi-Hua Zhou:
Towards Analyzing Recombination Operators in Evolutionary Search. PPSN (1) 2010: 144-153
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1011-4028
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1011-4028
Yang Yu, Xin Yao, Zhi-Hua Zhou:
Evolutionary Algorithms as Guaranteed Approximation Optimizers. CoRR abs/1011.4028 (2010)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/LiYZ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/LiYZ09
Nan Li, Yang Yu, Zhi-Hua Zhou:
Semi-naive Exploitation of One-Dependence Estimators. ICDM 2009: 278-287
2008
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/YuZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/YuZ08
Yang Yu, Zhi-Hua Zhou:
A new approach to estimating the expected first hitting time of evolutionary algorithms. Artif. Intell. 172(15): 1809-1832 (2008)
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jair/LiuTYZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/LiuTYZ08
Fei Tony Liu, Kai Ming Ting, Yang Yu, Zhi-Hua Zhou:
Spectrum of Variable-Random Trees. J. Artif. Intell. Res. 32: 355-384 (2008)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cec/YuZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cec/YuZ08
Yang Yu, Zhi-Hua Zhou:
On the usefulness of infeasible solutions in evolutionary search: A theoretical study. IEEE Congress on Evolutionary Computation 2008: 835-840
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/LiuYJZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/LiuYJZ08
Li-Ping Liu, Yang Yu, Yuan Jiang, Zhi-Hua Zhou:
TEFE: A Time-Efficient Approach to Feature Extraction. ICDM 2008: 423-432
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/pakdd/YuZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pakdd/YuZ08
Yang Yu, Zhi-Hua Zhou:
A Framework for Modeling Positive Class Expansion with Single Snapshot. PAKDD 2008: 429-440
2007
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/jdwm/YuZLLZ07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jdwm/YuZLLZ07
Yang Yu, De-Chuan Zhan, Xu-Ying Liu, Ming Li, Zhi-Hua Zhou:
Predicting Future Customers via Ensembling Gradually Expanded Trees. Int. J. Data Warehous. Min. 3(2): 12-21 (2007)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/YuZT07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/YuZT07
Yang Yu, Zhi-Hua Zhou, Kai Ming Ting:
Cocktail Ensemble for Regression. ICDM 2007: 721-726
2006
[c1]
- view
- export record
  dblp key:
  - conf/aaai/YuZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YuZ06
Yang Yu, Zhi-Hua Zhou:
A New Approach to Estimating the Expected First Hitting Time of Evolutionary Algorithms. AAAI 2006: 555-560
2005
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jcst/ZhouY05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jcst/ZhouY05
Zhi-Hua Zhou, Yang Yu:
Adapt Bagging to Nearest Neighbor Classifiers. J. Comput. Sci. Technol. 20(1): 48-54 (2005)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tsmc/ZhouY05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsmc/ZhouY05
Zhi-Hua Zhou, Yang Yu:
Ensembling local learners ThroughMultimodal perturbation. IEEE Trans. Syst. Man Cybern. Part B 35(4): 725-735 (2005)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.