


Остановите войну!
for scientists:


default search action
Yang Yu 0001
Person information

- affiliation (PhD 2011): Nanjing University, State Key Laboratory for Novel Software Technology, China
- affiliation: Pazhou Lab, Guangzhou, China
Other persons with the same name
- Yang Yu — disambiguation page
- Yang Yu 0002
— University of Technology Sydney, Faculty of Engineering and Information Technology, NSW, Australia (and 1 more)
- Yang Yu 0003
— North China Electric Power University, State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, Baoding, China
- Yang Yu 0004
— Rochester Institute of Technology, Saunders College of Business, Rochester, NY, USA (and 1 more)
- Yang Yu 0005
— Jiangsu University of Technology, School of Electric Information Engineering, Changzhou, China
- Yang Yu 0006
— National University of Defense Technology, College of Electrical Science and Engineering, National Key Laboratory of Science and Technology on ATR, Changsha, China
- Yang Yu 0007
— National University of Defense Technology, College of Computer, Changsha, China
- Yang Yu 0008
— Tsinghua University, Department of Computer Science and Technology, Beijing, China (and 1 more)
- Yang Yu 0009 — Motorola Labs, Schaumburg, IL, USA (and 1 more)
- Yang Yu 0010 — Rutgers University, Department of Computer Science, Piscataway, NJ, USA
- Yang Yu 0011
— Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China (and 1 more)
- Yang Yu 0012 — University of Sheffield, UK
- Yang Yu 0013
— Nanjing University of Posts and Telecommunications, College of Automation / College of Artificial Intelligence, China (and 1 more)
- Yang Yu 0014
— National University of Defense Technology, College of Intelligence Science and Technology, Changsha, China (and 1 more)
- Yang Yu 0015
— Harbin Institute of Technology, Department of Automatic Test and Control, Harbin, China
- Yang Yu 0016
— Northeastern University, College of Information Science and Engineering, Shenyang, China
- Yang Yu 0017
— Changchun University of Technology, School of Mechatronic Engineering, Changchun, China
- Yang Yu 0018
— Harbin Jiancheng Group Company, Harbin, China
- Yang Yu 0019
— Shanghai Jiao Tong University, School of Mechanical Engineering, State Key Laboratory of Mechanical System and Vibration, Shanghai, China
- Yang Yu 0020
— Tongji University, State Key Laboratory of Marine Geology, Shanghai, China
- Yang Yu 0021
— University of Technology Sydney, School of Civil and Environmental Engineering, Sydney, Australia
- Yang Yu 0022
— Hebei University of Technology, School of Computer Science and Engineering, Tianjin, China
- Yang Yu 0023
— Wuhan University, School of Urban Design, Department of Urban Planning, Wuha, China
- Yang Yu 0024
— Tongji University, Department of Control Science and Engineering, Shanghai, China
- Yang Yu 0025 — Rutgers University, Department of Mathematics, Piscataway, NJ, USA
- Yang Yu 0026
— China Agricultural University, College of Engineering, Beijing, China
- Yang Yu 0027
— Sun Yat-sen University, School of Data and Computer Science, Guangzhou, China
- Yang Yu 0028
— Hong Kong University of Science and Technology, Department of Electronic and Computer Engineering, Robotics and Multi-Perception Laborotary, Hong Kong
- Yang Yu 0029 — Google, Mountain View, CA, USA (and 3 more)
- Yang Yu 0030
— Tianjin University, College of Intelligence and Computing, China
- Yang Yu 0031
— Southwest Forestry University, School of Machinery and Transportation, Kunming, China (and 1 more)
- Yang Yu 0032 — National University of Defense Technology, Center of Material Science, College of Liberal Arts and Sciences, College of Advanced Interdisciplinary Studies, College of Sciences, Changsha, China
- Yang Yu 0033
— Guizhou Medical University, School of Biology and Engineering, Guiyang, China (and 1 more)
- Yang Yu 0034
— Nanjing University of Posts and Telecommunications, College of Communication & Information Engineering, China
- Yang Yu 0035
— Royal Institute of Technology, Stockholm, Sweden
- Yang Yu 0036
— University of Duisburg-Essen, Germany
- Yang Yu 0037
— Auckland University of Technology, Institute of Biomedical Technologies, New Zealand
- Yang Yu 0038
— University of Science and Technology of China, State Key Laboratory of Cognitive Intelligence, Hefei, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j34]Hua Yang
, Minghao Zhao, Lei Yuan, Yang Yu, Zhenhua Li
, Ming Gu:
Memory-efficient Transformer-based network model for Traveling Salesman Problem. Neural Networks 161: 589-597 (2023) - [j33]Xiong-Hui Chen
, Fan-Ming Luo
, Yang Yu
, Qingyang Li
, Zhiwei Qin
, Wenjie Shang
, Jieping Ye
:
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15260-15274 (2023) - [j32]Guangda Huzhang
, Zhen-Jia Pang, Yongqing Gao, Yawen Liu, Weijie Shen, Wen-Ji Zhou
, Qianying Lin, Qing Da, Anxiang Zeng
, Han Yu
, Yang Yu
, Zhi-Hua Zhou
:
AliExpress Learning-to-Rank: Maximizing Online Model Performance Without Going Online. IEEE Trans. Knowl. Data Eng. 35(2): 1214-1226 (2023) - [j31]Hang Zhao
, Zherong Pan
, Yang Yu
, Kai Xu
:
Learning Physically Realizable Skills for Online Packing of General 3D Shapes. ACM Trans. Graph. 42(5): 165:1-165:21 (2023) - [c108]Yang Yu, Qi Liu, Likang Wu, Runlong Yu
, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. AAAI 2023: 4854-4863 - [c107]Weijian Liao, Zongzhang Zhang, Yang Yu:
Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning. AAAI 2023: 8746-8754 - [c106]Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers. AAAI 2023: 11753-11762 - [c105]Chao Chen, Dawei Wang, Feng Mao, Zongzhang Zhang, Yang Yu:
Deep Anomaly Detection and Search via Reinforcement Learning (Student Abstract). AAAI 2023: 16180-16181 - [c104]Yi-Chen Li, Wen-Jie Shen, Boyu Zhang, Feng Mao, Zongzhang Zhang, Yang Yu:
Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract). AAAI 2023: 16258-16259 - [c103]Aoran Wang, Hongyang Yang, Feng Mao, Zongzhang Zhang, Yang Yu, Xiaoyang Liu:
Anti-drifting Feature Selection via Deep Reinforcement Learning (Student Abstract). AAAI 2023: 16356-16357 - [c102]Renzhe Zhou, Zongzhang Zhang, Yang Yu:
Model-Based Offline Weighted Policy Optimization (Student Abstract). AAAI 2023: 16392-16393 - [c101]Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. AAMAS 2023: 476-484 - [c100]Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. AAMAS 2023: 1276-1284 - [c99]Huakang Lu, Hong Qian, Yupeng Wu, Ziqi Liu, Ya-Lin Zhang, Aimin Zhou, Yang Yu:
Degradation-Resistant Offline Optimization via Accumulative Risk Control. ECAI 2023: 1609-1616 - [c98]Xiong-Hui Chen, Bowei He
, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma
:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. ICDE 2023: 3389-3402 - [c97]Fuxiang Zhang, Chengxing Jia, Yi-Chen Li, Lei Yuan, Yang Yu, Zongzhang Zhang:
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data. ICLR 2023 - [c96]Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu:
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning. ICML 2023: 28701-28717 - [c95]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. UAI 2023: 2367-2378 - [c94]Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. UAI 2023: 2465-2476 - [i77]Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. CoRR abs/2301.02083 (2023) - [i76]Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Theoretical Analysis of Offline Imitation With Supplementary Dataset. CoRR abs/2301.11687 (2023) - [i75]Jing-Cheng Pang, Xin-Yu Yang, Si-Hang Yang, Yang Yu:
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation. CoRR abs/2302.09368 (2023) - [i74]Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning. CoRR abs/2302.09605 (2023) - [i73]Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. CoRR abs/2303.02073 (2023) - [i72]Zheng-Mao Zhu, Yu-Ren Liu, Hong-Long Tian, Yang Yu, Kun Zhang:
Beware of Instantaneous Dependence in Reinforcement Learning. CoRR abs/2303.05458 (2023) - [i71]Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. CoRR abs/2305.04832 (2023) - [i70]Lei Yuan, Feng Chen, Zongzhang Zhang, Yang Yu:
Communication-Robust Multi-Agent Learning by Adaptable Auxiliary Multi-Agent Adversary Generation. CoRR abs/2305.05116 (2023) - [i69]Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers. CoRR abs/2305.05909 (2023) - [i68]Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. CoRR abs/2305.05911 (2023) - [i67]Lei Yuan, Tao Jiang, Lihe Li, Feng Chen, Zongzhang Zhang, Yang Yu:
Robust Multi-agent Communication via Multi-view Message Certification. CoRR abs/2305.13936 (2023) - [i66]Lei Yuan, Lihe Li, Ziqian Zhang, Fuxiang Zhang, Cong Guan, Yang Yu:
Multi-agent Continual Coordination via Progressive Task Contextualization. CoRR abs/2305.13937 (2023) - [i65]Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Language Model Self-improvement by Reinforcement Learning Contemplation. CoRR abs/2305.14483 (2023) - [i64]Yu-Ren Liu, Biwei Huang, Zheng-Mao Zhu, Hong-Long Tian, Mingming Gong, Yang Yu, Kun Zhang:
Learning World Models with Identifiable Factorization. CoRR abs/2306.06561 (2023) - [i63]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. CoRR abs/2306.06563 (2023) - [i62]Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu:
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning. CoRR abs/2306.06569 (2023) - [i61]Chenxiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu:
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning. CoRR abs/2309.05915 (2023) - [i60]Lei Yuan, Lihe Li, Ziqian Zhang, Feng Chen, Tianyi Zhang, Cong Guan, Yang Yu, Zhi-Hua Zhou:
Learning to Coordinate with Anyone. CoRR abs/2309.12633 (2023) - [i59]Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu:
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning. CoRR abs/2310.05422 (2023) - [i58]Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, Yang Yu:
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments. CoRR abs/2310.05712 (2023) - [i57]Cong Guan, Lichao Zhang, Chunpeng Fan, Yichen Li, Feng Chen, Lihe Li, Yunjia Tian, Lei Yuan, Yang Yu:
Efficient Human-AI Coordination via Preparatory Language-based Convention. CoRR abs/2311.00416 (2023) - [i56]Lei Yuan, Ziqian Zhang, Lihe Li, Cong Guan, Yang Yu:
A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment. CoRR abs/2312.01058 (2023) - 2022
- [j30]Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Chao Qian, Yang Yu:
ZOOpt: a toolbox for derivative-free optimization. Sci. China Inf. Sci. 65(10) (2022) - [j29]Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. J. Artif. Intell. Res. 75: 213-260 (2022) - [j28]Yi-Feng Zhang
, Fan-Ming Luo, Yang Yu:
Improve generated adversarial imitation learning with reward variance regularization. Mach. Learn. 111(3): 977-995 (2022) - [j27]Yi-Qi Hu
, Xu-Hui Liu
, Shu-Qiao Li
, Yang Yu:
Cascaded Algorithm Selection With Extreme-Region UCB Bandit. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6782-6794 (2022) - [j26]Tian Xu
, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments for Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6968-6980 (2022) - [j25]Ruo-Ze Liu
, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zhen-Jia Pang, Zitai Xiao, Yuzhou Wu, Tong Lu
:
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning. IEEE Trans. Games 14(2): 294-307 (2022) - [j24]Xin Jin, Yanping Xie, Xiu-Shen Wei
, Borui Zhao, Yongshun Zhang, Xiaoyang Tan
, Yang Yu:
A Lightweight Encoder-Decoder Path for Deep Residual Networks. IEEE Trans. Neural Networks Learn. Syst. 33(2): 866-878 (2022) - [c93]Fan-Ming Luo, Shengyi Jiang, Yang Yu, Zongzhang Zhang, Yi-Feng Zhang:
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy. AAAI 2022: 7637-7646 - [c92]Zheng-Mao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang:
Invariant Action Effect Model for Reinforcement Learning. AAAI 2022: 9260-9268 - [c91]Lei Yuan, Jianhao Wang, Fuxiang Zhang, Chenghe Wang, Zongzhang Zhang, Yang Yu, Chongjie Zhang:
Multi-Agent Incentive Communication via Decentralized Teammate Modeling. AAAI 2022: 9466-9474 - [c90]Yang Yu, Rui Jin, Hao Yin, Keke Gai, Zijian Zhang:
A Searchable Re-encryption-based Scheme for Massive Data Transactions. CSCloud/EdgeCom 2022: 135-140 - [c89]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. ICLR 2022 - [c88]Siyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang:
Active Hierarchical Exploration with Stable Subgoal Representation Learning. ICLR 2022 - [c87]Hang Zhao, Yang Yu, Kai Xu:
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees. ICLR 2022 - [c86]Hong Qian, Xu-Hui Liu, Chen-Xi Su, Aimin Zhou, Yang Yu:
The Teaching Dimension of Regularized Kernel Learners. ICML 2022: 17984-18002 - [c85]Di Xue, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Multi-Agent Communication via Shapley Message Value. IJCAI 2022: 578-584 - [c84]Lei Yuan, Chenghe Wang, Jianhao Wang, Fuxiang Zhang, Feng Chen, Cong Guan, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Concentrative Coordination with Decentralized Task Representation. IJCAI 2022: 599-605 - [c83]Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. NeurIPS 2022 - [c82]Cong Guan, Feng Chen, Lei Yuan, Chenghe Wang, Hao Yin, Zongzhang Zhang, Yang Yu:
Efficient Multi-agent Communication via Self-supervised Information Aggregation. NeurIPS 2022 - [c81]Rongjun Qin, Xingyuan Zhang, Songyi Gao, Xiong-Hui Chen, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. NeurIPS 2022 - [c80]Chenyang Wu, Tianci Li, Zongzhang Zhang, Yang Yu:
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning. NeurIPS 2022 - [e4]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13280, Springer 2022, ISBN 978-3-031-05932-2 [contents] - [e3]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part II. Lecture Notes in Computer Science 13281, Springer 2022, ISBN 978-3-031-05935-3 [contents] - [e2]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part III. Lecture Notes in Computer Science 13282, Springer 2022, ISBN 978-3-031-05980-3 [contents] - [i55]Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Rethinking ValueDice: Does It Really Improve Performance? CoRR abs/2202.02468 (2022) - [i54]Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Policy Transfer via Task Relationship Modeling. CoRR abs/2203.04482 (2022) - [i53]Ziniu Li, Tian Xu, Yang Yu:
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle. CoRR abs/2203.11489 (2022) - [i52]Fan-Ming Luo, Xingchen Cao, Yang Yu:
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble. CoRR abs/2206.00238 (2022) - [i51]Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu:
Offline Reinforcement Learning with Causal Structured World Models. CoRR abs/2206.01474 (2022) - [i50]Xue-Kun Jin, Xu-Hui Liu, Shengyi Jiang, Yang Yu:
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning. CoRR abs/2206.02000 (2022) - [i49]Xiong-Hui Chen, Yang Yu, Zheng-Mao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Hongqiu Wu, Rong-Jun Qin, Ruijin Ding, Fangsheng Huang:
Adversarial Counterfactual Environment Model Learning. CoRR abs/2206.04890 (2022) - [i48]Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A Survey on Model-based Reinforcement Learning. CoRR abs/2206.09328 (2022) - [i47]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis. CoRR abs/2208.01899 (2022) - [i46]Ke Xue, Yutong Wang, Lei Yuan, Cong Guan, Chao Qian, Yang Yu:
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution. CoRR abs/2208.04957 (2022) - [i45]Rong-Jun Qin, Fan-Ming Luo, Hong Qian, Yang Yu:
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games. CoRR abs/2208.09452 (2022) - [i44]Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. CoRR abs/2209.11553 (2022) - [i43]Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. CoRR abs/2210.05662 (2022) - [i42]Ke Xue, Jiacheng Xu
, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. CoRR abs/2210.06835 (2022) - [i41]Yang Yu, Qi Liu, Likang Wu, Runlong Yu, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. CoRR abs/2212.05399 (2022) - 2021
- [j23]Anxiang Zeng, Han Yu, Qing Da, Yusen Zhan, Yang Yu, Jingren Zhou, Chunyan Miao:
Improving Search Engine Efficiency through Contextual Factor Selection. AI Mag. 42(2): 50-58 (2021) - [j22]Chao Qian
, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of Noisy Evolutionary Optimization When Sampling Fails. Algorithmica 83(4): 940-975 (2021) - [j21]Chao Bian, Chao Qian, Yang Yu, Ke Tang:
On the robustness of median sampling in noisy evolutionary optimization. Sci. China Inf. Sci. 64(5) (2021) - [j20]Lei Bu
, Yongjuan Liang, Zhunyi Xie, Hong Qian, Yi-Qi Hu, Yang Yu, Xin Chen, Xuandong Li:
Machine learning steered symbolic execution framework for complex software code. Formal Aspects Comput. 33(3): 301-323 (2021) - [j19]Wenjie Shang
, Qingyang Li, Zhiwei (Tony) Qin, Yang Yu, Yiping Meng, Jieping Ye:
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Mach. Learn. 110(9): 2603-2640 (2021) - [j18]Hugo Jair Escalante
, Quanming Yao
, Wei-Wei Tu, Nelishia Pillay, Rong Qu
, Yang Yu, Neil Houlsby:
Guest Editorial: Automated Machine Learning. IEEE Trans. Pattern Anal. Mach. Intell. 43(9): 2887-2890 (2021) - [c79]Chenyang Wu, Rui Kong, Guoyu Yang, Xianghan Kong, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu:
LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract). AAAI 2021: 15927-15928 - [c78]Feng Xu, Shengyi Jiang, Hao Yin, Zongzhang Zhang, Yang Yu, Ming Li, Dong Li, Wulong Liu:
Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract). AAAI 2021: 15937-15938 - [c77]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. ICLR 2021 - [c76]Chao Bian, Chao Qian, Frank Neumann, Yang Yu:
Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints. IJCAI 2021: 2191-2197 - [c75]Weijie Shen, Lei Yuan, Junfu Huang, Songyi Gao, Yuyang Huang, Yang Yu:
Sequential and Dynamic constraint Contrastive Learning for Reinforcement Learning. IJCNN 2021: 1-9 - [c74]Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei (Tony) Qin, Wenjie Shang, Jieping Ye:
Offline Model-based Adaptable Policy Learning. NeurIPS 2021: 8432-8443 - [c73]Xiong-Hui Chen, Shengyi Jiang, Feng Xu, Zongzhang Zhang, Yang Yu:
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning. NeurIPS 2021: 12520-12532 - [c72]Xu-Hui Liu, Zhenghai Xue, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning. NeurIPS 2021: 17604-17615 - [c71]Chenyang Wu, Guoyu Yang, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu, Jianye Hao:
Adaptive Online Packing-guided Search for POMDPs. NeurIPS 2021: 28419-28430 - [i40]Rongjun Qin, Songyi Gao, Xingyuan Zhang, Zhen Xu, Shengkai Huang, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. CoRR abs/2102.00714 (2021) - [i39]Hong Qian, Yang Yu:
Derivative-Free Reinforcement Learning: A Review. CoRR abs/2102.05710 (2021) - [i38]Ruo-Ze Liu, Wenhai Wang, Yanjie Shen, Zhiqi Li, Yang Yu, Tong Lu:
An Introduction of mini-AlphaStar. CoRR abs/2104.06890 (2021) - [i37]Zhenghai Xue, Xu-Hui Liu, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay. CoRR abs/2105.07253 (2021) - [i36]Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu, Yang Yu:
Sparsity Prior Regularized Q-learning for Sparse Action Tasks. CoRR abs/2105.08666 (2021) - [i35]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. CoRR abs/2106.02886 (2021) - [i34]Tian Xu, Ziniu Li, Yang Yu:
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions. CoRR abs/2106.10424 (2021) - [i33]Yongqing Gao, Guangda Huzhang, Weijie Shen, Yawen Liu, Wen-Ji Zhou, Qing Da, Dan Shen, Yang Yu:
Imitate TheWorld: A Search Engine Simulation Platform. CoRR abs/2107.07693 (2021) - [i32]Zhao-Hua Li, Yang Yu, Yingfeng Chen, Ke Chen, Zhipeng Hu, Changjie Fan:
Neural-to-Tree Policy Distillation with Policy Improvement Criterion. CoRR abs/2108.06898 (2021) - [i31]Jiahan Cao, Lei Yuan, Jianhao Wang, Shaowei Zhang, Chongjie Zhang, Yang Yu, De-Chuan Zhan:
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates. CoRR abs/2109.12508 (2021) - [i30]Qixin Zhang, Wenbing Ye, Zaiyi Chen, Haoyuan Hu, Enhong Chen, Yang Yu:
Online Allocation with Two-sided Resource Constraints. CoRR abs/2112.13964 (2021) - 2020
- [j17]Yi-Qi Hu
, Yang Yu:
A technical view on neural architecture search. Int. J. Mach. Learn. Cybern. 11(4): 795-811 (2020) - [j16]Chao Bian, Chao Qian, Ke Tang, Yang Yu:
Running time analysis of the (1+1)-EA for robust linear optimization. Theor. Comput. Sci. 843: 57-72 (2020) - [c70]Chao Bian, Chao Feng, Chao Qian, Yang Yu:
An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints. AAAI 2020: 3267-3274 - [c69]