default search action
Shie Mannor
Person information
- affiliation (PhD 2002): Technion - Israel Institute of Technology, Department of Electrical Engineering, Haifa, Israel
- affiliation: Nvidia Research, Tel Aviv-Yafo, Israel
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c263]Uri Gadot, Esther Derman, Navdeep Kumar, Maxence Mohamed Elfatihi, Kfir Levy, Shie Mannor:
Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization. AAAI 2024: 21090-21098 - [c262]Navdeep Kumar, Priyank Agrawal, Kfir Yehuda Levy, Shie Mannor:
Policy Gradient with Tree Search (PGTS) in Reinforcement Learning Evades Local Maxima. Tiny Papers @ ICLR 2024 - [c261]Navdeep Kumar, Ilnura Usmanova, Kfir Yehuda Levy, Shie Mannor:
Towards Faster Global Convergence of Robust Policy Gradient Methods. Tiny Papers @ ICLR 2024 - [c260]Navdeep Kumar, Kaixin Wang, Uri Gadot, Kfir Yehuda Levy, Shie Mannor:
Learning the Uncertainty Set in Robust Markov Decision Process. Tiny Papers @ ICLR 2024 - [c259]Navdeep Kumar, Kaixin Wang, Utkarsh Pratiush, Kfir Yehuda Levy, Shie Mannor:
Policy Gradient for Reinforcement Learning with General Utilities. Tiny Papers @ ICLR 2024 - [c258]David Valensi, Esther Derman, Shie Mannor, Gal Dalal:
Tree Search-Based Policy Optimization under Stochastic Execution Delay. ICLR 2024 - [c257]Lior Cohen, Kaixin Wang, Bingyi Kang, Shie Mannor:
Improving Token-Based World Models with Parallel Observation Prediction. ICML 2024 - [c256]Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant:
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization. ICML 2024 - [c255]Uri Gadot, Kaixin Wang, Navdeep Kumar, Kfir Yehuda Levy, Shie Mannor:
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel. ICML 2024 - [c254]Mark Kozdoba, Binyamin Perets, Shie Mannor:
Sobolev Space Regularised Pre Density Models. ICML 2024 - [c253]Navdeep Kumar, Kaixin Wang, Kfir Yehuda Levy, Shie Mannor:
Efficient Value Iteration for s-rectangular Robust Markov Decision Processes. ICML 2024 - [c252]Jeongyeol Kwon, Yonathan Efroni, Shie Mannor, Constantine Caramanis:
Prospective Side Information for Latent MDPs. ICML 2024 - [i207]Lior Cohen, Kaixin Wang, Bingyi Kang, Shie Mannor:
Improving Token-Based World Models with Parallel Observation Prediction. CoRR abs/2402.05643 (2024) - [i206]Nitsan Soffair, Dotan Di Castro, Orly Avner, Shie Mannor:
SQT - std Q-target. CoRR abs/2402.05950 (2024) - [i205]Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant:
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization. CoRR abs/2402.10342 (2024) - [i204]Navdeep Kumar, Yashaswini Murthy, Itai Shufaro, Kfir Y. Levy, R. Srikant, Shie Mannor:
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes. CoRR abs/2403.06806 (2024) - [i203]David Valensi, Esther Derman, Shie Mannor, Gal Dalal:
Tree Search-Based Policy Optimization under Stochastic Execution Delay. CoRR abs/2404.05440 (2024) - [i202]Itai Shufaro, Nadav Merlis, Nir Weinberger, Shie Mannor:
On Bits and Bandits: Quantifying the Regret-Information Trade-off. CoRR abs/2405.16581 (2024) - [i201]Jeongyeol Kwon, Shie Mannor, Constantine Caramanis, Yonathan Efroni:
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation. CoRR abs/2406.01389 (2024) - [i200]Assaf Hallak, Gal Dalal, Chen Tessler, Kelly Guo, Shie Mannor, Gal Chechik:
PlaMo: Plan and Move in Rich 3D Physical Environments. CoRR abs/2406.18237 (2024) - [i199]Guy Lutsker, Gal Sapir, Anastasia Godneva, Smadar Shilo, Jerry R. Greenfield, Dorit Samocha-Bonet, Shie Mannor, Eli Meirom, Gal Chechik, Hagai Rossman, Eran Segal:
From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis. CoRR abs/2408.11876 (2024) - 2023
- [j80]Michael Lutter, Boris Belousov, Shie Mannor, Dieter Fox, Animesh Garg, Jan Peters:
Continuous-Time Fitted Value Iteration for Robust Policies. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 5534-5548 (2023) - [c251]Aviv Rosenberg, Assaf Hallak, Shie Mannor, Gal Chechik, Gal Dalal:
Planning and Learning with Adaptive Lookahead. AAAI 2023: 9606-9613 - [c250]Pranav Khanna, Guy Tennenholtz, Nadav Merlis, Shie Mannor, Chen Tessler:
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning. AAMAS 2023: 2430-2432 - [c249]Benjamin Fuhrer, Yuval Shpigelman, Chen Tessler, Shie Mannor, Gal Chechik, Eitan Zahavi, Gal Dalal:
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs. CCGrid 2023: 331-343 - [c248]Yuval Atzmon, Eli A. Meirom, Shie Mannor, Gal Chechik:
Learning to Initiate and Reason in Event-Driven Cascading Processes. ICML 2023: 1218-1243 - [c247]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reward-Mixing MDPs with Few Latent Contexts are Learnable. ICML 2023: 18057-18082 - [c246]Ofir Nabati, Guy Tennenholtz, Shie Mannor:
Representation-Driven Reinforcement Learning. ICML 2023: 25588-25603 - [c245]Binyamin Perets, Mark Kozdoba, Shie Mannor:
Learning Hidden Markov Models When the Locations of Missing Observations are Unknown. ICML 2023: 27642-27667 - [c244]Kaixin Wang, Daquan Zhou, Jiashi Feng, Shie Mannor:
PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient. ICML 2023: 36694-36713 - [c243]Yaosheng Fu, Evgeny Bolotin, Aamer Jaleel, Gal Dalal, Shie Mannor, Jacob Subag, Noam Korem, Michael Behar, David W. Nellans:
AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs. MLSys 2023 - [c242]Stav Belogolovsky, Ido Greenberg, Danny Eytan, Shie Mannor:
Individualized Dosing Dynamics via Neural Eigen Decomposition. NeurIPS 2023 - [c241]Ido Greenberg, Shie Mannor, Gal Chechik, Eli A. Meirom:
Train Hard, Fight Easy: Robust Meta Reinforcement Learning. NeurIPS 2023 - [c240]Ido Greenberg, Netanel Yannay, Shie Mannor:
Optimization or Architecture: How to Hack Kalman Filtering. NeurIPS 2023 - [c239]Navdeep Kumar, Esther Derman, Matthieu Geist, Kfir Y. Levy, Shie Mannor:
Policy Gradient for Rectangular Robust Markov Decision Processes. NeurIPS 2023 - [c238]Chen Tessler, Yoni Kasten, Yunrong Guo, Shie Mannor, Gal Chechik, Xue Bin Peng:
CALM: Conditional Adversarial Latent Models for Directable Virtual Characters. SIGGRAPH (Conference Paper Track) 2023: 37:1-37:9 - [i198]Shie Mannor, Aviv Tamar:
Towards Deployable RL - What's Broken with RL Research and a Potential Fix. CoRR abs/2301.01320 (2023) - [i197]Ido Greenberg, Shie Mannor, Gal Chechik, Eli A. Meirom:
Train Hard, Fight Easy: Robust Meta Reinforcement Learning. CoRR abs/2301.11147 (2023) - [i196]Gal Dalal, Assaf Hallak, Gugan Thoppe, Shie Mannor, Gal Chechik:
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search. CoRR abs/2301.13236 (2023) - [i195]Navdeep Kumar, Esther Derman, Matthieu Geist, Kfir Levy, Shie Mannor:
Policy Gradient for s-Rectangular Robust Markov Decision Processes. CoRR abs/2301.13589 (2023) - [i194]Navdeep Kumar, Kfir Levy, Kaixin Wang, Shie Mannor:
An Efficient Solution to s-Rectangular Robust Markov Decision Processes. CoRR abs/2301.13642 (2023) - [i193]Esther Derman, Yevgeniy Men, Matthieu Geist, Shie Mannor:
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization. CoRR abs/2303.06654 (2023) - [i192]Chen Tessler, Yoni Kasten, Yunrong Guo, Shie Mannor, Gal Chechik, Xue Bin Peng:
CALM: Conditional Adversarial Latent Models for Directable Virtual Characters. CoRR abs/2305.02195 (2023) - [i191]Ofir Nabati, Guy Tennenholtz, Shie Mannor:
Representation-Driven Reinforcement Learning. CoRR abs/2305.19922 (2023) - [i190]Kaixin Wang, Uri Gadot, Navdeep Kumar, Kfir Levy, Shie Mannor:
Robust Reinforcement Learning via Adversarial Kernel Approximation. CoRR abs/2306.05859 (2023) - [i189]Stav Belogolovsky, Ido Greenberg, Danny Eytan, Shie Mannor:
Individualized Dosing Dynamics via Neural Eigen Decomposition. CoRR abs/2306.14020 (2023) - [i188]Mark Kozdoba, Binyamin Perets, Shie Mannor:
Implicitly Normalized Explicitly Regularized Density Estimation. CoRR abs/2307.13763 (2023) - [i187]Uri Gadot, Esther Derman, Navdeep Kumar, Maxence Mohamed Elfatihi, Kfir Levy, Shie Mannor:
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization. CoRR abs/2309.01107 (2023) - [i186]Ido Greenberg, Netanel Yannay, Shie Mannor:
Optimization or Architecture: How to Hack Kalman Filtering. CoRR abs/2310.00675 (2023) - [i185]Jeongyeol Kwon, Yonathan Efroni, Shie Mannor, Constantine Caramanis:
Prospective Side Information for Latent MDPs. CoRR abs/2310.07596 (2023) - 2022
- [j79]Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor:
Reinforcement Learning for Datacenter Congestion Control. SIGMETRICS Perform. Evaluation Rev. 49(2): 43-46 (2022) - [c237]Lior Shani, Tom Zahavy, Shie Mannor:
Online Apprenticeship Learning. AAAI 2022: 8240-8248 - [c236]Roy Zohar, Shie Mannor, Guy Tennenholtz:
Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning. AAAI 2022: 9278-9285 - [c235]Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor:
Reinforcement Learning for Datacenter Congestion Control. AAAI 2022: 12615-12621 - [c234]Péter Karkus, Boris Ivanovic, Shie Mannor, Marco Pavone:
DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles. CoRL 2022: 2170-2180 - [c233]Shie Mannor:
Reinforcement Learning for Extended Intelligence. ICINCO 2022: 5 - [c232]Guy Tennenholtz, Assaf Hallak, Gal Dalal, Shie Mannor, Gal Chechik, Uri Shalit:
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning. ICLR 2022 - [c231]Shirli Di-Castro Shashua, Shie Mannor, Dotan Di Castro:
Analysis of Stochastic Processes through Replay Buffers. ICML 2022: 5039-5060 - [c230]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms. ICML 2022: 11772-11789 - [c229]Eli A. Meirom, Haggai Maron, Shie Mannor, Gal Chechik:
Optimizing Tensor Network Contraction Using Reinforcement Learning. ICML 2022: 15278-15292 - [c228]Kaixin Wang, Navdeep Kumar, Kuangqi Zhou, Bryan Hooi, Jiashi Feng, Shie Mannor:
The Geometry of Robust Value Functions. ICML 2022: 22727-22751 - [c227]Mohammadi Zaki, Avi Mohan, Aditya Gopalan, Shie Mannor:
Actor-Critic based Improper Reinforcement Learning. ICML 2022: 25867-25919 - [c226]Ido Greenberg, Yinlam Chow, Mohammad Ghavamzadeh, Shie Mannor:
Efficient Risk-Averse Reinforcement Learning. NeurIPS 2022 - [c225]Mark Kozdoba, Edward Moroshko, Shie Mannor, Yacov Crammer:
Finite Sample Analysis Of Dynamic Regression Parameter Learning. NeurIPS 2022 - [c224]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Tractable Optimality in Episodic Latent MABs. NeurIPS 2022 - [c223]Guy Tennenholtz, Shie Mannor:
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning. NeurIPS 2022 - [c222]Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. NeurIPS 2022 - [i184]Aviv Rosenberg, Assaf Hallak, Shie Mannor, Gal Chechik, Gal Dalal:
Planning and Learning with Adaptive Lookahead. CoRR abs/2201.12403 (2022) - [i183]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms. CoRR abs/2201.12700 (2022) - [i182]Kaixin Wang, Navdeep Kumar, Kuangqi Zhou, Bryan Hooi, Jiashi Feng, Shie Mannor:
The Geometry of Robust Value Functions. CoRR abs/2201.12929 (2022) - [i181]Stav Belogolovsky, Ido Greenberg, Danny Eytan, Shie Mannor:
Continuous Forecasting via Neural Eigen Decomposition of Stochastic Dynamics. CoRR abs/2202.00117 (2022) - [i180]Yuval Atzmon, Eli A. Meirom, Shie Mannor, Gal Chechik:
Learning to reason about and to act on physical cascading events. CoRR abs/2202.01108 (2022) - [i179]Binyamin Perets, Mark Kozdoba, Shie Mannor:
Whats Missing? Learning Hidden Markov Models When the Locations of Missing Observations are Unknown. CoRR abs/2203.06527 (2022) - [i178]Eli A. Meirom, Haggai Maron, Shie Mannor, Gal Chechik:
Optimizing Tensor Network Contraction Using Reinforcement Learning. CoRR abs/2204.09052 (2022) - [i177]Ido Greenberg, Yinlam Chow, Mohammad Ghavamzadeh, Shie Mannor:
Efficient Risk-Averse Reinforcement Learning. CoRR abs/2205.05138 (2022) - [i176]Navdeep Kumar, Kfir Levy, Kaixin Wang, Shie Mannor:
Efficient Policy Iteration for Robust Markov Decision Processes via Regularization. CoRR abs/2205.14327 (2022) - [i175]Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. CoRR abs/2205.15376 (2022) - [i174]Shirli Di-Castro Shashua, Shie Mannor, Dotan Di Castro:
Analysis of Stochastic Processes through Replay Buffers. CoRR abs/2206.12848 (2022) - [i173]Benjamin Fuhrer, Yuval Shpigelman, Chen Tessler, Shie Mannor, Gal Chechik, Eitan Zahavi, Gal Dalal:
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs. CoRR abs/2207.02295 (2022) - [i172]Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor:
Actor-Critic based Improper Reinforcement Learning. CoRR abs/2207.09090 (2022) - [i171]Gal Dalal, Assaf Hallak, Shie Mannor, Gal Chechik:
SoftTreeMax: Policy Gradient with Tree Search. CoRR abs/2209.13966 (2022) - [i170]Navdeep Kumar, Kaixin Wang, Kfir Levy, Shie Mannor:
Policy Gradient for Reinforcement Learning with General Utilities. CoRR abs/2210.00991 (2022) - [i169]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reward-Mixing MDPs with a Few Latent Contexts are Learnable. CoRR abs/2210.02594 (2022) - [i168]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Tractable Optimality in Episodic Latent MABs. CoRR abs/2210.03528 (2022) - [i167]Péter Karkus, Boris Ivanovic, Shie Mannor, Marco Pavone:
DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles. CoRR abs/2212.06437 (2022) - 2021
- [j78]Stav Belogolovsky, Philip Korsunsky, Shie Mannor, Chen Tessler, Tom Zahavy:
Inverse reinforcement learning in contextual MDPs. Mach. Learn. 110(9): 2295-2334 (2021) - [c221]Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. AAAI 2021: 7288-7295 - [c220]Nadav Merlis, Shie Mannor:
Lenient Regret for Multi-Armed Bandits. AAAI 2021: 8950-8957 - [c219]Avi Mohan, Shie Mannor, Arman C. Kizilkale:
On the Volatility of Optimal Control Policies of a Class of Linear Quadratic Regulators. ACC 2021: 4533-4540 - [c218]Roi Pony, Itay Naeh, Shie Mannor:
Over-the-Air Adversarial Flickering Attacks Against Video Recognition Networks. CVPR 2021: 515-524 - [c217]Esther Derman, Gal Dalal, Shie Mannor:
Acting in Delayed Environments with Non-Stationary Markov Policies. ICLR 2021 - [c216]Shauharda Khadka, Estelle Aflalo, Mattias Marder, Avrech Ben-David, Santiago Miret, Shie Mannor, Tamir Hazan, Hanlin Tang, Somdeb Majumdar:
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning. ICLR 2021 - [c215]Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. ICML 2021: 2937-2947 - [c214]Ido Greenberg, Shie Mannor:
Detecting Rewards Deterioration in Episodic Reinforcement Learning. ICML 2021: 3842-3853 - [c213]Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg:
Value Iteration in Continuous Actions, States and Time. ICML 2021: 7224-7234 - [c212]Eli A. Meirom, Haggai Maron, Shie Mannor, Gal Chechik:
Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks. ICML 2021: 7565-7577 - [c211]Ofir Nabati, Tom Zahavy, Shie Mannor:
Online Limited Memory Neural-Linear Bandits with Likelihood Matching. ICML 2021: 7905-7915 - [c210]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reinforcement Learning in Reward-Mixing MDPs. NeurIPS 2021: 2253-2264 - [c209]Gal Dalal, Assaf Hallak, Steven Dalton, Iuri Frosio, Shie Mannor, Gal Chechik:
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction. NeurIPS 2021: 5518-5530 - [c208]Shirli Di-Castro Shashua, Dotan Di Castro, Shie Mannor:
Sim and Real: Better Together. NeurIPS 2021: 6868-6880 - [c207]Esther Derman, Matthieu Geist, Shie Mannor:
Twice regularized MDPs and the equivalence between robustness and regularization. NeurIPS 2021: 22274-22287 - [c206]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
RL for Latent MDPs: Regret Guarantees and a Lower Bound. NeurIPS 2021: 24523-24534 - [c205]Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg:
Robust Value Iteration for Continuous Control Tasks. Robotics: Science and Systems 2021 - [c204]Nir Baram, Guy Tennenholtz, Shie Mannor:
Action redundancy in reinforcement learning. UAI 2021: 376-385 - [c203]Guy Tennenholtz, Uri Shalit, Shie Mannor, Yonathan Efroni:
Bandits with partially observable confounded data. UAI 2021: 430-439 - [c202]Harsh Agrawal, Eli A. Meirom, Yuval Atzmon, Shie Mannor, Gal Chechik:
Known unknowns: Learning novel concepts using reasoning-by-elimination. UAI 2021: 504-514 - [i166]Esther Derman, Gal Dalal, Shie Mannor:
Acting in Delayed Environments with Non-Stationary Markov Policies. CoRR abs/2101.11992 (2021) - [i165]Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. CoRR abs/2102.03400 (2021) - [i164]Ofir Nabati, Tom Zahavy, Shie Mannor:
Online Limited Memory Neural-Linear Bandits with Likelihood Matching. CoRR abs/2102.03799 (2021) - [i163]Mark Kozdoba, Shie Mannor:
Dimension Free Generalization Bounds for Non Linear Metric Learning. CoRR abs/2102.03802 (2021) - [i162]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
RL for Latent MDPs: Regret Guarantees and a Lower Bound. CoRR abs/2102.04939 (2021) - [i161]Lior Shani, Tom Zahavy, Shie Mannor:
Online Apprenticeship Learning. CoRR abs/2102.06924 (2021) - [i160]Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor:
Improper Learning with Gradient-based Policy Optimization. CoRR abs/2102.08201 (2021) - [i159]Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor:
Reinforcement Learning for Datacenter Congestion Control. CoRR abs/2102.09337 (2021) - [i158]Guy Tennenholtz, Nir Baram, Shie Mannor:
GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning. CoRR abs/2102.11327 (2021) - [i157]Nir Baram, Guy Tennenholtz, Shie Mannor:
Action Redundancy in Reinforcement Learning. CoRR abs/2102.11329 (2021) - [i156]Nir Baram, Guy Tennenholtz, Shie Mannor:
Maximum Entropy Reinforcement Learning with Mixture Policies. CoRR abs/2103.10176 (2021) - [i155]Ido Greenberg, Shie Mannor, Netanel Yannay:
Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal. CoRR abs/2104.02372 (2021) - [i154]Mohammadi Zaki, Avi Mohan, Aditya Gopalan, Shie Mannor:
Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling. CoRR abs/2105.00210 (2021) - [i153]Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg:
Value Iteration in Continuous Actions, States and Time. CoRR abs/2105.04682 (2021) - [i152]