default search action

combined dblp search
author search
venue search
publication search

ask others

Prashanth L. A.

Prashanth Lakshmanrao Ananthapadmanabharao

> Home > Persons

Person information

affiliation: University of Maryland
affiliation: INRIA Lille - Nord Europe
affiliation: Indian Institute of Science, Department of Computer Science and Automation

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/ftopt/AB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ftopt/AB25
Prashanth L. A., Shalabh Bhatnagar:
Gradient-Based Algorithms for Zeroth-Order Optimization. Found. Trends Optim. 8(1-3): 1-332 (2025)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/mor/HegdeMAJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mor/HegdeMAJ25
Vishwajit Hegde, Arvind S. Menon, Prashanth L. A., Krishna P. Jagannathan:
Online Estimation and Optimization of Utility-Based Shortfall Risk. Math. Oper. Res. 50(4): 2470-2501 (2025)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/tac/PachalBA25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/PachalBA25
Soumen Pachal, Shalabh Bhatnagar, Prashanth L. A.:
Generalized Simultaneous Perturbation-Based Gradient Search With Reduced Estimator Bias. IEEE Trans. Autom. Control. 70(7): 4687-4702 (2025)
[c30]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/TatliMAST25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/TatliMAST25
Meltem Tatli, Arpan Mukherjee, Prashanth L. A., Karthikeyan Shanmugam, Ali Tajer:
Risk-sensitive Bandits: Arm Mixture Optimality and Regret-efficient Algorithms. AISTATS 2025: 3871-3879
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-08896
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-08896
Meltem Tatli, Arpan Mukherjee, Prashanth L. A., Karthikeyan Shanmugam, Ali Tajer:
Risk-sensitive Bandits: Arm Mixture Optimality and Regret-efficient Algorithms. CoRR abs/2503.08896 (2025)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-20877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-20877
Meltem Tatli, Arpan Mukherjee, Prashanth L. A., Karthikeyan Shanmugam, Ali Tajer:
Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms. CoRR abs/2504.20877 (2025)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01101
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01101
Sumedh Gupte, Prashanth L. A., Sanjay P. Bhat:
Learning to optimize convex risk measures: The cases of utility-based shortfall risk and optimized certainty equivalent risk. CoRR abs/2506.01101 (2025)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-07249
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-07249
Soumen Pachal, Mizhaan Prajit Maniyar, Prashanth L. A.:
Policy Newton methods for Distortion Riskmetrics. CoRR abs/2508.07249 (2025)
2024
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/automatica/MondalAB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/automatica/MondalAB24
Akash Mondal, Prashanth L. A., Shalabh Bhatnagar:
Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization. Autom. 162: 111528 (2024)
[c29]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ManiyarAMB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ManiyarAMB24
Mizhaan Prajit Maniyar, Prashanth L. A., Akash Mondal, Shalabh Bhatnagar:
A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning. AISTATS 2024: 4708-4716
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/GupteAB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/GupteAB24
Sumedh Gupte, Prashanth L. A., Sanjay P. Bhat:
Optimization of Utility-based Shortfall Risk: A Non-asymptotic Viewpoint. CDC 2024: 1075-1080
[c27]
- view
- export record
  dblp key:
  - conf/icml/AgrawalAM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AgrawalAM24
Shubhada Agrawal, Prashanth L. A., Siva Theja Maguluri:
Policy Evaluation for Variance in Average Reward Reinforcement Learning. ICML 2024: 471-502
[c26]
- view
- export record
  dblp key:
  - conf/icml/ThoppeAB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ThoppeAB24
Gugan Thoppe, Prashanth L. A., Sanjay P. Bhat:
Risk Estimation in a Markov Cost Process: Lower and Upper Bounds. ICML 2024: 48124-48138
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20933
Ayon Ghosh, Prashanth L. A., Krishna P. Jagannathan:
Concentration Bounds for Optimized Certainty Equivalent Risk Estimation. CoRR abs/2405.20933 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07892
Tejaram Sangadi, Prashanth L. A., Krishna P. Jagannathan:
Finite Time Analysis of Temporal Difference Learning for Mean-Variance in a Discounted MDP. CoRR abs/2406.07892 (2024)
2023
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/tac/BhavsarA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/BhavsarA23
Nirav Bhavsar, Prashanth L. A.:
Nonasymptotic Bounds for Stochastic Optimization With Biased Noisy Gradient Oracles. IEEE Trans. Autom. Control. 68(3): 1628-1641 (2023)
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/PatilANP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/PatilANP23
Gandharv Patil, Prashanth L. A., Dheeraj Nagaraj, Doina Precup:
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation. AISTATS 2023: 5438-5448
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/ciss/BhatnagarA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ciss/BhatnagarA23
Shalabh Bhatnagar, Prashanth L. A.:
Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias. CISS 2023: 1-6
[c23]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/VijayanA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/VijayanA23
Nithia Vijayan, Prashanth L. A.:
A policy gradient approach for optimization of smooth risk measures. UAI 2023: 2168-2178
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-10951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-10951
Mizhaan Prajit Maniyar, Akash Mondal, Prashanth L. A., Shalabh Bhatnagar:
A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning. CoRR abs/2304.10951 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11389
Sanjay Bhat, Prashanth L. A., Gugan Thoppe:
VaR\ and CVaR Estimation in a Markov Cost Process: Lower and Upper Bounds. CoRR abs/2310.11389 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18743
Sumedh Gupte, Prashanth L. A., Sanjay P. Bhat:
Optimization of utility-based shortfall risk: A non-asymptotic viewpoint. CoRR abs/2310.18743 (2023)
2022
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/ftml/A022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ftml/A022
Prashanth L. A., Michael C. Fu:
Risk-Sensitive Reinforcement Learning via Policy Gradient Search. Found. Trends Mach. Learn. 15(5): 537-693 (2022)
[j14]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/AB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/AB22
Prashanth L. A., Sanjay P. Bhat:
A Wasserstein Distance Approach for Concentration of Empirical Risk Estimates. J. Mach. Learn. Res. 23: 238:1-238:61 (2022)
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/TanAJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/TanAJ22
Vincent Y. F. Tan, Prashanth L. A., Krishna P. Jagannathan:
A Survey of Risk-Aware Multi-Armed Bandits. IJCAI 2022: 5623-5629
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11046
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11046
Nithia Vijayan, Prashanth L. A.:
Approximate gradient ascent methods for distortion risk measures. CoRR abs/2202.11046 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16810
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16810
Dipayan Sen, Prashanth L. A., Aditya Gopalan:
Adaptive Estimation of Random Vectors with Bandit Feedback. CoRR abs/2203.16810 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05843
Vincent Y. F. Tan, Prashanth L. A., Krishna P. Jagannathan:
A Survey of Risk-Aware Multi-Armed Bandits. CoRR abs/2205.05843 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-00290
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-00290
Akash Mondal, Prashanth L. A., Shalabh Bhatnagar:
A Gradient Smoothed Functional Algorithm with Truncated Cauchy Random Perturbations for Stochastic Optimization. CoRR abs/2208.00290 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05918
Gandharv Patil, Prashanth L. A., Dheeraj Nagaraj, Doina Precup:
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation. CoRR abs/2210.05918 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-10477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-10477
Shalabh Bhatnagar, Prashanth L. A.:
Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias. CoRR abs/2212.10477 (2022)
2021
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/AKM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/AKM21
Prashanth L. A., Nathaniel Korda, Rémi Munos:
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling. Mach. Learn. 110(3): 559-618 (2021)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/scl/VijayanA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scl/VijayanA21
Nithia Vijayan, Prashanth L. A.:
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint. Syst. Control. Lett. 155: 104988 (2021)
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PandeyAB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PandeyAB21
Ajay Kumar Pandey, Prashanth L. A., Sanjay P. Bhat:
Estimation of Spectral Risk Measures. AAAI 2021: 12166-12173
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-02137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-02137
Nithia Vijayan, Prashanth L. A.:
Smoothed functional-based gradient algorithms for off-policy reinforcement learning. CoRR abs/2101.02137 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-04422
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-04422
Nithia Vijayan, Prashanth L. A.:
Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis. CoRR abs/2107.04422 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08805
Arvind S. Menon, Prashanth L. A., Krishna P. Jagannathan:
Online Estimation and Optimization of Utility-Based Shortfall Risk. CoRR abs/2111.08805 (2021)
2020
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/tac/ABBFM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/ABBFM20
Prashanth L. A., Shalabh Bhatnagar, Nirav Bhavsar, Michael C. Fu, Steven I. Marcus:
Random Directions Stochastic Approximation With Deterministic Perturbations. IEEE Trans. Autom. Control. 65(6): 2450-2465 (2020)
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AJK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AJK20
Prashanth L. A., Krishna P. Jagannathan, Ravi Kumar Kolla:
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions. ICML 2020: 5577-5586
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11440
Nirav Bhavsar, Prashanth L. A.:
Non-Asymptotic Bounds for Zeroth-Order Stochastic Optimization. CoRR abs/2002.11440 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/orl/KollaABJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/orl/KollaABJ19
Ravi Kumar Kolla, Prashanth L. A., Sanjay P. Bhat, Krishna P. Jagannathan:
Concentration bounds for empirical conditional value-at-risk: The unbounded case. Oper. Res. Lett. 47(1): 16-20 (2019)
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BodaA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BodaA19
Vinay Praneeth Boda, Prashanth L. A.:
Correlated bandits or: How to minimize mean-squared error online. ICML 2019: 686-694
[c18]
- view
- export record
  dblp key:
  - conf/nips/BhatA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BhatA19
Sanjay P. Bhat, Prashanth L. A.:
Concentration of risk measures: A Wasserstein distance approach. NeurIPS 2019: 11739-11748
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-00997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-00997
Ravi Kumar Kolla, Prashanth L. A., Krishna P. Jagannathan:
Risk-aware Multi-armed Bandits Using Conditional Value-at-Risk. CoRR abs/1901.00997 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02953
Vinay Praneeth Boda, Prashanth L. A.:
Correlated bandits or: How to minimize mean-squared error online. CoRR abs/1902.02953 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-10709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-10709
Sanjay P. Bhat, Prashanth L. A.:
Improved Concentration Bounds for Conditional Value-at-Risk and Cumulative Prospect Theory using Wasserstein distance. CoRR abs/1902.10709 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-10398
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-10398
Ajay Kumar Pandey, Prashanth L. A., Sanjay P. Bhat:
Estimation of Spectral Risk Measures. CoRR abs/1912.10398 (2019)
2018
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/tac/JieAFMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/JieAFMS18
Cheng Jie, Prashanth L. A., Michael C. Fu, Steven I. Marcus, Csaba Szepesvári:
Stochastic Optimization in a Cumulative Prospect Theory Framework. IEEE Trans. Autom. Control. 63(9): 2867-2882 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-01739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-01739
Ravi Kumar Kolla, Prashanth L. A., Sanjay P. Bhat, Krishna P. Jagannathan:
Concentration bounds for empirical conditional value-at-risk: The unbounded case. CoRR abs/1808.01739 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02871
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02871
Prashanth L. A., Shalabh Bhatnagar, Nirav Bhavsar, Michael C. Fu, Steven I. Marcus:
Random directions stochastic approximation with deterministic perturbations. CoRR abs/1808.02871 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-09126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-09126
Prashanth L. A., Michael C. Fu:
Risk-Sensitive Reinforcement Learning: A Constrained Optimization Viewpoint. CoRR abs/1810.09126 (2018)
2017
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/tac/ABFM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/ABFM17
Prashanth L. A., Shalabh Bhatnagar, Michael C. Fu, Steven I. Marcus:
Adaptive System Optimization Using Random Directions Stochastic Approximation. IEEE Trans. Autom. Control. 62(5): 2223-2238 (2017)
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GopalanAFM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GopalanAFM17
Aditya Gopalan, Prashanth L. A., Michael C. Fu, Steven I. Marcus:
Weighted Bandits or: How Bandits Learn Distorted Values That Are Not Expected. AAAI 2017: 1941-1947
2016
[j7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ml/AG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/AG16
Prashanth L. A., Mohammad Ghavamzadeh:
Variance-constrained actor-critic algorithms for discounted and average reward MDPs. Mach. Learn. 105(3): 367-417 (2016)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/scl/APBC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scl/APBC16
Prashanth L. A., H. L. Prasad, Shalabh Bhatnagar, Prakash Chandra:
A constrained optimization perspective on actor-critic algorithms and application to network routing. Syst. Control. Lett. 92: 46-51 (2016)
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/HuAGS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/HuAGS16
Xiaowei Hu, Prashanth L. A., András György, Csaba Szepesvári:
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles. AISTATS 2016: 819-828
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/ReddyAB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/ReddyAB16
Sai Koti Reddy Danda, Prashanth L. A., Shalabh Bhatnagar:
Improved Hessian estimation for adaptive random directions stochastic approximation. CDC 2016: 3682-3687
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AJFMS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AJFMS16
Prashanth L. A., Cheng Jie, Michael C. Fu, Steven I. Marcus, Csaba Szepesvári:
Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control. ICML 2016: 1406-1415
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HuAGS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HuAGS16
Xiaowei Hu, Prashanth L. A., András György, Csaba Szepesvári:
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles. CoRR abs/1609.07087 (2016)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/GopalanAFM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GopalanAFM16
Aditya Gopalan, Prashanth L. A., Michael C. Fu, Steven I. Marcus:
Weighted bandits or: How bandits learn distorted values that are not expected. CoRR abs/1611.10283 (2016)
2015
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/jota/BhatnagarA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jota/BhatnagarA15
Shalabh Bhatnagar, Prashanth L. A.:
Simultaneous Perturbation Newton Algorithms for Simulation Optimization. J. Optim. Theory Appl. 164(2): 621-643 (2015)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/simulation/APDBD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/simulation/APDBD15
Prashanth L. A., H. L. Prasad, Nirmit Desai, Shalabh Bhatnagar, Gargi Dasgupta:
Simultaneous perturbation methods for adaptive labor staffing in service systems. Simul. 91(5): 432-455 (2015)
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KordaAM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KordaAM15
Nathaniel Korda, Prashanth L. A., Rémi Munos:
Fast Gradient Descent for Drifting Least Squares Regression, with Application to Bandits. AAAI 2015: 2708-2714
[c12]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/PrasadAB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/PrasadAB15
H. L. Prasad, Prashanth L. A., Shalabh Bhatnagar:
Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games. AAMAS 2015: 1371-1379
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KordaA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KordaA15
Nathaniel Korda, Prashanth L. A.:
On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence. ICML 2015: 626-634
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AB15
Prashanth L. A., Shalabh Bhatnagar:
Adaptive system optimization using (simultaneous) random directions stochastic approximation. CoRR abs/1502.05577 (2015)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ACFM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ACFM15
Prashanth L. A., Cheng Jie, Michael C. Fu, Steven I. Marcus:
Cumulative Prospect Theory Meets Reinforcement Learning: Estimation and Control. CoRR abs/1506.02632 (2015)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/APBC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/APBC15
Prashanth L. A., H. L. Prasad, Shalabh Bhatnagar, Prakash Chandra:
A constrained optimization perspective on actor critic algorithms and application to network routing. CoRR abs/1507.07984 (2015)
2014
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/winet/ACB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/winet/ACB14
Prashanth L. A., Abhranil Chatterjee, Shalabh Bhatnagar:
Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks. Wirel. Networks 20(8): 2589-2604 (2014)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/alt/A14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/A14
Prashanth L. A.:
Policy Gradients for CVaR-Constrained MDPs. ALT 2014: 155-169
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/FonteneauA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/FonteneauA14
Raphael Fonteneau, Prashanth L. A.:
Simultaneous perturbation algorithms for batch off-policy search. CDC 2014: 2622-2627
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/comsnets/ACB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/comsnets/ACB14
Prashanth L. A., Abhranil Chatterjee, Shalabh Bhatnagar:
Adaptive sleep-wake control using reinforcement learning in sensor networks. COMSNETS 2014: 1-8
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/pkdd/AKM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pkdd/AKM14
Prashanth L. A., Nathaniel Korda, Rémi Munos:
Fast LSTD Using Stochastic Approximation: Finite Time Analysis and Application to Traffic Control. ECML/PKDD (2) 2014: 66-81
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/PrasadAB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PrasadAB14
H. L. Prasad, Prashanth L. A., Shalabh Bhatnagar:
Algorithms for Nash Equilibria in General-Sum Stochastic Games. CoRR abs/1401.2086 (2014)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FonteneauA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FonteneauA14
Raphael Fonteneau, Prashanth L. A.:
Simultaneous Perturbation Algorithms for Batch Off-Policy Search. CoRR abs/1403.4514 (2014)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AG14
Prashanth L. A., Mohammad Ghavamzadeh:
Actor-Critic Algorithms for Risk-Sensitive Reinforcement Learning. CoRR abs/1403.6530 (2014)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/A14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/A14a
Prashanth L. A.:
Policy Gradients for CVaR-Constrained MDPs. CoRR abs/1405.2690 (2014)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KordaP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KordaP14
Nathaniel Korda, Prashanth L. A.:
On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence. CoRR abs/1411.3224 (2014)
2013
[c6]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/AnanthapadmanabharaoPDB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/AnanthapadmanabharaoPDB13
Prashanth Lakshmanrao Ananthapadmanabharao, Horabailu Laxminarayana Prasad, Nirmit Desai, Shalabh Bhatnagar:
Mechanisms for hostile agents with capacity constraints. AAMAS 2013: 659-666
[c5]
- view
- export record
  dblp key:
  - conf/nips/LAG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LAG13
Prashanth L. A., Mohammad Ghavamzadeh:
Actor-Critic Algorithms for Risk-Sensitive MDPs. NIPS 2013: 252-260
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/PrashanthKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PrashanthKM13
Prashanth L. A., Nathaniel Korda, Rémi Munos:
Analysis of stochastic approximation for efficient least squares regression and LSTD. CoRR abs/1306.2557 (2013)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KordaPM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KordaPM13
Nathaniel Korda, Prashanth L. A., Rémi Munos:
Online gradient descent for least squares regression: Non-asymptotic bounds and application to bandits. CoRR abs/1307.3176 (2013)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AnanthapadmanabharaoCB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AnanthapadmanabharaoCB13
Prashanth Lakshmanrao Ananthapadmanabharao, Abhranil Chatterjee, Shalabh Bhatnagar:
Reinforcement Learning for Sleep-Wake Scheduling in Sensor Networks. CoRR abs/1312.7292 (2013)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/PrashanthPDBD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PrashanthPDBD13
Prashanth L. A., H. L. Prasad, Nirmit Desai, Shalabh Bhatnagar, Gargi Dasgupta:
Simultaneous Perturbation Methods for Adaptive Labor Staffing in Service Systems. CoRR abs/1312.7430 (2013)
2012
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tvt/PrashanthB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvt/PrashanthB12
Prashanth L. A., Shalabh Bhatnagar:
Threshold Tuning Using Stochastic Optimization for Graded Signal Control. IEEE Trans. Veh. Technol. 61(9): 3865-3880 (2012)
2011
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tits/PrashanthB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tits/PrashanthB11
Prashanth L. A., Shalabh Bhatnagar:
Reinforcement Learning With Function Approximation for Traffic Signal Control. IEEE Trans. Intell. Transp. Syst. 12(2): 412-421 (2011)
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/icsoc/PrashanthPDBD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icsoc/PrashanthPDBD11
Prashanth L. A., H. L. Prasad, Nirmit Desai, Shalabh Bhatnagar, Gargi Banerjee Dasgupta:
Stochastic Optimization for Adaptive Labor Staffing in Service Systems. ICSOC 2011: 487-494
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/itsc/AB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itsc/AB11
Prashanth L. A., Shalabh Bhatnagar:
Reinforcement learning with average cost for adaptive control of traffic lights at intersections. ITSC 2011: 1640-1645

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2008
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ccnc/ADG08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ccnc/ADG08
Prashanth L. A., Sajal Kumar Das, K. Gopinath:
MAC Design for Heterogeneous Application Support in OFDM Based Wireless Systems. CCNC 2008: 412-413
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/comsware/PrashanthG08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/comsware/PrashanthG08
Prashanth L. A., K. Gopinath:
OFDM-MAC algorithms and their impact on TCP performance in next generation mobile networks. COMSWARE 2008: 133-140

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.