default search action
Dinesh Manocha
Person information
- affiliation: University of Maryland at College Park, MD, USA
- affiliation (former): University of North Carolina at Chapel Hill, NC, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j248]Daeun Song, Jing Liang, Amirreza Payandeh, Amir Hossain Raj, Xuesu Xiao, Dinesh Manocha:
VLM-Social-Nav: Socially Aware Robot Navigation Through Scoring Using Vision-Language Models. IEEE Robotics Autom. Lett. 10(1): 508-515 (2025) - 2024
- [j247]Vishnu Sashank Dorbala, James F. Mullen Jr., Dinesh Manocha:
Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Based Zero-Shot Object Navigation. IEEE Robotics Autom. Lett. 9(5): 4083-4090 (2024) - [j246]Mohamed Elnoor, Adarsh Jagan Sathyamoorthy, Kasun Weerakoon, Dinesh Manocha:
ProNav: Proprioceptive Traversability Estimation for Legged Robot Navigation in Outdoor Environments. IEEE Robotics Autom. Lett. 9(8): 7190-7197 (2024) - [j245]James F. Mullen Jr., Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadan:
"Don't Forget to Put the Milk Back!" Dataset for Enabling Embodied Agents to Detect Anomalous Situations. IEEE Robotics Autom. Lett. 9(10): 9087-9094 (2024) - [j244]Geonsun Lee, Dae Yeol Lee, Guan-Ming Su, Dinesh Manocha:
"May I Speak?": Multi-Modal Attention Guidance in Social VR Group Conversations. IEEE Trans. Vis. Comput. Graph. 30(5): 2287-2297 (2024) - [j243]Mohammad R. Saeedpour-Parizi, Niall L. Williams, Tim Wong, Phillip Guan, Dinesh Manocha, Ian M. Erkelens:
Perceptual Thresholds for Radial Optic Flow Distortion in Near-Eye Stereoscopic Displays. IEEE Trans. Vis. Comput. Graph. 30(5): 2570-2579 (2024) - [j242]Elizabeth Childs, Ferzam Mohammad, Logan Stevens, Hugo Burbelo, Amanuel Awoke, Nicholas Rewkowski, Dinesh Manocha:
An Overview of Enhancing Distance Learning Through Emerging Augmented and Virtual Reality Technologies. IEEE Trans. Vis. Comput. Graph. 30(8): 4480-4496 (2024) - [c483]Jaehoon Choi, Yonghan Lee, Hyungtae Lee, Heesung Kwon, Dinesh Manocha:
MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering. ACCV (9) 2024: 262-279 - [c482]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, S. Sakshi, Sanjoy Chowdhury, Dinesh Manocha:
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations. ACL (Findings) 2024: 386-406 - [c481]Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramaneswaran S., S. Sakshi, Dinesh Manocha:
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions. ACL (1) 2024: 726-748 - [c480]Pooja Guhan, Uttaran Bhattacharya, Somdeb Sarkhel, Vahid Azizi, Xiang Chen, Saayan Mitra, Aniket Bera, Dinesh Manocha:
TAME-RD: Text Assisted Replication of Image Multi-Adjustments for Reverse Designing. ACL (Findings) 2024: 10710-10727 - [c479]Puneet Mathur, Zhe Liu, Ke Li, Yingyi Ma, Gil Keren, Zeeshan Ahmed, Dinesh Manocha, Xuedong Zhang:
DOC-RAG: ASR Language Model Personalization with Domain-Distributed Co-occurrence Retrieval Augmentation. LREC/COLING 2024: 5132-5139 - [c478]Puneet Mathur, Vlad I. Morariu, Aparna Garimella, Franck Dernoncourt, Jiuxiang Gu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha, Rajiv Jain:
DocScript: Document-level Script Event Prediction. LREC/COLING 2024: 5140-5155 - [c477]Samyak Jain, Parth Chhabra, Atula Tejaswi Neerkaje, Puneet Mathur, Ramit Sawhney, Shivam Agarwal, Preslav Nakov, Sudheer Chava, Dinesh Manocha:
Saliency-Aware Interpolative Augmentation for Multimodal Financial Prediction. LREC/COLING 2024: 14285-14297 - [c476]Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha:
Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs. CVPR Workshops 2024: 1877-1887 - [c475]Jaehoon Choi, Rajvi Shah, Qinbo Li, Yipeng Wang, Ayush Saraf, Changil Kim, Jia-Bin Huang, Dinesh Manocha, Suhib Alsisan, Johannes Kopf:
LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-Time Rendering. CVPR 2024: 5053-5063 - [c474]Tianrui Guan, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou:
Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models. CVPR 2024: 14375-14385 - [c473]Sanjoy Chowdhury, Sayan Nag, K. J. Joseph, Balaji Vasan Srinivasan, Dinesh Manocha:
MELFuSION: Synthesizing Music from Image and Language Cues Using Diffusion Models. CVPR 2024: 26816-26825 - [c472]Anton Ratnarajah, Sreyan Ghosh, Sonal Kumar, Purva Chiniya, Dinesh Manocha:
AV-RIR: Audio-Visual Room Impulse Response Estimation. CVPR 2024: 27154-27165 - [c471]Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha:
MEERKAT: Audio-Visual Large Language Model for Grounding in Space and Time. ECCV (64) 2024: 52-70 - [c470]Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su, Subhadra Gopalakrishnan, Dinesh Manocha:
V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation. ECCV (80) 2024: 191-206 - [c469]Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. EMNLP 2024: 6288-6313 - [c468]Ashish Seth, Ramaneswaran Selvakumar, S. Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha:
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning. EMNLP 2024: 6386-6400 - [c467]Xiyang Wu, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan L. Boyd-Graber, Tianyi Zhou, Dinesh Manocha:
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models. EMNLP (Findings) 2024: 8395-8419 - [c466]Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I. Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha:
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding. EMNLP 2024: 15485-15505 - [c465]Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha:
IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning. EMNLP 2024: 19584-19601 - [c464]Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramani Duraiswami, Dinesh Manocha:
Recap: Retrieval-Augmented Audio Captioning. ICASSP 2024: 1161-1165 - [c463]Ashish Seth, Sreyan Ghosh, Srinivasan Umesh, Dinesh Manocha:
Stable Distillation: Regularizing Continued Pre-Training for Low-Resource Automatic Speech Recognition. ICASSP 2024: 10821-10825 - [c462]Ashish Seth, Sreyan Ghosh, Srinivasan Umesh, Dinesh Manocha:
FusDom: Combining in-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning. ICASSP 2024: 12572-12576 - [c461]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, Furong Huang:
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback. ICLR 2024 - [c460]Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Reddy Evuru, Ramaneswaran S., Sakshi Singh, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models. ICLR 2024 - [c459]Manan Suri, Puneet Mathur, Ramit Sawhney, Preslav Nakov, Dinesh Manocha:
Doc2Command: Furthering Language Guided Document Editing. Tiny Papers @ ICLR 2024 - [c458]Souradip Chakraborty, Amrit S. Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang:
Position: On the Possibilities of AI-Generated Text Detection. ICML 2024 - [c457]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit S. Bedi, Mengdi Wang:
MaxMin-RLHF: Alignment with Diverse Human Preferences. ICML 2024 - [c456]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S., Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha:
A Closer Look at the Limitations of Instruction Tuning. ICML 2024 - [c455]Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Dinesh Manocha, Amrit S. Bedi:
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles. ICML 2024 - [c454]Jing Liang, Peng Gao, Xuesu Xiao, Adarsh Jagan Sathyamoorthy, Mohamed Elnoor, Ming C. Lin, Dinesh Manocha:
MTG: Mapless Trajectory Generator with Traversability Coverage for Outdoor Navigation. ICRA 2024: 2396-2402 - [c453]Nare Karapetyan, Ahmad Bilal Asghar, Amisha Bhaskar, Guangyao Shi, Dinesh Manocha, Pratap Tokekar:
AG-Cvg: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV. ICRA 2024: 2617-2623 - [c452]Christopher Maxey, Jaehoon Choi, Hyungtae Lee, Dinesh Manocha, Heesung Kwon:
UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception. ICRA 2024: 5323-5329 - [c451]Senthil Hariharan Arul, Jong Jin Park, Vishnu Prem, Yang Zhang, Dinesh Manocha:
Unconstrained Model Predictive Control for Robot Navigation under Uncertainty. ICRA 2024: 9321-9327 - [c450]Kasun Weerakoon, Adarsh Jagan Sathyamoorthy, Mohamed Elnoor, Dinesh Manocha:
VAPOR: Legged Robot Navigation in Unstructured Outdoor Environments using Offline Reinforcement Learning. ICRA 2024: 10344-10350 - [c449]Adarsh Jagan Sathyamoorthy, Kasun Weerakoon, Mohamed Elnoor, Mason Russell, Jason L. Pusey, Dinesh Manocha:
MIM: Indoor and Outdoor Navigation in Complex Environments Using Multi-Layer Intensity Maps. ICRA 2024: 10917-10924 - [c448]Biao Jia, Dinesh Manocha:
Sim-to-Real Robotic Sketching using Behavior Cloning and Reinforcement Learning. ICRA 2024: 18272-18278 - [c447]Jing Liang, Amirreza Payandeh, Daeun Song, Xuesu Xiao, Dinesh Manocha:
DTG : Diffusion-based Trajectory Generation for Mapless Global Navigation. IROS 2024: 5340-5347 - [c446]Senthil Hariharan Arul, Dhruva Kumar, Vivek Sugirtharaj, Richard Kim, Xuewei Tony Qi, Rajasimman Madhivanan, Arnie Sen, Dinesh Manocha:
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps. IROS 2024: 7625-7632 - [c445]Mohamed Elnoor, Kasun Weerakoon, Adarsh Jagan Sathyamoorthy, Tianrui Guan, Vignesh Rajagopal, Dinesh Manocha:
AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments. IROS 2024: 7687-7694 - [c444]Senthil Hariharan Arul, Amrit Singh Bedi, Dinesh Manocha:
When, What, and with Whom to Communicate: Enhancing RL-based Multi-Robot Navigation through Selective Communication. IROS 2024: 7695 - [c443]Tianrui Guan, Ruiqi Xian, Xijun Wang, Xiyang Wu, Mohamed Elnoor, Daeun Song, Dinesh Manocha:
AGL-Net: Aerial-Ground Cross-Modal Global Localization with Varying Scales. IROS 2024: 8161 - [c442]Chak Lam Shek, Xiyang Wu, Wesley A. Suttle, Carl E. Busart, Erin G. Zaroukian, Dinesh Manocha, Pratap Tokekar, Amrit Singh Bedi:
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments. IROS 2024: 9612-9619 - [c441]Xijun Wang, Ruiqi Xian, Tianrui Guan, Fuxiao Liu, Dinesh Manocha:
SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition. IROS 2024: 10967-10974 - [c440]Adarsh Jagan Sathyamoorthy, Kasun Weerakoon, Mohamed Elnoor, Anuj Zore, Brian Ichter, Fei Xia, Jie Tan, Wenhao Yu, Dinesh Manocha:
CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments. IROS 2024: 13837-13844 - [c439]Jing Liang, Zhuo Deng, Zheming Zhou, Omid Ghasemalizadeh, Dinesh Manocha, Min Sun, Cheng-Hao Kuo, Arnie Sen:
PoCo: Point Context Cluster for RGBD Indoor Place Recognition. IROS 2024: 14180-14187 - [c438]Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha:
Can LLM's Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis. NAACL (Short Papers) 2024: 258-271 - [c437]Sonal Kumar, Sreyan Ghosh, S. Sakshi, Utkarsh Tyagi, Dinesh Manocha:
Do Vision-Language Models Understand Compound Nouns? NAACL (Short Papers) 2024: 519-527 - [c436]Chandra Kiran Reddy Evuru, Sreyan Ghosh, Sonal Kumar, Ramaneswaran S., Utkarsh Tyagi, Dinesh Manocha:
CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP. NAACL-HLT (Findings) 2024: 3754-3769 - [c435]Anton Ratnarajah, Dinesh Manocha:
Listen2Scene: Interactive material-aware binaural sound propagation for reconstructed 3D scenes. VR 2024: 254-264 - [c434]Geonsun Lee, Jennifer Healey, Dinesh Manocha:
DocuBits: VR Document Decomposition for Procedural Task Completion. VR 2024: 309-319 - [c433]Ruiqi Xian, Xijun Wang, Dinesh Manocha:
MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition. WACV 2024: 6611-6620 - [c432]Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha:
PMI Sampler: Patch Similarity Guided Frame Selection For Aerial Action Recognition. WACV 2024: 6967-6976 - [i324]Geonsun Lee, Dae Yeol Lee, Guan-Ming Su, Dinesh Manocha:
"May I Speak?": Multi-modal Attention Guidance in Social VR Group Conversations. CoRR abs/2401.15507 (2024) - [i323]Geonsun Lee, Jennifer Healey, Dinesh Manocha:
DocuBits: VR Document Decomposition for Procedural Task Completion. CoRR abs/2401.15510 (2024) - [i322]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S., Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha:
A Closer Look at the Limitations of Instruction Tuning. CoRR abs/2402.05119 (2024) - [i321]Mohammad R. Saeedpour-Parizi, Niall L. Williams, Tim Wong, Phillip Guan, Dinesh Manocha, Ian M. Erkelens:
Perceptual Thresholds for Radial Optic Flow Distortion in Near-Eye Stereoscopic Displays. CoRR abs/2402.07916 (2024) - [i320]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang:
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences. CoRR abs/2402.08925 (2024) - [i319]Xiyang Wu, Ruiqi Xian, Tianrui Guan, Jing Liang, Souradip Chakraborty, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi:
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities. CoRR abs/2402.10340 (2024) - [i318]Peihong Yu, Manav Mishra, Alec Koppel, Carl E. Busart, Priya Narayan, Dinesh Manocha, Amrit S. Bedi, Pratap Tokekar:
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning. CoRR abs/2403.08936 (2024) - [i317]Jing Liang, Amirreza Payandeh, Daeun Song, Xuesu Xiao, Dinesh Manocha:
DTG : Diffusion-based Trajectory Generation for Mapless Global Navigation. CoRR abs/2403.09900 (2024) - [i316]Vishnu Sashank Dorbala, Bhrij Patel, Amrit Singh Bedi, Dinesh Manocha:
Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals. CoRR abs/2403.09905 (2024) - [i315]Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha:
Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis. CoRR abs/2403.11487 (2024) - [i314]Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha:
Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic. CoRR abs/2403.11925 (2024) - [i313]James F. Mullen Jr., Dinesh Manocha:
Towards Robots That Know When They Need Help: Affordance-Based Uncertainty for Large Language Model Planners. CoRR abs/2403.13198 (2024) - [i312]Mohamed Elnoor, Kasun Weerakoon, Adarsh Jagan Sathyamoorthy, Tianrui Guan, Vignesh Rajagopal, Dinesh Manocha:
AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments. CoRR abs/2403.13235 (2024) - [i311]Adarsh Jagan Sathyamoorthy, Kasun Weerakoon, Mohamed Elnoor, Anuj Zore, Brian Ichter, Fei Xia, Jie Tan, Wenhao Yu, Dinesh Manocha:
CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments. CoRR abs/2403.15637 (2024) - [i310]Daeun Song, Jing Liang, Amirreza Payandeh, Xuesu Xiao, Dinesh Manocha:
Socially Aware Robot Navigation through Scoring Using Vision-Language Models. CoRR abs/2404.00210 (2024) - [i309]Chandra Kiran Reddy Evuru, Sreyan Ghosh, Sonal Kumar, Ramaneswaran S., Utkarsh Tyagi, Dinesh Manocha:
CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP. CoRR abs/2404.00415 (2024) - [i308]Sonal Kumar, Sreyan Ghosh, Sakshi Singh, Utkarsh Tyagi, Dinesh Manocha:
Do Vision-Language Models Understand Compound Nouns? CoRR abs/2404.00419 (2024) - [i307]Jing Liang, Zhuo Deng, Zheming Zhou, Omid Ghasemalizadeh, Dinesh Manocha, Min Sun, Cheng-Hao Kuo, Arnie Sen:
PoCo: Point Context Cluster for RGBD Indoor Place Recognition. CoRR abs/2404.02885 (2024) - [i306]Tianrui Guan, Ruiqi Xian, Xijun Wang, Xiyang Wu, Mohamed Elnoor, Daeun Song, Dinesh Manocha:
AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales. CoRR abs/2404.03187 (2024) - [i305]James F. Mullen Jr., Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadan:
"Don't forget to put the milk back!" Dataset for Enabling Embodied Agents to Detect Anomalous Situations. CoRR abs/2404.08827 (2024) - [i304]Christopher Maxey, Jaehoon Choi, Yonghan Lee, Hyungtae Lee, Dinesh Manocha, Heesung Kwon:
TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes. CoRR abs/2405.02762 (2024) - [i303]Vishnu Sashank Dorbala, Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadan:
S-EQA: Tackling Situational Queries in Embodied Question Answering. CoRR abs/2405.04732 (2024) - [i302]Tianrui Guan, Yurou Yang, Harry Cheng, Muyuan Lin, Richard Kim, Rajasimman Madhivanan, Arnie Sen, Dinesh Manocha:
LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation. CoRR abs/2405.05363 (2024) - [i301]Divya Kothandaraman, Ming C. Lin, Dinesh Manocha:
Prompt Mixing in Diffusion Models using the Black Scholes Algorithm. CoRR abs/2405.13685 (2024) - [i300]Divya Kothandaraman, Kihyuk Sohn, Ruben Villegas, Paul Voigtlaender, Dinesh Manocha, Mohammad Babaeizadeh:
Text Prompting for Multi-Concept Video Customization by Autoregressive Generation. CoRR abs/2405.13951 (2024) - [i299]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha:
VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap. CoRR abs/2405.15683 (2024) - [i298]Nilesh Suriyarachchi, Rohan Chandra, Arya Anantula, John S. Baras, Dinesh Manocha:
GAMEOPT+: Improving Fuel Efficiency in Unregulated Heterogeneous Traffic Intersections via Optimal Multi-agent Cooperative Control. CoRR abs/2405.16430 (2024) - [i297]Ruichen Wang, Dinesh Manocha:
EM-GANSim: Real-time and Accurate EM Simulation Using Conditional GANs for 3D Indoor Scenes. CoRR abs/2405.17366 (2024) - [i296]Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang:
Transfer Q Star: Principled Decoding for LLM Alignment. CoRR abs/2405.20495 (2024) - [i295]Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Chandra Kiran Reddy Evuru, S. Ramaneswaran, Sakshi Singh, Dinesh Manocha:
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions. CoRR abs/2406.04286 (2024) - [i294]Sreyan Ghosh, Sonal Kumar, Ashish Seth, Purva Chiniya, Utkarsh Tyagi, Ramani Duraiswami, Dinesh Manocha:
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition. CoRR abs/2406.04432 (2024) - [i293]Sanjoy Chowdhury, Sayan Nag, K. J. Joseph, Balaji Vasan Srinivasan, Dinesh Manocha:
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models. CoRR abs/2406.04673 (2024) - [i292]Xiyang Wu, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, Dinesh Manocha:
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models. CoRR abs/2406.10900 (2024) - [i291]Bhrij Patel, Vishnu Sashank Dorbala, Dinesh Manocha, Amrit Singh Bedi:
Embodied Question Answering via Multi-LLM Systems. CoRR abs/2406.10918 (2024) - [i290]Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, Sakshi Singh, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. CoRR abs/2406.11768 (2024) - [i289]Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha:
IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning. CoRR abs/2406.13683 (2024) - [i288]Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha:
Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs. CoRR abs/2406.18068 (2024) - [i287]Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha:
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time. CoRR abs/2407.01851 (2024) - [i286]Jing Liang, Zhuo Deng, Zheming Zhou, Min Sun, Omid Ghasemalizadeh, Cheng-Hao Kuo, Arnie Sen, Dinesh Manocha:
CSCPR: Cross-Source-Context Indoor RGB-D Place Recognition. CoRR abs/2407.17457 (2024) - [i285]Vishnu Sashank Dorbala, Vishnu Dutt Sharma, Pratap Tokekar, Dinesh Manocha:
Is Generative Communication between Embodied Agents Good for Zero-Shot ObjectNav? CoRR abs/2408.01877 (2024) - [i284]Daeun Song, Jing Liang, Xuesu Xiao, Dinesh Manocha:
TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments. CoRR abs/2408.02454 (2024) - [i283]Kasun Weerakoon, Adarsh Jagan Sathyamoorthy, Mohamed Elnoor, Anuj Zore, Dinesh Manocha:
TOPGN: Real-time Transparent Obstacle Detection using Lidar Point Cloud Intensity for Autonomous Robot Navigation. CoRR abs/2408.05608 (2024) - [i282]Taewon Kang, Divya Kothandaraman, Dinesh Manocha, Ming C. Lin:
Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance. CoRR abs/2408.06157 (2024) - [i281]Senthil Hariharan Arul, Dhruva Kumar, Vivek Sugirtharaj, Richard Kim, Xuewei Qi, Rajasimman Madhivanan, Arnie Sen, Dinesh Manocha:
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps. CoRR abs/2408.08301 (2024) - [i280]Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds. CoRR abs/2409.09213 (2024) - [i279]Jing Liang, Dibyendu Das, Daeun Song, Md Nahid Hasan Shuvo, Mohammad Durrani, Karthik Taranath, Ivan Penskiy, Dinesh Manocha, Xuesu Xiao:
GND: Global Navigation Dataset with Multi-Modal Perception and Multi-Category Traversability in Outdoor Campus Environments. CoRR abs/2409.14262 (2024) - [i278]Divya Kothandaraman, Kuldeep Kulkarni, Sumit Shekhar, Balaji Vasan Srinivasan, Dinesh Manocha:
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models. CoRR abs/2409.15650 (2024) - [i277]Kasun Weerakoon, Mohamed Elnoor, Gershom Seneviratne, Vignesh Rajagopal, Senthil Hariharan Arul, Jing Liang, Mohamed Khalid M. Jaffar, Dinesh Manocha:
BehAV: Behavioral Rule Guided Autonomy Using VLMs for Robot Navigation in Outdoor Scenes. CoRR abs/2409.16484 (2024) - [i276]Gershom Seneviratne, Kasun Weerakoon, Mohamed Elnoor, Vignesh Rajagopal, Harshavarthan Varatharajan, Mohamed Khalid M. Jaffar, Jason Pusey, Dinesh Manocha:
CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains. CoRR abs/2409.17262 (2024) - [i275]Ruiqi Xian, Xiyang Wu, Tianrui Guan, Xijun Wang, Boqing Gong, Dinesh Manocha:
SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining.