default search action
Ranjay Krishna
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c50]Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization. ACL (Findings) 2024: 14982-14995 - [c49]Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Patrick Howe, Sharan Ranjit S, Anand Bhattad, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna:
MIMIC: Masked Image Modeling with Image Correspondences. CVPR Workshops 2024: 718-727 - [c48]Yushi Hu, Otilia Stretcu, Chun-Ta Lu, Krishnamurthy Viswanathan, Kenji Hata, Enming Luo, Ranjay Krishna, Ariel Fuxman:
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models. CVPR 2024: 9590-9601 - [c47]Mehmet Saygin Seyfioglu, Wisdom Oluchi Ikezogwo, Fatemeh Ghezloo, Ranjay Krishna, Linda G. Shapiro:
Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos. CVPR 2024: 13183-13192 - [c46]Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna:
Iterated Learning Improves Compositionality in Large Vision-Language Models. CVPR 2024: 13785-13795 - [c45]Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi:
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World. CVPR 2024: 16238-16250 - [c44]Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark:
Holodeck: Language Guided Generation of 3D Embodied AI Environments. CVPR 2024: 16277-16287 - [c43]Imad Eddine Toubal, Aditya Avinash, Neil Gordon Alldrin, Jan Dlabal, Wenlei Zhou, Enming Luo, Otilia Stretcu, Hao Xiong, Chun-Ta Lu, Howard Zhou, Ranjay Krishna, Ariel Fuxman, Tom Duerig:
Modeling Collaborator: Enabling Subjective Vision Classification with Minimal Human Effort via LLM Tool-Use. CVPR 2024: 17553-17563 - [c42]Jaemin Cho, Yushi Hu, Jason M. Baldridge, Roopal Garg, Peter Anderson, Ranjay Krishna, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. ICLR 2024 - [c41]Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Selective Visual Representations Improve Convergence and Generalization for Embodied AI. ICLR 2024 - [c40]Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu:
Offline Training of Language Model Agents with Functions as Learnable Weights. ICML 2024 - [c39]Jun Wang, Chun-Cheng Chang, Jiafei Duan, Dieter Fox, Ranjay Krishna:
EVE: Enabling Anyone to Train Robots using Augmented Reality. UIST 2024: 34:1-34:13 - [c38]Wei Qiao, Tushar Dogra, Otilia Stretcu, Yu-Han Lyu, Tiantian Fang, Dongjin Kwon, Chun-Ta Lu, Enming Luo, Yuan Wang, Chih-Chun Chia, Ariel Fuxman, Fangzhou Wang, Ranjay Krishna, Mehmet Tek:
Scaling Up LLM Reviews for Google Ads Content Moderation. WSDM 2024: 1174-1175 - [i78]Wilbert Pumacay, Ishika Singh, Jiafei Duan, Ranjay Krishna, Jesse Thomason, Dieter Fox:
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation. CoRR abs/2402.08191 (2024) - [i77]Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu:
Training Language Model Agents without Modifying Language Models. CoRR abs/2402.11359 (2024) - [i76]Wei Qiao, Tushar Dogra, Otilia Stretcu, Yu-Han Lyu, Tiantian Fang, Dongjin Kwon, Chun-Ta Lu, Enming Luo, Yuan Wang, Chih-Chun Chia, Ariel Fuxman, Fangzhou Wang, Ranjay Krishna, Mehmet Tek:
Scaling Up LLM Reviews for Google Ads Content Moderation. CoRR abs/2402.14590 (2024) - [i75]Imad Eddine Toubal, Aditya Avinash, Neil Gordon Alldrin, Jan Dlabal, Wenlei Zhou, Enming Luo, Otilia Stretcu, Hao Xiong, Chun-Ta Lu, Howard Zhou, Ranjay Krishna, Ariel Fuxman, Tom Duerig:
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use. CoRR abs/2403.02626 (2024) - [i74]Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna:
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks. CoRR abs/2403.11085 (2024) - [i73]Xiang Fan, Anand Bhattad, Ranjay Krishna:
Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion. CoRR abs/2403.14617 (2024) - [i72]Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna:
Iterated Learning Improves Compositionality in Large Vision-Language Models. CoRR abs/2404.02145 (2024) - [i71]Jun Wang, Chun-Cheng Chang, Jiafei Duan, Dieter Fox, Ranjay Krishna:
EVE: Enabling Anyone to Train Robot using Augmented Reality. CoRR abs/2404.06089 (2024) - [i70]Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A. Smith, Wei-Chiu Ma, Ranjay Krishna:
BLINK: Multimodal Large Language Models Can See but Not Perceive. CoRR abs/2404.12390 (2024) - [i69]Ankit Vani, Bac Nguyen, Samuel Lavoie, Ranjay Krishna, Aaron C. Courville:
SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision. CoRR abs/2404.15721 (2024) - [i68]Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Baldridge, Radu Soricut:
ImageInWords: Unlocking Hyper-Detailed Image Descriptions. CoRR abs/2405.02793 (2024) - [i67]Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna:
Multilingual Diversity Improves Vision-Language Representations. CoRR abs/2405.16915 (2024) - [i66]Ethan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati:
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass. CoRR abs/2405.18400 (2024) - [i65]Scott Geng, Cheng-Yu Hsieh, Vivek Ramanujan, Matthew Wallingford, Chun-Liang Li, Pang Wei Koh, Ranjay Krishna:
The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better. CoRR abs/2406.05184 (2024) - [i64]Yushi Hu, Weijia Shi, Xingyu Fu, Dan Roth, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Ranjay Krishna:
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models. CoRR abs/2406.09403 (2024) - [i63]Wentao Yuan, Jiafei Duan, Valts Blukis, Wilbert Pumacay, Ranjay Krishna, Adithyavairavan Murali, Arsalan Mousavian, Dieter Fox:
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics. CoRR abs/2406.10721 (2024) - [i62]Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Task Me Anything. CoRR abs/2406.11775 (2024) - [i61]Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization. CoRR abs/2406.16008 (2024) - [i60]Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna:
Manipulate-Anything: Automating Real-World Robots using Vision-Language Models. CoRR abs/2406.18915 (2024) - [i59]Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadipour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi:
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions. CoRR abs/2407.06723 (2024) - [i58]Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. CoRR abs/2407.07071 (2024) - [i57]Zuyan Liu, Benlin Liu, Jiahui Wang, Yuhao Dong, Guangyi Chen, Yongming Rao, Ranjay Krishna, Jiwen Lu:
Efficient Inference of Vision Instruction-Following Models with Elastic Cache. CoRR abs/2407.18121 (2024) - [i56]Benlin Liu, Yuhao Dong, Yiqin Wang, Yongming Rao, Yansong Tang, Wei-Chiu Ma, Ranjay Krishna:
Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model. CoRR abs/2408.00754 (2024) - [i55]Enhao Zhang, Nicole Sullivan, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
Self-Enhancing Video Data Management System for Compositional Events with Large Language Models [Technical Report]. CoRR abs/2408.02243 (2024) - [i54]Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, Yen-Sung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross B. Girshick, Ali Farhadi, Aniruddha Kembhavi:
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models. CoRR abs/2409.17146 (2024) - [i53]Jiafei Duan, Wilbert Pumacay, Nishanth Kumar, Yi Ru Wang, Shulin Tian, Wentao Yuan, Ranjay Krishna, Dieter Fox, Ajay Mandlekar, Yijie Guo:
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation. CoRR abs/2410.00371 (2024) - 2023
- [j7]Helena Vasconcelos, Matthew Jörke, Madeleine Grunde-McLaughlin, Tobias Gerstenberg, Michael S. Bernstein, Ranjay Krishna:
Explanations Can Reduce Overreliance on AI Systems During Decision-Making. Proc. ACM Hum. Comput. Interact. 7(CSCW1): 1-38 (2023) - [j6]Song Bai, Philip H. S. Torr, Ranjay Krishna, Li Fei-Fei, Abhinav Gupta, Song-Chun Zhu:
Guest Editorial: Introduction to the Special Section on Graphs in Vision and Pattern Analysis. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 6867-6869 (2023) - [j5]Enhao Zhang, Maureen Daum, Dong He, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions. Proc. VLDB Endow. 16(11): 2714-2727 (2023) - [j4]Enhao Zhang, Maureen Daum, Dong He, Manasi Ganti, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
EQUI-VOCAL Demonstration: Synthesizing Video Queries from User Interactions. Proc. VLDB Endow. 16(12): 3978-3981 (2023) - [j3]Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. Proc. VLDB Endow. 16(13): 4188-4201 (2023) - [c37]Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alex Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister:
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. ACL (Findings) 2023: 8003-8017 - [c36]Jiafei Duan, Yi Ru Wang, Mohit Shridhar, Dieter Fox, Ranjay Krishna:
AR2-D2: Training a Robot Without a Robot. CoRL 2023: 2838-2848 - [c35]Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna:
@ CREPE: Can Vision-Language Foundation Models Reason Compositionally? CVPR 2023: 10910-10921 - [c34]Yushi Hu, Benlin Liu, Jungo Kasai, Yizhong Wang, Mari Ostendorf, Ranjay Krishna, Noah A. Smith:
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering. ICCV 2023: 20349-20360 - [c33]Otilia Stretcu, Edward Vendrow, Kenji Hata, Krishnamurthy Viswanathan, Vittorio Ferrari, Sasan Tavakkol, Wenlei Zhou, Aditya Avinash, Enming Luo, Neil Gordon Alldrin, MohammadHossein Bateni, Gabriel Berger, Andrew Bunner, Chun-Ta Lu, Javier A Rey, Giulia DeSalvo, Ranjay Krishna, Ariel Fuxman:
Agile Modeling: From Concept to Classifier in Minutes. ICCV 2023: 22266-22277 - [c32]Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah M. Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander J. Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt:
DataComp: In search of the next generation of multimodal datasets. NeurIPS 2023 - [c31]Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna:
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality. NeurIPS 2023 - [c30]Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda G. Shapiro:
Quilt-1M: One Million Image-Text Pairs for Histopathology. NeurIPS 2023 - [c29]Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta:
OBJECT 3DIT: Language-guided 3D-aware Image Editing. NeurIPS 2023 - [c28]Arijit Ray, Filip Radenovic, Abhimanyu Dubey, Bryan A. Plummer, Ranjay Krishna, Kate Saenko:
Cola: A Benchmark for Compositional Text-to-image Retrieval. NeurIPS 2023 - [c27]Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander J. Ratner, Ranjay Krishna, Jiaming Shen, Chao Zhang:
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias. NeurIPS 2023 - [i52]Enhao Zhang, Maureen Daum, Dong He, Magdalena Balazinska, Brandon Haynes, Ranjay Krishna:
EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report]. CoRR abs/2301.00929 (2023) - [i51]Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. CoRR abs/2303.04068 (2023) - [i50]Yushi Hu, Benlin Liu, Jungo Kasai, Yizhong Wang, Mari Ostendorf, Ranjay Krishna, Noah A. Smith:
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering. CoRR abs/2303.11897 (2023) - [i49]Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah M. Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt:
DataComp: In search of the next generation of multimodal datasets. CoRR abs/2304.14108 (2023) - [i48]Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alexander Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister:
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. CoRR abs/2305.02301 (2023) - [i47]Arijit Ray, Filip Radenovic, Abhimanyu Dubey, Bryan A. Plummer, Ranjay Krishna, Kate Saenko:
COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? CoRR abs/2305.03689 (2023) - [i46]Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda G. Shapiro:
Quilt-1M: One Million Image-Text Pairs for Histopathology. CoRR abs/2306.11207 (2023) - [i45]Jiafei Duan, Yi Ru Wang, Mohit Shridhar, Dieter Fox, Ranjay Krishna:
AR2-D2: Training a Robot Without a Robot. CoRR abs/2306.13818 (2023) - [i44]Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna:
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality. CoRR abs/2306.14610 (2023) - [i43]Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna:
MIMIC: Masked Image Modeling with Image Correspondences. CoRR abs/2306.15128 (2023) - [i42]Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, Chao Zhang:
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias. CoRR abs/2306.15895 (2023) - [i41]Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta:
OBJECT 3DIT: Language-guided 3D-aware Image Editing. CoRR abs/2307.11073 (2023) - [i40]Cheng-Yu Hsieh, Si-An Chen, Chun-Liang Li, Yasuhisa Fujii, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models. CoRR abs/2308.00675 (2023) - [i39]Jieyu Zhang, Ranjay Krishna, Ahmed Hassan Awadallah, Chi Wang:
EcoAssistant: Using LLM Assistant More Affordably and Accurately. CoRR abs/2310.03046 (2023) - [i38]Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna:
Cultural and Linguistic Diversity Improves Visual Representations. CoRR abs/2310.14356 (2023) - [i37]Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. CoRR abs/2310.18235 (2023) - [i36]Ryan Liu, Howard Yen, Raja Marjieh, Thomas L. Griffiths, Ranjay Krishna:
Improving Interpersonal Communication by Simulating Audiences with Language Models. CoRR abs/2311.00687 (2023) - [i35]Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Selective Visual Representations Improve Convergence and Generalization for Embodied AI. CoRR abs/2311.04193 (2023) - [i34]Jiao Sun, Deqing Fu, Yushi Hu, Su Wang, Royi Rassin, Da-Cheng Juan, Dana Alon, Charles Herrmann, Sjoerd van Steenkiste, Ranjay Krishna, Cyrus Rashtchian:
DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback. CoRR abs/2311.17946 (2023) - [i33]Dina Bashkirova, Arijit Ray, Rupayan Mallick, Sarah Adel Bargal, Jianming Zhang, Ranjay Krishna, Kate Saenko:
Lasagna: Layered Score Distillation for Disentangled Object Relighting. CoRR abs/2312.00833 (2023) - [i32]Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi:
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World. CoRR abs/2312.02976 (2023) - [i31]Yushi Hu, Otilia Stretcu, Chun-Ta Lu, Krishnamurthy Viswanathan, Kenji Hata, Enming Luo, Ranjay Krishna, Ariel Fuxman:
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models. CoRR abs/2312.03052 (2023) - [i30]Mehmet Saygin Seyfioglu, Wisdom Oluchi Ikezogwo, Fatemeh Ghezloo, Ranjay Krishna, Linda G. Shapiro:
Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos. CoRR abs/2312.04746 (2023) - [i29]Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark:
Holodeck: Language Guided Generation of 3D Embodied AI Environments. CoRR abs/2312.09067 (2023) - [i28]Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer:
Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows. CoRR abs/2312.11681 (2023) - 2022
- [c26]Maureen Daum, Enhao Zhang, Dong He, Magdalena Balazinska, Brandon Haynes, Ranjay Krishna, Apryle Craig, Aaron Wirsing:
VOCAL: Video Organization and Interactive Compositional AnaLytics. CIDR 2022 - [c25]Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
Measuring Compositional Consistency for Video Question Answering. CVPR 2022: 5036-5045 - [c24]Zixian Ma, Rose E. Wang, Fei-Fei Li, Michael S. Bernstein, Ranjay Krishna:
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward. NeurIPS 2022 - [i27]Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
AGQA 2.0: An Updated Benchmark for Compositional Spatio-Temporal Reasoning. CoRR abs/2204.06105 (2022) - [i26]Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
Measuring Compositional Consistency for Video Question Answering. CoRR abs/2204.07190 (2022) - [i25]Zixian Ma, Rose E. Wang, Li Fei-Fei, Michael S. Bernstein, Ranjay Krishna:
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward. CoRR abs/2210.04365 (2022) - [i24]Helena Vasconcelos, Matthew Jörke, Madeleine Grunde-McLaughlin, Tobias Gerstenberg, Michael S. Bernstein, Ranjay Krishna:
Explanations Can Reduce Overreliance on AI Systems During Decision-Making. CoRR abs/2212.06823 (2022) - [i23]Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna:
CREPE: Can Vision-Language Foundation Models Reason Compositionally? CoRR abs/2212.07796 (2022) - 2021
- [b1]Ranjay Krishna:
Visual intelligence through human learning. Stanford University, USA, 2021 - [c23]Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei, Christopher D. Manning:
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering. ACL/IJCNLP (1) 2021: 7265-7281 - [c22]Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning. CVPR 2021: 11287-11297 - [i22]Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning. CoRR abs/2103.16002 (2021) - [i21]Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei, Christopher D. Manning:
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering. CoRR abs/2107.02331 (2021) - [i20]Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ B. Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri S. Chatterji, Annie S. Chen, Kathleen Creel, Jared Quincy Davis, Dorottya Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren E. Gillespie, Karan Goel, Noah D. Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Koh, Mark S. Krass, Ranjay Krishna, Rohith Kuditipudi, et al.:
On the Opportunities and Risks of Foundation Models. CoRR abs/2108.07258 (2021) - [i19]Ranjay Krishna, Mitchell L. Gordon, Li Fei-Fei, Michael S. Bernstein:
Visual Intelligence through Human Interaction. CoRR abs/2111.06913 (2021) - 2020
- [j2]Pranav Khadpe, Ranjay Krishna, Li Fei-Fei, Jeffrey T. Hancock, Michael S. Bernstein:
Conceptual Metaphors Impact Perceptions of Human-AI Collaboration. Proc. ACM Hum. Comput. Interact. 4(CSCW2): 163:1-163:26 (2020) - [c21]Rachel Gardner, Maya Varma, Clare Zhu, Ranjay Krishna:
Determining Question-Answer Plausibility in Crowdsourced Datasets Using Multi-Task Learning. W-NUT@EMNLP 2020: 22-27 - [c20]Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles:
Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs. CVPR 2020: 10233-10244 - [i18]Pranav Khadpe, Ranjay Krishna, Li Fei-Fei, Jeffrey T. Hancock, Michael S. Bernstein:
Conceptual Metaphors Impact Perceptions of Human-AI Collaboration. CoRR abs/2008.02311 (2020) - [i17]Rachel Gardner, Maya Varma, Clare Zhu, Ranjay Krishna:
Determining Question-Answer Plausibility in Crowdsourced Datasets Using Multi-Task Learning. CoRR abs/2011.04883 (2020)
2010 – 2019
- 2019
- [c19]Michelle S. Lam, Grace B. Young, Catherine Y. Xu, Ranjay Krishna, Michael S. Bernstein:
Eevee: Transforming Images by Bridging High-level Goals and Low-level Edit Operations. CHI Extended Abstracts 2019 - [c18]Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Information Maximizing Visual Question Generation. CVPR 2019: 2008-2018 - [c17]Junwon Park, Ranjay Krishna, Pranav Khadpe, Li Fei-Fei, Michael S. Bernstein:
AI-Based Request Augmentation to Increase Crowdsourcing Participation. HCOMP 2019: 115-124 - [c16]Ranjay Krishna, Vincent S. Chen, Paroma Varma, Michael S. Bernstein, Christopher Ré, Li Fei-Fei:
Scene Graph Prediction With Limited Labels. ICCV 2019: 2580-2590 - [c15]Apoorva Dornadula, Austin Narcomey, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Visual Relationships as Functions: Enabling Few-Shot Scene Graph Prediction. ICCV Workshops 2019: 1730-1739 - [c14]Vincent S. Chen, Paroma Varma, Ranjay Krishna, Michael S. Bernstein, Christopher Ré, Li Fei-Fei:
Scene Graph Prediction with Limited Labels. ICCV Workshops 2019: 1772-1782 - [c13]Sharon Zhou, Mitchell L. Gordon, Ranjay Krishna, Austin Narcomey, Durim Morina, Michael S. Bernstein:
HYPE: Human-eYe Perceptual Evaluation of Generative Models. DGS@ICLR 2019 - [c12]Sharon Zhou, Mitchell L. Gordon, Ranjay Krishna, Austin Narcomey, Li Fei-Fei, Michael S. Bernstein:
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models. NeurIPS 2019: 3444-3456 - [i16]Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Information Maximizing Visual Question Generation. CoRR abs/1903.11207 (2019) - [i15]Sharon Zhou, Mitchell L. Gordon, Ranjay Krishna, Austin Narcomey, Durim Morina, Michael S. Bernstein:
HYPE: Human eYe Perceptual Evaluation of Generative Models. CoRR abs/1904.01121 (2019) - [i14]Vincent S. Chen, Paroma Varma, Ranjay Krishna, Michael S. Bernstein, Christopher Ré, Li Fei-Fei:
Scene Graph Prediction with Limited Labels. CoRR abs/1904.11622 (2019) - [i13]Apoorva Dornadula, Austin Narcomey, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Visual Relationships as Functions: Enabling Few-Shot Scene Graph Prediction. CoRR abs/1906.04876 (2019) - [i12]Khaled Jedoui, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Deep Bayesian Active Learning for Multiple Correct Outputs. CoRR abs/1912.01119 (2019) - [i11]Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles:
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs. CoRR abs/1912.06992 (2019) - 2018
- [c11]Ranjay Krishna, Ines Chami, Michael S. Bernstein, Li Fei-Fei:
Referring Relationships. CVPR 2018: 6867-6876 - [c10]Ranjay Krishna, Donsuk Lee, Li Fei-Fei, Michael S. Bernstein:
Engagement Learning: Expanding Visual Knowledge by Engaging Online Participants. UIST (Adjunct Volume) 2018: 87-89 - [i10]Ranjay Krishna, Ines Chami, Michael S. Bernstein, Li Fei-Fei:
Referring Relationships. CoRR abs/1803.10362 (2018) - [i9]Bernard Ghanem, Juan Carlos Niebles, Cees Snoek, Fabian Caba Heilbron, Humam Alwassel, Victor Escorcia, Ranjay Krishna, Shyamal Buch, Cuong Duc Dao:
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary. CoRR abs/1808.03766 (2018) - 2017
- [j1]Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalantidis, Li-Jia Li, David A. Shamma, Michael S. Bernstein, Li Fei-Fei:
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations. Int. J. Comput. Vis. 123(1): 32-73 (2017) - [c9]Kenji Hata, Ranjay Krishna, Li Fei-Fei, Michael S. Bernstein:
A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality. CSCW 2017: 889-901 - [c8]Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei:
A Hierarchical Approach for Generating Descriptive Image Paragraphs. CVPR 2017: 3337-3345 - [c7]Ranjay Krishna, Kenji Hata, Frederic Ren, Li Fei-Fei, Juan Carlos Niebles:
Dense-Captioning Events in Videos. ICCV 2017: 706-715 - [c6]Rajan Vaish, Snehalkumar (Neil) S. Gaikwad, Geza Kovacs, Andreas Veit, Ranjay Krishna, Imanol Arrieta Ibarra, Camelia Simoiu, Michael J. Wilber, Serge J. Belongie, Sharad Goel, James Davis, Michael S. Bernstein:
Crowd Research: Open and Scalable University Laboratories. UIST 2017: 829-843 - [i8]Ranjay Krishna, Kenji Hata, Frederic Ren, Li Fei-Fei, Juan Carlos Niebles:
Dense-Captioning Events in Videos. CoRR abs/1705.00754 (2017) - [i7]Bernard Ghanem, Juan Carlos Niebles, Cees Snoek, Fabian Caba Heilbron, Humam Alwassel, Ranjay Krishna, Victor Escorcia, Kenji Hata, Shyamal Buch:
ActivityNet Challenge 2017 Summary. CoRR abs/1710.08011 (2017) - 2016
- [c5]Ranjay A. Krishna, Kenji Hata, Stephanie Chen, Joshua Kravitz, David A. Shamma, Li Fei-Fei, Michael S. Bernstein:
Embracing Error to Enable Rapid Crowdsourcing. CHI 2016: 3167-3179 - [c4]Cewu Lu, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Visual Relationship Detection with Language Priors. ECCV (1) 2016: 852-869 - [i6]Ranjay Krishna, Kenji Hata, Stephanie Chen, Joshua Kravitz, David A. Shamma, Li Fei-Fei, Michael S. Bernstein:
Embracing Error to Enable Rapid Crowdsourcing. CoRR abs/1602.04506 (2016) - [i5]Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalantidis, Li-Jia Li, David A. Shamma, Michael S. Bernstein, Li Fei-Fei:
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations. CoRR abs/1602.07332 (2016) - [i4]Cewu Lu, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
Visual Relationship Detection with Language Priors. CoRR abs/1608.00187 (2016) - [i3]Kenji Hata, Ranjay Krishna, Li Fei-Fei, Michael S. Bernstein:
A Glimpse Far into the Future: Understanding Long-term Crowd Worker Accuracy. CoRR abs/1609.04855 (2016) - [i2]Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei:
A Hierarchical Approach for Generating Descriptive Image Paragraphs. CoRR abs/1611.06607 (2016) - 2015
- [c3]Sebastian Schuster, Ranjay Krishna, Angel X. Chang, Li Fei-Fei, Christopher D. Manning:
Generating Semantically Precise Scene Graphs from Textual Descriptions for Improved Image Retrieval. VL@EMNLP 2015: 70-80 - [c2]Justin Johnson, Ranjay Krishna, Michael Stark, Li-Jia Li, David A. Shamma, Michael S. Bernstein, Li Fei-Fei:
Image retrieval using scene graphs. CVPR 2015: 3668-3678 - [c1]Snehal (Neil) Gaikwad, Durim Morina, Rohit Nistala, Megha Agarwal, Alison Cossette, Radhika Bhanu, Saiph Savage, Vishwajeet Narwal, Karan Rajpal, Jeff Regino, Aditi Mithal, Adam Ginzberg, Aditi Nath, Karolina R. Ziulkoski, Trygve Cossette, Dilrukshi Gamage, Angela Richmond-Fuller, Ryo Suzuki, Jeerel Herrejón, Kevin Le, Claudia Flores-Saviaga, Haritha Thilakarathne, Kajal Gupta, William Dai, Ankita Sastry, Shirish Goyal, Thejan Rajapakshe, Niki Abolhassani, Angela Xie, Abigail Reyes, Surabhi Ingle, Verónica Jaramillo, Martin Godínez, Walter Ángel, Carlos Toxtli, Juan Flores, Asmita Gupta, Vineet Sethia, Diana Padilla, Kristy Milland, Kristiono Setyadi, Nuwan Wajirasena, Muthitha Batagoda, Rolando Cruz, James Damon, Divya Nekkanti, Tejas Sarma, Mohamed Saleh, Gabriela Gongora-Svartzman, Soroosh Bateni, Gema Toledo Barrera, Alex Peña, Ryan Compton, Deen Aariff, Luis Palacios, Manuela Paula Ritter, Nisha K. K., Alan C. Kay, Jana Uhrmeister, Srivalli Nistala, Milad Esfahani, Elsa Bakiu, Christopher Diemert, Luca Matsumoto, Manik Singh, Krupa Patel, Ranjay Krishna, Geza Kovacs, Rajan Vaish, Michael S. Bernstein:
Daemo: A Self-Governed Crowdsourcing Marketplace. UIST (Adjunct Volume) 2015: 101-102 - [i1]Kenji Hata, Sherman Leung, Ranjay Krishna, Michael S. Bernstein, Li Fei-Fei:
SentenceRacer: A Game with a Purpose for Image Sentence Annotation. CoRR abs/1508.07053 (2015)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-11 21:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint