default search action
Michael S. Ryoo
Person information
- affiliation: Stony Brook University, Department of Computer Science, NY, USA
- affiliation: Google Brain
- affiliation: Indiana University Bloomington, IN, USA
- affiliation: NASA Jet Propulsion Laboratory (JPL), Pasadena, CA, USA
- affiliation (PhD 2008): University of Texas at Austin, Computer and Vision Research Center, TX, USA
- affiliation: Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c103]Kanchana Ranasinghe, Satya Narayan Shukla, Omid Poursaeed, Michael S. Ryoo, Tsung-Yu Lin:
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs. CVPR 2024: 12977-12987 - [c102]Kumara Kahatapitiya, Anurag Arnab, Arsha Nagrani, Michael S. Ryoo:
VicTR: Video-conditioned Text Representations for Activity Recognition. CVPR 2024: 18547-18558 - [c101]Ryan D. Burgert, Brian L. Price, Jason Kuen, Yijun Li, Michael S. Ryoo:
MAGICK: A Large-Scale Captioned Dataset from Matting Generated Images Using Chroma Keying. CVPR 2024: 22595-22604 - [c100]A. J. Piergiovanni, Isaac Noble, Dahun Kim, Michael S. Ryoo, Victor Gomes, Anelia Angelova:
Mirasol3B: A Multimodal Autoregressive Model for Time-Aligned and Contextual Modalities. CVPR 2024: 26794-26804 - [c99]Cristina Mata, Kanchana Ranasinghe, Michael S. Ryoo:
CoPT: Unsupervised Domain Adaptive Segmentation Using Domain-Agnostic Text Embeddings. ECCV (62) 2024: 424-440 - [c98]Isabel Leal, Krzysztof Choromanski, Deepali Jain, Avinava Dubey, Jake Varley, Michael S. Ryoo, Yao Lu, Frederick Liu, Vikas Sindhwani, Quan Vuong, Tamás Sarlós, Ken Oslund, Karol Hausman, Kanishka Rao:
SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention. ICRA 2024: 6920-6927 - [c97]Xiang Li, Varun Belagali, Jinghuan Shang, Michael S. Ryoo:
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning. ICRA 2024: 16841-16849 - [c96]Ryan Burgert, Xiang Li, Abe Leite, Kanchana Ranasinghe, Michael S. Ryoo:
Diffusion Illusions: Hiding Images in Plain Sight. SIGGRAPH (Conference Paper Track) 2024: 131 - [c95]Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo:
Grafting Vision Transformers. WACV 2024: 1134-1143 - [c94]Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo:
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders. WACV 2024: 6864-6874 - [i83]Kumara Kahatapitiya, Kanchana Ranasinghe, Jongwoo Park, Michael S. Ryoo:
Language Repository for Long Video Understanding. CoRR abs/2403.14622 (2024) - [i82]Kanchana Ranasinghe, Xiang Li, Kumara Kahatapitiya, Michael S. Ryoo:
Understanding Long Videos in One Multimodal Language Model Pass. CoRR abs/2403.16998 (2024) - [i81]Kanchana Ranasinghe, Satya Narayan Shukla, Omid Poursaeed, Michael S. Ryoo, Tsung-Yu Lin:
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs. CoRR abs/2404.07449 (2024) - [i80]Jongwoo Park, Kanchana Ranasinghe, Kumara Kahatapitiya, Wonjeong Ryoo, Donghyun Kim, Michael S. Ryoo:
Too Many Frames, not all Useful: Efficient Strategies for Long-Form Video QA. CoRR abs/2406.09396 (2024) - [i79]Xiang Li, Cristina Mata, Jongwoo Park, Kumara Kahatapitiya, Yoo Sung Jang, Jinghuan Shang, Kanchana Ranasinghe, Ryan Burgert, Mu Cai, Yong Jae Lee, Michael S. Ryoo:
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy. CoRR abs/2406.20095 (2024) - [i78]Le Xue, Manli Shu, Anas Awadalla, Jun Wang, An Yan, Senthil Purushwalkam, Honglu Zhou, Viraj Prabhu, Yutong Dai, Michael S. Ryoo, Shrikant Kendre, Jieyu Zhang, Can Qin, Shu Zhang, Chia-Chih Chen, Ning Yu, Juntao Tan, Tulika Manoj Awalgaonkar, Shelby Heinecke, Huan Wang, Yejin Choi, Ludwig Schmidt, Zeyuan Chen, Silvio Savarese, Juan Carlos Niebles, Caiming Xiong, Ran Xu:
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models. CoRR abs/2408.08872 (2024) - [i77]Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong:
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations. CoRR abs/2408.12590 (2024) - [i76]Michael S. Ryoo, Honglu Zhou, Shrikant Kendre, Can Qin, Le Xue, Manli Shu, Silvio Savarese, Ran Xu, Caiming Xiong, Juan Carlos Niebles:
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs. CoRR abs/2410.16267 (2024) - 2023
- [j15]Jinghuan Shang, Xiang Li, Kumara Kahatapitiya, Yu-Cheol Lee, Michael S. Ryoo:
StARformer: Transformer With State-Action-Reward Representations for Robot Learning. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12862-12877 (2023) - [c93]Kumara Kahatapitiya, Zhou Ren, Haoxiang Li, Zhenyu Wu, Michael S. Ryoo, Gang Hua:
Weakly-Guided Self-Supervised Pretraining for Temporal Activity Detection. AAAI 2023: 1078-1086 - [c92]Rui Dai, Srijan Das, Michael S. Ryoo, François Brémond:
Attributes-Aware Network for Temporal Action Detection. BMVC 2023: 114-116 - [c91]Brianna Zitkovich, Tianhe Yu, Sichun Xu, Peng Xu, Ted Xiao, Fei Xia, Jialin Wu, Paul Wohlhart, Stefan Welker, Ayzaan Wahid, Quan Vuong, Vincent Vanhoucke, Huong T. Tran, Radu Soricut, Anikait Singh, Jaspiar Singh, Pierre Sermanet, Pannag R. Sanketi, Grecia Salazar, Michael S. Ryoo, Krista Reymann, Kanishka Rao, Karl Pertsch, Igor Mordatch, Henryk Michalewski, Yao Lu, Sergey Levine, Lisa Lee, Tsang-Wei Edward Lee, Isabel Leal, Yuheng Kuang, Dmitry Kalashnikov, Ryan Julian, Nikhil J. Joshi, Alex Irpan, Brian Ichter, Jasmine Hsu, Alexander Herzog, Karol Hausman, Keerthana Gopalakrishnan, Chuyuan Fu, Pete Florence, Chelsea Finn, Kumar Avinava Dubey, Danny Driess, Tianli Ding, Krzysztof Marcin Choromanski, Xi Chen, Yevgen Chebotar, Justice Carbajal, Noah Brown, Anthony Brohan, Montserrat Gonzalez Arenas, Kehang Han:
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control. CoRL 2023: 2165-2183 - [c90]Michael S. Ryoo, Keerthana Gopalakrishnan, Kumara Kahatapitiya, Ted Xiao, Kanishka Rao, Austin Stone, Yao Lu, Julian Ibarz, Anurag Arnab:
Token Turing Machines. CVPR 2023: 19070-19081 - [c89]Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim:
Reducing Inference Latency with Concurrent Architectures for Image Recognition at Edge. EDGE 2023: 245-254 - [c88]Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Marcin Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael S. Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence:
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language. ICLR 2023 - [c87]Boyuan Chen, Fei Xia, Brian Ichter, Kanishka Rao, Keerthana Gopalakrishnan, Michael S. Ryoo, Austin Stone, Daniel Kappler:
Open-vocabulary Queryable Scene Representations for Real World Planning. ICRA 2023: 11509-11522 - [c86]Alan Wu, Michael S. Ryoo:
Energy-Based Models for Cross-Modal Localization using Convolutional Transformers. ICRA 2023: 11726-11733 - [c85]Kumara Kahatapitiya, Michael S. Ryoo:
SWAT: Spatial Structure Within and Among Tokens. IJCAI 2023: 956-964 - [c84]Srijan Das, Michael S. Ryoo:
Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning. MVA 2023: 1-6 - [c83]Kanchana Ranasinghe, Michael S. Ryoo:
Language-based Action Concept Spaces Improve Video Self-Supervised Learning. NeurIPS 2023 - [c82]Jinghuan Shang, Michael S. Ryoo:
Active Vision Reinforcement Learning under Limited Visual Observability. NeurIPS 2023 - [c81]Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alexander Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Kuang-Huei Lee, Sergey Levine, Yao Lu, Utsav Malla, Deeksha Manjunath, Igor Mordatch, Ofir Nachum, Carolina Parada, Jodilyn Peralta, Emily Perez, Karl Pertsch, Jornell Quiambao, Kanishka Rao, Michael S. Ryoo, Grecia Salazar, Pannag R. Sanketi, Kevin Sayed, Jaspiar Singh, Sumedh Sontakke, Austin Stone, Clayton Tan, Huong T. Tran, Vincent Vanhoucke, Steve Vega, Quan Vuong, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich:
RT-1: Robotics Transformer for Real-World Control at Scale. Robotics: Science and Systems 2023 - [c80]Srijan Das, Michael S. Ryoo:
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints. WACV 2023: 5562-5572 - [i75]Kumara Kahatapitiya, Anurag Arnab, Arsha Nagrani, Michael S. Ryoo:
VicTR: Video-conditioned Text Representations for Activity Recognition. CoRR abs/2304.02560 (2023) - [i74]Jinghuan Shang, Michael S. Ryoo:
Active Reinforcement Learning under Limited Visual Observability. CoRR abs/2306.00975 (2023) - [i73]Alan Wu, Michael S. Ryoo:
Energy-Based Models for Cross-Modal Localization using Convolutional Transformers. CoRR abs/2306.04021 (2023) - [i72]Xiang Li, Varun Belagali, Jinghuan Shang, Michael S. Ryoo:
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning. CoRR abs/2307.01849 (2023) - [i71]Kanchana Ranasinghe, Michael S. Ryoo:
Language-based Action Concept Spaces Improve Video Self-Supervised Learning. CoRR abs/2307.10922 (2023) - [i70]Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, Pete Florence, Chuyuan Fu, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Kehang Han, Karol Hausman, Alexander Herzog, Jasmine Hsu, Brian Ichter, Alex Irpan, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Lisa Lee, Tsang-Wei Edward Lee, Sergey Levine, Yao Lu, Henryk Michalewski, Igor Mordatch, Karl Pertsch, Kanishka Rao, Krista Reymann, Michael S. Ryoo, Grecia Salazar, Pannag Sanketi, Pierre Sermanet, Jaspiar Singh, Anikait Singh, Radu Soricut, Huong T. Tran, Vincent Vanhoucke, Quan Vuong, Ayzaan Wahid, Stefan Welker, Paul Wohlhart, Jialin Wu, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich:
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control. CoRR abs/2307.15818 (2023) - [i69]Rui Dai, Srijan Das, Michael S. Ryoo, François Brémond:
AAN: Attributes-Aware Network for Temporal Action Detection. CoRR abs/2309.00696 (2023) - [i68]Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo:
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders. CoRR abs/2310.20704 (2023) - [i67]A. J. Piergiovanni, Isaac Noble, Dahun Kim, Michael S. Ryoo, Victor Gomes, Anelia Angelova:
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities. CoRR abs/2311.05698 (2023) - [i66]Isabel Leal, Krzysztof Choromanski, Deepali Jain, Avinava Dubey, Jake Varley, Michael S. Ryoo, Yao Lu, Frederick Liu, Vikas Sindhwani, Quan Vuong, Tamás Sarlós, Ken Oslund, Karol Hausman, Kanishka Rao:
SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention. CoRR abs/2312.01990 (2023) - [i65]Ryan Burgert, Xiang Li, Abe Leite, Kanchana Ranasinghe, Michael S. Ryoo:
Diffusion Illusions: Hiding Images in Plain Sight. CoRR abs/2312.03817 (2023) - 2022
- [c79]Ryan Burgert, Jinghuan Shang, Xiang Li, Michael S. Ryoo:
TRITON: Neural Neural Textures for Better Sim2Real. CoRL 2022: 2215-2225 - [c78]Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Michael S. Ryoo:
Self-supervised Video Transformer. CVPR 2022: 2864-2874 - [c77]Rui Dai, Srijan Das, Kumara Kahatapitiya, Michael S. Ryoo, François Brémond:
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection. CVPR 2022: 20009-20019 - [c76]A. J. Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova:
Video Question Answering with Iterative Video-Text Co-tokenization. ECCV (36) 2022: 76-94 - [c75]Jinghuan Shang, Kumara Kahatapitiya, Xiang Li, Michael S. Ryoo:
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning. ECCV (39) 2022: 462-479 - [c74]Krzysztof Marcin Choromanski, Han Lin, Haoxian Chen, Arijit Sehanobish, Yuanzhe Ma, Deepali Jain, Jake Varley, Andy Zeng, Michael S. Ryoo, Valerii Likhosherstov, Dmitry Kalashnikov, Vikas Sindhwani, Adrian Weller:
Hybrid Random Features. ICLR 2022 - [c73]Xiang Li, Jinghuan Shang, Srijan Das, Michael S. Ryoo:
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels? NeurIPS 2022 - [c72]Jinghuan Shang, Srijan Das, Michael S. Ryoo:
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space. NeurIPS 2022 - [i64]Andy Zeng, Adrian Wong, Stefan Welker, Krzysztof Choromanski, Federico Tombari, Aveek Purohit, Michael S. Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence:
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language. CoRR abs/2204.00598 (2022) - [i63]Xiang Li, Jinghuan Shang, Srijan Das, Michael S. Ryoo:
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels? CoRR abs/2206.05266 (2022) - [i62]Jinghuan Shang, Srijan Das, Michael S. Ryoo:
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space. CoRR abs/2206.11895 (2022) - [i61]Ryan Burgert, Jinghuan Shang, Xiang Li, Michael S. Ryoo:
Neural Neural Textures Make Sim2Real Consistent. CoRR abs/2206.13500 (2022) - [i60]Srijan Das, Michael S. Ryoo:
Video + CLIP Baseline for Ego4D Long-term Action Anticipation. CoRR abs/2207.00579 (2022) - [i59]A. J. Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova:
Video Question Answering with Iterative Video-Text Co-Tokenization. CoRR abs/2208.00934 (2022) - [i58]Boyuan Chen, Fei Xia, Brian Ichter, Kanishka Rao, Keerthana Gopalakrishnan, Michael S. Ryoo, Austin Stone, Daniel Kappler:
Open-vocabulary Queryable Scene Representations for Real World Planning. CoRR abs/2209.09874 (2022) - [i57]Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo:
Grafting Vision Transformers. CoRR abs/2210.15943 (2022) - [i56]Michael S. Ryoo, Keerthana Gopalakrishnan, Kumara Kahatapitiya, Ted Xiao, Kanishka Rao, Austin Stone, Yao Lu, Julian Ibarz, Anurag Arnab:
Token Turing Machines. CoRR abs/2211.09119 (2022) - [i55]Ryan Burgert, Kanchana Ranasinghe, Xiang Li, Michael S. Ryoo:
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors. CoRR abs/2211.13224 (2022) - [i54]Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alexander Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Kuang-Huei Lee, Sergey Levine, Yao Lu, Utsav Malla, Deeksha Manjunath, Igor Mordatch, Ofir Nachum, Carolina Parada, Jodilyn Peralta, Emily Perez, Karl Pertsch, Jornell Quiambao, Kanishka Rao, Michael S. Ryoo, Grecia Salazar, Pannag Sanketi, Kevin Sayed, Jaspiar Singh, Sumedh Sontakke, Austin Stone, Clayton Tan, Huong T. Tran, Vincent Vanhoucke, Steve Vega, Quan Vuong, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich:
RT-1: Robotics Transformer for Real-World Control at Scale. CoRR abs/2212.06817 (2022) - 2021
- [c71]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa:
Unsupervised Discovery of Actions in Instructional Videos. BMVC 2021: 283 - [c70]Juhana Kangaspunta, A. J. Piergiovanni, Rico Jonschkowski, Michael S. Ryoo, Anelia Angelova:
Adaptive Intermediate Representations for Video Understanding. CVPR Workshops 2021: 1602-1612 - [c69]A. J. Piergiovanni, Michael S. Ryoo:
Recognizing Actions in Videos From Unseen Viewpoints. CVPR 2021: 4124-4132 - [c68]Kumara Kahatapitiya, Michael S. Ryoo:
Coarse-Fine Networks for Temporal Activity Detection in Videos. CVPR 2021: 8385-8394 - [c67]A. J. Piergiovanni, Vincent Casser, Michael S. Ryoo, Anelia Angelova:
4D-Net for Learned Multi-Modal Alignment. ICCV 2021: 15415-15425 - [c66]Iretiayo Akinola, Anelia Angelova, Yao Lu, Yevgen Chebotar, Dmitry Kalashnikov, Jacob Varley, Julian Ibarz, Michael S. Ryoo:
Visionary: Vision architecture discovery for robot learning. ICRA 2021: 10779-10785 - [c65]Jinghuan Shang, Michael S. Ryoo:
Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning. IROS 2021: 214-221 - [c64]Michael S. Ryoo, A. J. Piergiovanni, Anurag Arnab, Mostafa Dehghani, Anelia Angelova:
TokenLearner: Adaptive Space-Time Tokenization for Videos. NeurIPS 2021: 12786-12797 - [i53]Kumara Kahatapitiya, Michael S. Ryoo:
Coarse-Fine Networks for Temporal Activity Detection in Videos. CoRR abs/2103.01302 (2021) - [i52]Iretiayo Akinola, Anelia Angelova, Yao Lu, Yevgen Chebotar, Dmitry Kalashnikov, Jacob Varley, Julian Ibarz, Michael S. Ryoo:
Visionary: Vision architecture discovery for robot learning. CoRR abs/2103.14633 (2021) - [i51]A. J. Piergiovanni, Michael S. Ryoo:
Recognizing Actions in Videos from Unseen Viewpoints. CoRR abs/2103.16516 (2021) - [i50]Juhana Kangaspunta, A. J. Piergiovanni, Rico Jonschkowski, Michael S. Ryoo, Anelia Angelova:
Adaptive Intermediate Representations for Video Understanding. CoRR abs/2104.07135 (2021) - [i49]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan A. Essa:
Unsupervised Action Segmentation for Instructional Videos. CoRR abs/2106.03738 (2021) - [i48]Michael S. Ryoo, A. J. Piergiovanni, Anurag Arnab, Mostafa Dehghani, Anelia Angelova:
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? CoRR abs/2106.11297 (2021) - [i47]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan A. Essa:
Unsupervised Discovery of Actions in Instructional Videos. CoRR abs/2106.14733 (2021) - [i46]Jinghuan Shang, Michael S. Ryoo:
Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning. CoRR abs/2108.01069 (2021) - [i45]A. J. Piergiovanni, Vincent Casser, Michael S. Ryoo, Anelia Angelova:
4D-Net for Learned Multi-Modal Alignment. CoRR abs/2109.01066 (2021) - [i44]Krzysztof Choromanski, Haoxian Chen, Han Lin, Yuanzhe Ma, Arijit Sehanobish, Deepali Jain, Michael S. Ryoo, Jake Varley, Andy Zeng, Valerii Likhosherstov, Dmitry Kalashnikov, Vikas Sindhwani, Adrian Weller:
Hybrid Random Features. CoRR abs/2110.04367 (2021) - [i43]Jinghuan Shang, Michael S. Ryoo:
StARformer: Transformer with State-Action-Reward Representations. CoRR abs/2110.06206 (2021) - [i42]Kumara Kahatapitiya, Zhou Ren, Haoxiang Li, Zhenyu Wu, Michael S. Ryoo:
Self-supervised Pretraining with Classification Labels for Temporal Activity Detection. CoRR abs/2111.13675 (2021) - [i41]Kumara Kahatapitiya, Michael S. Ryoo:
SWAT: Spatial Structure Within and Among Tokens. CoRR abs/2111.13677 (2021) - [i40]Kanchana Ranasinghe, Muzammal Naseer, Salman H. Khan, Fahad Shahbaz Khan, Michael S. Ryoo:
Self-supervised Video Transformer. CoRR abs/2112.01514 (2021) - [i39]Rui Dai, Srijan Das, Kumara Kahatapitiya, Michael S. Ryoo, François Brémond:
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection. CoRR abs/2112.03902 (2021) - [i38]Srijan Das, Michael S. Ryoo:
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints. CoRR abs/2112.03905 (2021) - [i37]Srijan Das, Michael S. Ryoo:
STC-mix: Space, Time, Channel mixing for Self-supervised Video Representation. CoRR abs/2112.03906 (2021) - 2020
- [j14]Alan Wu, A. J. Piergiovanni, Michael S. Ryoo:
Model-Based Robot Imitation with Future Image Similarity. Int. J. Comput. Vis. 128(5): 1360-1374 (2020) - [j13]Alan Wu, A. J. Piergiovanni, Michael S. Ryoo:
Correction to: Model-Based Robot Imitation with Future Image Similarity. Int. J. Comput. Vis. 128(5): 1375 (2020) - [j12]Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim:
Toward Collaborative Inferencing of Deep Neural Networks on Internet-of-Things Devices. IEEE Internet Things J. 7(6): 4950-4960 (2020) - [c63]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo:
Differentiable Grammars for Videos. AAAI 2020: 11874-11881 - [c62]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo:
Evolving Losses for Unsupervised Video Representation Learning. CVPR 2020: 130-139 - [c61]Xiaofang Wang, Xuehan Xiong, Maxim Neumann, A. J. Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua:
AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification. ECCV (8) 2020: 449-465 - [c60]A. J. Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo:
Adversarial Generative Grammars for Human Activity Prediction. ECCV (2) 2020: 507-523 - [c59]Michael S. Ryoo, A. J. Piergiovanni, Juhana Kangaspunta, Anelia Angelova:
AssembleNet++: Assembling Modality Representations via Attention Connections. ECCV (20) 2020: 654-671 - [c58]Xiuye Gu, Weixin Luo, Michael S. Ryoo, Yong Jae Lee:
Password-Conditioned Anonymization and Deanonymization with Face Identity Transformers. ECCV (23) 2020: 727-743 - [c57]Michael S. Ryoo, A. J. Piergiovanni, Mingxing Tan, Anelia Angelova:
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures. ICLR 2020 - [c56]A. J. Piergiovanni, Michael S. Ryoo:
AViD Dataset: Anonymized Videos from Diverse Countries. NeurIPS 2020 - [c55]A. J. Piergiovanni, Michael S. Ryoo:
Learning Multimodal Representations for Unseen Activities. WACV 2020: 506-515 - [i36]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo:
Evolving Losses for Unsupervised Video Representation Learning. CoRR abs/2002.12177 (2020) - [i35]Ramyad Hadidi, Bahar Asgari, Jiashen Cao, Younmin Bae, Hyojong Kim, Michael S. Ryoo, Hyesoon Kim:
Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution. CoRR abs/2003.06464 (2020) - [i34]A. J. Piergiovanni, Michael S. Ryoo:
AViD Dataset: Anonymized Videos from Diverse Countries. CoRR abs/2007.05515 (2020) - [i33]Xiaofang Wang, Xuehan Xiong, Maxim Neumann, A. J. Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua:
AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification. CoRR abs/2007.12034 (2020) - [i32]A. J. Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo:
Adversarial Generative Grammars for Human Activity Prediction. CoRR abs/2008.04888 (2020) - [i31]Michael S. Ryoo, A. J. Piergiovanni, Juhana Kangaspunta, Anelia Angelova:
AssembleNet++: Assembling Modality Representations via Attention Connections. CoRR abs/2008.08072 (2020) - [i30]Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim:
Reducing Inference Latency with Concurrent Architectures for Image Recognition. CoRR abs/2011.07092 (2020)
2010 – 2019
- 2019
- [c54]Alan Wu, A. J. Piergiovanni, Michael S. Ryoo:
Model-based Behavioral Cloning with Future Image Similarity Learning. CoRL 2019: 1062-1077 - [c53]A. J. Piergiovanni, Michael S. Ryoo:
Early Detection of Injuries in MLB Pitchers From Video. CVPR Workshops 2019: 2431-2438 - [c52]A. J. Piergiovanni, Michael S. Ryoo:
Representation Flow for Action Recognition. CVPR 2019: 9945-9953 - [c51]