default search action
Rogério Feris
Rogério Schmidt Feris
Person information
- affiliation: IBM T. J. Watson Research Center, Hawthorne, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c160]Junmo Kang, Hongyin Luo, Yada Zhu, Jacob A. Hansen, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky:
Self-Specialization: Uncovering Latent Expertise within Large Language Models. ACL (Findings) 2024: 2681-2706 - [c159]James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogério Feris, Zsolt Kira, Leonid Karlinsky:
Adaptive Memory Replay for Continual Learning. CVPR Workshops 2024: 3605-3615 - [c158]Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Daniel Kondermann, Samuel Thomas, Shih-Fu Chang, Rogério Feris, James R. Glass, Hilde Kuehne:
What, When, and Where? Self-Supervised Spatio- Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions. CVPR 2024: 18419-18429 - [c157]Aaron K. Baughman, Eduardo Morales, Rahul Agarwal, Gozde Akay, Rogério Feris, Tony Johnson, Stephen Hammer, Leonid Karlinsky:
Large Scale Generative AI Text Applied to Sports and Music. KDD 2024: 4784-4792 - [c156]Bowen Pan, Rameswar Panda, SouYoung Jin, Rogério Feris, Aude Oliva, Phillip Isola, Yoon Kim:
LangNav: Language as a Perceptual Representation for Navigation. NAACL-HLT (Findings) 2024: 950-974 - [c155]Ximeng Sun, Rameswar Panda, Chun-Fu Richard Chen, Naigang Wang, Bowen Pan, Aude Oliva, Rogério Feris, Kate Saenko:
Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths. WACV 2024: 946-956 - [i105]Zexue He, Leonid Karlinsky, Donghyun Kim, Julian J. McAuley, Dmitry Krotov, Rogério Feris:
CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory. CoRR abs/2402.13449 (2024) - [i104]Aaron K. Baughman, Stephen Hammer, Rahul Agarwal, Gozde Akay, Eduardo Morales, Tony Johnson, Leonid Karlinsky, Rogério Feris:
Large Scale Generative AI Text Applied to Sports and Music. CoRR abs/2402.15514 (2024) - [i103]James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogério Feris, Zsolt Kira, Leonid Karlinsky:
Adaptive Memory Replay for Continual Learning. CoRR abs/2404.12526 (2024) - [i102]Runqian Wang, Soumya Ghosh, David D. Cox, Diego Antognini, Aude Oliva, Rogério Feris, Leonid Karlinsky:
Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning. CoRR abs/2405.17258 (2024) - [i101]Irene Huang, Wei Lin, Muhammad Jehanzeb Mirza, Jacob A. Hansen, Sivan Doveh, Victor Ion Butoi, Roei Herzig, Assaf Arbelle, Hilde Kuehne, Trevor Darrell, Chuang Gan, Aude Oliva, Rogério Feris, Leonid Karlinsky:
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs. CoRR abs/2406.08164 (2024) - [i100]Wei Lin, Muhammad Jehanzeb Mirza, Sivan Doveh, Rogério Feris, Raja Giryes, Sepp Hochreiter, Leonid Karlinsky:
Comparison Visual Instruction Tuning. CoRR abs/2406.09240 (2024) - [i99]Andrew Rouditchenko, Yuan Gong, Samuel Thomas, Leonid Karlinsky, Hilde Kuehne, Rogério Feris, James R. Glass:
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation. CoRR abs/2406.10082 (2024) - [i98]Junmo Kang, Leonid Karlinsky, Hongyin Luo, Zhen Wang, Jacob A. Hansen, Jim Glass, David D. Cox, Rameswar Panda, Rogério Feris, Alan Ritter:
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts. CoRR abs/2406.12034 (2024) - [i97]Nasim Borazjanizadeh, Roei Herzig, Trevor Darrell, Rogério Feris, Leonid Karlinsky:
Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems. CoRR abs/2406.12172 (2024) - [i96]Matt Stallone, Vaibhav Saxena, Leonid Karlinsky, Bridget McGinn, Tim Bula, Mayank Mishra, Adriana Meza Soria, Gaoyuan Zhang, Aditya Prasad, Yikang Shen, Saptha Surendran, Shanmukha C. Guttula, Hima Patel, Parameswaran Selvam, Xuan-Hong Dang, Yan Koyfman, Atin Sood, Rogério Feris, Nirmit Desai, David D. Cox, Ruchir Puri, Rameswar Panda:
Scaling Granite Code Models to 128K Context. CoRR abs/2407.13739 (2024) - [i95]Muhammad Jehanzeb Mirza, Mengjie Zhao, Zhuoyuan Mao, Sivan Doveh, Wei Lin, Paul Gavrikov, Michael Dorkenwald, Shiqi Yang, Saurav Jha, Hiromi Wakaki, Yuki Mitsufuji, Horst Possegger, Rogério Feris, Leonid Karlinsky, James R. Glass:
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models. CoRR abs/2410.06154 (2024) - 2023
- [c154]Zexue He, Graeme Blackwood, Rameswar Panda, Julian J. McAuley, Rogério Feris:
Synthetic Pre-Training Tasks for Neural Machine Translation. ACL (Findings) 2023: 8080-8098 - [c153]Sivan Doveh, Assaf Arbelle, Sivan Harary, Eli Schwartz, Roei Herzig, Raja Giryes, Rogério Feris, Rameswar Panda, Shimon Ullman, Leonid Karlinsky:
Teaching Structured Vision & Language Concepts to Vision & Language Models. CVPR 2023: 2657-2668 - [c152]James Seale Smith, Leonid Karlinsky, Vyshnavi Gutta, Paola Cascante-Bonilla, Donghyun Kim, Assaf Arbelle, Rameswar Panda, Rogério Feris, Zsolt Kira:
CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning. CVPR 2023: 11909-11919 - [c151]James Seale Smith, Paola Cascante-Bonilla, Assaf Arbelle, Donghyun Kim, Rameswar Panda, David D. Cox, Diyi Yang, Zsolt Kira, Rogério Feris, Leonid Karlinsky:
ConStruct-VL: Data-Free Continual Structured VL Concepts Learning. CVPR 2023: 14994-15004 - [c150]Roei Herzig, Alon Mendelson, Leonid Karlinsky, Assaf Arbelle, Rogério Feris, Trevor Darrell, Amir Globerson:
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs. EMNLP 2023: 14077-14098 - [c149]Andrew Rouditchenko, Yung-Sung Chuang, Nina Shvetsova, Samuel Thomas, Rogério Feris, Brian Kingsbury, Leonid Karlinsky, David Harwath, Hilde Kuehne, James R. Glass:
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval. ICASSP 2023: 1-5 - [c148]Wei Lin, Leonid Karlinsky, Nina Shvetsova, Horst Possegger, Mateusz Kozinski, Rameswar Panda, Rogério Feris, Hilde Kuehne, Horst Bischof:
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge. ICCV 2023: 2839-2850 - [c147]Kaihong Wang, Donghyun Kim, Rogério Feris, Margrit Betke:
CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic Segmentation. ICCV 2023: 11485-11495 - [c146]Paola Cascante-Bonilla, Khaled Shehada, James Seale Smith, Sivan Doveh, Donghyun Kim, Rameswar Panda, Gül Varol, Aude Oliva, Vicente Ordonez, Rogério Feris, Leonid Karlinsky:
Going Beyond Nouns With Vision & Language Models Using Synthetic Data. ICCV 2023: 20098-20108 - [c145]Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogério Feris, David Daniel Cox, Zhangyang Wang, Yoon Kim:
Learning to Grow Pretrained Models for Efficient Transformer Training. ICLR 2023 - [c144]Zhen Wang, Rameswar Panda, Leonid Karlinsky, Rogério Feris, Huan Sun, Yoon Kim:
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning. ICLR 2023 - [c143]Andrew Rouditchenko, Sameer Khurana, Samuel Thomas, Rogério Feris, Leonid Karlinsky, Hilde Kuehne, David Harwath, Brian Kingsbury, James R. Glass:
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages. INTERSPEECH 2023: 2268-2272 - [c142]Sivan Doveh, Assaf Arbelle, Sivan Harary, Roei Herzig, Donghyun Kim, Paola Cascante-Bonilla, Amit Alfassy, Rameswar Panda, Raja Giryes, Rogério Feris, Shimon Ullman, Leonid Karlinsky:
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models. NeurIPS 2023 - [c141]Muhammad Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Mateusz Kozinski, Rogério Feris, Horst Bischof:
LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections. NeurIPS 2023 - [c140]Howard Zhong, Samarth Mishra, Donghyun Kim, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Aude Oliva, Rogério Feris:
Learning Human Action Recognition Representations Without Real Humans. NeurIPS 2023 - [c139]Tianhong Li, Lijie Fan, Yuan Yuan, Hao He, Yonglong Tian, Rogério Feris, Piotr Indyk, Dina Katabi:
Addressing Feature Suppression in Unsupervised Visual Representations. WACV 2023: 1411-1420 - [c138]Aadarsh Sahoo, Rameswar Panda, Rogério Feris, Kate Saenko, Abir Das:
Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain Adaptation. WACV 2023: 4199-4208 - [i94]Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogério Feris, David Daniel Cox, Zhangyang Wang, Yoon Kim:
Learning to Grow Pretrained Models for Efficient Transformer Training. CoRR abs/2303.00980 (2023) - [i93]Zhen Wang, Rameswar Panda, Leonid Karlinsky, Rogério Feris, Huan Sun, Yoon Kim:
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning. CoRR abs/2303.02861 (2023) - [i92]Wei Lin, Leonid Karlinsky, Nina Shvetsova, Horst Possegger, Mateusz Kozinski, Rameswar Panda, Rogério Feris, Hilde Kuehne, Horst Bischof:
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge. CoRR abs/2303.08914 (2023) - [i91]Kuniaki Saito, Donghyun Kim, Piotr Teterwak, Rogério Feris, Kate Saenko:
Mind the Backbone: Minimizing Backbone Distortion for Robust Object Detection. CoRR abs/2303.14744 (2023) - [i90]Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Daniel Kondermann, Samuel Thomas, Shih-Fu Chang, Rogério Feris, James R. Glass, Hilde Kuehne:
What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions. CoRR abs/2303.16990 (2023) - [i89]Paola Cascante-Bonilla, Khaled Shehada, James Seale Smith, Sivan Doveh, Donghyun Kim, Rameswar Panda, Gül Varol, Aude Oliva, Vicente Ordonez, Rogério Feris, Leonid Karlinsky:
Going Beyond Nouns With Vision & Language Models Using Synthetic Data. CoRR abs/2303.17590 (2023) - [i88]Roei Herzig, Alon Mendelson, Leonid Karlinsky, Assaf Arbelle, Rogério Feris, Trevor Darrell, Amir Globerson:
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs. CoRR abs/2305.06343 (2023) - [i87]Andrew Rouditchenko, Sameer Khurana, Samuel Thomas, Rogério Feris, Leonid Karlinsky, Hilde Kuehne, David Harwath, Brian Kingsbury, James R. Glass:
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages. CoRR abs/2305.12606 (2023) - [i86]Muhammad Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Mateusz Kozinski, Horst Possegger, Rogério Feris, Horst Bischof:
LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections. CoRR abs/2305.18287 (2023) - [i85]Sivan Doveh, Assaf Arbelle, Sivan Harary, Roei Herzig, Donghyun Kim, Paola Cascante-Bonilla, Amit Alfassy, Rameswar Panda, Raja Giryes, Rogério Feris, Shimon Ullman, Leonid Karlinsky:
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models. CoRR abs/2305.19595 (2023) - [i84]Muhammad Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Rogério Feris, Horst Bischof:
TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification. CoRR abs/2309.06809 (2023) - [i83]Junmo Kang, Hongyin Luo, Yada Zhu, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky:
Self-Specialization: Uncovering Latent Expertise within Large Language Models. CoRR abs/2310.00160 (2023) - [i82]Bowen Pan, Rameswar Panda, SouYoung Jin, Rogério Feris, Aude Oliva, Phillip Isola, Yoon Kim:
LangNav: Language as a Perceptual Representation for Navigation. CoRR abs/2310.07889 (2023) - [i81]Howard Zhong, Samarth Mishra, Donghyun Kim, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Aude Oliva, Rogério Feris:
Learning Human Action Recognition Representations Without Real Humans. CoRR abs/2311.06231 (2023) - 2022
- [j21]Joshua K. Lee, Yuheng Bu, Prasanna Sattigeri, Rameswar Panda, Gregory W. Wornell, Leonid Karlinsky, Rogério Schmidt Feris:
A Maximal Correlation Framework for Fair Machine Learning. Entropy 24(4): 461 (2022) - [j20]Mathew Monfort, Bowen Pan, Kandan Ramakrishnan, Alex Andonian, Barry A. McNamara, Alex Lascelles, Quanfu Fan, Dan Gutfreund, Rogério Schmidt Feris, Aude Oliva:
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9434-9445 (2022) - [j19]Eli Schwartz, Leonid Karlinsky, Rogério Feris, Raja Giryes, Alexander M. Bronstein:
Baby steps towards few-shot learning with multiple semantics. Pattern Recognit. Lett. 160: 142-147 (2022) - [c137]Paola Cascante-Bonilla, Hui Wu, Letao Wang, Rogério Feris, Vicente Ordonez:
Sim VQA: Exploring Simulated Environments for Visual Question Answering. CVPR 2022: 5046-5056 - [c136]Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu Richard Chen, Rogério Feris, David D. Cox, Nuno Vasconcelos:
VALHALLA: Visual Hallucination for Machine Translation. CVPR 2022: 5206-5216 - [c135]Sivan Harary, Eli Schwartz, Assaf Arbelle, Peter W. J. Staar, Shady Abu-Hussein, Elad Amrani, Roei Herzig, Amit Alfassy, Raja Giryes, Hilde Kuehne, Dina Katabi, Kate Saenko, Rogério Feris, Leonid Karlinsky:
Unsupervised Domain Generalization by Learning a Bridge Across Domains. CVPR 2022: 5270-5280 - [c134]Tianhong Li, Peng Cao, Yuan Yuan, Lijie Fan, Yuzhe Yang, Rogério Feris, Piotr Indyk, Dina Katabi:
Targeted Supervised Contrastive Learning for Long-Tailed Recognition. CVPR 2022: 6908-6918 - [c133]Samarth Mishra, Rameswar Panda, Cheng Perng Phoo, Chun-Fu Richard Chen, Leonid Karlinsky, Kate Saenko, Venkatesh Saligrama, Rogério Schmidt Feris:
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data. CVPR 2022: 9184-9194 - [c132]Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Hilde Kuehne:
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval. CVPR 2022: 19988-19997 - [c131]Joshua K. Lee, Yuheng Bu, Prasanna Sattigeri, Rameswar Panda, Gregory W. Wornell, Leonid Karlinsky, Rogério Feris:
A Maximal Correlation Approach to Imposing Fairness in Machine Learning. ICASSP 2022: 3523-3527 - [c130]Amit Alfassy, Assaf Arbelle, Oshri Halimi, Sivan Harary, Roei Herzig, Eli Schwartz, Rameswar Panda, Michele Dolfi, Christoph Auer, Peter W. J. Staar, Kate Saenko, Rogério Feris, Leonid Karlinsky:
FETA: Towards Specializing Foundational Models for Expert Task Applications. NeurIPS 2022 - [c129]Manel Baradad, Chun-Fu Richard Chen, Jonas Wulff, Tongzhou Wang, Rogério Feris, Antonio Torralba, Phillip Isola:
Procedural Image Programs for Representation Learning. NeurIPS 2022 - [c128]Yo-whan Kim, Samarth Mishra, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Kate Saenko, Aude Oliva, Rogério Feris:
How Transferable are Video Representations Based on Synthetic Data? NeurIPS 2022 - [i80]Paola Cascante-Bonilla, Hui Wu, Letao Wang, Rogério Feris, Vicente Ordonez:
SimVQA: Exploring Simulated Environments for Visual Question Answering. CoRR abs/2203.17219 (2022) - [i79]Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu Chen, Rogério Feris, David D. Cox, Nuno Vasconcelos:
VALHALLA: Visual Hallucination for Machine Translation. CoRR abs/2206.00100 (2022) - [i78]Amit Alfassy, Assaf Arbelle, Oshri Halimi, Sivan Harary, Roei Herzig, Eli Schwartz, Rameswar Panda, Michele Dolfi, Christoph Auer, Kate Saenko, Peter W. J. Staar, Rogério Feris, Leonid Karlinsky:
FETA: Towards Specializing Foundation Models for Expert Task Applications. CoRR abs/2209.03648 (2022) - [i77]Andrew Rouditchenko, Yung-Sung Chuang, Nina Shvetsova, Samuel Thomas, Rogério Feris, Brian Kingsbury, Leonid Karlinsky, David Harwath, Hilde Kuehne, James R. Glass:
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval. CoRR abs/2210.03625 (2022) - [i76]James Seale Smith, Paola Cascante-Bonilla, Assaf Arbelle, Donghyun Kim, Rameswar Panda, David D. Cox, Diyi Yang, Zsolt Kira, Rogério Feris, Leonid Karlinsky:
ConStruct-VL: Data-Free Continual Structured VL Concepts Learning. CoRR abs/2211.09790 (2022) - [i75]Sivan Doveh, Assaf Arbelle, Sivan Harary, Rameswar Panda, Roei Herzig, Eli Schwartz, Donghyun Kim, Raja Giryes, Rogério Feris, Shimon Ullman, Leonid Karlinsky:
Teaching Structured Vision&Language Concepts to Vision&Language Models. CoRR abs/2211.11733 (2022) - [i74]James Seale Smith, Leonid Karlinsky, Vyshnavi Gutta, Paola Cascante-Bonilla, Donghyun Kim, Assaf Arbelle, Rameswar Panda, Rogério Feris, Zsolt Kira:
CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning. CoRR abs/2211.13218 (2022) - [i73]Kaihong Wang, Donghyun Kim, Rogério Feris, Kate Saenko, Margrit Betke:
Exploring Consistency in Cross-Domain Transformer for Domain Adaptive Semantic Segmentation. CoRR abs/2211.14703 (2022) - [i72]Manel Baradad, Chun-Fu Chen, Jonas Wulff, Tongzhou Wang, Rogério Feris, Antonio Torralba, Phillip Isola:
Procedural Image Programs for Representation Learning. CoRR abs/2211.16412 (2022) - [i71]Zexue He, Graeme Blackwood, Rameswar Panda, Julian J. McAuley, Rogério Feris:
Synthetic Pre-Training Tasks for Neural Machine Translation. CoRR abs/2212.09864 (2022) - 2021
- [j18]Sivan Doveh, Eli Schwartz, Chao Xue, Rogério Feris, Alexander M. Bronstein, Raja Giryes, Leonid Karlinsky:
MetAdapt: Meta-learned task-adaptive architecture for few-shot classification. Pattern Recognit. Lett. 149: 130-136 (2021) - [c127]Leonid Karlinsky, Joseph Shtok, Amit Alfassy, Moshe Lichtenstein, Sivan Harary, Eli Schwartz, Sivan Doveh, Prasanna Sattigeri, Rogério Feris, Alex M. Bronstein, Raja Giryes:
StarNet: towards Weakly Supervised Few-Shot Object Detection. AAAI 2021: 1743-1753 - [c126]Rameswar Panda, Michele Merler, Mayoore S. Jaiswal, Hui Wu, Kandan Ramakrishnan, Ulrich Finkler, Chun-Fu (Richard) Chen, Minsik Cho, Rogério Feris, David S. Kung, Bishwaranjan Bhattacharjee:
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search. AAAI 2021: 9294-9302 - [c125]Jiachen Li, Bowen Cheng, Rogério Feris, Jinjun Xiong, Thomas S. Huang, Wen-Mei Hwu, Humphrey Shi:
Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection. CVPR Workshops 2021: 2378-2387 - [c124]Spencer Whitehead, Hui Wu, Heng Ji, Rogério Feris, Kate Saenko:
Separating Skills and Concepts for Novel Visual Question Answering. CVPR 2021: 5632-5641 - [c123]Chun-Fu (Richard) Chen, Rameswar Panda, Kandan Ramakrishnan, Rogério Feris, John Cohn, Aude Oliva, Quanfu Fan:
Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition. CVPR 2021: 6165-6175 - [c122]Guy Bukchin, Eli Schwartz, Kate Saenko, Ori Shahar, Rogério Feris, Raja Giryes, Leonid Karlinsky:
Fine-Grained Angular Contrastive Learning With Coarse Labels. CVPR 2021: 8730-8740 - [c121]Ankit Singh, Omprakash Chakraborty, Ashutosh Varshney, Rameswar Panda, Rogério Feris, Kate Saenko, Abir Das:
Semi-Supervised Action Recognition With Temporal Contrastive Learning. CVPR 2021: 10389-10399 - [c120]Hui Wu, Yupeng Gao, Xiaoxiao Guo, Ziad Al-Halah, Steven Rennie, Kristen Grauman, Rogério Feris:
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback. CVPR 2021: 11307-11317 - [c119]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions. CVPR 2021: 14871-14881 - [c118]Assaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen, Alex M. Bronstein, Kate Saenko, Shimon Ullman, Raja Giryes, Rogério Feris, Leonid Karlinsky:
Detector-Free Weakly Supervised Grounding by Separation. ICCV 2021: 1781-1792 - [c117]Ximeng Sun, Rameswar Panda, Chun-Fu (Richard) Chen, Aude Oliva, Rogério Feris, Kate Saenko:
Dynamic Network Quantization for Efficient Video Inference. ICCV 2021: 7355-7365 - [c116]Rameswar Panda, Chun-Fu (Richard) Chen, Quanfu Fan, Ximeng Sun, Kate Saenko, Aude Oliva, Rogério Feris:
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition. ICCV 2021: 7556-7565 - [c115]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. ICCV 2021: 7992-8001 - [c114]Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Richard J. Radke, Rogério Feris:
A Broad Study on the Transferability of Visual Representations with Contrastive Learning. ICCV 2021: 8825-8835 - [c113]Yue Meng, Rameswar Panda, Chung-Ching Lin, Prasanna Sattigeri, Leonid Karlinsky, Kate Saenko, Aude Oliva, Rogério Feris:
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition. ICLR 2021 - [c112]Bowen Pan, Rameswar Panda, Camilo Luciano Fosco, Chung-Ching Lin, Alex J. Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogério Feris:
VA-RED2: Video Adaptive Redundancy Reduction. ICLR 2021 - [c111]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne, Rameswar Panda, Rogério Schmidt Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. Interspeech 2021: 1584-1588 - [c110]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. Interspeech 2021: 3006-3010 - [c109]Ashraful Islam, Chun-Fu (Richard) Chen, Rameswar Panda, Leonid Karlinsky, Rogério Feris, Richard J. Radke:
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data. NeurIPS 2021: 3584-3595 - [c108]Bowen Pan, Rameswar Panda, Yifan Jiang, Zhangyang Wang, Rogério Feris, Aude Oliva:
IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision Transformers. NeurIPS 2021: 24898-24911 - [i70]Ankit Singh, Omprakash Chakraborty, Ashutosh Varshney, Rameswar Panda, Rogério Feris, Kate Saenko, Abir Das:
Semi-Supervised Action Recognition with Temporal Contrastive Learning. CoRR abs/2102.02751 (2021) - [i69]Yue Meng, Rameswar Panda, Chung-Ching Lin, Prasanna Sattigeri, Leonid Karlinsky, Kate Saenko, Aude Oliva, Rogério Feris:
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition. CoRR abs/2102.05775 (2021) - [i68]Bowen Pan, Rameswar Panda, Camilo Fosco, Chung-Ching Lin, Alex Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogério Feris:
VA-RED2: Video Adaptive Redundancy Reduction. CoRR abs/2102.07887 (2021) - [i67]Ximeng Sun, Rameswar Panda, Chun-Fu Chen, Naigang Wang, Bowen Pan, Kailash Gopalakrishnan, Aude Oliva, Rogério Feris, Kate Saenko:
All at Once Network Quantization via Collaborative Knowledge Transfer. CoRR abs/2103.01435 (2021) - [i66]Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Richard J. Radke, Rogério Feris:
A Broad Study on the Transferability of Visual Representations with Contrastive Learning. CoRR abs/2103.13517 (2021) - [i65]Assaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen, Alex M. Bronstein, Kate Saenko, Shimon Ullman, Raja Giryes, Rogério Feris, Leonid Karlinsky:
Detector-Free Weakly Supervised Grounding by Separation. CoRR abs/2104.09829 (2021) - [i64]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Schmidt Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. CoRR abs/2104.12671 (2021) - [i63]Jiachen Li, Bowen Cheng, Rogério Feris, Jinjun Xiong, Thomas S. Huang, Wen-Mei Hwu, Humphrey Shi:
Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection. CoRR abs/2104.14082 (2021) - [i62]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions. CoRR abs/2105.04489 (2021) - [i61]Rameswar Panda, Chun-Fu Chen, Quanfu Fan, Ximeng Sun, Kate Saenko, Aude Oliva, Rogério Feris:
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition. CoRR abs/2105.05165 (2021) - [i60]Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Rogério Feris, Richard J. Radke:
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data. CoRR abs/2106.07807 (2021) - [i59]Bowen Pan, Yifan Jiang, Rameswar Panda, Zhangyang Wang, Rogério Feris, Aude Oliva:
IA-RED2: Interpretability-Aware Redundancy Reduction for Vision Transformers.