


default search action
Heng Tao Shen
Hengtao Shen – 申恒涛
Person information
- unicode name: 申恒涛
- affiliation: University of Electronic Science and Technology of China, School of Computer Science and Engineering, Chengdu, China
- affiliation (2004 - 2017): University of Queensland, Brisbane, Australia
- affiliation (PhD 2004): National University of Singapore, Singapore
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j278]Yujie Mo
, Heng Tao Shen, Xiaofeng Zhu:
Unsupervised multi-view graph representation learning with dual weight-net. Inf. Fusion 114: 102669 (2025) - [j277]Yujie Mo
, Heng Tao Shen, Xiaofeng Zhu:
Efficient self-supervised heterogeneous graph representation learning with reconstruction. Inf. Fusion 117: 102846 (2025) - [j276]Xun Jiang
, Xing Xu
, Huimin Lu
, Lianghua He
, Heng Tao Shen
:
Joint Objective and Subjective Fuzziness Denoising for Multimodal Sentiment Analysis. IEEE Trans. Fuzzy Syst. 33(1): 15-27 (2025) - [j275]Pengpeng Zeng
, Haonan Zhang
, Lianli Gao
, Xiangpeng Li, Jin Qian, Heng Tao Shen
:
Visual Commonsense-Aware Representation Network for Video Captioning. IEEE Trans. Neural Networks Learn. Syst. 36(1): 1092-1103 (2025) - 2024
- [j274]Lifeng Sun, Xinhang Song, Shuqiang Jiang, Lili Wang, Hengtao Shen:
Preface to the Special Issue on Multimodal Collaborative Perception and Fusion Technology. Int. J. Softw. Informatics 14(2): 119-122 (2024) - [j273]Jingjing Li
, Zhiqi Yu
, Zhekai Du
, Lei Zhu
, Heng Tao Shen
:
A Comprehensive Survey on Source-Free Domain Adaptation. IEEE Trans. Pattern Anal. Mach. Intell. 46(8): 5743-5762 (2024) - [j272]Yuhui Wu
, Guoqing Wang
, Shaochong Liu, Yang Yang
, Wei Liu
, Xiongxin Tang
, Shuhang Gu
, Chongyi Li
, Heng Tao Shen
:
Towards a Flexible Semantic Guided Model for Single Image Enhancement and Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9921-9939 (2024) - [j271]Chaofan Zheng
, Lianli Gao
, Xinyu Lyu
, Pengpeng Zeng
, Abdulmotaleb El-Saddik
, Heng Tao Shen
:
Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation. IEEE Trans. Circuits Syst. Video Technol. 34(3): 1743-1756 (2024) - [j270]Shenshen Li
, Xing Xu
, Xun Jiang
, Fumin Shen
, Xin Liu
, Heng Tao Shen
:
Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(4): 2959-2972 (2024) - [j269]Zeyu Ma, Ziqiang Zheng
, Jiwei Wei
, Yang Yang
, Heng Tao Shen
:
Instance-Dictionary Learning for Open-World Object Detection in Autonomous Driving Scenarios. IEEE Trans. Circuits Syst. Video Technol. 34(5): 3395-3408 (2024) - [j268]Haonan Zhang
, Pengpeng Zeng
, Lianli Gao
, Xinyu Lyu
, Jingkuan Song
, Heng Tao Shen
:
SPT: Spatial Pyramid Transformer for Image Captioning. IEEE Trans. Circuits Syst. Video Technol. 34(6): 4829-4842 (2024) - [j267]Ziqiang Zheng
, Hao Ren
, Yang Wu, Weichuan Zhang
, Hong Lu
, Yang Yang
, Heng Tao Shen
:
Fully Unsupervised Domain-Agnostic Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(6): 5077-5090 (2024) - [j266]Yin Tang, Tao Chen
, Xiruo Jiang, Yazhou Yao
, Guo-Sen Xie
, Heng Tao Shen
:
Holistic Prototype Attention Network for Few-Shot Video Object Segmentation. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6699-6709 (2024) - [j265]Yahui Xu
, Jiwei Wei
, Yi Bin
, Yang Yang
, Zeyu Ma, Heng Tao Shen
:
Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(10): 10494-10506 (2024) - [j264]Mingfeng Zha
, Feiyang Fu
, Yunqiang Pei, Guoqing Wang, Tianyu Li
, Xiongxin Tang
, Yang Yang
, Heng Tao Shen
:
Dual Domain Perception and Progressive Refinement for Mirror Detection. IEEE Trans. Circuits Syst. Video Technol. 34(11): 11942-11953 (2024) - [j263]Haonan Zhang
, Pengpeng Zeng
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
Ump: Unified Modality-Aware Prompt Tuning for Text-Video Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(11): 11954-11964 (2024) - [j262]Yixuan Zhou
, Yi Qu
, Xing Xu
, Fumin Shen
, Jingkuan Song
, Heng Tao Shen
:
BatchNorm-Based Weakly Supervised Video Anomaly Detection. IEEE Trans. Circuits Syst. Video Technol. 34(12): 13642-13654 (2024) - [j261]Feiyu Chen
, Jie Shao
, Anjie Zhu, Deqiang Ouyang
, Xueliang Liu
, Heng Tao Shen
:
Modeling Hierarchical Uncertainty for Multimodal Emotion Recognition in Conversation. IEEE Trans. Cybern. 54(1): 187-198 (2024) - [j260]Yujie Li
, Xun Jiang
, Xing Xu
, Huimin Lu
, Heng Tao Shen
:
Fuzzy Multimodal Graph Reasoning for Human-Centric Instructional Video Grounding. IEEE Trans. Fuzzy Syst. 32(9): 5046-5059 (2024) - [j259]Quan Rui
, Shiyuan He
, Tianyu Li
, Guoqing Wang
, Ningjuan Ruan, Lin Mei, Yang Yang
, Heng Tao Shen
:
Density-Aware Cloud Removal of Remote Sensing Imagery Using a Global-Local Fusion Transformer. IEEE Trans. Geosci. Remote. Sens. 62: 1-11 (2024) - [j258]Jiefu Chen
, Tong Chen, Xing Xu
, Jingran Zhang
, Yang Yang
, Heng Tao Shen
:
Coreset Learning-Based Sparse Black-Box Adversarial Attack for Video Recognition. IEEE Trans. Inf. Forensics Secur. 19: 1547-1560 (2024) - [j257]Guobao Xiao
, Zhimin Tang
, Hanlin Guo
, Jun Yu
, Heng Tao Shen
:
FAFusion: Learning for Infrared and Visible Image Fusion via Frequency Awareness. IEEE Trans. Instrum. Meas. 73: 1-11 (2024) - [j256]Mengmeng Jing
, Jingjing Li
, Ke Lu
, Lei Zhu
, Heng Tao Shen
:
Visually Source-Free Domain Adaptation via Adversarial Style Matching. IEEE Trans. Image Process. 33: 1032-1044 (2024) - [j255]Zheng Wang
, Xing Xu
, Jiwei Wei
, Ning Xie
, Yang Yang
, Heng Tao Shen
:
Semantics Disentangling for Cross-Modal Retrieval. IEEE Trans. Image Process. 33: 2226-2237 (2024) - [j254]Xuanhan Wang
, Xiaojia Chen
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen:
CPI-Parser: Integrating Causal Properties Into Multiple Human Parsing. IEEE Trans. Image Process. 33: 5771-5782 (2024) - [j253]Kumie Gedamu
, Yanli Ji
, Yang Yang
, Jie Shao
, Heng Tao Shen:
Self-Supervised Sub-Action Parsing Network for Semi-Supervised Action Quality Assessment. IEEE Trans. Image Process. 33: 6057-6070 (2024) - [j252]Lei Zhu
, Chaoqun Zheng
, Weili Guan
, Jingjing Li
, Yang Yang
, Heng Tao Shen
:
Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey. IEEE Trans. Knowl. Data Eng. 36(1): 239-260 (2024) - [j251]Yang Xu
, Lei Zhu
, Jingjing Li
, Fengling Li
, Heng Tao Shen
:
Temporal Social Graph Network Hashing for Efficient Recommendation. IEEE Trans. Knowl. Data Eng. 36(7): 3541-3555 (2024) - [j250]Yan Dai
, Xiaojia Chen
, Xuanhan Wang
, Minghui Pang
, Lianli Gao
, Heng Tao Shen
:
ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets. IEEE Trans. Multim. 26: 1384-1394 (2024) - [j249]Shuaiqi Jing
, Haonan Zhang
, Pengpeng Zeng
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
Memory-Based Augmentation Network for Video Captioning. IEEE Trans. Multim. 26: 2367-2379 (2024) - [j248]Jinghan Ru
, Jun Tian
, Chengwei Xiao
, Jingjing Li
, Heng Tao Shen
:
Imbalanced Open Set Domain Adaptation via Moving-Threshold Estimation and Gradual Alignment. IEEE Trans. Multim. 26: 2504-2514 (2024) - [j247]Congrui Li
, Ziqiang Zheng
, Yi Bin
, Guoqing Wang
, Yang Yang
, Xuesheng Li
, Heng Tao Shen
:
Pixel Bleach Network for Detecting Face Forgery Under Compression. IEEE Trans. Multim. 26: 2585-2597 (2024) - [j246]Yan Dai
, Beitao Chen
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
DMH-CL: Dynamic Model Hardness Based Curriculum Learning for Complex Pose Estimation. IEEE Trans. Multim. 26: 3180-3193 (2024) - [j245]Jiwei Wei
, Yang Yang
, Xiang Guan
, Xing Xu
, Guoqing Wang
, Heng Tao Shen
:
Runge-Kutta Guided Feature Augmentation for Few-Sample Learning. IEEE Trans. Multim. 26: 7349-7358 (2024) - [j244]Huafeng Liu
, Mengmeng Sheng
, Zeren Sun
, Yazhou Yao
, Xian-Sheng Hua
, Heng Tao Shen
:
Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection. IEEE Trans. Multim. 26: 7426-7437 (2024) - [j243]Shiyuan He
, Jiwei Wei
, Chaoning Zhang
, Xing Xu
, Jingkuan Song
, Yang Yang
, Heng Tao Shen
:
Boosting Adversarial Training with Hardness-Guided Attack Strategy. IEEE Trans. Multim. 26: 7748-7760 (2024) - [j242]Jian Huang
, Yanli Ji
, Zhen Qin
, Yang Yang
, Heng Tao Shen
:
Dominant SIngle-Modal SUpplementary Fusion (SIMSUF) for Multimodal Sentiment Analysis. IEEE Trans. Multim. 26: 8383-8394 (2024) - [j241]Xun Jiang
, Xing Xu
, Zailei Zhou
, Yang Yang
, Fumin Shen
, Heng Tao Shen
:
Zero-Shot Video Moment Retrieval With Angular Reconstructive Text Embeddings. IEEE Trans. Multim. 26: 9657-9670 (2024) - [j240]Yahui Xu
, Yi Bin
, Jiwei Wei
, Yang Yang
, Guoqing Wang
, Heng Tao Shen
:
Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback. IEEE Trans. Multim. 26: 9936-9948 (2024) - [j239]Zheng Wang
, Zhenwei Gao, Mengqun Han, Yang Yang
, Heng Tao Shen
:
Estimating the Semantics via Sector Embedding for Image-Text Retrieval. IEEE Trans. Multim. 26: 10342-10353 (2024) - [j238]Jiwei Wei
, Chen Pan
, Shiyuan He
, Guoqing Wang
, Yang Yang
, Heng Tao Shen
:
Towards Robust Person Re-Identification by Adversarial Training With Dynamic Attack Strategy. IEEE Trans. Multim. 26: 10367-10380 (2024) - [j237]Dan Zhang
, Zhekai Du
, Jingjing Li
, Lei Zhu
, Heng Tao Shen
:
Domain-Adaptive Energy-Based Models for Generalizable Face Anti-Spoofing. IEEE Trans. Multim. 26: 10474-10488 (2024) - [j236]Ke Liu
, Jiwei Wei
, Jie Zou
, Peng Wang
, Yang Yang
, Heng Tao Shen
:
Improving Pre-Trained Model-Based Speech Emotion Recognition From a Low-Level Speech Feature Perspective. IEEE Trans. Multim. 26: 10623-10636 (2024) - [j235]Ran Ran
, Jiwei Wei
, Chaoning Zhang
, Guoqing Wang
, Yang Yang
, Heng Tao Shen
:
Adaptive Multi-scale Degradation-Based Attack for Boosting the Adversarial Transferability. IEEE Trans. Multim. 26: 10979-10990 (2024) - [j234]Xiruo Jiang
, Yazhou Yao
, Xili Dai
, Fumin Shen
, Liqiang Nie
, Heng Tao Shen
:
Anti-Collapse Loss for Deep Metric Learning. IEEE Trans. Multim. 26: 11139-11150 (2024) - [j233]Yalan Ye
, Tongjie Pan
, Qianhe Meng, Jingjing Li
, Heng Tao Shen
:
Online Unsupervised Domain Adaptation via Reducing Inter- and Intra-Domain Discrepancies. IEEE Trans. Neural Networks Learn. Syst. 35(1): 884-898 (2024) - [j232]Xun Jiang
, Xing Xu
, Jingran Zhang
, Fumin Shen
, Zuo Cao
, Heng Tao Shen
:
SDN: Semantic Decoupling Network for Temporal Language Grounding. IEEE Trans. Neural Networks Learn. Syst. 35(5): 6598-6612 (2024) - [j231]Zheng Wang
, Xing Xu
, Yin Zhang
, Yang Yang
, Heng Tao Shen
:
Complex Relation Embedding for Scene Graph Generation. IEEE Trans. Neural Networks Learn. Syst. 35(6): 8321-8335 (2024) - [j230]Liang Peng
, Yujie Mo
, Jie Xu
, Jialie Shen
, Xiaoshuang Shi
, Xiaoxiao Li
, Heng Tao Shen
, Xiaofeng Zhu
:
GRLC: Graph Representation Learning With Constraints. IEEE Trans. Neural Networks Learn. Syst. 35(6): 8609-8622 (2024) - [j229]Yan Dai
, Xuanhan Wang
, Lianli Gao
, Jingkuan Song
, Feng Zheng
, Heng Tao Shen
:
Overcoming Data Deficiency for Multi-Person Pose Estimation. IEEE Trans. Neural Networks Learn. Syst. 35(8): 10857-10868 (2024) - [j228]Haonan Luo
, Guosheng Lin
, Fumin Shen
, Xingguo Huang
, Yazhou Yao
, Hengtao Shen
:
Robust-EQA: Robust Learning for Embodied Question Answering With Noisy Labels. IEEE Trans. Neural Networks Learn. Syst. 35(9): 12083-12094 (2024) - [j227]Hongzu Su
, Jingjing Li
, Zhekai Du
, Lei Zhu
, Ke Lu
, Heng Tao Shen
:
Cross-domain Recommendation via Dual Adversarial Adaptation. ACM Trans. Inf. Syst. 42(3): 83:1-83:26 (2024) - [j226]Tianshi Wang
, Fengling Li
, Lei Zhu
, Jingjing Li
, Zheng Zhang
, Heng Tao Shen
:
Invisible Black-Box Backdoor Attack against Deep Cross-Modal Hashing Retrieval. ACM Trans. Inf. Syst. 42(4): 111:1-111:27 (2024) - [c264]Shenshen Li, Chen He, Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen:
Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval. AAAI 2024: 3172-3180 - [c263]Ziyang Lu, Yunqiang Pei, Guoqing Wang, Peiwei Li, Yang Yang, Yinjie Lei, Heng Tao Shen:
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding. AAAI 2024: 3936-3944 - [c262]Mingfeng Zha, Yunqiang Pei, Guoqing Wang, Tianyu Li
, Yang Yang, Wenbin Qian, Heng Tao Shen:
Weakly-Supervised Mirror Detection via Scribble Annotations. AAAI 2024: 6953-6961 - [c261]Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen:
T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering. AAAI 2024: 19162-19170 - [c260]Fei Kong, Jinhao Duan, Lichao Sun, Hao Cheng, Renjing Xu, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu:
ACT-Diffusion: Efficient Adversarial Consistency Training for One-Step Diffusion Models. CVPR 2024: 8890-8899 - [c259]Ji Zhang, Shihan Wu, Lianli Gao, Heng Tao Shen, Jingkuan Song:
DePT: Decoupled Prompt Tuning. CVPR 2024: 12924-12933 - [c258]Kaipeng Fang, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Zhi-Qi Cheng, Xiyao Li, Heng Tao Shen:
ProS: Prompting-to-Simulate Generalized Knowledge for Universal Cross-Domain Retrieval. CVPR 2024: 17292-17301 - [c257]Bowen Tang, Zheng Wang, Yi Bin, Qi Dou, Yang Yang, Heng Tao Shen:
Ensemble Diversity Facilitates Adversarial Transferability. CVPR 2024: 24377-24386 - [c256]Zixian Gao, Xun Jiang, Xing Xu, Fumin Shen, Yujie Li, Heng Tao Shen:
Embracing Unimodal Aleatoric Uncertainty for Robust Multimodal Fusion. CVPR 2024: 26866-26875 - [c255]Renming Huang, Yunqiang Pei, Guoqing Wang, Yangming Zhang, Yang Yang, Peng Wang, Hengtao Shen:
Diffusion Models as Optimizers for Efficient Planning in Offline RL. ECCV (51) 2024: 1-17 - [c254]Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Xiaoshuang Shi, Kaidi Xu, Heng Tao Shen, Xiaofeng Zhu:
ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees. EMNLP (Findings) 2024: 6886-6898 - [c253]Zetao Zheng, Jie Shao, Shilong Deng, Anjie Zhu, Heng Tao Shen, Xiaofang Zhou:
Cross-Insight Trader: A Trading Approach Integrating Policies with Diverse Investment Horizons for Portfolio Management. ICDE 2024: 4685-4698 - [c252]Zetao Zheng, Jie Shao, Feiyu Chen, Anjie Zhu, Shilong Deng, Heng Tao Shen:
HIT: Solving Partial Index Tracking via Hierarchical Reinforcement Learning. ICDE 2024: 4709-4721 - [c251]Fei Kong, Jinhao Duan, Ruipeng Ma, Heng Tao Shen, Xiaoshuang Shi, Xiaofeng Zhu, Kaidi Xu:
An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization. ICLR 2024 - [c250]Yujie Mo, Feiping Nie, Ping Hu, Heng Tao Shen, Zheng Zhang, Xinchao Wang, Xiaofeng Zhu:
Self-Supervised Heterogeneous Graph Learning: a Homophily and Heterogeneity View. ICLR 2024 - [c249]Mengmeng Zhan, Zongqian Wu, Rongyao Hu, Ping Hu, Heng Tao Shen, Xiaofeng Zhu:
Towards Dynamic-Prompting Collaboration for Source-Free Domain Adaptation. IJCAI 2024: 1643-1651 - [c248]Yunqiang Pei
, Kaiyue Zhang
, Hongrong Yang
, Yong Tao
, Qihang Tang
, Jialei Tang
, Guoqing Wang
, Zhitao Liu
, Ning Xie
, Peng Wang
, Yang Yang
, Hengtao Shen
:
Improving Interaction Comfort in Authoring Task in AR-HRI through Dynamic Dual-Layer Interaction Adjustment. ACM Multimedia 2024: 88-97 - [c247]Haonan Zhang
, Pengpeng Zeng
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
MPT: Multi-grained Prompt Tuning for Text-Video Retrieval. ACM Multimedia 2024: 1206-1214 - [c246]Yuhui Wu
, Guoqing Wang
, Zhiwen Wang
, Yang Yang
, Tianyu Li
, Malu Zhang
, Chongyi Li
, Heng Tao Shen
:
JoReS-Diff: Joint Retinex and Semantic Priors in Diffusion Model for Low-light Image Enhancement. ACM Multimedia 2024: 1810-1818 - [c245]Zhiwen Wang
, Yuhui Wu
, Zheng Wang
, Jiwei Wei
, Tianyu Li
, Guoqing Wang
, Yang Yang
, Hengtao Shen
:
Cascaded Adversarial Attack: Simultaneously Fooling Rain Removal and Semantic Segmentation Networks. ACM Multimedia 2024: 2136-2145 - [c244]Yi Bin
, Junrong Liao
, Yujuan Ding
, Haoxuan Li
, Yang Yang
, See-Kiong Ng
, Heng Tao Shen
:
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning. ACM Multimedia 2024: 4630-4639 - [c243]Yunqiang Pei
, Jialei Tang
, Qihang Tang
, Mingfeng Zha
, Dongyu Xie
, Guoqing Wang
, Zhitao Liu
, Ning Xie
, Peng Wang
, Yang Yang
, Hengtao Shen
:
Emotion Recognition in HMDs: A Multi-task Approach Using Physiological Signals and Occluded Faces. ACM Multimedia 2024: 5977-5986 - [c242]Xun Jiang
, Zhuoyuan Wei
, Shenshen Li
, Xing Xu
, Jingkuan Song
, Heng Tao Shen
:
Counterfactually Augmented Event Matching for De-biased Temporal Sentence Grounding. ACM Multimedia 2024: 6472-6481 - [c241]Jin Sun
, Xiaoshuang Shi
, Zhiyuan Wang
, Kaidi Xu
, Heng Tao Shen
, Xiaofeng Zhu
:
Caterpillar: A Pure-MLP Architecture with Shifted-Pillars-Concatenation. ACM Multimedia 2024: 7123-7132 - [c240]Yi Bin
, Wenhao Shi
, Yujuan Ding
, Zhiqiang Hu
, Zheng Wang
, Yang Yang
, See-Kiong Ng
, Heng Tao Shen
:
GalleryGPT: Analyzing Paintings with Large Multimodal Models. ACM Multimedia 2024: 7734-7743 - [c239]Peng Yin
, Xiaosu Zhu
, Jingkuan Song
, Lianli Gao
, Heng Tao Shen
:
SI-BiViT: Binarizing Vision Transformers with Spatial Interaction. ACM Multimedia 2024: 8169-8178 - [c238]Zixian Gao
, Disen Hu
, Xun Jiang
, Huimin Lu
, Heng Tao Shen
, Xing Xu
:
Enhanced Experts with Uncertainty-Aware Routing for Multimodal Sentiment Analysis. ACM Multimedia 2024: 9650-9659 - [c237]Cheng Chen, Junchen Zhu, Xu Luo, Hengtao Shen, Jingkuan Song, Lianli Gao:
CoIN: A Benchmark of Continual Instruction Tuning for Multimodel Large Language Models. NeurIPS 2024 - [c236]Wei Dong, Yuan Sun, Yiting Yang, Xing Zhang, Zhijun Lin, Qingsen Yan, Haokui Zhang, Peng Wang, Yang Yang, Hengtao Shen:
Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation. NeurIPS 2024 - [c235]Xinyu Lyu, Beitao Chen, Lianli Gao, Hengtao Shen, Jingkuan Song:
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization. NeurIPS 2024 - [c234]Kai Wang
, Jiayang Liu
, Xing Xu
, Jingkuan Song
, Xin Liu
, Heng Tao Shen
:
Unsupervised Cross-Domain Image Retrieval with Semantic-Attended Mixture-of-Experts. SIGIR 2024: 197-207 - [c233]Yunqiang Pei, Bowen Jiang, Kaiyue Zhang, Ziyang Lu, Mingfeng Zha, Guoqing Wang, Zhitao Liu, Ning Xie, Yang Yang, Hengtao Shen:
Toward Optimized AR-Based Human-Robot Interaction Ergonomics: Modeling and Predicting Interaction Comfort. VR Workshops 2024: 797-798 - [i121]Huafeng Liu, Mengmeng Sheng, Zeren Sun, Yazhou Yao, Xian-Sheng Hua, Heng Tao Shen:
Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection. CoRR abs/2402.11242 (2024) - [i120]Cheng Chen, Junchen Zhu, Xu Luo, Hengtao Shen, Lianli Gao, Jingkuan Song:
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model. CoRR abs/2403.08350 (2024) - [i119]Meixuan Li, Tianyu Li, Guoqing Wang, Peng Wang, Yang Yang, Heng Tao Shen:
Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning. CoRR abs/2403.10252 (2024) - [i118]Beitao Chen, Xinyu Lyu, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization. CoRR abs/2405.15356 (2024) - [i117]Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu:
ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees. CoRR abs/2407.00499 (2024) - [i116]Xiruo Jiang, Yazhou Yao, Xili Dai, Fumin Shen, Xian-Sheng Hua, Heng Tao Shen:
Anti-Collapse Loss for Deep Metric Learning Based on Coding Rate Metric. CoRR abs/2407.03106 (2024) - [i115]Renming Huang, Yunqiang Pei, Guoqing Wang, Yangming Zhang, Yang Yang, Peng Wang, Hengtao Shen:
Diffusion Models as Optimizers for Efficient Planning in Offline RL. CoRR abs/2407.16142 (2024) - [i114]Yi Bin, Junrong Liao, Yujuan Ding, Haoxuan Li, Yang Yang, See-Kiong Ng, Heng Tao Shen:
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning. CoRR abs/2408.00305 (2024) - [i113]Yi Bin, Wenhao Shi, Yujuan Ding, Zhiqiang Hu, Zheng Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen:
GalleryGPT: Analyzing Paintings with Large Multimodal Models. CoRR abs/2408.00491 (2024) - [i112]Yujia Wu, Yiming Shi, Jiwei Wei, Chengwei Sun, Yuyang Zhou, Yang Yang, Heng Tao Shen:
DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion. CoRR abs/2408.06740 (2024) - [i111]Yixuan Zhou, Xing Xu, Zhe Sun, Jingkuan Song, Andrzej Cichocki, Heng Tao Shen:
VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization. CoRR abs/2409.00942 (2024) - [i110]Renming Huang, Shaochong Liu, Yunqiang Pei, Peng Wang, Guoqing Wang, Yang Yang, Hengtao Shen:
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance. CoRR abs/2409.03996 (2024) - [i109]Run Luo, Haonan Zhang, Longze Chen, Ting-En Lin, Xiong Liu, Yuchuan Wu, Min Yang, Minzheng Wang, Pengpeng Zeng, Lianli Gao, Heng Tao Shen, Yunshui Li, Xiaobo Xia, Fei Huang, Jingkuan Song, Yongbin Li:
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct. CoRR abs/2409.05840 (2024) - [i108]Xiaorui Sun, Jun Liu, Heng Tao Shen, Xiaofeng Zhu, Ping Hu:
On Efficient Variants of Segment Anything Model: A Survey. CoRR abs/2410.04960 (2024) - [i107]Xiao Cai, Pengpeng Zeng, Lianli Gao, Junchen Zhu, Jiaxin Zhang, Sitong Su, Heng Tao Shen, Jingkuan Song:
SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors. CoRR abs/2410.07658 (2024) - [i106]Wei Dong, Yuan Sun, Yiting Yang, Xing Zhang, Zhijun Lin, Qingsen Yan, Haokui Zhang, Peng Wang, Yang Yang, Hengtao Shen:
Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation. CoRR abs/2410.22952 (2024) - [i105]Sitong Su, Xiao Cai, Lianli Gao, Pengpeng Zeng, Qinhong Du, Mengqi Li, Heng Tao Shen, Jingkuan Song:
GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark. CoRR abs/2412.09997 (2024) - [i104]Shihan Wu, Ji Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves. CoRR abs/2412.11509 (2024) - 2023
- [j225]