default search action

combined dblp search
author search
venue search
publication search

ask others

Haonan Zhang 0003

> Home > Persons

Person information

affiliation: University of Electronic Science and Technology of China (UESTC), Future Media Center, School of Computer Science and Engineering, Chengdu, China
affiliation: Sichuan Artificial Intelligence Research Institute, Yibin, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/QinZGZZS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/QinZGZZS25
Yixin Qin, Lei Zhao, Lianli Gao, Haonan Zhang, Pengpeng Zeng, Heng Tao Shen:
Temporal-Guided Mixture-of-Experts for Zero-Shot Video Question Answering. IEEE Trans. Circuits Syst. Video Technol. 35(9): 9003-9016 (2025)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/ZhangZGSDLS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/ZhangZGSDLS25
Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Yihang Duan, Xinyu Lyu, Heng Tao Shen:
Text-Video Retrieval With Global-LocalSemantic Consistent Learning. IEEE Trans. Image Process. 34: 3463-3474 (2025)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/ZengZGLQS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/ZengZGLQS25
Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, Jin Qian, Heng Tao Shen:
Visual Commonsense-Aware Representation Network for Video Captioning. IEEE Trans. Neural Networks Learn. Syst. 36(1): 1092-1103 (2025)
2024
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/ZhangZGLSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/ZhangZGLSS24
Haonan Zhang, Pengpeng Zeng, Lianli Gao, Xinyu Lyu, Jingkuan Song, Heng Tao Shen:
SPT: Spatial Pyramid Transformer for Image Captioning. IEEE Trans. Circuits Syst. Video Technol. 34(6): 4829-4842 (2024)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/ZhangZGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/ZhangZGSS24
Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Ump: Unified Modality-Aware Prompt Tuning for Text-Video Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(11): 11954-11964 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/JingZZGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/JingZZGSS24
Shuaiqi Jing, Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Memory-Based Augmentation Network for Video Captioning. IEEE Trans. Multim. 26: 2367-2379 (2024)
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/ZhangZHQSG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/ZhangZHQSG23
Haonan Zhang, Pengpeng Zeng, Yuxuan Hu, Jin Qian, Jingkuan Song, Lianli Gao:
Learning visual question answering on controlled semantic noisy labels. Pattern Recognit. 138: 109339 (2023)
2022
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/ZengZGSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/ZengZGSS22
Pengpeng Zeng, Haonan Zhang, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Video Question Answering With Prior Knowledge and Object-Sensitive Learning. IEEE Trans. Image Process. 31: 5936-5948 (2022)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c7]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LuoZCLLWYLWZGSL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LuoZCLLWYLWZGSL25
Run Luo, Haonan Zhang, Longze Chen, Ting-En Lin, Xiong Liu, Yuchuan Wu, Min Yang, Yongbin Li, Minzheng Wang, Pengpeng Zeng, Lianli Gao, Heng Tao Shen, Yunshui Li, Hamid Alinejad-Rokny, Xiaobo Xia, Jingkuan Song, Fei Huang:
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct. ACL (Findings) 2025: 19655-19682
[c6]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhangLLWLZQFYGS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangLLWLZQFYGS25
Haonan Zhang, Run Luo, Xiong Liu, Yuchuan Wu, Ting-En Lin, Pengpeng Zeng, Qiang Qu, Feiteng Fang, Min Yang, Lianli Gao, Jingkuan Song, Fei Huang, Yongbin Li:
OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction. ACL (1) 2025: 26318-26331
2024
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/NiLLZZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/NiLLZZS24
Hao Ni, Ping Lai, Yuke Li, Pengpeng Zeng, Haonan Zhang, Jingkuan Song:
Pedestrian Attributes Recognition for UAV-Human. ICME Workshops 2024: 1-5
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZGSS24
Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
MPT: Multi-grained Prompt Tuning for Text-Video Retrieval. ACM Multimedia 2024: 1206-1214
2023
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangGZHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangGZHS23
Haonan Zhang, Lianli Gao, Pengpeng Zeng, Alan Hanjalic, Heng Tao Shen:
Depth-Aware Sparse Transformer for Video-Language Learning. ACM Multimedia 2023: 4778-4787
2022
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZengZSG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZengZSG22
Pengpeng Zeng, Haonan Zhang, Jingkuan Song, Lianli Gao:
S2 Transformer for Image Captioning. IJCAI 2022: 1608-1614
[c1]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiSGZZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiSGZZL22
Hao Li, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Haonan Zhang, Gongfu Li:
A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval. NeurIPS 2022

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04561
Run Luo, Ting-En Lin, Haonan Zhang, Yuchuan Wu, Xiong Liu, Min Yang, Yongbin Li, Longze Chen, Jiaming Li, Lei Zhang, Yangyi Chen, Hamid Alinejad-Rokny, Fei Huang:
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis. CoRR abs/2501.04561 (2025)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20277
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20277
Haonan Zhang, Run Luo, Xiong Liu, Yuchuan Wu, Ting-En Lin, Pengpeng Zeng, Qiang Qu, Feiteng Fang, Min Yang, Lianli Gao, Jingkuan Song, Fei Huang, Yongbin Li:
OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction. CoRR abs/2505.20277 (2025)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-23923
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-23923
Feiteng Fang, Ting-En Lin, Yuchuan Wu, Xiong Liu, Xiang Huang, Dingwei Chen, Jing Ye, Haonan Zhang, Liang Zhu, Hamid Alinejad-Rokny, Min Yang, Fei Huang, Yongbin Li:
ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents. CoRR abs/2505.23923 (2025)
2024
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05840
Run Luo, Haonan Zhang, Longze Chen, Ting-En Lin, Xiong Liu, Yuchuan Wu, Min Yang, Minzheng Wang, Pengpeng Zeng, Lianli Gao, Heng Tao Shen, Yunshui Li, Xiaobo Xia, Fei Huang, Jingkuan Song, Yongbin Li:
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct. CoRR abs/2409.05840 (2024)
2022
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-09469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-09469
Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, Jin Qian, Heng Tao Shen:
Visual Commonsense-aware Representation Network for Video Captioning. CoRR abs/2211.09469 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.