


default search action
Peng Jin 0001
Person information
- affiliation: Peking University, School of Electronic and Computer Engineering, Shenzhen, China
Other persons with the same name
- Peng Jin — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j2]Peng Jin
, Hao Li
, Li Yuan
, Shuicheng Yan
, Jie Chen
:
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning. IEEE Trans. Pattern Anal. Mach. Intell. 47(3): 2125-2139 (2025) - [c17]Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Xiawu Zheng, Chang Liu, Jie Chen:
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation. AAAI 2025: 2482-2490 - [c16]Peng Jin, Bo Zhu, Li Yuan, Shuicheng Yan:
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts. ICLR 2025 - [i26]Hongyu Zhang, Yufan Deng, Shenghai Yuan, Peng Jin, Zesen Cheng, Yian Zhao, Chang Liu, Jie Chen:
MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation. CoRR abs/2503.14428 (2025) - 2024
- [c15]Zesen Cheng, Kehan Li, Peng Jin, Siheng Li, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. AAAI 2024: 1326-1334 - [c14]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. ACL (Findings) 2024: 7160-7174 - [c13]Peng Jin, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, Li Yuan:
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. CVPR 2024: 13700-13710 - [c12]Hao Li
, Yanhao Jia
, Peng Jin
, Zesen Cheng
, Kehan Li, Jialu Sui, Chang Liu
, Li Yuan
:
FreestyleRet: Retrieving Images from Style-Diversified Queries. ECCV (23) 2024: 258-274 - [c11]Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Xing Zhou, Munan Ning, Li Yuan:
Repaint123: Fast and High-Quality One Image to 3D Generation with Progressive Controllable Repainting. ECCV (25) 2024: 303-320 - [c10]Peng Jin
, Hao Li
, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu
, Xiangyang Ji
, Li Yuan
, Jie Chen
:
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation. ECCV (25) 2024: 392-409 - [c9]Zhongwei Wan, Ziang Wu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan:
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference. EMNLP (Findings) 2024: 4065-4078 - [i25]Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Chang Liu, Xiawu Zheng, Rongrong Ji, Jie Chen:
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation. CoRR abs/2401.09732 (2024) - [i24]Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Junwu Zhang, Munan Ning, Li Yuan:
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models. CoRR abs/2401.15947 (2024) - [i23]Bin Zhu, Peng Jin, Munan Ning, Bin Lin, Jinfa Huang, Qi Song, Jiaxi Cui, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan:
LLMBind: A Unified Modality-Task Integration Framework. CoRR abs/2402.14891 (2024) - [i22]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. CoRR abs/2405.19465 (2024) - [i21]Zhongwei Wan, Ziang Wu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan:
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference. CoRR abs/2406.18139 (2024) - [i20]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation. CoRR abs/2407.10528 (2024) - [i19]Haoran Tang, Meng Cao, Jinfa Huang, Ruyang Liu, Peng Jin, Ge Li, Xiaodan Liang:
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval. CoRR abs/2408.10575 (2024) - [i18]Peng Jin, Bo Zhu, Li Yuan, Shuicheng Yan:
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts. CoRR abs/2410.07348 (2024) - [i17]Peng Jin, Bo Zhu, Li Yuan, Shuicheng Yan:
MoH: Multi-Head Attention as Mixture-of-Head Attention. CoRR abs/2410.11842 (2024) - [i16]Yatian Pang, Peng Jin, Shuo Yang, Bin Lin, Bin Zhu, Zhenyu Tang, Liuhan Chen, Francis E. H. Tay, Ser-Nam Lim, Harry Yang, Li Yuan:
Next Patch Prediction for Autoregressive Visual Generation. CoRR abs/2412.15321 (2024) - [i15]Peng Jin, Hao Li, Li Yuan, Shuicheng Yan, Jie Chen:
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning. CoRR abs/2412.20964 (2024) - 2023
- [j1]Hao Li
, Jinfa Huang, Peng Jin
, Guoli Song
, Qi Wu
, Jie Chen:
Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual Question Answering. IEEE Trans. Image Process. 32: 3367-3382 (2023) - [c8]Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. CVPR 2023: 2472-2482 - [c7]Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation. ICCV 2023: 666-676 - [c6]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen:
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model. ICCV 2023: 2470-2481 - [c5]Zesen Cheng, Peng Jin, Hao Li, Kehan Li, Siheng Li, Xiangyang Ji, Chang Liu, Jie Chen:
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation. IJCAI 2023: 636-644 - [c4]Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen:
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment. IJCAI 2023: 938-946 - [c3]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. IJCAI 2023: 1044-1052 - [c2]Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Wei Yang, Li Yuan:
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. NeurIPS 2023 - [i14]Zesen Cheng, Kehan Li, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. CoRR abs/2303.07216 (2023) - [i13]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen:
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model. CoRR abs/2303.09867 (2023) - [i12]Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation. CoRR abs/2303.13399 (2023) - [i11]Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. CoRR abs/2303.14369 (2023) - [i10]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. CoRR abs/2305.10049 (2023) - [i9]Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen:
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment. CoRR abs/2305.12218 (2023) - [i8]Zesen Cheng, Peng Jin, Hao Li, Kehan Li, Siheng Li, Xiangyang Ji, Chang Liu, Jie Chen:
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation. CoRR abs/2306.10750 (2023) - [i7]Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Yang Wei, Li Yuan:
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. CoRR abs/2311.01015 (2023) - [i6]Peng Jin, Ryuichi Takanobu, Caiwan Zhang, Xiaochun Cao, Li Yuan:
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. CoRR abs/2311.08046 (2023) - [i5]Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan:
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection. CoRR abs/2311.10122 (2023) - [i4]Hao Li, Curise Jia, Peng Jin, Zesen Cheng, Kehan Li, Jialu Sui, Chang Liu, Li Yuan:
FreestyleRet: Retrieving Images from Style-Diversified Queries. CoRR abs/2312.02428 (2023) - [i3]Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan:
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting. CoRR abs/2312.13271 (2023) - 2022
- [c1]Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen:
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations. NeurIPS 2022 - [i2]Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen:
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering. CoRR abs/2209.10326 (2022) - [i1]Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu
, Shen Ge, Guoli Song, David A. Clifton, Jie Chen:
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations. CoRR abs/2211.11427 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-27 22:03 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint