


default search action
Jingqun Tang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c12]An-Lan Wang, Bin Shan, Wei Shi, Kun-Yu Lin, Xiang Fei, Guozhi Tang, Lei Liao, Jingqun Tang, Can Huang, Wei-Shi Zheng:
ParGo: Bridging Vision-Language with Partial and Global Views. AAAI 2025: 7491-7499
[c11]Wenhao Sun, Xue-Mei Dong, Benlei Cui, Jingqun Tang:
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance. AAAI 2025: 20734-20742
[c10]Xiang Fei, Jinghui Lu, Qi Sun, Hao Feng, Yanjie Wang, Wei Shi, An-Lan Wang, Jingqun Tang, Can Huang:
Advancing Sequential Numerical Prediction in Autoregressive Models. ACL (2) 2025: 562-574
[c9]Jinghui Lu, Haiyang Yu, Yanjie Wang, Yongjie Ye, Jingqun Tang, Ziwei Yang, Binghong Wu, Qi Liu, Hao Feng, Han Wang, Hao Liu, Can Huang:
A Bounding Box is Worth One Token - Interleaving Layout and Text in a Large Language Model for Document Understanding. ACL (Findings) 2025: 7252-7273
[c8]Jingqun Tang, Qi Liu, Yongjie Ye, Jinghui Lu, Shu Wei, An-Lan Wang, Chunhui Lin, Hao Feng, Zhen Zhao, Yanjie Wang, Yuliang Liu, Hao Liu, Xiang Bai, Can Huang:
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. ACL (Findings) 2025: 7748-7763
[c7]Hao Feng, Shu Wei, Xiang Fei, Wei Shi, Yingdong Han, Lei Liao, Jinghui Lu, Binghong Wu, Qi Liu, Chunhui Lin, Jingqun Tang, Hao Liu, Can Huang:
Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting. ACL (Findings) 2025: 21919-21936
[i22]Ling Fu, Biao Yang, Zhebin Kuang, Jiajun Song, Yuzhe Li, Linghao Zhu, Qidi Luo, Xinyu Wang, Hao Lu, Mingxin Huang, Zhang Li, Guozhi Tang, Bin Shan, Chunhui Lin, Qi Liu, Binghong Wu, Hao Feng, Hao Liu, Can Huang, Jingqun Tang, Wei Chen, Lianwen Jin, Yuliang Liu
, Xiang Bai:
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning. CoRR abs/2501.00321 (2025)
[i21]Han Wang, Yongjie Ye, Bingru Li, Yuxiang Nie, Jinghui Lu, Jingqun Tang, Yanjie Wang, Can Huang:
Vision as LoRA. CoRR abs/2503.20680 (2025)
[i20]Dong Guo, Faming Wu, Feida Zhu, Fuxing Leng, Guang Shi, Haobin Chen, Haoqi Fan, Jian Wang, Jianyu Jiang, Jiawei Wang, Jingji Chen, Jingjia Huang, Kang Lei, Liping Yuan, Lishu Luo, Pengfei Liu, Qinghao Ye, Rui Qian, Shen Yan, Shixiong Zhao, Shuai Peng, Shuangye Li, Sihang Yuan, Sijin Wu, Tianheng Cheng, Weiwei Liu, Wenqian Wang, Xianhan Zeng, Xiao Liu, Xiaobo Qin, Xiaohan Ding, Xiaojun Xiao, Xiaoying Zhang, Xuanwei Zhang, Xuehan Xiong, Yanghua Peng, Yangrui Chen, Yanwei Li, Yanxu Hu, Yi Lin, Yiyuan Hu, Yiyuan Zhang, Youbin Wu, Yu Li, Yudong Liu, Yue Ling, Yujia Qin, Zanbo Wang, Zhiwu He, Aoxue Zhang, Bairen Yi, Bencheng Liao, Can Huang, Can Zhang, Chaorui Deng, Chaoyi Deng, Cheng Lin, Cheng Yuan, Chenggang Li, Chenhui Gou, Chenwei Lou, Chengzhi Wei, Chundian Liu, Chunyuan Li, Deyao Zhu, Donghong Zhong, Feng Li, Feng Zhang, Gang Wu, Guodong Li, Guohong Xiao, Haibin Lin, Haihua Yang, Haoming Wang, Heng Ji, Hongxiang Hao, Hui Shen, Huixia Li, Jiahao Li, Jialong Wu, Jianhua Zhu, Jianpeng Jiao, Jiashi Feng, Jiaze Chen, Jianhui Duan, Jihao Liu, Jin Zeng, Jingqun Tang, Jingyu Sun, Joya Chen, Jun Long, Junda Feng, Junfeng Zhan, Junjie Fang, Junting Lu, Kai Hua, Kai Liu, Kai Shen, Kaiyuan Zhang, Ke Shen:
Seed1.5-VL Technical Report. CoRR abs/2505.07062 (2025)
[i19]An-Lan Wang, Jingqun Tang, Liao Lei, Hao Feng, Qi Liu, Xiang Fei, Jinghui Lu, Han Wang, Weiwei Liu, Hao Liu, Yuliang Liu, Xiang Bai, Can Huang:
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild? CoRR abs/2505.11015 (2025)
[i18]Xiang Fei, Jinghui Lu, Qi Sun, Hao Feng, Yanjie Wang, Wei Shi, An-Lan Wang, Jingqun Tang, Can Huang:
Advancing Sequential Numerical Prediction in Autoregressive Models. CoRR abs/2505.13077 (2025)
[i17]Hao Feng, Shu Wei, Xiang Fei, Wei Shi, Yingdong Han, Lei Liao, Jinghui Lu, Binghong Wu, Qi Liu, Chunhui Lin, Jingqun Tang, Hao Liu, Can Huang:
Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting. CoRR abs/2505.14059 (2025)
[i16]Jinghui Lu, Haiyang Yu, Siliang Xu, Shiwei Ran, Guozhi Tang, Siqi Wang, Bin Shan, Teng Fu, Hao Feng, Jingqun Tang, Han Wang, Can Huang:
Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning. CoRR abs/2505.15154 (2025)
[i15]Weitao Jia, Jinghui Lu, Haiyang Yu, Siqi Wang, Guozhi Tang, An-Lan Wang, Weijie Yin, Dingkang Yang, Yuxiang Nie, Bin Shan, Hao Feng, Irene Li, Kun Yang, Han Wang, Jingqun Tang, Teng Fu, Changhong Jin, Chao Feng, Xiaohui Lv, Can Huang:
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement. CoRR abs/2508.09670 (2025)
[i14]Haiyang Yu, Yuchuan Wu, Fan Shi, Lei Liao, Jinghui Lu, Xiaodong Ge, Han Wang, Minghan Zhuo, Xuecheng Wu, Xiang Fei, Hao Feng, Guozhi Tang, An-Lan Wang, Hanshen Zhu, Yangfan He, Quanhuan Liang, Liyuan Meng, Chao Feng, Can Huang, Jingqun Tang, Bin Li:
Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning. CoRR abs/2509.09731 (2025)- 2024
[j2]Hao Feng, Qi Liu
, Hao Liu, Jingqun Tang, Wengang Zhou, Houqiang Li, Can Huang:
DocPedia: unleashing the power of large multimodal model in the frequency domain for versatile document understanding. Sci. China Inf. Sci. 67(12) (2024)
[c6]Zhen Zhao, Jingqun Tang, Chunhui Lin, Binghong Wu, Can Huang, Hao Liu, Xin Tan, Zhizhong Zhang, Yuan Xie:
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer. CVPR 2024: 15567-15576
[c5]Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Binghong Wu, Lei Liao, Shu Wei, Yongjie Ye, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang:
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy. NeurIPS 2024
[c4]Zhen Zhao, Jingqun Tang, Binghong Wu, Chunhui Lin, Shu Wei, Hao Liu, Xin Tan, Zhizhong Zhang, Can Huang, Yuan Xie:
Harmonizing Visual Text Comprehension and Generation. NeurIPS 2024
[i13]Jingqun Tang, Chunhui Lin, Zhen Zhao, Shu Wei, Binghong Wu, Qi Liu, Hao Feng, Yang Li, Siqi Wang, Lei Liao, Wei Shi, Yuliang Liu
, Hao Liu, Yuan Xie, Xiang Bai, Can Huang:
TextSquare: Scaling up Text-Centric Visual Instruction Tuning. CoRR abs/2404.12803 (2024)
[i12]Jingqun Tang, Qi Liu, Yongjie Ye, Jinghui Lu, Shu Wei, Chunhui Lin, Wanqing Li, Mohamad Fitri Faiz Bin Mahmood, Hao Feng, Zhen Zhao, Yanjie Wang, Yuliang Liu
, Hao Liu, Xiang Bai, Can Huang:
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. CoRR abs/2405.11985 (2024)
[i11]Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Shu Wei, Binghong Wu, Lei Liao, Yongjie Ye, Hao Liu, Houqiang Li, Can Huang:
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy. CoRR abs/2406.01326 (2024)
[i10]Jinghui Lu, Haiyang Yu, Yanjie Wang, Yongjie Ye, Jingqun Tang, Ziwei Yang, Binghong Wu, Qi Liu, Hao Feng, Han Wang, Hao Liu, Can Huang:
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding. CoRR abs/2407.01976 (2024)
[i9]Zhen Zhao, Jingqun Tang, Binghong Wu, Chunhui Lin, Shu Wei, Hao Liu, Xin Tan, Zhizhong Zhang, Can Huang, Yuan Xie:
Harmonizing Visual Text Comprehension and Generation. CoRR abs/2407.16364 (2024)
[i8]An-Lan Wang, Bin Shan, Wei Shi, Kun-Yu Lin, Xiang Fei, Guozhi Tang, Lei Liao, Jingqun Tang, Can Huang, Wei-Shi Zheng:
ParGo: Bridging Vision-Language with Partial and Global Views. CoRR abs/2408.12928 (2024)
[i7]Bin Shan, Xiang Fei, Wei Shi, An-Lan Wang, Guozhi Tang, Lei Liao, Jingqun Tang, Xiang Bai, Can Huang:
MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark. CoRR abs/2410.11538 (2024)
[i6]Wenhao Sun, Benlei Cui, Xue-Mei Dong, Jingqun Tang:
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance. CoRR abs/2412.12974 (2024)- 2023
[j1]Yuliang Liu
, Jiaxin Zhang
, Dezhi Peng
, Mingxin Huang
, Xinyu Wang
, Jingqun Tang
, Can Huang
, Dahua Lin
, Chunhua Shen
, Xiang Bai
, Lianwen Jin
:
SPTS v2: Single-Point Scene Text Spotting. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15665-15679 (2023)
[i5]Yuliang Liu
, Jiaxin Zhang
, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin:
SPTS v2: Single-Point Scene Text Spotting. CoRR abs/2301.01635 (2023)
[i4]Hao Feng, Zijian Wang, Jingqun Tang, Jinghui Lu, Wengang Zhou, Houqiang Li, Can Huang:
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding. CoRR abs/2308.11592 (2023)
[i3]Zhen Zhao, Jingqun Tang, Chunhui Lin, Binghong Wu, Hao Liu, Zhizhong Zhang, Xin Tan, Can Huang, Yuan Xie:
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer. CoRR abs/2311.13120 (2023)- 2022
[c3]Jingqun Tang
, Wenqing Zhang, Hongye Liu, Mingkun Yang, Bo Jiang, Guanglong Hu, Xiang Bai:
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection. CVPR 2022: 4553-4562
[c2]Jingqun Tang
, Wenming Qian, Luchuan Song
, Xiena Dong, Lan Li, Xiang Bai:
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning. ECCV (28) 2022: 233-248
[c1]Jingqun Tang
, Su Qiao, Benlei Cui, Yuhang Ma, Sheng Zhang, Dimitrios Kanoulas
:
You Can even Annotate Text with Voice: Transcription-only-Supervised Text Spotting. ACM Multimedia 2022: 4154-4163
[i2]Jingqun Tang, Wenqing Zhang, Hongye Liu, Mingkun Yang, Bo Jiang, Guanglong Hu, Xiang Bai:
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection. CoRR abs/2203.15221 (2022)
[i1]Jingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai:
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning. CoRR abs/2207.11934 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-11-04 22:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







