


default search action
Hao Feng 0009
Person information
- affiliation: University of Science and Technology of China, Department of Electronic Engineering and Information Science, CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System, Hefei, China
Other persons with the same name
- Hao Feng — disambiguation page
- Hao Feng 0001 — Microsoft, Redmond, WA, USA (and 1 more)
- Hao Feng 0002
— Intel Labs, Hillsboro, OR, USA (and 1 more)
- Hao Feng 0003
— Tianjin University, State Key Laboratory of Precision Measurement Technology and Instrument, China
- Hao Feng 0004
— Huazhong University of Science and Technology, School of Electrical and Electronic Engineering, State Key Laboratory of Advanced Electromagnetic Engineering and Technology, Wuhan, China
- Hao Feng 0005
— Case Western Reserve University School of Medicine, Department of Population and Quantitative Health Sciences, Cleveland, OH, USA (and 1 more)
- Hao Feng 0006 — Google, San Francisco, USA (and 1 more)
- Hao Feng 0007
— Tsinghua University, School of Software, Beijing, China
- Hao Feng 0008
— Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, China
- Hao Feng 0010
— Hainan University, School of Computer Science and Technology, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j7]Hao Feng
, Wendi Wang
, Shaokai Liu
, Jiajun Deng
, Wengang Zhou
, Houqiang Li
:
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser. IEEE Trans. Multim. 27: 1914-1925 (2025) - [i24]Ling Fu, Biao Yang, Zhebin Kuang, Jiajun Song, Yuzhe Li, Linghao Zhu, Qidi Luo, Xinyu Wang, Hao Lu, Mingxin Huang, Zhang Li, Guozhi Tang, Bin Shan, Chunhui Lin, Qi Liu, Binghong Wu, Hao Feng, Hao Liu, Can Huang, Jingqun Tang, Wei Chen, Lianwen Jin, Yuliang Liu
, Xiang Bai:
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning. CoRR abs/2501.00321 (2025) - [i23]Bozhi Luan, Wengang Zhou, Hao Feng, Zhe Wang, Xiaosong Li, Houqiang Li:
Multi-Cue Adaptive Visual Token Pruning for Large Vision-Language Models. CoRR abs/2503.08019 (2025) - 2024
- [j6]Hao Feng, Qi Liu
, Hao Liu, Jingqun Tang, Wengang Zhou, Houqiang Li, Can Huang:
DocPedia: unleashing the power of large multimodal model in the frequency domain for versatile document understanding. Sci. China Inf. Sci. 67(12) (2024) - [j5]Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li:
Progressive Recurrent Network for shadow removal. Comput. Vis. Image Underst. 238: 103861 (2024) - [j4]Shaokai Liu
, Hao Feng, Wengang Zhou
:
Rethinking Supervision in Document Unwarping: A Self-Consistent Flow-Free Approach. IEEE Trans. Circuits Syst. Video Technol. 34(6): 4817-4828 (2024) - [j3]Hao Feng, Keyi Zhou, Wengang Zhou
, Yufei Yin
, Jiajun Deng, Qi Sun, Houqiang Li
:
Recurrent Generic Contour-Based Instance Segmentation With Progressive Learning. IEEE Trans. Circuits Syst. Video Technol. 34(9): 7947-7961 (2024) - [j2]Hao Feng
, Shaokai Liu
, Jiajun Deng
, Wengang Zhou
, Houqiang Li
:
Deep Unrestricted Document Image Rectification. IEEE Trans. Multim. 26: 6142-6154 (2024) - [c8]Xiaoyu Qiu
, Hao Feng
, Yuechen Wang
, Wengang Zhou
, Houqiang Li
:
Progressive Multi-modal Conditional Prompt Tuning. ICMR 2024: 46-54 - [c7]Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Binghong Wu, Lei Liao, Shu Wei, Yongjie Ye, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang:
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy. NeurIPS 2024 - [i22]Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li:
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser. CoRR abs/2402.19108 (2024) - [i21]Bozhi Luan, Hao Feng, Hong Chen, Yonghui Wang, Wengang Zhou, Houqiang Li:
TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding. CoRR abs/2404.09797 (2024) - [i20]Xiaoyu Qiu, Hao Feng, Yuechen Wang, Wengang Zhou, Houqiang Li:
Progressive Multi-modal Conditional Prompt Tuning. CoRR abs/2404.11864 (2024) - [i19]Jingqun Tang, Chunhui Lin, Zhen Zhao, Shu Wei, Binghong Wu, Qi Liu, Hao Feng, Yang Li, Siqi Wang, Lei Liao, Wei Shi, Yuliang Liu
, Hao Liu, Yuan Xie, Xiang Bai, Can Huang:
TextSquare: Scaling up Text-Centric Visual Instruction Tuning. CoRR abs/2404.12803 (2024) - [i18]Jingqun Tang, Qi Liu, Yongjie Ye, Jinghui Lu, Shu Wei, Chunhui Lin, Wanqing Li, Mohamad Fitri Faiz Bin Mahmood, Hao Feng, Zhen Zhao, Yanjie Wang, Yuliang Liu
, Hao Liu, Xiang Bai, Can Huang:
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. CoRR abs/2405.11985 (2024) - [i17]Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Shu Wei, Binghong Wu, Lei Liao, Yongjie Ye, Hao Liu, Houqiang Li, Can Huang:
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy. CoRR abs/2406.01326 (2024) - [i16]Zhaokang Liao, Hao Feng, Shaokai Liu, Wengang Zhou, Houqiang Li:
RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation. CoRR abs/2406.18927 (2024) - [i15]Jinghui Lu, Haiyang Yu, Yanjie Wang, Yongjie Ye, Jingqun Tang, Ziwei Yang, Binghong Wu, Qi Liu, Hao Feng, Han Wang, Hao Liu, Can Huang:
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding. CoRR abs/2407.01976 (2024) - [i14]Keyi Zhou, Li Li, Wengang Zhou, Yonghui Wang, Hao Feng, Houqiang Li:
LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation. CoRR abs/2408.13852 (2024) - [i13]Yonghui Wang, Wengang Zhou, Hao Feng, Houqiang Li:
AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding. CoRR abs/2408.16986 (2024) - 2023
- [j1]Wendi Wang
, Hao Feng
, Wengang Zhou
, Zhaokang Liao
, Houqiang Li
:
Model-Aware Pre-Training for Radial Distortion Rectification. IEEE Trans. Image Process. 32: 5764-5778 (2023) - [c6]Hao Feng, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li:
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning. ICCV 2023: 12384-12393 - [c5]Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li:
Sign Language Translation with Iterative Prototype. ICCV 2023: 15546-15555 - [c4]Shaokai Liu, Hao Feng, Wengang Zhou, Houqiang Li, Cong Liu, Feng Wu:
DocMAE: Document Image Rectification via Self-supervised Representation Learning. ICME 2023: 1613-1618 - [i12]Hao Feng, Wengang Zhou, Yufei Yin, Jiajun Deng, Qi Sun, Houqiang Li:
Recurrent Contour-based Instance Segmentation with Progressive Learning. CoRR abs/2301.08898 (2023) - [i11]Hao Feng
, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li:
Deep Unrestricted Document Image Rectification. CoRR abs/2304.08796 (2023) - [i10]Shaokai Liu, Hao Feng
, Wengang Zhou, Houqiang Li, Cong Liu, Feng Wu:
DocMAE: Document Image Rectification via Self-supervised Representation Learning. CoRR abs/2304.10341 (2023) - [i9]Hao Feng
, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li:
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning. CoRR abs/2308.09040 (2023) - [i8]Hao Feng, Zijian Wang, Jingqun Tang, Jinghui Lu, Wengang Zhou, Houqiang Li, Can Huang:
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding. CoRR abs/2308.11592 (2023) - [i7]Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li:
Sign Language Translation with Iterative Prototype. CoRR abs/2308.12191 (2023) - [i6]Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li:
Progressive Recurrent Network for Shadow Removal. CoRR abs/2311.00455 (2023) - [i5]Hao Feng, Qi Liu, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang:
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding. CoRR abs/2311.11810 (2023) - [i4]Yonghui Wang, Wengang Zhou, Hao Feng, Keyi Zhou, Houqiang Li:
Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs. CoRR abs/2311.13194 (2023) - 2022
- [c3]Hao Feng
, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li:
Geometric Representation Learning for Document Image Rectification. ECCV (37) 2022: 475-492 - [c2]Sanjing Shen, Hao Feng
, Wengang Zhou, Houqiang Li:
PolyTracker: Progressive Contour Regression for Multiple Object Tracking and Segmentation. PRCV (4) 2022: 633-645 - [i3]Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li:
Geometric Representation Learning for Document Image Rectification. CoRR abs/2210.08161 (2022) - 2021
- [c1]Hao Feng
, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li:
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction. ACM Multimedia 2021: 273-281 - [i2]Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li:
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction. CoRR abs/2110.12942 (2021) - [i1]Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, Houqiang Li:
DocScanner: Robust Document Image Rectification with Progressive Learning. CoRR abs/2110.14968 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-19 21:07 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint