


default search action
Yihao Ding
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j4]Zechuan Li, Hongshan Yu, Yihao Ding, Yan Li, Yong He, Naveed Akhtar
:
Embodied intelligence for 3D understanding: A survey on 3D Scene question answering. Inf. Fusion 126: 103624 (2026)- 2025
[j3]Fan Ye, Xuan Hu
, Yihao Ding, Feifei Liu:
Pseudo-labeling and knowledge-guided contrastive learning for radiology report generation. J. Biomed. Informatics 172: 104941 (2025)
[c14]Yanbei Jiang, Yihao Ding, Chao Lei, Jiayang Ao, Jey Han Lau, Krista A. Ehinger:
Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task. ACL (Findings) 2025: 13-45
[c13]Zihan Xu, Haotian Ma, Yihao Ding, Gongbo Zhang, Chunhua Weng, Yifan Peng:
Natural Language Processing in Support of Evidence-based Medicine: A Scoping Review. ACL (Findings) 2025: 21421-21443
[c12]Lorenzo Vaiani
, Yihao Ding
, Luca Cagliero
, Jean Lee
, Paolo Garza
, Josiah Poon
, Soyeon Caren Han
:
KIEPrompter: Leveraging Lightweight Models' Predictions for Cost-Effective Key Information Extraction using Vision LLMs. CIKM 2025: 2925-2934
[c11]Zechuan Li, Hongshan Yu, Yihao Ding, Jinhao Qiao, Basim Azam
, Naveed Akhtar:
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector. CVPR 2025: 27211-27221
[c10]Yihao Ding, Soyeon Caren Han, Yan Li, Josiah Poon:
VRD-IU: Lessons from Visually Rich Document Intelligence and Understanding. IJCAI 2025: 11039-11043
[i18]Zechuan Li, Hongshan Yu, Yihao Ding, Yan Li, Yong He, Naveed Akhtar:
Embodied Intelligence for 3D Understanding: A Survey on 3D Scene Question Answering. CoRR abs/2502.00342 (2025)
[i17]Zechuan Li, Hongshan Yu, Yihao Ding, Jinhao Qiao, Basim Azam
, Naveed Akhtar:
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector. CoRR abs/2503.15211 (2025)
[i16]Yanbei Jiang, Yihao Ding, Chao Lei, Jiayang Ao, Jey Han Lau, Krista A. Ehinger:
Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task. CoRR abs/2505.21850 (2025)
[i15]Zihan Xu
, Haotian Ma, Gongbo Zhang, Yihao Ding, Chunhua Weng, Yifan Peng:
Natural Language Processing in Support of Evidence-based Medicine: A Scoping Review. CoRR abs/2505.22280 (2025)
[i14]Yihao Ding, Soyeon Caren Han, Yan Li, Josiah Poon:
VRD-IU: Lessons from Visually Rich Document Intelligence and Understanding. CoRR abs/2506.01388 (2025)
[i13]Yihao Ding, Siwen Luo, Yue Dai, Yanbei Jiang, Zechuan Li, Geoffrey Martin, Yifan Peng:
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends. CoRR abs/2507.09861 (2025)
[i12]Jiwon Park, Seohyun Pyeon, Jinwoo Kim, Rina Carines Cabral, Yihao Ding, Soyeon Caren Han:
DocHop-QA: Towards Multi-Hop Reasoning over Multimodal Document Collections. CoRR abs/2508.15851 (2025)
[i11]Yihao Ding, Soyeon Caren Han, Yanbei Jiang, Yan Li, Zechuan Li, Yifan Peng:
SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction. CoRR abs/2509.23273 (2025)- 2024
[j2]Kunze Wang, Yihao Ding
, Soyeon Caren Han:
Graph neural networks for text classification: a survey. Artif. Intell. Rev. 57(8): 190 (2024)
[c9]Tianyi Chen
, Feiqi Cao, Yihao Ding
, Soyeon Caren Han:
The Language Model Can Have the Personality: Joint Learning for Personality Enhanced Language Model (Student Abstract). AAAI 2024: 23454-23455
[c8]Yihao Ding
, Lorenzo Vaiani, Soyeon Caren Han, Jean Lee
, Paolo Garza, Josiah Poon, Luca Cagliero:
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding. ACL (Findings) 2024: 15233-15244
[c7]Yihao Ding, Kaixuan Ren, Jiabin Huang, Siwen Luo, Soyeon Caren Han:
MMVQA: A Comprehensive Dataset for Investigating Multipage Multimodal Information Retrieval in PDF-based Visual Question Answering. IJCAI 2024: 6243-6251
[i10]Yihao Ding, Lorenzo Vaiani, Soyeon Caren Han, Jean Lee
, Paolo Garza, Josiah Poon, Luca Cagliero:
M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding. CoRR abs/2402.17983 (2024)
[i9]Yihao Ding, Kaixuan Ren
, Jiabin Huang, Siwen Luo, Soyeon Caren Han:
PDF-MVQA: A Dataset for Multimodal Information Retrieval in PDF-based Visual Question Answering. CoRR abs/2404.12720 (2024)
[i8]Yihao Ding, Jean Lee, Soyeon Caren Han:
Deep Learning based Visually Rich Document Content Understanding: A Survey. CoRR abs/2408.01287 (2024)
[i7]Yihao Ding, Soyeon Caren Han, Zechuan Li, Hyunsuk Chung:
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights. CoRR abs/2410.01609 (2024)- 2023
[j1]Jie Yang, Yihao Ding
, Siqu Long, Josiah Poon, Soyeon Caren Han:
DDI-MuG: Multi-aspect graphs for drug-drug interaction extraction. Frontiers Digit. Health 5 (2023)
[c6]Soyeon Caren Han
, Yihao Ding
, Siwen Luo
, Josiah Poon
, Hee-Guen Yoon
, Zhe Huang
, Paul Duuring
, Eun-Jung Holden
:
Workshop on Document Intelligence Understanding. CIKM 2023: 5273-5276
[c5]Yihao Ding
, Siwen Luo
, Hyunsuk Chung, Soyeon Caren Han:
PDF-VQA: A New Dataset for Real-World VQA on PDF Documents. ECML/PKDD (6) 2023: 585-601
[c4]Yihao Ding
, Siqu Long
, Jiabin Huang
, Kaixuan Ren
, Xingxiang Luo
, Hyunsuk Chung
, Soyeon Caren Han
:
Form-NLU: Dataset for the Form Natural Language Understanding. SIGIR 2023: 2807-2816
[i6]Yihao Ding, Siqu Long, Jiabin Huang, Kaixuan Ren
, Xingxiang Luo, Hyunsuk Chung, Soyeon Caren Han:
Form-NLU: Dataset for the Form Language Understanding. CoRR abs/2304.01577 (2023)
[i5]Yihao Ding, Siwen Luo, Hyunsuk Chung, Soyeon Caren Han:
PDFVQA: A New Dataset for Real-World VQA on PDF Documents. CoRR abs/2304.06447 (2023)
[i4]Kunze Wang, Yihao Ding, Soyeon Caren Han:
Graph Neural Networks for Text Classification: A Survey. CoRR abs/2304.11534 (2023)
[i3]Soyeon Caren Han, Yihao Ding, Siwen Luo, Josiah Poon, Hee-Guen Yoon, Zhe Huang, Paul Duuring
, Eun-Jung Holden:
Workshop on Document Intelligence Understanding. CoRR abs/2307.16369 (2023)- 2022
[c3]Jie Yang, Yihao Ding, Siqu Long, Josiah Poon, Soyeon Caren Han:
DDI-MuG: Multi-aspect Graphs for Drug-Drug Interaction Extraction. LOUHI@EMNLP 2022: 127-137
[c2]Siwen Luo
, Yihao Ding, Siqu Long, Josiah Poon, Soyeon Caren Han:
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis. COLING 2022: 2906-2916
[c1]Yihao Ding
, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han:
V-Doc : Visual questions answers with Documents. CVPR 2022: 21460-21466
[i2]Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han:
V-Doc : Visual questions answers with Documents. CoRR abs/2205.13724 (2022)
[i1]Siwen Luo, Yihao Ding, Siqu Long, Soyeon Caren Han, Josiah Poon:
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis. CoRR abs/2208.10970 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-12-28 00:20 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







