


default search action
Xinyu Zhang 0018
Person information
- affiliation: University of Waterloo, David R. Cheriton School of Computer Science, Canada
Other persons with the same name
- Xinyu Zhang (aka: Xin-yu Zhang, Xin-Yu Zhang) — disambiguation page
- Xinyu Zhang 0001
— Tsinghua University, State Key Laboratory of Automotive Safety and Energy, School of Vehicle and Mobility, Information Technology Center, Beijing, China
- Xinyu Zhang 0002
— East China Normal University, School of Computer Science and Software Engineering, Shanghai, China (and 3 more)
- Xinyu Zhang 0003
— University of California San Diego, USA (and 3 more)
- Xinyu Zhang 0004
— Yale University School of Medicine, Department of Psychiatry, New Haven, CT, USA (and 1 more)
- Xinyu Zhang 0005
— Sinopec International Petroleum Exploration and Production Corporation, Beijing, China (and 1 more)
- Xinyu Zhang 0006
— Beihang University, State Key Laboratory of Software Development Environment, Beijing, China
- Xinyu Zhang 0007
— Beijing Jiaotong University, China
- Xinyu Zhang 0008
— University of Nottingham, Ningbo, China
- Xinyu Zhang 0009 — University of California, San Diego, USA
- Xinyu Zhang 0010
— National University of Defense Technology, College of Electronic Science and Engineering, Changsha, China
- Xinyu Zhang 0011
— Shanghai University, School of Computer Engineering and Science, China
- Xinyu Zhang 0012
— Hunan University of Technology and Business, Changsha, China (and 1 more)
- Xinyu Zhang 0013
— Harbin Engineering University, China
- Xinyu Zhang 0015
— Tongji University, College of Electronics and Information Engineering, China
- Xinyu Zhang 0016
— Zhejiang University, Zhejiang, China
- Xinyu Zhang 0017
— Monash University, Australia
- Xinyu Zhang 0019
— Huawei, Distributed and Parallel Software Lab, Huawei Poisson Lab, Hangzhou, China (and 1 more)
- Xinyu Zhang 0020
(aka: Xin-Yu Zhang 0020) — Dalian Maritime University, Maritime Intelligent Transportation Research Team, Navigation College, China
- Xinyu Zhang 0021
— Xi'an Jiaotong University, Key Laboratory of Intelligent Networks and Network Security, SPKLSTN Laboratory, China
- Xinyu Zhang 0022
— University of Science and Technology of China, Hefei, Anhui, China
- Xinyu Zhang 0023
(aka: Xin-Yu Zhang 0023) — Nankai University, College of Computer Science, School of Mathematical Sciences, TKLNDST, Tianjin, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c26]Andrew Liu, Edward Xu, Crystina Zhang, Jimmy Lin:
The Impact of Incidental Multilingual Text on Cross-Lingual Transfer in Monolingual Retrieval. ECIR (3) 2025: 165-173 - [c25]Crystina Zhang, Sebastian Hofstätter, Patrick Lewis, Raphael Tang, Jimmy Lin:
Rank-Without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models. ECIR (2) 2025: 233-247 - [c24]Crystina Zhang, Jing Lu, Vinh Q. Tran, Tal Schuster, Donald Metzler, Jimmy Lin:
Tomato, Tomahto, Tomate: Do Multilingual Language Models Understand Based on Subword-Level Semantic Concepts? NAACL (Findings) 2025: 1821-1837 - [i22]Kenneth C. Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzeminski, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çagatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafal Poswiata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Mariya Hendriksen, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Suppa, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal A. Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lù, Jordan Clive, Gayatri Krishnakumar, Anna Maksimova, Silvan Wehrli
, Maria Tikhonova, Henil Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James V. Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi
, Wen-Ding Li, Alessia Borghini, Federico Cassano, Hongjin Su, Jimmy Lin, Howard Yen, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy, Niklas Muennighoff:
MMTEB: Massive Multilingual Text Embedding Benchmark. CoRR abs/2502.13595 (2025) - [i21]Zhichao Xu, Fengran Mo, Zhiqi Huang, Crystina Zhang, Puxuan Yu, Bei Wang, Jimmy Lin, Vivek Srikumar:
A Survey of Model Architectures in Information Retrieval. CoRR abs/2502.14822 (2025) - 2024
- [j2]Xinyu Zhang
, Kelechi Ogueji
, Xueguang Ma
, Jimmy Lin
:
Toward Best Practices for Training Multilingual Dense Retrieval Models. ACM Trans. Inf. Syst. 42(2): 39:1-39:33 (2024) - [c23]Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei:
Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM. ACL (Findings) 2024: 14575-14595 - [c22]Libo Sun, Siyuan Wang, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei:
Multi-Objective Forward Reasoning and Multi-Reward Backward Refinement for Product Review Summarization. LREC/COLING 2024: 11944-11955 - [c21]Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. EMNLP 2024: 5441-5454 - [c20]Nandan Thakur, Luiz Bonifacio, Xinyu Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin:
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation. EMNLP (Findings) 2024: 12508-12526 - [c19]Wenyan Li, Xinyu Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott:
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture. EMNLP 2024: 19077-19095 - [c18]Xinyu Zhang, Minghan Li, Jimmy Lin:
CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders. NAACL (Short Papers) 2024: 188-196 - [c17]Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. NAACL-HLT 2024: 2327-2340 - [c16]Mofetoluwa Adeyemi
, Akintunde Oladipo
, Xinyu Zhang
, David Alfonso-Hermelo
, Mehdi Rezagholizadeh
, Boxing Chen
, Abdul-Hakeem Omotayo
, Idris Abdulmumin
, Naome A. Etori
, Toyib Babatunde Musa
, Samuel Fanijo
, Oluwabusayo Olufunke Awoyomi
, Saheed Abdullahi Salahudeen
, Labaran Adamu Mohammed
, Daud Olamide Abolade
, Falalu Ibrahim Lawan
, Maryam Sabo Abubakar
, Ruqayya Nasir Iro
, Amina Abubakar Imam
, Shafie Abdi Mohamed
, Hanad Mohamud Mohamed
, Tunde Oluwaseyi Ajayi
, Jimmy Lin
:
CIRAL: A Test Collection for CLIR Evaluations in African Languages. SIGIR 2024: 293-302 - [i20]Jingcong Liang
, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei:
Debatrix: Multi-dimensinal Debate Judge with Iterative Chronological Analysis Based on LLM. CoRR abs/2403.08010 (2024) - [i19]Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. CoRR abs/2406.08482 (2024) - [i18]Wenyan Li, Xinyu Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott:
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture. CoRR abs/2406.11030 (2024) - [i17]Xinyu Zhang, Jing Lu, Vinh Q. Tran, Tal Schuster, Donald Metzler, Jimmy Lin:
Tomato, Tomahto, Tomate: Measuring the Role of Shared Semantics among Subwords in Multilingual Language Models. CoRR abs/2411.04530 (2024) - 2023
- [j1]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages. Trans. Assoc. Comput. Linguistics 11: 1114-1131 (2023) - [c15]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. ACL (industry) 2023: 518-526 - [c14]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. ACL (demo) 2023: 588-598 - [c13]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. EMNLP (Demos) 2023: 140-148 - [c12]Mofetoluwa Adeyemi
, Akintunde Oladipo
, Xinyu Zhang
, David Alfonso-Hermelo
, Mehdi Rezagholizadeh
, Boxing Chen
, Jimmy Lin
:
CIRAL at FIRE 2023: Cross-Lingual Information Retrieval for African Languages. FIRE 2023: 4-6 - [c11]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Crystina Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Jimmy Lin:
Overview of the CIRAL Track at FIRE 2023: Cross-lingual Information Retrieval for African Languages. FIRE (Working Notes) 2023: 118-136 - [i16]Xinyu Zhang, Minghan Li, Jimmy Lin:
Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction. CoRR abs/2302.06589 (2023) - [i15]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. CoRR abs/2302.14534 (2023) - [i14]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. CoRR abs/2304.01019 (2023) - [i13]Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin:
Zero-Shot Listwise Document Reranking with a Large Language Model. CoRR abs/2305.02156 (2023) - [i12]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. CoRR abs/2305.06300 (2023) - [i11]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. CoRR abs/2306.01481 (2023) - [i10]Ehsan Kamalloo, Aref Jafari, Xinyu Zhang, Nandan Thakur, Jimmy Lin:
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution. CoRR abs/2307.16883 (2023) - [i9]Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. CoRR abs/2310.07712 (2023) - [i8]Raphael Tang, Xinyu Zhang, Jimmy Lin, Ferhan Ture:
What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations. CoRR abs/2311.18812 (2023) - [i7]Xinyu Zhang, Sebastian Hofstätter, Patrick Lewis, Raphael Tang, Jimmy Lin:
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models. CoRR abs/2312.02969 (2023) - [i6]Nandan Thakur, Luiz Bonifacio, Xinyu Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin:
NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation. CoRR abs/2312.11361 (2023) - 2022
- [c10]Ronak Pradeep, Yuqi Liu, Xinyu Zhang, Yilin Li, Andrew Yates, Jimmy Lin:
Squeezing Water from a Stone: A Bag of Tricks for Further Improving Cross-Encoder Effectiveness for Reranking. ECIR (1) 2022: 655-670 - [c9]Minghan Li
, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. EMNLP 2022: 333-345 - [c8]Odunayo Ogundepo, Xinyu Zhang, Shuo Sun, Kevin Duh, Jimmy Lin:
AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages. EMNLP 2022: 8721-8728 - [c7]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. TREC 2022 - [i5]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Towards Best Practices for Training Multilingual Dense Retrieval Models. CoRR abs/2204.02363 (2022) - [i4]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. CoRR abs/2205.09638 (2022) - [i3]Odunayo Ogundepo, Xinyu Zhang, Jimmy Lin:
Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers. CoRR abs/2210.05481 (2022) - [i2]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages. CoRR abs/2210.09984 (2022) - 2021
- [c6]Wei Zhong, Xinyu Zhang, Ji Xin, Richard Zanibbi, Jimmy Lin:
Approach Zero and Anserini at the CLEF-2021 ARQMath Track: Applying Substructure Search and BM25 on Operator Tree Path Tokens. CLEF (Working Notes) 2021: 133-156 - [c5]Xinyu Zhang, Andrew Yates, Jimmy Lin:
Comparing Score Aggregation Approaches for Document Retrieval with Pretrained Transformers. ECIR (2) 2021: 150-163 - [i1]Xinyu Zhang, Xueguang Ma, Peng Shi, Jimmy Lin:
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval. CoRR abs/2108.08787 (2021) - 2020
- [c4]Andrew Yates, Kevin Martin Jose, Xinyu Zhang, Jimmy Lin:
Flexible IR Pipelines with Capreolus. CIKM 2020: 3181-3188 - [c3]Xinyu Zhang, Andrew Yates, Jimmy Lin:
A Little Bit Is Worse Than None: Ranking with Limited Training Data. SustaiNLP@EMNLP 2020: 107-112 - [c2]Ronak Pradeep, Xueguang Ma, Xinyu Zhang, Hang Cui, Ruizhou Xu, Rodrigo Nogueira, Jimmy Lin:
H2oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine. TREC 2020 - [c1]Andrew Yates, Siddhant Arora, Xinyu Zhang, Wei Yang, Kevin Martin Jose, Jimmy Lin:
Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval. WSDM 2020: 861-864
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-30 20:37 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint