Остановите войну!
for scientists:
default search action
Nan Tang 0001
- > Home > Persons > Nan Tang 0001
Publications
- 2023
- [i17]Mohammad Shahmeer Ahmad, Zan Ahmad Naeem, Mohamed Y. Eltabakh, Mourad Ouzzani, Nan Tang:
RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes. CoRR abs/2303.16909 (2023) - 2022
- [j42]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Interactively discovering and ranking desired tuples by data exploration. VLDB J. 31(4): 753-777 (2022) - 2021
- [j40]Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Samuel Madden, Mourad Ouzzani:
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation. Proc. VLDB Endow. 14(8): 1254-1261 (2021) - [j39]Saravanan Thirumuruganathan, Han Li, Nan Tang, Mourad Ouzzani, Yash Govind, Derek Paulsen, Glenn Fung, AnHai Doan:
Deep Learning for Blocking in Entity Matching: A Design Space Exploration. Proc. VLDB Endow. 14(11): 2459-2472 (2021) - [c59]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Ranking Desired Tuples by Database Exploration. ICDE 2021: 1973-1978 - 2020
- [j33]Abdulhakim Ali Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker:
Pattern Functional Dependencies for Data Cleaning. Proc. VLDB Endow. 13(5): 684-697 (2020) - [j30]El Kindi Rezig, Ashrita Brahmaroutu, Nesime Tatbul, Mourad Ouzzani, Nan Tang, Timothy G. Mattson, Samuel Madden, Michael Stonebraker:
Debugging Large-Scale Data Science Pipelines using Dagger. Proc. VLDB Endow. 13(12): 2993-2996 (2020) - [c56]El Kindi Rezig, Lei Cao, Giovanni Simonini, Maxime Schoemans, Samuel Madden, Nan Tang, Mourad Ouzzani, Michael Stonebraker:
Dagger: A Data (not code) Debugger. CIDR 2020 - [c55]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani, AnHai Doan:
Data Curation with Deep Learning. EDBT 2020: 277-286 - [c50]Mashaal Musleh, Mourad Ouzzani, Nan Tang, AnHai Doan:
CoClean: Collaborative Data Cleaning. SIGMOD Conference 2020: 2757-2760 - [i10]Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam Madden, Mourad Ouzzani:
Relational Pretrained Transformers towards Democratizing Data Preparation [Vision]. CoRR abs/2012.02469 (2020) - 2019
- [j27]El Kindi Rezig, Lei Cao, Michael Stonebraker, Giovanni Simonini, Wenbo Tao, Samuel Madden, Mourad Ouzzani, Nan Tang, Ahmed K. Elmagarmid:
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics. Proc. VLDB Endow. 12(12): 1954-1957 (2019) - [c49]Dong Deng, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Unsupervised String Transformation Learning for Entity Consolidation. ICDE 2019: 196-207 - [c48]Saravanan Thirumuruganathan, Mourad Ouzzani, Nan Tang:
Explaining Entity Resolution Predictions: Where are we and What needs to be done? HILDA@SIGMOD 2019: 10:1-10:6 - [c47]Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Raha: A Configuration-Free Error Detection System. SIGMOD Conference 2019: 865-882 - [c46]Abdulhakim Ali Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker:
ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies. SIGMOD Conference 2019: 1977-1980 - [p1]Mourad Ouzzani, Nan Tang, Raul Castro Fernandez:
Data civilizer: end-to-end support for data discovery, integration, and cleaning. Making Databases Work 2019: 291-300 - [i8]Ji Sun, Dong Deng, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation. CoRR abs/1906.06574 (2019) - [i6]Raul Castro Fernandez, Nan Tang, Mourad Ouzzani, Michael Stonebraker, Samuel Madden:
Dataset-On-Demand: Automatic View Search and Presentation for Data Discovery. CoRR abs/1911.11876 (2019) - 2018
- [j24]Divy Agrawal, Sanjay Chawla, Bertty Contreras-Rojas, Ahmed K. Elmagarmid, Yasser Idris, Zoi Kaoudi, Sebastian Kruse, Ji Lucas, Essam Mansour, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Saravanan Thirumuruganathan, Anis Troudi:
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -. Proc. VLDB Endow. 11(11): 1414-1427 (2018) - [j23]Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, Nan Tang:
Distributed Representations of Tuples for Entity Resolution. Proc. VLDB Endow. 11(11): 1454-1467 (2018) - [c41]Raul Castro Fernandez, Essam Mansour, Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery. ICDE 2018: 989-1000 - [c40]Essam Mansour, Dong Deng, Raul Castro Fernandez, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Building Data Civilizer Pipelines with an Advanced Workflow Engine. ICDE 2018: 1593-1596 - [c38]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Mourad Ouzzani, Nan Tang:
FAHES: Detecting Disguised Missing Values. ICDE 2018: 1609-1612 - [c37]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Raul Castro Fernandez, Mourad Ouzzani, Nan Tang:
FAHES: A Robust Disguised Missing Values Detector. KDD 2018: 2100-2109 - [i5]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani:
Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation. CoRR abs/1803.01384 (2018) - [i4]Saravanan Thirumuruganathan, Shameem Ahamed Puthiya Parambath, Mourad Ouzzani, Nan Tang, Shafiq R. Joty:
Reuse and Adaptation for Entity Resolution through Transfer Learning. CoRR abs/1809.11084 (2018) - 2017
- [j20]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Errata for "Lightning Fast and Space Efficient Inequality Joins" (PVLDB 8(13): 2074-2085). Proc. VLDB Endow. 10(9): 985 (2017) - [j17]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Fast and scalable inequality joins. VLDB J. 26(1): 125-150 (2017) - [c35]Dong Deng, Raul Castro Fernandez, Ziawasch Abedjan, Sibo Wang, Michael Stonebraker, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Nan Tang:
The Data Civilizer System. CIDR 2017 - [c31]Saravanan Thirumuruganathan, Laure Berti-Équille, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang:
UGuide: User-Guided Discovery of FD-Detectable Errors. SIGMOD Conference 2017: 1385-1397 - [c29]Raul Castro Fernandez, Dong Deng, Essam Mansour, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
A Demo of the Data Civilizer System. SIGMOD Conference 2017: 1639-1642 - [i3]Dong Deng, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Entity Consolidation: The Golden Record Problem. CoRR abs/1709.10436 (2017) - [i2]Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, Nan Tang:
DeepER - Deep Entity Resolution. CoRR abs/1710.00597 (2017) - 2016
- [j16]Ziawasch Abedjan, Xu Chu, Dong Deng, Raul Castro Fernandez, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker, Nan Tang:
Detecting Data Errors: Where are we and what needs to be done? Proc. VLDB Endow. 9(12): 993-1004 (2016) - [c28]Divy Agrawal, Sanjay Chawla, Ahmed K. Elmagarmid, Zoi Kaoudi, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J. Zaki:
Road to Freedom in Big Data Analytics. EDBT 2016: 479-484 - [c25]Divy Agrawal, Mouhamadou Lamine Ba, Laure Berti-Équille, Sanjay Chawla, Ahmed K. Elmagarmid, Hossam Hammady, Yasser Idris, Zoi Kaoudi, Zuhair Khayyat, Sebastian Kruse, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J. Zaki:
Rheem: Enabling Multi-Platform Task Execution. SIGMOD Conference 2016: 2069-2072 - 2015
- [j15]Xu Chu, Mourad Ouzzani, John Morcos, Ihab F. Ilyas, Paolo Papotti, Nan Tang, Yin Ye:
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing. Proc. VLDB Endow. 8(12): 1952-1955 (2015) - [j14]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Lightning Fast and Space Efficient Inequality Joins. Proc. VLDB Endow. 8(13): 2074-2085 (2015) - [c22]Zuhair Khayyat, Ihab F. Ilyas, Alekh Jindal, Samuel Madden, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
BigDansing: A System for Big Data Cleansing. SIGMOD Conference 2015: 1215-1230 - [c21]Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye:
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing. SIGMOD Conference 2015: 1247-1261 - 2014
- [c18]Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
NADEEF/ER: generic and interactive entity resolution. SIGMOD Conference 2014: 1071-1074 - 2013
- [j10]Amr Ebaid, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
NADEEF: A Generalized Data Cleaning System. Proc. VLDB Endow. 6(12): 1218-1221 (2013) - [c15]Michele Dallachiesa, Amr Ebaid, Ahmed Eldawy, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Nan Tang:
NADEEF: a commodity data cleaning system. SIGMOD Conference 2013: 541-552 - 2012
- [j8]George Beskales, Gautam Das, Ahmed K. Elmagarmid, Ihab F. Ilyas, Felix Naumann, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang:
The data analytics group at the qatar computing research institute. SIGMOD Rec. 41(4): 33-38 (2012)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-12 18:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint