default search action
AnHai Doan
Person information
- affiliation: University of Wisconsin, Madison, WI, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j47]AnHai Doan:
Technical Perspective: Unicorn: A Unified Multi-Tasking Matching Model. SIGMOD Rec. 53(1): 43 (2024) - 2023
- [j46]Derek Paulsen, Yash Govind, AnHai Doan:
Sparkly: A Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching. Proc. VLDB Endow. 16(6): 1507-1519 (2023) - [j45]Yuliang Li, Jinfeng Li, Yoshi Suhara, AnHai Doan, Wang-Chiew Tan:
Effective entity matching with transformers. VLDB J. 32(6): 1215-1235 (2023) - 2022
- [j44]Daniel Abadi, Anastasia Ailamaki, David G. Andersen, Peter Bailis, Magdalena Balazinska, Philip A. Bernstein, Peter A. Boncz, Surajit Chaudhuri, Alvin Cheung, AnHai Doan, Luna Dong, Michael J. Franklin, Juliana Freire, Alon Y. Halevy, Joseph M. Hellerstein, Stratos Idreos, Donald Kossmann, Tim Kraska, Sailesh Krishnamurthy, Volker Markl, Sergey Melnik, Tova Milo, C. Mohan, Thomas Neumann, Beng Chin Ooi, Fatma Ozcan, Jignesh M. Patel, Andrew Pavlo, Raluca A. Popa, Raghu Ramakrishnan, Christopher Ré, Michael Stonebraker, Dan Suciu:
The Seattle report on database research. Commun. ACM 65(8): 72-79 (2022) - [j43]Magdalena Balazinska, Surajit Chaudhuri, AnHai Doan, Joseph M. Hellerstein, Hanuma Kodavalla, Ippokratis Pandis, Matei Zaharia:
Cloud Data Systems: What are the Opportunities for the Database Research Community? Proc. VLDB Endow. 15(12): 3826-3827 (2022) - [c72]Adel Ardalan, Derek Paulsen, Amanpreet Singh Saini, Walter Cai, AnHai Doan:
Toward Data Cleaning with a Target Accuracy: A Case Study for Value Normalization. IEEE Big Data 2022: 3975-3981 - 2021
- [j42]Saravanan Thirumuruganathan, Han Li, Nan Tang, Mourad Ouzzani, Yash Govind, Derek Paulsen, Glenn Fung, AnHai Doan:
Deep Learning for Blocking in Entity Matching: A Design Space Exploration. Proc. VLDB Endow. 14(11): 2459-2472 (2021) - [i9]Adel Ardalan, Derek Paulsen, Amanpreet Singh Saini, Walter Cai, AnHai Doan:
Toward Data Cleaning with a Target Accuracy: A Case Study for Value Normalization. CoRR abs/2101.05308 (2021) - 2020
- [j41]AnHai Doan, Pradap Konda, Paul Suganthan G. C., Yash Govind, Derek Paulsen, Kaushik Chandrasekhar, Philip Martinkus, Matthew Christie:
Magellan: toward building ecosystems of entity matching solutions. Commun. ACM 63(8): 83-91 (2020) - [j40]Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, Wang-Chiew Tan:
Deep Entity Matching with Pre-Trained Language Models. Proc. VLDB Endow. 14(1): 50-60 (2020) - [j39]David Maier, Rachel Pottinger, AnHai Doan, Eduard C. Dragut, Bill Howe, Joanne Lateulere, John Lateulere, Mostafa Milani, Tilmann Rabl, Dan Suciu, Yufei Tao, Wang-Chiew Tan, Kristin Tufte:
Advice from SIGMOD/PODS 2020. SIGMOD Rec. 49(3): 43-54 (2020) - [c71]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani, AnHai Doan:
Data Curation with Deep Learning. EDBT 2020: 277-286 - [c70]Haojun Zhang, Chengliang Chai, AnHai Doan, Paris Koutris, Esteban Arcaute:
Manually Detecting Errors for Data Cleaning Using Adaptive Crowdsourcing Strategies. EDBT 2020: 311-322 - [c69]Mashaal Musleh, Mourad Ouzzani, Nan Tang, AnHai Doan:
CoClean: Collaborative Data Cleaning. SIGMOD Conference 2020: 2757-2760 - [e3]David Maier, Rachel Pottinger, AnHai Doan, Wang-Chiew Tan, Abdussalam Alawini, Hung Q. Ngo:
Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 14-19, 2020. ACM 2020, ISBN 978-1-4503-6735-6 [contents] - [i8]Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, Wang-Chiew Tan:
Deep Entity Matching with Pre-Trained Language Models. CoRR abs/2004.00584 (2020)
2010 – 2019
- 2019
- [j38]Daniel Abadi, Anastasia Ailamaki, David G. Andersen, Peter Bailis, Magdalena Balazinska, Philip A. Bernstein, Peter A. Boncz, Surajit Chaudhuri, Alvin Cheung, AnHai Doan, Luna Dong, Michael J. Franklin, Juliana Freire, Alon Y. Halevy, Joseph M. Hellerstein, Stratos Idreos, Donald Kossmann, Tim Kraska, Sailesh Krishnamurthy, Volker Markl, Sergey Melnik, Tova Milo, C. Mohan, Thomas Neumann, Beng Chin Ooi, Fatma Ozcan, Jignesh M. Patel, Andrew Pavlo, Raluca A. Popa, Raghu Ramakrishnan, Christopher Ré, Michael Stonebraker, Dan Suciu:
The Seattle Report on Database Research. SIGMOD Rec. 48(4): 44-53 (2019) - [c68]Pradap Konda, Sanjay Subramanian Seshadri, Elan Segarra, Brent Hueth, AnHai Doan:
Executing Entity Matching End to End: A Case Study. EDBT 2019: 489-500 - [c67]Yash Govind, Pradap Konda, Paul Suganthan G. C., Philip Martinkus, Palaniappan Nagarajan, Han Li, Aravind Soundararajan, Sidharth Mudgal, Jeffrey R. Ballard, Haojun Zhang, Adel Ardalan, Sanjib Das, Derek Paulsen, Amanpreet Singh Saini, Erik Paulson, Youngchoon Park, Marshall Carter, Mingju Sun, Glenn Moo Fung, AnHai Doan:
Entity Matching Meets Data Science: A Progress Report from the Magellan Project. SIGMOD Conference 2019: 389-403 - 2018
- [j37]Chen Chen, Behzad Golshan, Alon Y. Halevy, Wang-Chiew Tan, AnHai Doan:
BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration. IEEE Data Eng. Bull. 41(2): 10-22 (2018) - [j36]AnHai Doan, Pradap Konda, Paul Suganthan G. C., Adel Ardalan, Jeffrey R. Ballard, Sanjib Das, Yash Govind, Han Li, Philip Martinkus, Sidharth Mudgal, Erik Paulson, Haojun Zhang:
Toward a System Building Agenda for Data Integration (and Data Science). IEEE Data Eng. Bull. 41(2): 35-46 (2018) - [j35]Yash Govind, Erik Paulson, Palaniappan Nagarajan, Paul Suganthan G. C., AnHai Doan, Youngchoon Park, Glenn Fung, Devin Conathan, Marshall Carter, Mingju Sun:
CloudMatcher: A Hands-Off Cloud/Crowd Service for Entity Matching. Proc. VLDB Endow. 11(12): 2042-2045 (2018) - [j34]Paul Suganthan G. C., Adel Ardalan, AnHai Doan, Aditya Akella:
Smurf: Self-Service String Matching Using Random Forests. Proc. VLDB Endow. 12(3): 278-291 (2018) - [c66]Han Li, Pradap Konda, Paul Suganthan G. C., AnHai Doan, Benjamin Snyder, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Vijay Raghavendra:
MatchCatcher: A Debugger for Blocking in Entity Matching. EDBT 2018: 193-204 - [c65]AnHai Doan:
Human-in-the-Loop Data Analysis: A Personal Perspective. HILDA@SIGMOD 2018: 1:1-1:6 - [c64]Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra:
Deep Learning for Entity Matching: A Design Space Exploration. SIGMOD Conference 2018: 19-34 - 2017
- [j33]Matthew Bernstein, AnHai Doan, Colin N. Dewey:
MetaSRA: normalized human sample-specific metadata for the Sequence Read Archive. Bioinform. 33(18): 2914-2923 (2017) - [c63]Ahmad Pahlavan Tafti, Ehsun Behravesh, Mehdi Assefi, Eric LaRose, Jonathan C. Badger, John Mayer, AnHai Doan, David Page, Peggy L. Peissig:
bigNN: An open-source big data toolkit focused on biomedical sentence classification. IEEE BigData 2017: 3888-3896 - [c62]AnHai Doan:
What is Our Agenda for Data Science? CIDR 2017 - [c61]Eric R. LaRose, Jonathan C. Badger, Pradap Konda, AnHai Doan, Peggy L. Peissig:
Entity Matching Using Magellan: Matching Drug Reference Tables. CRI 2017 - [c60]Fatemah Panahi, Wentao Wu, AnHai Doan, Jeffrey F. Naughton:
Towards Interactive Debugging of Rule-based Entity Matching. EDBT 2017: 354-365 - [c59]AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Sanjib Das, Yash Govind, Pradap Konda, Han Li, Sidharth Mudgal, Erik Paulson, Paul Suganthan G. C., Haojun Zhang:
Human-in-the-Loop Challenges for Entity Matching: A Midterm Report. HILDA@SIGMOD 2017: 12:1-12:6 - [c58]Sanjib Das, Paul Suganthan G. C., AnHai Doan, Jeffrey F. Naughton, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra, Youngchoon Park:
Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services. SIGMOD Conference 2017: 1431-1446 - [i7]AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Sanjib Das, Yash Govind, Pradap Konda, Han Li, Erik Paulson, Paul Suganthan G. C., Haojun Zhang:
Toward a System Building Agenda for Data Integration. CoRR abs/1710.00027 (2017) - 2016
- [j32]Daniel Abadi, Rakesh Agrawal, Anastasia Ailamaki, Magdalena Balazinska, Philip A. Bernstein, Michael J. Carey, Surajit Chaudhuri, Jeffrey Dean, AnHai Doan, Michael J. Franklin, Johannes Gehrke, Laura M. Haas, Alon Y. Halevy, Joseph M. Hellerstein, Yannis E. Ioannidis, H. V. Jagadish, Donald Kossmann, Samuel Madden, Sharad Mehrotra, Tova Milo, Jeffrey F. Naughton, Raghu Ramakrishnan, Volker Markl, Christopher Olston, Beng Chin Ooi, Christopher Ré, Dan Suciu, Michael Stonebraker, Todd Walter, Jennifer Widom:
The Beckman report on database research. Commun. ACM 59(2): 92-99 (2016) - [j31]Pradap Konda, Sanjib Das, Paul Suganthan G. C., AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Han Li, Fatemah Panahi, Haojun Zhang, Jeffrey F. Naughton, Shishir Prasad, Ganesh Krishnan, Rohit Deep, Vijay Raghavendra:
Magellan: Toward Building Entity Matching Management Systems. Proc. VLDB Endow. 9(12): 1197-1208 (2016) - [j30]Pradap Konda, Sanjib Das, Paul Suganthan G. C., AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Han Li, Fatemah Panahi, Haojun Zhang, Jeffrey F. Naughton, Shishir Prasad, Ganesh Krishnan, Rohit Deep, Vijay Raghavendra:
Magellan: Toward Building Entity Matching Management Systems over Data Science Stacks. Proc. VLDB Endow. 9(13): 1581-1584 (2016) - 2015
- [c57]Akanksha Baid, Wentao Wu, Chong Sun, AnHai Doan, Jeffrey F. Naughton:
On Debugging Non-Answers in Keyword Search Systems. EDBT 2015: 37-48 - [c56]Paul Suganthan G. C., Chong Sun, Krishna Gayatri K., Haojun Zhang, Frank Yang, Narasimhan Rampalli, Shishir Prasad, Esteban Arcaute, Ganesh Krishnan, Rohit Deep, Vijay Raghavendra, AnHai Doan:
Why Big Data Industrial Systems Need Rules and What We Can Do About It. SIGMOD Conference 2015: 265-276 - 2014
- [j29]Yueh-Hsuan Chiang, AnHai Doan, Jeffrey F. Naughton:
Tracking Entities in the Dynamic World: A Fast Algorithm for Matching Temporal Records. Proc. VLDB Endow. 7(6): 469-480 (2014) - [j28]Chong Sun, Narasimhan Rampalli, Frank Yang, AnHai Doan:
Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing. Proc. VLDB Endow. 7(13): 1529-1540 (2014) - [j27]Daniel J. Abadi, Rakesh Agrawal, Anastasia Ailamaki, Magdalena Balazinska, Philip A. Bernstein, Michael J. Carey, Surajit Chaudhuri, Jeffrey Dean, AnHai Doan, Michael J. Franklin, Johannes Gehrke, Laura M. Haas, Alon Y. Halevy, Joseph M. Hellerstein, Yannis E. Ioannidis, H. V. Jagadish, Donald Kossmann, Samuel Madden, Sharad Mehrotra, Tova Milo, Jeffrey F. Naughton, Raghu Ramakrishnan, Volker Markl, Christopher Olston, Beng Chin Ooi, Christopher Ré, Dan Suciu, Michael Stonebraker, Todd Walter, Jennifer Widom:
The Beckman Report on Database Research. SIGMOD Rec. 43(3): 61-70 (2014) - [c55]Chaitanya Gokhale, Sanjib Das, AnHai Doan, Jeffrey F. Naughton, Narasimhan Rampalli, Jude W. Shavlik, Xiaojin Zhu:
Corleone: hands-off crowdsourcing for entity matching. SIGMOD Conference 2014: 601-612 - [c54]Yueh-Hsuan Chiang, AnHai Doan, Jeffrey F. Naughton:
Modeling entity evolution for temporal record matching. SIGMOD Conference 2014: 1175-1186 - 2013
- [j26]Xiaoyong Chai, Omkar Deshpande, Nikesh Garera, Rohit Kumar, Wang Lam, Digvijay S. Lamba, Lu Liu, Mitul Tiwari, Michel Tourn, Zoheb Vacheri, STS Prasad, Sri Subramaniam, Venky Harinarayan, Anand Rajaraman, Adel Ardalan, Sanjib Das, Paul Suganthan G. C., AnHai Doan:
Social Media Analytics: The Kosmix Story. IEEE Data Eng. Bull. 36(3): 4-12 (2013) - [j25]Rohit Kumar, Digvijay S. Lamba, Nikesh Garera, Mitul Tiwari, Xiaoyong Chai, Sanjib Das, Sri Subramaniam, Anand Rajaraman, Venky Harinarayan, AnHai Doan:
Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach. Proc. VLDB Endow. 6(11): 1126-1137 (2013) - [c53]Omkar Deshpande, Digvijay S. Lamba, Michel Tourn, Sanjib Das, Sri Subramaniam, Anand Rajaraman, Venky Harinarayan, AnHai Doan:
Building, maintaining, and using knowledge bases: a report from the trenches. SIGMOD Conference 2013: 1209-1220 - [c52]AnHai Doan:
Badger: Toward Crowdsourcing the Building of Structured Knowledge Bases. WebDB 2013 - [i6]AnHai Doan, Peter Haddawy:
Sound Abstraction of Probabilistic Actions in The Constraint Mass Assignment Framework. CoRR abs/1302.3574 (2013) - [i5]Peter Haddawy, AnHai Doan, Richard Goodwin:
Efficient Decision-Theoretic Planning: Techniques and Empirical Analysis. CoRR abs/1302.4952 (2013) - [i4]Peter Haddawy, AnHai Doan:
Abstracting Probabilistic Actions. CoRR abs/1302.6812 (2013) - 2012
- [b1]AnHai Doan, Alon Y. Halevy, Zachary G. Ives:
Principles of Data Integration. Morgan Kaufmann 2012, ISBN 978-0-12-416044-6, pp. I-XVIII, 1-497 - [j24]Wang Lam, Lu Liu, STS Prasad, Anand Rajaraman, Zoheb Vacheri, AnHai Doan:
Muppet: MapReduce-Style Processing of Fast Data. Proc. VLDB Endow. 5(12): 1814-1825 (2012) - [i3]Wang Lam, Lu Liu, STS Prasad, Anand Rajaraman, Zoheb Vacheri, AnHai Doan:
Muppet: MapReduce-Style Processing of Fast Data. CoRR abs/1208.4175 (2012) - 2011
- [j23]AnHai Doan, Raghu Ramakrishnan, Alon Y. Halevy:
Crowdsourcing systems on the World-Wide Web. Commun. ACM 54(4): 86-96 (2011) - [j22]Feng Niu, Christopher Ré, AnHai Doan, Jude W. Shavlik:
Tuffy: Scaling up Statistical Inference in Markov Logic Networks using an RDBMS. Proc. VLDB Endow. 4(6): 373-384 (2011) - [j21]AnHai Doan, Michael J. Franklin, Donald Kossmann, Tim Kraska:
Crowdsourcing Applications and Platforms: A Data Management Perspective. Proc. VLDB Endow. 4(12): 1508-1509 (2011) - [i2]Feng Niu, Christopher Ré, AnHai Doan, Jude W. Shavlik:
Tuffy: Scaling up Statistical Inference in Markov Logic Networks using an RDBMS. CoRR abs/1104.3216 (2011) - 2010
- [j20]Akanksha Baid, Ian Rae, Jiexing Li, AnHai Doan, Jeffrey F. Naughton:
Toward Scalable Keyword Search over Relational Data. Proc. VLDB Endow. 3(1): 140-149 (2010) - [c51]Akanksha Baid, Ian Rae, AnHai Doan, Jeffrey F. Naughton:
Toward industrial-strength keyword search systems over relational data. ICDE 2010: 717-720 - [c50]Sihem Amer-Yahia, AnHai Doan, Jon M. Kleinberg, Nick Koudas, Michael J. Franklin:
Crowds, clouds, and algorithms: exploring the human side of "big data" applications. SIGMOD Conference 2010: 1259-1260
2000 – 2009
- 2009
- [j19]Rakesh Agrawal, Anastasia Ailamaki, Philip A. Bernstein, Eric A. Brewer, Michael J. Carey, Surajit Chaudhuri, AnHai Doan, Daniela Florescu, Michael J. Franklin, Hector Garcia-Molina, Johannes Gehrke, Le Gruenwald, Laura M. Haas, Alon Y. Halevy, Joseph M. Hellerstein, Yannis E. Ioannidis, Henry F. Korth, Donald Kossmann, Samuel Madden, Roger Magoulas, Beng Chin Ooi, Tim O'Reilly, Raghu Ramakrishnan, Sunita Sarawagi, Michael Stonebraker, Alexander S. Szalay, Gerhard Weikum:
The Claremont report on database research. Commun. ACM 52(6): 56-65 (2009) - [c49]AnHai Doan, Jeffrey F. Naughton, Akanksha Baid, Xiaoyong Chai, Fei Chen, Ting Chen, Eric Chu, Pedro DeRose, Byron J. Gao, Chaitanya Gokhale, Jiansheng Huang, Warren Shen, Ba-Quy Vuong:
The Case for a Structured Approach to Managing Unstructured Data. CIDR 2009 - [c48]Alpa Jain, Panagiotis G. Ipeirotis, AnHai Doan, Luis Gravano:
Join Optimization of Information Extraction Output: Quality Matters! ICDE 2009: 186-197 - [c47]Risi Thonangi, Hao He, AnHai Doan, Haixun Wang, Jun Yang:
Weighted Proximity Best-Joins for Information Retrieval. ICDE 2009: 234-245 - [c46]Xiaoyong Chai, Ba-Quy Vuong, AnHai Doan, Jeffrey F. Naughton:
Efficiently incorporating user feedback into information extraction and integration programs. SIGMOD Conference 2009: 87-100 - [c45]Fei Chen, Byron J. Gao, AnHai Doan, Jun Yang, Raghu Ramakrishnan:
Optimizing complex extraction programs over evolving text data. SIGMOD Conference 2009: 321-334 - [c44]Eric Chu, Akanksha Baid, Xiaoyong Chai, AnHai Doan, Jeffrey F. Naughton:
Combining keyword search and forms for ad hoc querying of databases. SIGMOD Conference 2009: 349-360 - [p2]Wensheng Wu, AnHai Doan, Clement T. Yu, Weiyi Meng:
Modeling and Extracting Deep-Web Query Interfaces. Advances in Information and Intelligent Systems 2009: 65-90 - [i1]AnHai Doan, Jeffrey F. Naughton, Akanksha Baid, Xiaoyong Chai, Fei Chen, Ting Chen, Eric Chu, Pedro DeRose, Byron J. Gao, Chaitanya Gokhale, Jiansheng Huang, Warren Shen, Ba-Quy Vuong:
The Case for a Structured Approach to Managing Unstructured Data. CoRR abs/0909.1783 (2009) - 2008
- [j18]Jiansheng Huang, Ting Chen, AnHai Doan, Jeffrey F. Naughton:
On the provenance of non-answers to queries over extracted data. Proc. VLDB Endow. 1(1): 736-747 (2008) - [j17]Xiaoyong Chai, Mayssam Sayyadian, AnHai Doan, Arnon Rosenthal, Len Seligman:
Analyzing and revising data integration schemas to improve their matchability. Proc. VLDB Endow. 1(1): 773-784 (2008) - [j16]Sihem Amer-Yahia, Volker Markl, Alon Y. Halevy, AnHai Doan, Gustavo Alonso, Donald Kossmann, Gerhard Weikum:
Databases and Web 2.0 panel at VLDB 2007. SIGMOD Rec. 37(1): 49-52 (2008) - [j15]Rakesh Agrawal, Anastasia Ailamaki, Philip A. Bernstein, Eric A. Brewer, Michael J. Carey, Surajit Chaudhuri, AnHai Doan, Daniela Florescu, Michael J. Franklin, Hector Garcia-Molina, Johannes Gehrke, Le Gruenwald, Laura M. Haas, Alon Y. Halevy, Joseph M. Hellerstein, Yannis E. Ioannidis, Henry F. Korth, Donald Kossmann, Samuel Madden, Roger Magoulas, Beng Chin Ooi, Tim O'Reilly, Raghu Ramakrishnan, Sunita Sarawagi, Michael Stonebraker, Alexander S. Szalay, Gerhard Weikum:
The Claremont report on database research. SIGMOD Rec. 37(3): 9-19 (2008) - [j14]AnHai Doan, Jeffrey F. Naughton, Raghu Ramakrishnan, Akanksha Baid, Xiaoyong Chai, Fei Chen, Ting Chen, Eric Chu, Pedro DeRose, Byron J. Gao, Chaitanya Gokhale, Jiansheng Huang, Warren Shen, Ba-Quy Vuong:
Information extraction challenges in managing unstructured data. SIGMOD Rec. 37(4): 14-20 (2008) - [c43]Robert McCann, Warren Shen, AnHai Doan:
Matching Schemas in Online Communities: A Web 2.0 Approach. ICDE 2008: 110-119 - [c42]Alpa Jain, AnHai Doan, Luis Gravano:
Optimizing SQL Queries over Text Databases. ICDE 2008: 636-645 - [c41]Pedro DeRose, Xiaoyong Chai, Byron J. Gao, Warren Shen, AnHai Doan, Philip Bohannon, Xiaojin Zhu:
Building Community Wikipedias: A Machine-Human Partnership Approach. ICDE 2008: 646-655 - [c40]Fei Chen, AnHai Doan, Jun Yang, Raghu Ramakrishnan:
Efficient Information Extraction over Evolving Text Data. ICDE 2008: 943-952 - [c39]Michael L. Wick, Khashayar Rohanimanesh, Andrew McCallum, AnHai Doan:
A Discriminative Approach to Ontology Mapping. NTII 2008: 16-19 - [c38]AnHai Doan:
Building Structured Web Community Portals Via Extraction, Integration, and Mass Collaboration. PRICAI 2008: 3 - [c37]Warren Shen, Pedro DeRose, Robert McCann, AnHai Doan, Raghu Ramakrishnan:
Toward best-effort information extraction. SIGMOD Conference 2008: 1031-1042 - 2007
- [j13]AnHai Doan, Philip Bohannon, Raghu Ramakrishnan, Xiaoyong Chai, Pedro DeRose, Byron J. Gao, Warren Shen:
User-Centric Research Challenges in Community Information Management Systems. IEEE Data Eng. Bull. 30(2): 32-40 (2007) - [j12]Yoonkyong Lee, Mayssam Sayyadian, AnHai Doan, Arnon Rosenthal:
eTuner: tuning schema matching software using synthetic scenarios. VLDB J. 16(1): 97-122 (2007) - [c36]Pedro DeRose, Warren Shen, Fei Chen, Yoonkyong Lee, Douglas Burdick, AnHai Doan, Raghu Ramakrishnan:
DBLife: A Community Information Management Platform for the Database Research Community (Demo). CIDR 2007: 169-172 - [c35]Warren Shen, Pedro DeRose, Long H. Vu, AnHai Doan, Raghu Ramakrishnan:
Source-aware Entity Matching: A Compositional Approach. ICDE 2007: 196-205 - [c34]Mayssam Sayyadian, Hieu LeKhac, AnHai Doan, Luis Gravano:
Efficient Keyword Search Across Heterogeneous Relational Databases. ICDE 2007: 346-355 - [c33]Alpa Jain, AnHai Doan, Luis Gravano:
SQL Queries Over Unstructured Text Databases. ICDE 2007: 1255-1257 - [c32]Tochukwu Iwuchukwu, David J. DeWitt, AnHai Doan, Jeffrey F. Naughton:
K-Anonymization as Spatial Indexing: Toward Scalable and Incremental Anonymization. ICDE 2007: 1414-1416 - [c31]AnHai Doan:
Data Quality Challenges in Community Systems. QDB 2007: 9 - [c30]Douglas Burdick, AnHai Doan, Raghu Ramakrishnan, Shivakumar Vaithyanathan:
OLAP over Imprecise Data with Domain Constraints. VLDB 2007: 39-50 - [c29]Pedro DeRose, Warren Shen, Fei Chen, AnHai Doan, Raghu Ramakrishnan:
Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach. VLDB 2007: 399-410 - [c28]Warren Shen, AnHai Doan, Jeffrey F. Naughton, Raghu Ramakrishnan:
Declarative Information Extraction Using Datalog with Embedded Extraction Predicates. VLDB 2007: 1033-1044 - [c27]Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, Jeffrey F. Naughton:
A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data. VLDB 2007: 1045-1056 - 2006
- [j11]AnHai Doan, Raghu Ramakrishnan, Fei Chen, Pedro DeRose, Yoonkyong Lee, Robert McCann, Mayssam Sayyadian, Warren Shen:
Community Information Management. IEEE Data Eng. Bull. 29(1): 64-72 (2006) - [c26]Wensheng Wu, AnHai Doan, Clement T. Yu:
WebIQ: Learning from the Web to Match Deep-Web Query Interfaces. ICDE 2006: 44 - [c25]AnHai Doan, Raghu Ramakrishnan, Shivakumar Vaithyanathan:
Managing information extraction: state of the art and research directions. SIGMOD Conference 2006: 799-800 - 2005
- [j10]Natalya Fridman Noy, AnHai Doan, Alon Y. Halevy:
Semantic Integration. AI Mag. 26(1): 7-10 (2005) - [j9]AnHai Doan, Alon Y. Halevy:
Semantic Integration Research in the Database Community: A Brief Survey. AI Mag. 26(1): 83-94 (2005) - [c24]Warren Shen, Xin Li, AnHai Doan:
Constraint-Based Entity Matching. AAAI 2005: 862-867 - [c23]AnHai Doan, Robert McCann, Warren Shen:
Collaborative Development of Information Integration Systems. AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors 2005: 34-41 - [c22]Jayant Madhavan, Philip A. Bernstein, AnHai Doan, Alon Y. Halevy:
Corpus-based Schema Matching. ICDE 2005: 57-68 - [c21]Robert McCann, Alexander Kramnik, Warren Shen, Vanitha Varadarajan, Olu Sobulo, AnHai Doan:
Integrating Data from Disparate Sources: A Mass Collaboration Approach. ICDE 2005: 487-488 - [c20]Wensheng Wu, AnHai Doan, Clement T. Yu:
Merging Interface Schemas on the Deep Web via Clustering Aggregation. ICDM 2005: 801-804 - [c19]Wensheng Wu, AnHai Doan, Clement T. Yu, Weiyi Meng:
Bootstrapping Domain Ontology for Semantic Web Services from Source Web Sites. TES 2005: 11-22 - [c18]Mayssam Sayyadian, Yoonkyong Lee, AnHai Doan, Arnon Rosenthal:
Tuning Schema Matching Software using Synthetic Scenarios. VLDB 2005: 994-1005 - [c17]Robert McCann, Bedoor K. AlShebli, Quoc Le, Hoa Nguyen, Long H. Vu, AnHai Doan:
Mapping Maintenance for Data Integration Systems. VLDB 2005: 1018-1030 - [e2]AnHai Doan, Frank Neven, Robert McCann, Geert Jan Bex:
Proceedings of the Eight International Workshop on the Web & Databases (WebDB 2005), Baltimore, Maryland, USA, Collocated mith ACM SIGMOD/PODS 2005, June 16-17, 2005. 2005 [contents] - 2004
- [j8]AnHai Doan, Alon Y. Halevy,