


default search action
Proceedings of the VLDB Endowment, Volume 3
All papers published in volume 3 were presented at the

Number 1 contains papers from research sessions
Number 2 contains papers from industrial sessions and demo sessions
Volume 3, Number 1, September 2010
Keynotes
- Divesh Srivastava, Lukasz Golab, Rick Greer, Theodore Johnson, Joseph Seidel, Vladislav Shkapenyuk, Oliver Spatscheck, Jennifer Yates:
Enabling Real Time Data Analysis. 1-2 - Paul Matsudaira:
High-End Biological Imaging Generates Very Large 3D+ and Dynamic Datasets. 3
10-Year Best Paper Awards
- Junghoo Cho, Hector Garcia-Molina:
Dealing with Web Data: History and Look ahead. 4 - Bettina Kemme, Gustavo Alonso:
Database Replication: a Tale of Research across Communities. 5-12
Research Sessions
Database Security
- Mustafa Canim, Murat Kantarcioglu, Bijit Hore, Sharad Mehrotra:
Building Disclosure Risk Aware Query Optimizers for Relational Databases. 13-24 - Tristan Allard, Nicolas Anciaux, Luc Bouganim, Yanli Guo, Lionel Le Folgoc, Benjamin Nguyen
, Philippe Pucheral, Indrajit Ray, Indrakshi Ray, Shaoyi Yin:
Secure Personal Data Servers: a Vision Paper. 25-35 - Daniel Fabbri
, Kristen LeFevre, Qiang Zhu:
PolicyReplay: Misconfiguration-Response Queries for Data Breach Reporting. 36-47
Parallel and Distributed Databases
- Carlo Curino, Yang Zhang, Evan P. C. Jones, Samuel Madden:
Schism: a Workload-Driven Approach to Database Replication and Partitioning. 48-57 - Lu Qin
, Jeffrey Xu Yu, Lijun Chang:
Ten Thousand SQLs: Parallel Keyword Queries Computing. 58-69 - Alexander Thomson, Daniel J. Abadi
:
The Case for Determinism in Database Systems. 70-80
Data Exchange
- Bogdan Alexe, Mauricio A. Hernández, Lucian Popa, Wang Chiew Tan:
MapMerge: Correlating Independent Schema Mappings. 81-92 - Francesca Spezzano
, Sergio Greco
:
Chase Termination: A Constraints Rewriting Approach. 93-104 - Bruno Marnette, Giansalvatore Mecca, Paolo Papotti:
Scalable Data Exchange with Functional Dependencies. 105-116
Database Services and Applications
- Roy Levin, Yaron Kanza, Eliyahu Safra, Yehoshua Sagiv:
Interactive Route Search in the Presence of Order Constraints. 117-128 - Willis Lang, Jignesh M. Patel:
Energy Management for MapReduce Clusters. 129-139 - Akanksha Baid, Ian Rae, Jiexing Li, AnHai Doan, Jeffrey F. Naughton:
Toward Scalable Keyword Search over Relational Data. 140-149
Data Models and Languages
- Barzan Mozafari, Kai Zeng
, Carlo Zaniolo:
From Regular Expressions to Nested Words: Unifying Languages and Query Execution for Relational and XML Sequences. 150-161 - Torsten Grust, Jan Rittinger, Tom Schreiber:
Avalanche-Safe LINQ Compilation. 162-172 - Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Wenyuan Yu:
Towards Certain Fixes with Editing Rules and Master Data. 173-184
Semantics
- Melanie Herschel, Mauricio A. Hernández:
Explaining Missing Answers to SPJUA Queries. 185-196 - George Beskales, Ihab F. Ilyas, Lukasz Golab:
Sampling the Repairs of Functional Dependency Violations under Hard Constraints. 197-207 - David Menestrina, Steven Whang, Hector Garcia-Molina:
Evaluating Entity Resolution Results. 208-219
Stream Databases
- Badrish Chandramouli, Jonathan Goldstein, David Maier:
High-Performance Dynamic Pattern Matching over Disordered Streams. 220-231 - Irina Botan, Roozbeh Derakhshan, Nihal Dindar, Laura M. Haas, Renée J. Miller, Nesime Tatbul:
SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems. 232-243 - Haopeng Zhang, Yanlei Diao, Neil Immerman:
Recognizing Patterns in Streams with Imprecise Timestamps. 244-255
RDF and Graphs
- Thomas Neumann, Gerhard Weikum:
x-RDF-3X: Fast Querying, High Update Rates, and Consistency for RDF Databases. 256-263 - Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Yinghui Wu, Yunpeng Wu:
Graph Pattern Matching: From Intractable to Polynomial Time. 264-275 - Hilmi Yildirim, Vineet Chaoji
, Mohammed Javeed Zaki:
GRAIL: Scalable Reachability Index for Large Graphs. 276-284
Middleware Platforms for Data Management
- Yingyi Bu, Bill Howe
, Magdalena Balazinska, Michael D. Ernst:
HaLoop: Efficient Iterative Data Processing on Large Clusters. 285-296 - Michael Benedikt, Georg Gottlob:
The Impact of Virtual Views on Containment. 297-308 - James F. Terwilliger, Lois M. L. Delcambre, David Maier, Jeremy Steinhauer, Scott Britell:
Updatable and Evolvable Transforms for Virtual Databases. 309-319
Novel/Advanced Applications
- Daniel Deutch, Ohad Greenshpan, Tova Milo:
Navigating in Complex Mashed-Up Applications. 320-329 - Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton, Theo Vassilakis:
Dremel: Interactive Analysis of Web-Scale Datasets. 330-339 - Peixiang Zhao, Jiawei Han:
On Graph Query Optimization in Large Networks. 340-351
Ranking Queries
- Davide Martinenghi
, Marco Tagliasacchi:
Proximity Rank Join. 352-363 - Akrivi Vlachou, Christos Doulkeridis, Kjetil Nørvåg
, Yannis Kotidis:
Identifying the Most Influential Data Objects with Reverse Top-k Queries. 364-372 - Xin Cao
, Gao Cong
, Christian S. Jensen
:
Retrieving Top-k Prestige-Based Relevant Spatial Web Objects. 373-384
Spatial and Temporal Databases
- Lei Li, B. Aditya Prakash, Christos Faloutsos:
Parsimonious Linear Fingerprinting for Time Series. 385-396 - Rui Zhang, Martin Stradling:
The HV-tree: a Memory Hierarchy Aware Version Index. 397-408 - Sakti Pramanik, Alok Watve, Chad R. Meiners, Alex X. Liu:
Transforming Range Queries To Equivalent Box Queries To Optimize Page Access. 409-416
Record Linkage
- Songtao Guo, Xin Dong, Divesh Srivastava, Rémi Zajac:
Record Linkage with Uniqueness Constraints and Erroneous Values. 417-428 - Ekaterini Ioannou
, Wolfgang Nejdl
, Claudia Niederée
, Yannis Velegrakis
:
On-the-Fly Entity-Aware Query Processing in the Presence of Linkage. 429-438 - Mohamed Yakout, Ahmed K. Elmagarmid, Hazem Elmeleegy, Mourad Ouzzani, Alan Qi:
Behavior Based Record Linkage. 439-448
Experimental Analysis and Performance
- Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, Jeffrey Xu Yu:
iGraph: A Framework for Comparisons of Disk-Based Graph Indexing Techniques. 449-459 - Jörg Schad, Jens Dittrich, Jorge-Arnulfo Quiané-Ruiz
:
Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance. 460-471 - Dawei Jiang, Beng Chin Ooi, Lei Shi, Sai Wu:
The Performance of MapReduce: An In-depth Study. 472-483 - Hanna Köpcke, Andreas Thor
, Erhard Rahm
:
Evaluation of entity resolution approaches on real-world match problems. 484-493
Cloud Computing
- Tomasz Nykiel, Michalis Potamias, Chaitanya Mishra, George Kollios
, Nick Koudas:
MRShare: Sharing Across Multiple Queries in MapReduce. 494-505 - Hoang Tam Vo, Chun Chen, Beng Chin Ooi:
Towards Elastic Transactional Cloud Storage with Range Query Support. 506-517 - Jens Dittrich, Jorge-Arnulfo Quiané-Ruiz
, Alekh Jindal, Yagiz Kargin, Vinay Setty, Jörg Schad:
Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing). 518-529
Query Processing I
- Nicolas Bruno, Vivek R. Narasayya, Ravishankar Ramamurthy:
Slicing Long-Running Queries. 530-541 - Kostas Tzoumas, Amol Deshpande, Christian S. Jensen
:
Sharing-Aware Horizontal Partitioning for Exploiting Correlations During Query Processing. 542-553 - Andrea Calì, Georg Gottlob, Andreas Pieris:
Advanced Processing for Ontological Queries. 554-565
Data Extraction
- Aditya G. Parameswaran
, Hector Garcia-Molina, Anand Rajaraman:
Towards The Web of Concepts: Extracting Concepts from Large Datasets. 566-577 - Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Sengamedu, Ashwin Tengli:
Exploiting Content Redundancy for Web Information Extraction. 578-587 - Bin Liu, Laura Chiticariu, Vivian Chu, H. V. Jagadish, Frederick Reiss:
Automatic Rule Refinement for Information Extraction. 588-597
Privacy
- HweeHwa Pang
, Xuhua Ding
, Xiaokui Xiao
:
Embellishing Text Search Queries To Protect User Privacy. 598-607 - Rhonda Chaytor, Ke Wang:
Small Domain Randomization: Same Privacy, More Utility. 608-618 - Stavros Papadopoulos, Spiridon Bakiras
, Dimitris Papadias:
Nearest Neighbor Search with Strong Location Privacy. 619-629
Probabilistic and Uncertain Databases
- Hideaki Kimura, Samuel Madden, Stanley B. Zdonik:
UPI: A Primary Index for Uncertain Databases. 630-637 - Jian Li, Amol Deshpande:
Ranking Continuous Probabilistic Datasets. 638-649 - Xiang Lian
, Lei Chen
:
Set Similarity Join on Probabilistic Data. 650-659
Databases on Modern Hardware
- Louis Woods, Jens Teubner, Gustavo Alonso:
Complex Event Detection at Wire Speed with FPGAs. 660-669 - Wenbin Fang, Bingsheng He
, Qiong Luo
:
Database Compression on Graphics Processors. 670-680 - Ryan Johnson, Ippokratis Pandis, Radu Stoica, Manos Athanassoulis
, Anastasia Ailamaki:
Aether: A Scalable Approach to Logging. 681-692
Data Mining
- Kathy Macropol, Ambuj K. Singh:
Scalable Discovery of Best Clusters on Large Graphs. 693-702 - Alexander J. Smola, Shravan M. Narayanamurthy:
An Architecture for Parallel Topic Models. 703-710 - Dong Xin, Yeye He, Venkatesh Ganti:
Keyword++: A Framework to Improve Keyword Search Over Entity Databases. 711-722
Moving Object Databases
- Zhenhui Li, Bolin Ding, Jiawei Han, Roland Kays
:
Swarm: Mining Relaxed Temporal Moving Object Clusters. 723-734 - Su Chen, Beng Chin Ooi, Zhenjie Zhang:
An Adaptive Updating Protocol for Reducing Moving Object Databases Workload. 735-746 - Georgios Kellaris
, Kyriakos Mouratidis
:
Shortest Path Computation on Air Indexes. 747-757
Probabilistic Data
- Jia Xu, Zhenjie Zhang, Anthony K. H. Tung
, Ge Yu:
Efficient and Effective Similarity Search over Probabilistic Data based on Earth Mover's Distance. 758-769 - Michael Benedikt, Evgeny Kharlamov, Dan Olteanu, Pierre Senellart
:
Probabilistic XML via Markov Chains. 770-781
Fuzzy, Probabilistic and Approximate Databases
- Subi Arumugam, Ravi Jampani, Luis Leopoldo Perez, Fei Xu, Christopher M. Jermaine, Peter J. Haas:
MCDB-R: Risk Analysis in the Database. 782-793 - Michael L. Wick, Andrew McCallum, Gerome Miklau:
Scalable Probabilistic Databases with Factor Graphs and MCMC. 794-804
Discovery and Exploration
- Meihui Zhang, Marios Hadjieleftheriou, Beng Chin Ooi, Cecilia M. Procopiuc, Divesh Srivastava:
On Multi-Column Foreign Key Discovery. 805-814 - Reynold Cheng, Eric Lo, Xuan S. Yang, Ming-Hay Luk, Xiang Li, Xike Xie:
Explore or Exploit? Effective Strategies for Disambiguating Large Databases. 815-825
Information Filtering and Dissemination
- Mohamed A. Soliman, Ihab F. Ilyas, Mina Saleeb:
Building Ranked Mashups of Unstructured Sources with Uncertain Information. 826-837 - Chedy Raïssi, Jian Pei
, Thomas Kister:
Computing Closed Skycubes. 838-847
Query Processing II
- Eric Lo, Nick Cheng, Wing-Kai Hon:
Generating Databases for Query Workloads. 848-859 - Minji Wu, Laure Berti-Équille
, Amélie Marian, Cecilia M. Procopiuc, Divesh Srivastava:
Processing Top-k Join Queries. 860-870 - Xavier Martinez-Palau, David Dominguez-Sal, Josep Lluís Larriba-Pey:
Two-way Replacement Selection. 871-881
XML Data
- Sebastian Maneth, Kim Nguyen:
XPath Whole Query Optimization. 882-893 - Nils Grimsmo, Truls Amundsen Bjørklund, Magnus Lie Hetland:
Fast Optimal Twig Joins. 894-905 - Michael Benedikt, James Cheney:
Destabilizers and Independence of XML Updates. 906-917
Workflows, Transactions and Business Processes
- Ziyang Liu, Qihong Shao, Yi Chen:
Searching Workflows with Hierarchical Views. 918-927 - Ippokratis Pandis, Ryan Johnson, Nikos Hardavellas
, Anastasia Ailamaki:
Data-Oriented Transaction Execution. 928-939 - Daniel Deutch, Tova Milo, Neoklis Polyzotis, Tom Yam:
Optimal Top-K Query Evaluation for Weighted Business Processes. 940-951
Scientific databases
- Guozhang Wang, Marcos Antonio Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan J. Demers, Johannes Gehrke, Walker M. White:
Behavioral Simulations in MapReduce. 952-963 - Tingjian Ge, Stanley B. Zdonik:
A*-tree: A Structure for Storage and Modeling of Uncertain Multidimensional Arrays. 964-974 - Charu C. Aggarwal, Yao Li, Philip S. Yu, Ruoming Jin:
On Dense Pattern Mining in Graph Streams. 975-984
Mobility and Spatial Queries
- Man Lung Yiu, Leong Hou U
, Simonas Saltenis
, Kostas Tzoumas:
Efficient Proximity Detection among Mobile Users via Self-Tuning Policies. 985-996 - Michalis Potamias, Francesco Bonchi, Aristides Gionis, George Kollios
:
k-Nearest Neighbors in Uncertain Graphs. 997-1008 - Xin Cao
, Gao Cong
, Christian S. Jensen
:
Mining Significant Semantic Locations From GPS Data. 1009-1020
Data Anonymization Techniques
- Michael Hay, Vibhor Rastogi, Gerome Miklau, Dan Suciu
:
Boosting the Accuracy of Differentially Private Histograms Through Consistency. 1021-1032 - Jianneng Cao, Panagiotis Karras, Chedy Raïssi, Kian-Lee Tan
:
rho-uncertainty: Inference-Proof Transaction Anonymization. 1033-1044 - Graham Cormode
, Ninghui Li, Tiancheng Li, Divesh Srivastava:
Minimizing Minimality and Maximizing Utility: Analyzing Method-based attacks on Anonymized Data. 1045-1056
Querying and Integrating Probabilistic Databases
- Daisy Zhe Wang, Michael J. Franklin, Minos N. Garofalakis, Joseph M. Hellerstein:
Querying Probabilistic Information Extraction. 1057-1067 - Prithviraj Sen, Amol Deshpande, Lise Getoor:
Read-Once Functions and Query Evaluation in Probabilistic Databases. 1068-1079 - Parag Agrawal, Anish Das Sarma, Jeffrey D. Ullman, Jennifer Widom:
Foundations of Uncertain-Data Integration. 1080-1090
Database Design
- Michael Mathioudakis
, Nilesh Bansal, Nick Koudas:
Identifying, Attributing and Describing Spatial Bursts. 1091-1102 - Hideaki Kimura, George Huo, Alexander Rasin, Samuel Madden, Stanley B. Zdonik:
CORADD: Correlation Aware Database Designer for Materialized Views and Indexes. 1103-1113 - Danupon Nanongkai, Atish Das Sarma, Ashwin Lall, Richard J. Lipton, Jun (Jim) Xu:
Regret-Minimizing Representative Databases. 1114-1124
Query Optimization
- Benjamin Arai, Gautam Das
, Dimitrios Gunopulos
, Vagelis Hristidis
, Nick Koudas:
An Access Cost-Aware Approach for Object Retrieval over Multiple Sources. 1125-1136 - M. Abhirama, Sourjya Bhaumik, Atreyee Dey, Harsh Shrimal, Jayant R. Haritsa:
On the Stability of Plan Costs and the Costs of Plan Stability. 1137-1148 - Herodotos Herodotou, Shivnath Babu:
Xplus: A SQL-Tuning-Aware Query Optimizer. 1149-1160
Graph and Pattern Matching
- Wenfei Fan, Jianzhong Li, Shuai Ma, Hongzhi Wang
, Yinghui Wu:
Graph Homomorphism Revisited for Graph Matching. 1161-1172 - Ramakrishnan Kandhan, Nikhil Teletia, Jignesh M. Patel:
SigMatch: Fast and Scalable Multi-Pattern Matching. 1173-1184 - Shijie Zhang, Jiong Yang, Wei Jin:
SAPPER: Subgraph Indexing and Approximate Matching in Large Graphs. 1185-1194
Indexing Techniques
- Yinan Li, Bingsheng He
, Jun Yang, Qiong Luo
, Ke Yi:
Tree Indexing on Solid State Drives. 1195-1206 - Sai Wu, Dawei Jiang, Beng Chin Ooi, Kun-Lung Wu:
Efficient B-tree Based Indexing for Cloud Data Processing. 1207-1218 - Jiannan Wang, Guoliang Li, Jianhua Feng:
Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints. 1219-1230
Query Processing III
- Mehdi Sharifzadeh, Cyrus Shahabi:
VoR-Tree: R-trees with Voronoi Diagrams for Efficient Processing of Spatial Nearest Neighbor Queries. 1231-1242 - Deepak Padmanabhan, Prasad Deshpande:
Efficient RkNN Retrieval with Arbitrary Non-Metric Similarity Measures. 1243-1254 - Shiming Zhang, Nikos Mamoulis, Ben Kao, David Wai-Lok Cheung:
Efficient Skyline Evaluation over Partially Ordered Domains. 1255-1266
Streaming and Sensor Data
- Mingzhu Wei, Elke A. Rundensteiner, Murali Mani:
Achieving High Output Quality under Limited Resources through Structure-based Spilling in XML Streams. 1267-1278 - Svilen R. Mihaylov, Marie Jacob, Zachary G. Ives, Sudipto Guha:
Dynamic Join Optimization in Multi-Hop Wireless Sensor Networks. 1279-1290 - Mert Akdere, Ugur Çetintemel, Eli Upfal
:
Database-support for Continuous Prediction Queries over Streaming Data. 1291-1301 - Thanh T. L. Tran, Andrew McGregor, Yanlei Diao, Liping Peng, Anna Liu:
Conditioning and Aggregating Uncertain Data Streams: Going Beyond Expectations. 1302-1313
Information Integration and Retrieval