default search action
31st ICDE 2015: Seoul, South Korea
- Johannes Gehrke, Wolfgang Lehner, Kyuseok Shim, Sang Kyun Cha, Guy M. Lohman:
31st IEEE International Conference on Data Engineering, ICDE 2015, Seoul, South Korea, April 13-17, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-7964-6
Keynotes
- Surajit Chaudhuri:
Information at your Fingertips: Only a dream for enterprises? 1-4 - Hector Garcia-Molina:
Data crowdsourcing: Is it for real? 5
Research Session 1: Data Integration
- Chen Jason Zhang, Lei Chen, Yongxin Tong, Zheng Liu:
Cleaning uncertain data with a noisy crowd. 6-17 - Matteo Interlandi, Nan Tang:
Proof positive and negative in data cleaning. 18-29 - Jianmin Wang, Shaoxu Song, Xuemin Lin, Xiaochen Zhu, Jian Pei:
Cleaning structured event logs: A graph repair approach. 30-41 - El Kindi Rezig, Eduard C. Dragut, Mourad Ouzzani, Ahmed K. Elmagarmid:
Query-time record linkage and fusion over Web databases. 42-53
Research Session 2: Data Privacy and Security 1
- Yazhe Wang, Baihua Zheng:
Preserving privacy in social networks against connection fingerprint attacks. 54-65 - An Liu, Kai Zheng, Lu Li, Guanfeng Liu, Lei Zhao, Xiaofang Zhou:
Efficient secure similarity computation on encrypted trajectory data. 66-77 - Erman Pattuk, Murat Kantarcioglu, Huseyin Ulusoy, Bradley A. Malin:
Privacy-aware dynamic feature selection. 78-88 - Xian Li, Xin Luna Dong, Kenneth B. Lyons, Weiyi Meng, Divesh Srivastava:
Scaling up copy detection. 89-100
Research Session 3: Distributed Storage and Processing
- Ling Gu, Minqi Zhou, Zhenjie Zhang, Ming-Chien Shan, Aoying Zhou, Marianne Winslett:
Chronos: An elastic parallel framework for stream benchmark generation and simulation. 101-112 - Sai Wu, Gang Chen, Xianke Zhou, Zhenjie Zhang, Anthony K. H. Tung, Marianne Winslett:
PABIRS: A data access middleware for distributed file systems. 113-124 - Akon Dey, Alan D. Fekete, Uwe Röhm:
Scalable distributed transactions across heterogeneous stores. 125-136 - Muhammad Anis Uddin Nasir, Gianmarco De Francisci Morales, David García-Soriano, Nicolas Kourtellis, Marco Serafini:
The power of both choices: Practical load balancing for distributed stream processing engines. 137-148
Research Session 4: Graph and Timeseries Mining
- Julian Shun, Kanat Tangwongsan:
Multicore triangle computations without tuning. 149-160 - Yang Cao, Wenfei Fan, Jinpeng Huai, Ruizhe Huang:
Making pattern queries bounded in big graphs. 161-172 - Ge Luo, Ke Yi, Siu-Wing Cheng, Zhenguo Li, Wei Fan, Cheng He, Yadong Mu:
Piecewise linear approximation of streaming time series data with max-error guarantees. 173-184 - Charu C. Aggarwal, Philip S. Yu:
On historical diagnosis of sensor streams. 185-194
Research Session 5: Web Search and Crowdsourcing
- Manas Joglekar, Hector Garcia-Molina, Aditya G. Parameswaran:
Comprehensive and reliable crowd assessment algorithms. 195-206 - Chuanfei Xu, Bo Tang, Man Lung Yiu:
Diversified caching for replicated web search engines. 207-218 - Vasilis Verroios, Hector Garcia-Molina:
Entity Resolution with crowd errors. 219-230 - Thanh Tam Nguyen, Nguyen Quoc Viet Hung, Matthias Weidlich, Karl Aberer:
Result selection and summarization for Web Table search. 231-242
Research Session 6: Top-k and Pattern Mining
- Arko Provo Mukherjee, Pan Xu, Srikanta Tirthapura:
Mining maximal cliques from an uncertain graph. 243-254 - Lisi Chen, Gao Cong, Xin Cao, Kian-Lee Tan:
Temporal Spatial-Keyword Top-k publish/subscribe. 255-266 - Jinling Jiang, Hua Lu, Bin Yang, Bin Cui:
Finding top-k local users in geo-tagged social media data. 267-278 - Lei Chen, Xin Lin, Haibo Hu, Christian S. Jensen, Jianliang Xu:
Answering why-not questions on spatial keyword top-k queries. 279-290
Research Session 7: Query Processing 1
- Ziqiang Feng, Eric Lo:
Accelerating aggregation using intra-cycle parallelism. 291-302 - Yongming Luo, George H. L. Fletcher, Jan Hidders, Paul De Bra:
Efficient and scalable trie-based algorithms for computing set containment relations. 303-314 - Renata Borovica-Gajic, Stratos Idreos, Anastasia Ailamaki, Marcin Zukowski, Campbell Fraser:
Smooth Scan: Statistics-oblivious access paths. 315-326 - Hina A. Khan, Mohamed A. Sharaf:
Progressive diversification for column-based data exploration platforms. 327-338
Research Session 8: Graph Query Processing and Systems
- Zhe Fan, Byron Choi, Jianliang Xu, Sourav S. Bhowmick:
Asymmetric structure-preserving subgraph queries for large graphs. 339-350 - Guanfeng Liu, Kai Zheng, Yan Wang, Mehmet A. Orgun, An Liu, Lei Zhao, Xiaofang Zhou:
Multi-Constrained Graph Pattern Matching in large-scale contextual social graphs. 351-362 - Peter Macko, Virendra J. Marathe, Daniel W. Margo, Margo I. Seltzer:
LLAMA: Efficient graph analytics using Large Multiversioned Arrays. 363-374 - Xiaocheng Huang, Zhuowei Bao, Susan B. Davidson, Tova Milo, Xiaojie Yuan:
Answering regular path queries on workflow provenance. 375-386
Research Session 9: Keyword and Top-K Processing
- Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, Wenjie Zhang:
Diversified top-k clique search. 387-398 - Pericles de Oliveira, Altigran S. da Silva, Edleno Silva de Moura:
Ranking Candidate Networks of relations to improve keyword search over relational databases. 399-410 - Mehdi Kargar, Aijun An, Nick Cercone, Parke Godfrey, Jaroslaw Szlichta, Xiaohui Yu:
Meaningful keyword search in relational databases with large and complex schema. 411-422 - Kai Zheng, Han Su, Bolong Zheng, Shuo Shang, Jiajie Xu, Jiajun Liu, Xiaofang Zhou:
Interactive Top-k Spatial Keyword queries. 423-434
Research Session 10: Query Processing 2
- Arvind Arasu, Ken Eguro, Manas Joglekar, Raghav Kaushik, Donald Kossmann, Ravi Ramamurthy:
Transaction processing on confidential data using cipherbase. 435-446 - Lukas Kircher, Michael Grossniklaus, Christian Grün, Marc H. Scholl:
Efficient structural bulk updates on the Pre/Dist/Size XML encoding. 447-458 - Michael Shekelyan, Gregor Jossé, Matthias Schubert:
Linear path skylines in multicriteria networks. 459-470 - Martin Kaufmann, Peter M. Fischer, Norman May, Chang Ge, Anil K. Goel, Donald Kossmann:
Bi-temporal Timeline Index: A data structure for Processing Queries on bi-temporal data. 471-482
Research Session 11: Strings and Texts
- Rong Zhang, Zhenjie Zhang, Xiaofeng He, Aoying Zhou:
Dish comment summarization based on bilateral topic analysis. 483-494 - Wen Hua, Zhongyuan Wang, Haixun Wang, Kai Zheng, Xiaofang Zhou:
Short text understanding through lexical-semantic analysis. 495-506 - Georgia Koutrika, Lei Liu, Steven J. Simske:
Generating reading orders over document collections. 507-518 - Jin Wang, Guoliang Li, Dong Deng, Yong Zhang, Jianhua Feng:
Two birds with one stone: An efficient hierarchical framework for top-k and threshold-based string similarity search. 519-530
Research Session 12: Recommender Systems
- Saad Aljubayrin, Jianzhong Qi, Christian S. Jensen, Rui Zhang, Zhen He, Zeyi Wen:
The safest path via safe zones. 531-542 - Jian Dai, Bin Yang, Chenjuan Guo, Zhiming Ding:
Personalized route recommendation using big trajectory data. 543-554 - Tanmoy Chakraborty, Natwar Modani, Ramasuri Narayanam, Seema Nagar:
DiSCern: A diversified citation recommendation system for scientific queries. 555-566 - Tuan-Anh Nguyen Pham, Xutao Li, Gao Cong, Zhenjie Zhang:
A general graph-based model for recommendation in event-based social networks. 567-578
Research Session 13: Similarity Search and Join
- Yuhong Li, Leong Hou U, Man Lung Yiu, Zhiguo Gong:
Quick-motif: An efficient and scalable framework for exact motif discovery. 579-590 - Lu Chen, Yunjun Gao, Xinhan Li, Christian S. Jensen, Gang Chen:
Efficient metric indexing for similarity search. 591-602 - Takanori Maehara, Mitsuru Kusumoto, Ken-ichi Kawarabayashi:
Scalable SimRank join algorithm. 603-614 - Jun Hou, Richi Nayak:
Robust clustering of multi-type relational data via a heterogeneous manifold ensemble. 615-626
Research Session 14: Social Network
- Norases Vesdapunt, Hector Garcia-Molina:
Identifying users in social networks with limited information. 627-638 - Yuchen Li, Zhifeng Bao, Guoliang Li, Kian-Lee Tan:
Real time personalized search on social networks. 639-650 - Ke Wu, Song Yang, Kenny Q. Zhu:
False rumors detection on Sina Weibo by propagation structures. 651-662 - Emre Sefer, Carl Kingsford:
Convex Risk Minimization to Infer Networks from probabilistic diffusion data at multiple scales. 663-674
Research Session 15: Spatial Query Processing
- Kaiqi Zhao, Gao Cong, Quan Yuan, Kenny Q. Zhu:
SAR: A sentiment-aspect-region model for user preference analysis in geo-tagged reviews. 675-686 - Kenneth Fuglsang Christensen, Lasse Linnerup Christiansen, Torben Bach Pedersen, Jeppe Pihl:
Searchlight: Context-aware predictive Continuous Querying of moving objects in symbolic space. 687-698 - Dong-Wan Choi, Chin-Wan Chung:
Nearest neighborhood search in spatial databases. 699-710 - Huiqi Hu, Yiqun Liu, Guoliang Li, Jianhua Feng, Kian-Lee Tan:
A location-aware publish/subscribe framework for parameterized spatio-textual subscriptions. 711-722
Research Session 16: Stream
- Yingjun Wu, Kian-Lee Tan:
ChronoStream: Elastic stateful stream computation in the cloud. 723-734 - Jieying She, Yongxin Tong, Lei Chen, Caleb Chen Cao:
Conflict-aware event-participant arrangement. 735-746 - Zheng Li, Tingjian Ge:
PIE: Approximate interleaving event matching over sequences. 747-758 - Chunyao Song, Tingjian Ge:
Window-chained longest common subsequence: Common event matching in sequences. 759-770
Research Session 17: RDF-Processing
- François Goasdoué, Zoi Kaoudi, Ioana Manolescu, Jorge-Arnulfo Quiané-Ruiz, Stamatis Zampetakis:
CliqueSquare: Flat plans for massively parallel RDF queries. 771-782 - Mingzhu Wei, Elke A. Rundensteiner, Murali Mani:
INSURE: An integrated load reduction framework for XML stream processing. 783-794 - Buwen Wu, Yongluan Zhou, Pingpeng Yuan, Ling Liu, Hai Jin:
Scalable SPARQL querying using path partitioning. 795-806 - Günes Aluç, M. Tamer Özsu, Khuzaima Daudjee, Olaf Hartig:
Executing queries over schemaless RDF databases. 807-818
Research Session 18: System-level Techniques
- Mohammadreza Najafi, Mohammad Sadoghi, Hans-Arno Jacobsen:
Configurable hardware-based streaming architecture using Online Programmable-Blocks. 819-830 - Wenqing Lin, Xiaokui Xiao, Xing Xie, Xiaoli Li:
Network motif discovery: A GPU approach. 831-842 - Majed Sahli, Essam Mansour, Tariq Alturkestani, Panos Kalnis:
Automatic tuning of bag-of-tasks applications. 843-854 - Arian Bär, Lukasz Golab, Stefan Ruehrup, Mirko Schiavone, Pedro Casas:
Cache-oblivious scheduling of shared workloads. 855-866
Research Session 19: Foundations of Query Processing
- Alexander Shkapsky, Mohan Yang, Carlo Zaniolo:
Optimizing recursive queries with monotonic aggregates in DeALS. 867-878 - Lukasz Golab, Flip Korn, Feng Li, Barna Saha, Divesh Srivastava:
Size-Constrained Weighted Set Cover. 879-890 - Xiang Ao, Ping Luo, Chengkai Li, Fuzhen Zhuang, Qing He:
Online Frequent Episode Mining. 891-902 - Marius Eich, Guido Moerkotte:
Dynamic programming: The next step. 903-914
Research Session 20: Graph Sampling and Matching
- Yubao Wu, Ruoming Jin, Xiaofeng Zhu, Xiang Zhang:
Finding dense and connected subgraphs in dual networks. 915-926 - Rong-Hua Li, Jeffrey Xu Yu, Lu Qin, Rui Mao, Tan Jin:
On random walk based graph sampling. 927-938 - Junzhou Zhao, John C. S. Lui, Don Towsley, Pinghui Wang, Xiaohong Guan:
A tale of three graphs: Sampling design on hybrid social-affiliation networks. 939-950 - Arlei Silva, Petko Bogdanov, Ambuj K. Singh:
Hierarchical in-network attribute compression via importance sampling. 951-962
Research Session 21: Trajectories
- Han Su, Kai Zheng, Kai Zeng, Jiamin Huang, Shazia Wasim Sadiq, Nicholas Jing Yuan, Xiaofang Zhou:
Making sense of trajectory data: A partition-and-summarization approach. 963-974 - Bolong Zheng, Nicholas Jing Yuan, Kai Zheng, Xing Xie, Shazia Wasim Sadiq, Xiaofang Zhou:
Approximate keyword search in semantic trajectory database. 975-986 - Jiajun Liu, Kun Zhao, Philipp Sommer, Shuo Shang, Brano Kusy, Raja Jurdak:
Bounded Quadrant System: Error-bounded trajectory compression on the go. 987-998 - Sayan Ranu, Deepak P, Aditya D. Telang, Prasad Deshpande, Sriram Raghavan:
Indexing and matching trajectories under inconsistent sampling rates. 999-1010
Research Session 22: Data Privacy and Security 2
- Jianneng Cao, Fang-Yu Rao, Elisa Bertino, Murat Kantarcioglu:
A hybrid private record linkage scheme: Separating differentially private synopses from matching records. 1011-1022 - Zach Jorgensen, Ting Yu, Graham Cormode:
Conservative or liberal? Personalized differential privacy. 1023-1034 - Shengzhi Xu, Sen Su, Xiang Cheng, Zhengyi Li, Li Xiong:
Differentially private frequent sequence mining via sampling-based candidate pruning. 1035-1046
Research Session 23: MapReduce
- Inah Jeon, Evangelos E. Papalexakis, U Kang, Christos Faloutsos:
HaTen2: Billion-scale tensor decompositions. 1047-1058 - Liping Peng, Vuk Ercegovac, Kai Zeng, Peter J. Haas, Andrey Balmin, Yannis Sismanis:
Groupwise analytics via adaptive MapReduce. 1059-1070 - Gregory Buehrer, Roberto L. de Oliveira Jr., David Fuhry, Srinivasan Parthasarathy:
Towards a parameter-free and parallel itemset mining algorithm in linearithmic time. 1071-1082
Research Session 24: Query Processing 3
- Sean Chester, Darius Sidlauskas, Ira Assent, Kenneth S. Bøgh:
Scalable parallelization of skyline computation for multi-core processors. 1083-1094 - Daniel Schall, Theo Härder:
Dynamic physiological partitioning on a shared-nothing database cluster. 1095-1106 - Xiang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, Wei Wang:
AP-Tree: Efficiently support continuous spatial-keyword queries over stream. 1107-1118 - Yagiz Kargin, Martin L. Kersten, Stefan Manegold, Holger Pirk:
The DBMS - your big data sommelier. 1119-1130
Research Session 25: Graph-based Data Management
- Jiefeng Cheng, Qin Liu, Zhenguo Li, Wei Fan, John C. S. Lui, Cheng He:
VENUS: Vertex-centric streamlined graph computation on a single PC. 1131-1142 - Wenlei Xie, Yuanyuan Tian, Yannis Sismanis, Andrey Balmin, Peter J. Haas:
Dynamic interaction graphs with probabilistic edge decay. 1143-1154 - Aristides Gionis, Michael Mathioudakis, Antti Ukkonen:
Bump hunting in the dark: Local discrepancy maximization on graphs. 1155-1166
Research Session 26: Cloud Computing
- Sudip Roy, Arnd Christian König, Igor Dvorkin, Manish Kumar:
PerfAugur: Robust diagnostics for performance anomalies in cloud services. 1167-1178 - Quan Pham, Tanu Malik, Boris Glavic, Ian T. Foster:
LDV: Light-weight database virtualization. 1179-1190 - Sebastian Schelter, Juan Soto, Volker Markl, Douglas Burdick, Berthold Reinwald, Alexandre V. Evfimievski:
Efficient sample generation for scalable meta learning. 1191-1202
Research Session 27: Indexing
- David B. Lomet, Faisal Nawab:
High performance temporal indexing on modern hardware. 1203-1214 - Abdeltawab M. Hendawi, Jie Bao, Mohamed F. Mokbel, Mohamed H. Ali:
Predictive tree: An efficient index for predictive queries on road networks. 1215-1226 - Victor Alvarez, Stefan Richter, Xiao Chen, Jens Dittrich:
A comparison of adaptive radix trees and hash tables. 1227-1238
Industry Session 1: Main Memory Databases
- Per-Åke Larson, Eric N. Hanson, Mike Zwilling:
Evolving the architecture of SQL Server for modern hardware trends. 1239-1245 - Ronald Barber, Guy M. Lohman, Vijayshankar Raman, Richard Sidle, Sam Lightstone, Berni Schiefer:
In-memory BLU acceleration in IBM's DB2 and dashDB: Optimized for modern workloads and hardware architectures. 1246-1252 - Tirthankar Lahiri, Shasank Chavan, Maria Colgan, Dinesh Das, Amit Ganesh, Mike Gleeson, Sanket Hase, Allison Holloway, Jesse Kamp, Teck-Hua Lee, Juan Loaiza, Neil MacNaughton, Vineet Marwah, Niloy Mukherjee, Atrayee Mullick, Sujatha Muthulingam, Vivekanandhan Raja, Marty Roth, Ekrem Soylemez, Mohamed Zaït:
Oracle Database In-Memory: A dual format in-memory database. 1253-1258 - Franz Faerber, Jonathan Dees, Martin Weidner, Stefan Bäuerle, Wolfgang Lehner:
Towards a web-scale data management ecosystem demonstrated by SAP HANA. 1259-1267
Industry Session 2: Main Memory and Stream Databases
- Hao Zhang, Gang Chen, Beng Chin Ooi, Weng-Fai Wong, Shensen Wu, Yubin Xia:
"Anti-Caching"-based elastic memory management for Big Data. 1268-1279 - Robert Brunel, Jan Finis, Gerald Franz, Norman May, Alfons Kemper, Thomas Neumann, Franz Färber:
Supporting hierarchical data in SAP HANA. 1280-1291 - Akihiro Yamaguchi, Yukikazu Nakamoto, Kenya Sato, Yoshiharu Ishikawa, Yousuke Watanabe, Shinya Honda, Hiroaki Takada:
AEDSMS: Automotive Embedded Data Stream Management System. 1292-1303
Industry Session 3: Big Data
- Aditi Pandit, Derrick Kondo, David E. Simmen, Anjali Norwood, Tongxin Bai:
Accelerating Big Data analytics with Collaborative Planning in Teradata Aster 6. 1304-1315 - Douglas Lee Schales, Xin Hu, Jiyong Jang, Reiner Sailer, Marc Ph. Stoecklin, Ting Wang:
FCCE: Highly scalable distributed Feature Collection and Correlation Engine for low latency big data analytics. 1316-1327