default search action
ACM SIGMOD Conference 2014: Snowbird, UT, USA
- Curtis E. Dyreson, Feifei Li, M. Tamer Özsu:
International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, USA, June 22-27, 2014. ACM 2014, ISBN 978-1-4503-2376-5
Keynote 1
- Eric Sedlar:
How i learned to stop worrying and love compilers. 1-2
Research session 1: transaction processing
- Gene Pang, Tim Kraska, Michael J. Franklin, Alan D. Fekete:
PLANET: making progress with commit processing in unpredictable environments. 3-14 - Jose M. Faleiro, Alexander Thomson, Daniel J. Abadi:
Lazy evaluation of transactions in database systems. 15-26 - Peter Bailis, Alan D. Fekete, Joseph M. Hellerstein, Ali Ghodsi, Ion Stoica:
Scalable atomic visibility with RAMP transactions. 27-38 - Khai Q. Tran, Jeffrey F. Naughton, Bruhathi Sundarmurthy, Dimitris Tsirogiannis:
JECB: a join-extension, code-based approach to OLTP data partitioning. 39-50
Research session 2: social networks 1
- Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan:
HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. 51-62 - Kaiyu Feng, Gao Cong, Sourav S. Bhowmick, Shuai Ma:
In search of influential event organizers in online social networks. 63-74 - Youze Tang, Xiaokui Xiao, Yanchen Shi:
Influence maximization: near-optimal time complexity meets practical efficiency. 75-86 - Guoliang Li, Shuo Chen, Jianhua Feng, Kian-Lee Tan, Wen-Syan Li:
Efficient location-aware influence maximization. 87-98
Research session 3: spatial data
- Jieming Shi, Nikos Mamoulis, Dingming Wu, David W. Cheung:
Density-based place clustering in geo-social networks. 99-110 - Cheng Long, Raymond Chi-Wing Wong, Bin Zhang, Min Xie:
Hypersphere dominance: an optimal approach. 111-122 - Zitong Chen, Yubao Liu, Raymond Chi-Wing Wong, Jiamin Xiong, Ganglin Mai, Cheng Long:
Efficient algorithms for optimal location queries in road networks. 123-134 - Di Chen, Christian Konrad, Ke Yi, Wei Yu, Qin Zhang:
Robust set reconciliation. 135-146
Industry session 1: real-time/complex data analytics
- Ankit Toshniwal, Siddarth Taneja, Amit Shukla, Karthikeyan Ramasamy, Jignesh M. Patel, Sanjeev Kulkarni, Jason Jackson, Krishna Gade, Maosong Fu, Jake Donham, Nikunj Bhagat, Sailesh Mittal, Dmitriy V. Ryaboy:
Storm@twitter. 147-156 - Fangjin Yang, Eric Tschetter, Xavier Léauté, Nelson Ray, Gian Merlino, Deep Ganguli:
Druid: a real-time analytical data store. 157-168 - Sheng Huang, Yaoliang Chen, Xiaoyan Chen, Kai Liu, Xiaomin Xu, Chen Wang, Kevin Brown, Inge Halilovic:
The next generation operational data historian for IoT based on informix. 169-176 - Rebecca Taft, Manasi Vartak, Nadathur Rajagopalan Satish, Narayanan Sundaram, Samuel Madden, Michael Stonebraker:
GenBase: a complex analytics genomics benchmark. 177-188
Tutorial 1
- Anastasia Ailamaki, Erietta Liarou, Pinar Tözün, Danica Porobic, Iraklis Psaroudakis:
How to stop under-utilization and love multicores. 189-192
Research session 4: streams and complex event processing
- Yasuko Matsubara, Yasushi Sakurai, Christos Faloutsos:
AutoPlait: automatic mining of co-evolving time sequences. 193-204 - Yoshitaka Yamamoto, Koji Iwanuma, Shoshi Fukuda:
Resource-oriented approximation for frequent itemset mining from bursty data streams. 205-216 - Haopeng Zhang, Yanlei Diao, Neil Immerman:
On complexity and optimization of expensive queries in complex event processing. 217-228 - Yingmei Qi, Lei Cao, Medhabi Ray, Elke A. Rundensteiner:
Complex event analytics: online aggregation of stream sequence patterns. 229-240
Research session 5: data analytics
- Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann:
Towards indexing functions: answering scalar product queries. 241-252 - Milos Nikolic, Mohammed Elseidy, Christoph Koch:
LINVIEW: incremental view maintenance for complex analytical queries. 253-264 - Ce Zhang, Arun Kumar, Christopher Ré:
Materialization optimizations for feature selection workloads. 265-276 - Kai Zeng, Shi Gao, Barzan Mozafari, Carlo Zaniolo:
The analytical bootstrap: a new method for fast error estimation in approximate query processing. 277-288
Research session 6: graph and RDF data processing
- Sairam Gurajada, Stephan Seufert, Iris Miliaraki, Martin Theobald:
TriAD: a distributed shared-nothing RDF engine based on asynchronous message passing. 289-300 - Wenfei Fan, Xin Wang, Yinghui Wu:
Querying big graphs within bounded resources. 301-312 - Lei Zou, Ruizhe Huang, Haixun Wang, Jeffrey Xu Yu, Wenqiang He, Dongyan Zhao:
Natural language question answering over RDF: a graph data driven approach. 313-324 - Mitsuru Kusumoto, Takanori Maehara, Ken-ichi Kawarabayashi:
Scalable similarity search for SimRank. 325-336
Industry session 2: query optimization
- Mohamed A. Soliman, Lyublena Antova, Venkatesh Raghavan, Amr El-Helw, Zhongxian Gu, Entong Shen, George C. Caragea, Carlos Garcia-Alvarado, Foyzur Rahman, Michalis Petropoulos, Florian Waas, Sivaramakrishnan Narayanan, Konstantinos Krikellas, Rhonda Baldwin:
Orca: a modular query optimizer architecture for big data. 337-348 - Pedram Ghodsnia, Ivan T. Bowman, Anisoara Nica:
Parallel I/O aware query optimization. 349-360 - Guido Moerkotte, David DeHaan, Norman May, Anisoara Nica, Alexander Böhm:
Exploiting ordered dictionaries to efficiently construct histograms with q-error guarantees in SAP HANA. 361-372 - Lyublena Antova, Amr El-Helw, Mohamed A. Soliman, Zhongxian Gu, Michalis Petropoulos, Florian Waas:
Optimizing queries over partitioned tables in MPP systems. 373-384
Research session 7: multidimensional data
- Spyros Blanas, Kesheng Wu, Surendra Byna, Bin Dong, Arie Shoshani:
Parallel data analysis directly on scientific file formats. 385-396 - Tilmann Zäschke, Christoph Zimmerli, Moira C. Norrie:
The PH-tree: a space-efficient storage structure and multi-dimensional index. 397-408 - Jennie Duggan, Michael Stonebraker:
Incremental elasticity for array databases. 409-420 - Jie Xu, Dmitri V. Kalashnikov, Sharad Mehrotra:
Efficient summarization framework for multi-attribute uncertain data. 421-432
Research session 8: data cleaning
- Ravali Pochampally, Anish Das Sarma, Xin Luna Dong, Alexandra Meliou, Divesh Srivastava:
Fusing data with correlations. 433-444 - Anup Chalamalla, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti:
Descriptive and prescriptive data cleaning. 445-456 - Jiannan Wang, Nan Tang:
Towards dependable data repairing with fixing rules. 457-468 - Jiannan Wang, Sanjay Krishnan, Michael J. Franklin, Ken Goldberg, Tim Kraska, Tova Milo:
A sample-and-clean framework for fast and accurate query processing on dirty data. 469-480
Research session 9: data exploration
- Sameer Agarwal, Henry Milner, Ariel Kleiner, Ameet Talwalkar, Michael I. Jordan, Samuel Madden, Barzan Mozafari, Ion Stoica:
Knowing when you're wrong: building fast and reliable approximate query processing systems. 481-492 - Yanyan Shen, Kaushik Chakrabarti, Surajit Chaudhuri, Bolin Ding, Lev Novik:
Discovering queries based on example tuples. 493-504 - Alexander Kalinin, Ugur Çetintemel, Stanley B. Zdonik:
Interactive data exploration using semantic windows. 505-516 - Kyriaki Dimitriadou, Olga Papaemmanouil, Yanlei Diao:
Explore-by-example: an automatic query steering framework for interactive data exploration. 517-528
Industry session 3: storage management
- Woon-Hak Kang, Sang-Won Lee, Bongki Moon, Yang-Suk Kee, Moonwook Oh:
Durable write cache in flash memory SSD for relational and NoSQL databases. 529-540 - Aakash Goel, Bhuwan Chopra, Ciprian Gerea, Dhruv Mátáni, Josh Metzler, Fahim Ul Haq, Janet L. Wiener:
Fast database restarts at facebook. 541-549 - Khaled Elmeleegy, Christopher Olston, Benjamin C. Reed:
SpongeFiles: mitigating data skew in mapreduce using distributed memory. 551-562 - Richard Michael Grantham Wesley, Pawel Terlecki:
Leveraging compression in the tableau data engine. 563-573
Keynote 2
- Maurice Herlihy:
Fun with hardware transactional memory. 575
Research session 10: crowdsourcing
- Hyunjung Park, Jennifer Widom:
CrowdFill: collecting structured data from the crowd. 577-588 - Yael Amsterdamer, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit Somech:
OASSIS: query driven crowd mining. 589-600 - Chaitanya Gokhale, Sanjib Das, AnHai Doan, Jeffrey F. Naughton, Narasimhan Rampalli, Jude W. Shavlik, Xiaojin Zhu:
Corleone: hands-off crowdsourcing for entity matching. 601-612
Research session 11: parallel graph processing
- Yingxia Shao, Lei Chen, Bin Cui:
Efficient cohesive subgraphs detection in parallel. 613-624 - Yingxia Shao, Bin Cui, Lei Chen, Lin Ma, Junjie Yao, Ning Xu:
Parallel subgraph listing in a large-scale graph. 625-636 - Jinha Kim, Wook-Shin Han, Sangyeon Lee, Kyungyeol Park, Hwanjo Yu:
OPT: a new framework for overlapped and parallel triangulation in large-scale graphs. 637-648
Research session 12: potpouri
- Yang Chen, Daisy Zhe Wang:
Knowledge expansion over probabilistic knowledge bases. 649-660 - Dongqing Xiao, Mohamed Y. Eltabakh:
InsightNotes: summary-based annotation management in relational databases. 661-672 - Dong Deng, Guoliang Li, Jianhua Feng:
A pivotal prefix based filtering algorithm for string similarity search. 673-684
Demo A
- Astrid Rheinländer, Martin Beckmann, Anja Kunkel, Arvid Heise, Thomas Stoltmann, Ulf Leser:
Versatile optimization of UDF-heavy data flows with sofa. 685-688 - Tim Kiefer, Thomas Kissinger, Benjamin Schlegel, Dirk Habich, Daniel Molka, Wolfgang Lehner:
ERIS live: a NUMA-aware in-memory storage engine for tera-scale multiprocessor systems. 689-692 - Tomas Karnagel, Matthias Hille, Mario Ludwig, Dirk Habich, Wolfgang Lehner, Max Heimel, Volker Markl:
Demonstrating efficient query processing in heterogeneous environments. 693-696 - Tobias Mühlbauer, Wolf Rödiger, Robert Seilbeck, Angelika Reiser, Alfons Kemper, Thomas Neumann:
One DBMS for all: the brawny few and the wimpy crowd. 697-700 - Alkis Simitsis, Kevin Wilkinson, Jason Blais, Joe Walsh:
VQA: vertica query analyzer. 701-704 - Fei Chen, Tere Gonzalez, Jun Li, Manish Marwah, Jim Pruyne, Krishnamurthy Viswanathan, Mijung Kim:
Palette: enabling scalable analytics for big-memory, multicore machines. 705-708 - Fei Li, H. V. Jagadish:
NaLIR: an interactive natural language interface for querying relational databases. 709-712 - Petar Jovanovic, Alkis Simitsis, Kevin Wilkinson:
BabbleFlow: a translator for analytic data flow programs. 713-716 - Justin J. Levandoski, David B. Lomet, Sudipta Sengupta, Adrian Birka, Cristian Diaconu:
Indexing on modern hardware: hekaton and beyond. 717-720 - Chen Jason Zhang, Ziyuan Zhao, Lei Chen, H. V. Jagadish, Caleb Chen Cao:
CrowdMatcher: crowd-assisted schema matching. 721-724
Tutorial 2
- Zoi Kaoudi, Ioana Manolescu:
Cloud-based RDF data management. 725-729
Research session 13: data management over modern hardware
- Badrish Chandramouli, Jonathan Goldstein:
Patience is a virtue: revisiting merge and sort on modern processors. 731-742 - Viktor Leis, Peter A. Boncz, Alfons Kemper, Thomas Neumann:
Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age. 743-754 - Orestis Polychroniou, Kenneth A. Ross:
A comprehensive study of main-memory partitioning and its application to large-scale comparison- and radix-sort. 755-766 - Oliver Arnold, Sebastian Haas, Gerhard P. Fettweis, Benjamin Schlegel, Thomas Kissinger, Wolfgang Lehner:
An application-specific instruction set for accelerating set-oriented database primitives. 767-778
Research session 14: non-traditional data
- Arash Termehchy, Ali Vakilian, Yodsawalai Chodpathumwan, Marianne Winslett:
Which concepts are worth extracting? 779-790 - Curtis E. Dyreson, Sourav S. Bhowmick, Ryan Grapp:
Querying virtual hierarchies using virtual prefix-based numbers. 791-802 - Sumit Gulwani, Mark Marron:
NLyze: interactive programming by natural language for spreadsheet data analysis and manipulation. 803-814 - Daniel Tahara, Thaddeus Diamond, Daniel J. Abadi:
Sinew: a SQL system for multi-structured data. 815-826
Research session 15: mapreduce processing
- Lu Qin, Jeffrey Xu Yu, Lijun Chang, Hong Cheng, Chengqi Zhang, Xuemin Lin:
Scalable big graph processing in MapReduce. 827-838 - Alper Okcan, Mirek Riedewald:
Anti-combining for MapReduce. 839-850 - Jeff LeFevre, Jagan Sankaranarayanan, Hakan Hacigümüs, Jun'ichi Tatemura, Neoklis Polyzotis, Michael J. Carey:
Opportunistic physical design for big data analytics. 851-862 - Roy Levin, Yaron Kanza:
Stratified-sampling over social networks using mapreduce. 863-874
Demo B
- Daniel Halperin, Victor Teixeira de Almeida, Lee Lee Choo, Shumo Chu, Paraschos Koutris, Dominik Moritz, Jennifer Ortiz, Vaspol Ruamviboonsuk, Jingjing Wang, Andrew Whitaker, Shengliang Xu, Magdalena Balazinska, Bill Howe, Dan Suciu:
Demonstration of the Myria big data management service. 881-884 - Aditya G. Parameswaran, Ming Han Teh, Hector Garcia-Molina, Jennifer Widom:
DataSift: a crowd-powered search toolkit. 885-888 - Iraklis Psaroudakis, Manos Athanassoulis, Matthaios Olma, Anastasia Ailamaki:
Reactive and proactive sharing across concurrent analytical queries. 889-892 - Shengqi Yang, Yanan Xie, Yinghui Wu, Tianyi Wu, Huan Sun, Jian Wu, Xifeng Yan:
SLQ: a user-friendly graph querying system. 893-896 - Louai Alarabi, Ahmed Eldawy, Rami Alghamdi, Mohamed F. Mokbel:
TAREEG: a MapReduce-based web service for extracting spatial data from OpenStreetMap. 897-900 - Davide Mottin, Matteo Lissandrini, Yannis Velegrakis, Themis Palpanas:
Searching with XQ: the exemplar query search engine. 901-904 - Mehdi Kargar, Aijun An, Nick Cercone, Parke Godfrey, Jaroslaw Szlichta, Xiaohui Yu:
MeanKS: meaningful keyword search in relational databases with complex schema. 905-908 - Nikolaos Papailiou, Dimitrios Tsoumakos, Ioannis Konstantinou, Panagiotis Karras, Nectarios Koziris:
H2RDF+: an efficient data management system for big RDF graphs. 909-912 - Carsten Binnig, Abdallah Salama, Erfan Zamanian:
DoomDB: kill the query. 913-916
Panel
- Bill Howe, Michael J. Franklin, Juliana Freire, James Frew, Tim Kraska, Raghu Ramakrishnan:
Should we all be teaching "intro to data science" instead of "intro to databases"? 917-918
Research session 16: distributed and parallel data management
- Theodoros Rekatsinas, Xin Luna Dong, Divesh Srivastava:
Characterizing and selecting fresh data sources. 919-930 - Alvin Cheung, Samuel Madden, Armando Solar-Lezama:
Sloth: being lazy is a virtue (when issuing database queries). 931-942 - Konstantinos Karanasos, Andrey Balmin, Marcel Kutsch, Fatma Ozcan, Vuk Ercegovac, Chunyang Xia, Jesse Jackson:
Dynamically optimizing queries over large scale data platforms. 943-954 - PengCheng Xiong, Hakan Hacigümüs, Jeffrey F. Naughton:
A software-defined networking based approach for performance management of analytical queries on distributed data stores. 955-966
Research session 17: graph analytics
- Panos Parchas, Francesco Gullo, Dimitris Papadias, Francesco Bonchi:
The pursuit of a good possible world: extracting representative instances of uncertain graphs. 967-978 - Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Jiwon Seo, Jongsoo Park, Muhammad Amber Hassaan, Shubho Sengupta, Zhaoming Yin, Pradeep Dubey:
Navigating the maze of graph analytics frameworks using massive graph datasets. 979-990 - Wanyun Cui, Yanghua Xiao, Haixun Wang, Wei Wang:
Local search of communities in large graphs. 991-1002 - Akhil Arora, Mayank Sachan, Arnab Bhattacharya:
Mining statistically significant connected subgraphs in vertex labeled graphs. 1003-1014
Research session 18: query processing and optimization 1
- Ioana Ileana, Bogdan Cautis, Alin Deutsch, Yannis Katsis:
Complete yet practical search for minimal query reformulations under constraints. 1015-1026 - James Cheney, Sam Lindley, Philip Wadler:
Query shredding: efficient relational evaluation of queries over nested multisets. 1027-1038 - Anshuman Dutt, Jayant R. Haritsa:
Plan bouquets: query processing without selectivity estimation. 1039-1050 - Fei Li, Tianyin Pan, H. V. Jagadish:
Schema-free SQL. 1051-1062
Demo C
- You Wu, Brett Walenz, Peggy Li, Andrew Shim, Emre Sonmez, Pankaj K. Agarwal, Chengkai Li, Jun Yang, Cong Yu:
iCheck: computationally combating "lies, d-ned lies, and statistics". 1063-1066 - Kai Zeng, Shi Gao, Jiaqi Gu, Barzan Mozafari, Carlo Zaniolo:
ABS: a system for scalable approximate queries with accuracy guarantees. 1067-1070 - Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
NADEEF/ER: generic and interactive entity resolution. 1071-1074 - Alex Cheng, Mary Malit, Chuanxi Zhang, Nick Koudas:
SerpentTI: flexible analytics of users, boards and domains for pinterest. 1075-1078 - Esther Galbrun, Pauli Miettinen:
Interactive redescription mining. 1079-1082 - Carlos Garcia-Alvarado, Carlos Ordonez:
ONTOCUBO: cube-based ontology construction and exploration. 1083-1086 - Tobias Emrich, Maximilian Franzke, Hans-Peter Kriegel, Johannes Niedermayer, Matthias Renz, Andreas Züfle:
An extendable framework for managing uncertain spatio-temporal data. 1087-1090 - Fangbo Tao, George Brova, Jiawei Han, Heng Ji, Chi Wang, Brandon Norick, Ahmed El-Kishky, Jialu Liu, Xiang Ren, Yizhou Sun:
NewsNetExplorer: automatic construction and exploration of news information networks. 1091-1094 - Davide Mottin, Alice Marascu, Senjuti Basu Roy, Gautam Das, Themis Palpanas, Yannis Velegrakis:
IQR: an interactive query relaxation system for the empty-answer problem. 1095-1098 - Shiming Zhang, Yin Yang, Wei Fan, Liang Lan, Mingxuan Yuan:
OceanRT: real-time analytics over large temporal data. 1099-1102
Research session 19: storage and indexing
- Ioannis Alagiannis, Stratos Idreos, Anastasia Ailamaki:
H2O: a hands-free adaptive store. 1103-1114