default search action
BigData Conference 2016: Washington DC, USA
- James Joshi, George Karypis, Ling Liu, Xiaohua Hu, Ronay Ak, Yinglong Xia, Weijia Xu, Aki-Hiro Sato, Sudarsan Rachuri, Lyle H. Ungar, Philip S. Yu, Rama Govindaraju, Toyotaro Suzumura:
2016 IEEE International Conference on Big Data (IEEE BigData 2016), Washington DC, USA, December 5-8, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-9005-7 - Chaitanya K. Baru:
Harnessing the data revolution: A perspective from the national science foundation. 2 - Elisa Bertino:
Big data security and privacy. 3 - Jiawei Han:
On the power of big data: Mining structures from massive, unstructured text data. 4 - Mark Johnson:
Leveraging high performance computing to drive advanced manufacturing R&D at the US department of energy. 5-6 - Michael Stonebraker, Dong Deng, Michael L. Brodie:
Database decay and how to avoid it. 7-16 - Christian Böhm, Martin Perdacher, Claudia Plant:
Cache-oblivious loops based on a novel space-filling curve. 17-26 - Jagat Sesh Challa, Poonam Goyal, S. Nikhil, Aditya Mangla, Sundar Balasubramaniam, Navneet Goyal:
DD-Rtree: A dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithms. 27-36 - Ravikant Dindokar, Neel Choudhury, Yogesh Simmhan:
A meta-graph approach to analyze subgraph-centric distributed programming models. 37-47 - Subhadeep Karan, Jaroslaw Zola:
Exact structure learning of Bayesian networks by optimal path extension. 48-55 - Walaa Eldin Moustafa, Vicky Papavasileiou, Ken Yocum, Alin Deutsch:
Datalography: Scaling datalog graph analytics on graph processing systems. 56-65 - Yosuke Oyama, Akihiro Nomura, Ikuro Sato, Hiroki Nishimura, Yukimasa Tamatsu, Satoshi Matsuoka:
Predicting statistics of asynchronous SGD parameters for a large-scale distributed deep learning system on GPU supercomputers. 66-75 - Benjamin Sirb, Xiaojing Ye:
Consensus optimization with delayed and stochastic gradients on decentralized networks. 76-85 - Xiaoli Song, Yan Rui, Xiaohua Hu:
Pairwise topic model and its application to topic transition and evolution. 86-95 - Yuan Yuan, Sihong Xie, Chun-Ta Lu, Jie Tang, Philip S. Yu:
Interpretable and effective opinion spam detection via temporal patterns mining across websites. 96-105 - Fang Zhou, Mohamed F. Ghalwash, Zoran Obradovic:
A fast structured regression for large networks. 106-115 - Adiska Fardani Haryadi, Joris Hulstijn, Agung Wahyudi, Haiko Van Der Voort, Marijn Janssen:
Antecedents of big data quality: An empirical examination in financial service organizations. 116-121 - Joseph Jupin, Justin Y. Shi, Eduard C. Dragut:
PSH: A probabilistic signature hash method with hash neighborhood candidate generation for fast edit-distance string comparison on big data. 122-127 - Rocco Langone, Johan A. K. Suykens:
Efficient multiple scale kernel classifiers. 128-133 - Joaquim F. Silva, Carlos Gonçalves, José C. Cunha:
A theoretical model for n-gram distribution in big data corpora. 134-141 - Jonathan Stokes, Steven Weber:
The self-avoiding walk-jump (SAWJ) algorithm for finding maximum degree nodes in large graphs. 142-149 - Xiaoli Song, Xiaotong Wang, Xiaohua Hu:
Semantic pattern mining for text mining. 150-155 - Kenji Yamanishi, Kohei Miyaguchi:
Detecting gradual changes from data stream using MDL-change statistics. 156-163 - Rongda Zhu, Aston Zhang, Jian Peng, Chengxiang Zhai:
Exploiting temporal divergence of topic distributions for event detection. 164-171 - Timo Bingmann, Michael Axtmann, Emanuel Jöbstl, Sebastian Lamm, Huyen Chau Nguyen, Alexander Noe, Sebastian Schlag, Matthias Stumpp, Tobias Sturm, Peter Sanders:
Thrill: High-performance algorithmic distributed batch data processing with C++. 172-183 - Liuhua Chen, Haiying Shen:
Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioning. 184-193 - Katerina Doka, Nikolaos Papailiou, Victor Giannakouris, Dimitrios Tsoumakos, Nectarios Koziris:
Mix 'n' match multi-engine analytics. 194-203 - Alex Gittens, Aditya Devarakonda, Evan Racah, Michael F. Ringenburg, Lisa Gerhardt, Jey Kottalam, Jialin Liu, Kristyn J. Maschhoff, Shane Canon, Jatin Chhugani, Pramod Sharma, Jiyan Yang, James Demmel, Jim Harrell, Venkat Krishnamurthy, Michael W. Mahoney, Prabhat:
Matrix factorizations at scale: A comparison of scientific data analytics in spark and C+MPI using three case studies. 204-213 - Yin Huang, Yelena Yesha, Milton Halem, Yaacov Yesha, Shujia Zhou:
YinMem: A distributed parallel indexed in-memory computation system for large scale data analytics. 214-222 - Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dhabaleswar K. Panda:
Efficient data access strategies for Hadoop and Spark on HPC cluster with heterogeneous storage. 223-232 - Zhuozhao Li, Haiying Shen, Jeffrey Denton, Walter B. Ligon III:
Comparing application performance on HPC-based Hadoop platforms with local storage and dedicated storage. 233-242 - Jinwei Liu, Haiying Shen, Husnu S. Narman:
CCRP: Customized cooperative resource provisioning for high resource utilization in clouds. 243-252 - Xiaoyi Lu, Dipti Shankar, Shashank Gugnani, Dhabaleswar K. Panda:
High-performance design of apache spark with RDMA and its benefits on various workloads. 253-262 - Tomoki Yoshihisa, Takahiro Hara:
A low-load stream processing scheme for IoT environments. 263-272 - Yuan Yuan, Meisam Fathi Salmi, Yin Huai, Kaibo Wang, Rubao Lee, Xiaodong Zhang:
Spark-GPU: An accelerated in-memory data processing engine on clusters. 273-283 - Angen Zheng, Alexandros Labrinidis, Panos K. Chrysanthis, Jack Lange:
Argo: Architecture-aware graph partitioning. 284-293 - Kareem S. Aggour, Bülent Yener:
Adapting to data sparsity for efficient parallel PARAFAC tensor decomposition in Hadoop. 294-301 - Yadu N. Babuji, Kyle Chard, Aaron Gerow, Eamon Duede:
Cloud Kotta: Enabling secure and scalable data analytics in the cloud. 302-310 - Chunkun Bo, Ke Wang, Jeffrey J. Fox, Kevin Skadron:
Entity resolution acceleration using the automata processor. 311-318 - Kyle Chard, Mike D'Arcy, Benjamin D. Heavner, Ian T. Foster, Carl Kesselman, Ravi K. Madduri, Alexis A. Rodriguez, Stian Soiland-Reyes, Carole A. Goble, Kristi Clark, Eric W. Deutsch, Ivo D. Dinov, Nathan D. Price, Arthur W. Toga:
I'll take that to go: Big data bags and minimal identifiers for exchange of large, complex datasets. 319-328 - Chun-Chieh Chen, Chih-Ya Shen, Ming-Syan Chen:
Massive parallelism for non-linear and non-stationary data analysis with GPGPU. 329-334 - Stratos Dimopoulos, Chandra Krintz, Rich Wolski:
Big data framework interference in restricted private cloud settings. 335-340 - Khoa D. Doan, Amidu O. Oloso, Kwo-Sen Kuo, Thomas L. Clune, Hongfeng Yu, Brian Nelson, Jian Zhang:
Evaluating the impact of data placement to spark and SciDB with an Earth Science use case. 341-346 - Saliya Ekanayake, Supun Kamburugamuve, Pulasthi Wickramasinghe, Geoffrey C. Fox:
Java thread and process performance for parallel machine learning on multicore HPC clusters. 347-354 - Gheorghi Guzun, Josiah C. McClurg, Guadalupe Canahuate, Raghuraman Mudumbai:
Power efficient big data analytics algorithms through low-level operations. 355-361 - Satoshi Imamura, Keitaro Oka, Yuichiro Yasui, Yuichi Inadomi, Katsuki Fujisawa, Toshio Endo, Koji Ueno, Keiichiro Fukazawa, Nozomi Hata, Yuta Kakibuka, Koji Inoue, Takatsugu Ono:
Evaluating the impacts of code-level performance tunings on power efficiency. 362-369 - Fan Jiang, Claris Castillo, Charles Schmitt:
RADU: Bridging the divide between data and infrastructure management to support data-driven collaborations. 370-377 - Jinfeng Li, James Cheng, Yunjian Zhao, Fan Yang, Yuzhen Huang, Haipeng Chen, Ruihao Zhao:
A comparison of general-purpose distributed systems for data processing. 378-383 - Jinwei Liu, Haiying Shen:
A popularity-aware cost-effective replication scheme for high data durability in cloud storage. 384-389 - Luis Pineda-Morales, Ji Liu, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso:
Managing hot metadata for scientific workflows on multisite clouds. 390-397 - Hitoshi Sato, Ryo Mizote, Satoshi Matsuoka, Hirotaka Ogawa:
I/O chunking and latency hiding approach for out-of-core sorting acceleration using GPU and flash NVM. 398-403 - Dipti Shankar, Xiaoyi Lu, Dhabaleswar K. Panda:
Boldio: A hybrid and resilient burst-buffer over lustre for accelerating big data I/O. 404-409 - Christoforos Svingos, Theofilos Mailis, Herald Kllapi, Lefteris Stamatogiannakis, Yannis Kotidis, Yannis E. Ioannidis:
Real time processing of streaming and static information. 410-415 - Hans Vandierendonck, Karen L. Murphy, Mahwish Arif, Dimitrios S. Nikolopoulos:
HPTA: High-performance text analytics. 416-423 - Jorge Veiga, Roberto R. Expósito, Xoán C. Pardo, Guillermo L. Taboada, Juan Touriño:
Performance evaluation of big data frameworks for large-scale data analytics. 424-431 - Yali Zhao, Rodrigo N. Calheiros, James Bailey, Richard O. Sinnott:
SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. 432-441 - Kaiji Chen, Yongluan Zhou:
Materialized view selection in feed following systems. 442-451 - Victor Giannakouris, Nikolaos Papailiou, Dimitrios Tsoumakos, Nectarios Koziris:
MuSQLE: Distributed SQL query execution over multiple engine environments. 452-461 - Ahsanul Haque, Zhuoyi Wang, Swarup Chandra, Yupeng Gao, Latifur Khan, Charu C. Aggarwal:
Sampling-based distributed Kernel mean matching using spark. 462-471 - Yudian Ji, Yuda Zang, Wuman Luo, Xibo Zhou, Ye Ding, Lionel M. Ni:
Clockwise compression for trajectory data under road network constraints. 472-481 - Karuna P. Joshi, Aditi Gupta, Sudip Mittal, Claudia Pearce, Anupam Joshi, Tim Finin:
Semantic approach to automating management of big data privacy policies. 482-491 - Eleazar Leal, Le Gruenwald, Jianting Zhang:
Handling uncertainty in trajectories of moving objects in unconstrained outdoor spaces. 492-501 - Cuong M. Nguyen, Philip J. Rhodes:
Accelerating range queries for large-scale unstructured meshes. 502-511 - Md. Shiblee Sadik, Le Gruenwald, Eleazar Leal:
In pursuit of outliers in multi-dimensional data streams. 512-521 - Jianpeng Xu, Jiayu Zhou, Pang-Ning Tan, Xi Liu, Lifeng Luo:
WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decomposition. 522-531 - Farrukh Ahmed, Michele Samorani, Colin Bellinger, Osmar R. Zaïane:
Advantage of integration in big data: Feature generation in multi-relational databases for imbalanced learning. 532-539 - Matthew Edwards, Stephen Wattam, Paul Rayson, Awais Rashid:
Sampling labelled profile data for identity resolution. 540-547 - Frank Pallas, Johannes Günther, David Bermbach:
Pick your choice in HBase: Security or performance. 548-554 - Rui Ren, Zhen Jia, Lei Wang, Jianfeng Zhan, Tianxu Yi:
BDTUne: Hierarchical correlation-based performance analysis and rule-based diagnosis for big data systems. 555-562 - Ramyar Saeedi, Hassan Ghasemzadeh, Assefaw Hadish Gebremedhin:
Transfer learning algorithms for autonomous reconfiguration of wearable systems. 563-569 - Mei Saouk, Christos Doulkeridis, Akrivi Vlachou, Kjetil Nørvåg:
Efficient processing of top-k joins in MapReduce. 570-577 - Ting Wu, Chen Jason Zhang, Lei Chen, Pan Hui, Siyuan Liu:
Object identification with Pay-As-You-Go crowdsourcing. 578-585 - Nesreen K. Ahmed, Theodore L. Willke, Ryan A. Rossi:
Estimation of local subgraph counts. 586-595 - Christian Beecks, Alexander Graß:
Multi-step threshold algorithm for efficient feature-based query processing in large-scale multimedia databases. 596-605 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
PRIIME: A generic framework for interactive personalized interesting pattern discovery. 606-615 - Ngot Bui, Thanh Le, Vasant G. Honavar:
Labeling actors in multi-view social networks by integrating information from within and across multiple views. 616-625 - Hariton Efstathiades, Demetris Antoniades, George Pallis, Marios D. Dikaiakos, Zoltán Szlávik, Robert-Jan Sips:
Online social network evolution: Revisiting the Twitter graph. 626-635 - Jianliang Gao, Bo Song, Ping Liu, Weimao Ke, Jianxin Wang, Xiaohua Hu:
Parallel top-k subgraph query in massive graphs: Computing from the perspective of single vertex. 636-645 - Xiaoyu Ge, Yanbing Xue, Zhipeng Luo, Mohamed A. Sharaf, Panos K. Chrysanthis:
REQUEST: A scalable framework for interactive construction of exploratory queries. 646-655 - Chun Guo, Xiaozhong Liu:
Dynamic feature generation and selection on heterogeneous graph for music recommendation. 656-665 - Nguyen Ho, Huy T. Vo, Mai Vu:
An adaptive information-theoretic approach for identifying temporal correlations in big data sets. 666-675 - Chao Huang, Dong Wang, Shenglong Zhu, Daniel Yue Zhang:
Towards unsupervised home location inference from online social media. 676-685 - Wei Jiang, Juan Rodriguez, Torsten Suel:
Improved methods for static index pruning. 686-695 - Wooyeol Kim, Younghoon Kim, Kyuseok Shim:
Parallel computation of k-nearest neighbor joins using MapReduce. 696-705 - Sarasi Lalithsena, Pavan Kapanipathi, Amit P. Sheth:
Harnessing relationships for domain-specific subgraph extraction: A recommendation use case. 706-715 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
Scalable link community detection: A local dispersion-aware approach. 716-725 - Hongfu Liu, Yuchao Zhang, Bo Deng, Yun Fu:
Outlier detection via sampling ensemble. 726-735 - Athanasios N. Nikolakopoulos, Antonia Korba, John D. Garofalakis:
Random surfing on multipartite graphs. 736-745 - Cheong Hee Park, Youngsoon Kang:
An active learning method for data streams with concept drift. 746-752 - Charles Siegel, Jeff Daily, Abhinav Vishnu:
Adaptive neuron apoptosis for accelerating deep learning on large scale systems. 753-762 - Ata Turk, Hao Chen, Anthony Byrne, John Knollmeyer, Sastry S. Duri, Canturk Isci, Ayse K. Coskun:
DeltaSherlock: Identifying changes in the cloud. 763-772 - Xiaokai Wei, Bokai Cao, Weixiang Shao, Chun-Ta Lu, Philip S. Yu:
Community detection with partially observable links and node attributes. 773-782 - Yongyi Xian, Yan Liu, Chuanfei Xu:
Parallel gathering discovery over big trajectory data. 783-792 - Hu Xu, Sihong Xie, Lei Shu, Philip S. Yu:
CER: Complementary entity recognition via knowledge expansion on large unlabeled product reviews. 793-802 - Jingyuan Zhang, Chun-Ta Lu, Mianwei Zhou, Sihong Xie, Yi Chang, Philip S. Yu:
HEER: Heterogeneous graph embedding for emerging relation detection from news. 803-812 - Hao Zhang, Yuanyuan Zhu, Lu Qin, Hong Cheng, Jeffrey Xu Yu:
Efficient triangle listing for billion-scale graphs. 813-822 - Yating Zhang, Adam Jatowt, Katsumi Tanaka:
Towards understanding word embeddings: Automatically explaining similarity of terms. 823-832 - Kai Zhao, Denis Khryashchev, Juliana Freire, Cláudio T. Silva, Huy T. Vo:
Predicting taxi demand at high spatial resolution: Approaching the limit of predictability. 833-842 - Yixian Zheng, Wenchao Wu, Haipeng Zeng, Nan Cao, Huamin Qu, Mingxuan Yuan, Jia Zeng, Lionel M. Ni:
TelcoFlow: Visual exploration of collective behaviors based on telco data. 843-852 - Morteza Zihayat, Zane Zhenhua Hu, Aijun An, Yonggang Hu:
Distributed and parallel high utility sequential pattern mining. 853-862 - Philip K. Chan, Ebad Ahmadzadeh:
Improving efficiency of maximizing spread in the flow authority model for large sparse networks. 863-868 - Wanying Ding, Yue Zhang, Chaomei Chen, Xiaohua Hu:
Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in Twitter. 869-874 - Ioanna Filippidou, Yannis Kotidis:
Effective and efficient graph augmentation in large graphs. 875-880 - Ville Hyvönen, Teemu Pitkänen, Sotiris K. Tasoulis, Elias Jaasaari, Risto Tuomainen, Liang Wang, Jukka Corander, Teemu Roos:
Fast nearest neighbor search through sparse random projections and voting. 881-888 - Saïd Jabbour, Nizar Mhadhbi, Abdesattar Mhadhbi, Badran Raddaoui, Lakhdar Sais:
Summarizing big graphs by means of pseudo-boolean constraints. 889-894 - Uwe Jugel, Zbigniew Jerzak, Volker Markl:
Big data on a few pixels. 895-900 - Mohammad Mahdi Kamani, Farshid Farhat, Stephen Wistar, James Z. Wang:
Shape matching using skeleton context for automated bow echo detection. 901-908 - Weimao Ke, Javed Mostafa:
Scalability analysis of distributed search in large peer-to-peer networks. 909-914 - Nicolas Kourtellis, Gianmarco De Francisci Morales, Albert Bifet, Arinto Murdopo:
VHT: Vertical hoeffding tree. 915-922 - Yuh-Jye Lee, Hsing-Kuo Pao, Shueh-Han Shih, Jing-Yao Lin, Xin-Rong Chen:
Compressed learning for time series classification. 923-930 - Xiaopeng Li, Ming Cheung, James She:
Connection discovery using shared images by Gaussian relational topic model. 931-936 - Haofu Liao, Yucheng Li, Tianran Hu, Jiebo Luo:
Inferring restaurant styles by mining crowd sourced photos from user-review websites. 937-944 - Chang Liu, Bin Wu, Yi Yang, Zhihong Guo:
Multiple submodels parallel support vector machine on spark. 945-950 - Xiang Liu, Torsten Suel:
What makes a group fail: Modeling social group behavior in event-based social networks. 951-956