


default search action
BigData Congress 2015: New York City, NY, USA
- Barbara Carminati, Latifur Khan:

2015 IEEE International Congress on Big Data, New York City, NY, USA, June 27 - July 2, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-7278-7
Research Track
Research Session 1: Mining I
- N. Denizcan Vanli, Muhammed O. Sayin

, Ibrahim Delibalta
, Suleyman Serdar Kozat:
A Scalable Approach for Online Hierarchical Big Data Mining. 1-8 - Aris-Kyriakos Koliopoulos, Paraskevas Yiapanis, Firat Tekiner, Goran Nenadic, John A. Keane:

A Parallel Distributed Weka Framework for Big Data Mining Using Spark. 9-16 - José I. Rodrigues

, Mauro J. G. Figueiredo, Ivo Silvestre, Cristina Veiga-Pires
:
Geometrical and Topological Modelling: A Fast Computation of Spatial 3D TLS Data Selections. 17-24
Research Session 2: Mining II
- Elena Baralis, Luca Cagliero

, Paolo Garza
, Luigi Grimaudo:
PaWI: Parallel Weighted Itemset Mining by Means of MapReduce. 25-32 - Mulugeta Mammo, Srividya K. Bansal:

Distributed SPARQL over Big RDF Data: A Comparative Analysis Using Presto and MapReduce. 33-40 - Bo Yan, Yitian Ren, Zijiang Yang:

A GPU Based SVM Method with Accelerated Kernel Matrix Calculation. 41-46
Research Session 3: Big Data and Social Network
- Paolo Suppa, Eugenio Zimeo:

A Clustered Approach for Fast Computation of Betweenness Centrality in Social Networks. 47-54 - Stefano Faralli

, Giovanni Stilo
, Paola Velardi
:
A Semantic Recommender for Micro-blog Users. 55-62 - Youliang Zhong, Jian Yang

, Robertus Nugroho
:
Incorporating Tie Strength in Robust Social Recommendation. 63-70
Research Session 4: Big Data and Social Network
- Robertus Nugroho

, Youliang Zhong, Jian Yang
, Cécile Paris, Surya Nepal
:
Matrix Inter-joint Factorization - A New Approach for Topic Derivation in Twitter. 79-86 - Robertus Nugroho

, Jian Yang
, Youliang Zhong, Cécile Paris, Surya Nepal
:
Deriving Topics in Twitter by Exploiting Tweet Interactions. 87-94
Research Session 5: Privacy
- Jingquan Li, Xueying Li:

Privacy Preserving Data Analysis in Mental Health Research. 95-101 - Christian Schaefer, P. M. Manoj:

Enabling Privacy Mechanisms in Apache Storm. 102-109 - Chao Han, Ke Wang:

Sensitive Disclosures under Differential Privacy Guarantees. 110-117
Research Session 6: Big Data and Learning
- Hussein Mohsen

, Hasan Kurban
, Kurt Zimmer, Mark Jenne, Mehmet M. Dalkiliç:
Red-RF: Reduced Random Forest for Big Data Using Priority Voting & Dynamic Data Reduction. 118-125 - Muhammed O. Sayin

, N. Denizcan Vanli, Ibrahim Delibalta
, Suleyman Serdar Kozat:
Optimal and Efficient Distributed Online Learning for Big Data. 126-133 - Huaming Chen, Hong Zhao, Jun Shen

, Rui Zhou, Qingguo Zhou:
Supervised Machine Learning Model for High Dimensional Gene Data in Colon Cancer Detection. 134-141
Research Session 7: Query
- Phani Rohit Mullangi, Lakshmish Ramaswamy:

CoUPE: Continuous Query Processing Engine for Evolving Graphs. 142-149 - Jianting Zhang, Simin You, Le Gruenwald:

Lightweight Distributed Execution Engine for Large-Scale Spatial Join Query Processing. 150-157 - Duy-Hung Phan, Quang-Nhat Hoang-Xuan, Matteo Dell'Amico

, Pietro Michiardi:
Efficient and Self-Balanced ROLLUP Aggregates for Large-Scale Data Summarization. 158-165
Research Session 8: Big Data Processing
- Parijat Shukla, Arun K. Somani:

Tree Matching Using Data Shaping. 166-173 - Yehia Elshater, Patrick Martin, Dan Rope, Mike McRoberts, Craig Statchuk:

A Study of Data Locality in YARN. 174-181 - Wilson A. Higashino, Miriam A. M. Capretz, Luiz F. Bittencourt:

CEPSim: A Simulator for Cloud-Based Complex Event Processing. 182-190
Research Session 9: Big Data Quality
- Ikbal Taleb

, Rachida Dssouli, Mohamed Adel Serhani:
Big Data Pre-processing: A Quality Framework. 191-198 - Marianela García Lozano

, Ulrik Franke
, Magnus Rosell, Vladimir Vlassov
:
Towards Automatic Veracity Assessment of Open Source Information. 199-206 - Daniel Joseph, Nikolay Mehandjiev, Babis Theodoulidis

, John Davies, Ian Thurlow:
Identifying Relevant Formal Concepts through the Collapse Index. 207-214
Research Session 10: Big Data Platform/Framework
- Hong Liu, Ashwin Kumar T. K, Johnson P. Thomas:

Cleaning Framework for Big Data - Object Identification and Linkage. 215-221 - Chaochao Zhou, Saurabh Kumar Garg

:
Performance Analysis of Scheduling Algorithms for Dynamic Workflow Applications. 222-229 - Rong Zhang, Yuanchao Shu

, Zequ Yang, Peng Cheng, Jiming Chen:
Hybrid Traffic Speed Modeling and Prediction Using Real-World Data. 230-237
Research Session 11: Big Data Semantics
- Artem Chebotko, Andrey Kashlev, Shiyong Lu:

A Big Data Modeling Methodology for Apache Cassandra. 238-245 - Avrilia Floratou, Jignesh M. Patel:

Replica Placement in Multi-tenant Database Environments. 246-253 - Mustafa V. Nural, Michael E. Cotterell

, John A. Miller
:
Using Semantics in Predictive Big Data Analytics. 254-261
Research Session 12: Analysis on Big Data Research and Platforms
- Alan L. Porter

, Ying Huang, Jannik Schuehle, Jan L. Youtie
:
Meta Data: Big Data Research Evolving across Disciplines, Players, and Topics. 262-267 - Pedro Daniel Coimbra de Almeida, Jorge Bernardino

:
Big Data Open Source Platforms. 268-275
Data Science Special Track
DS Session 1
- T. H. A. S. Siriweera

, Incheon Paik, Banage T. G. S. Kumara
, Koswatte R. C. Koswatta:
Intelligent Big Data Analysis Architecture Based on Automatic Service Composition. 276-280
DS Session 2
- Mohammed Aledhari, Fahad Saeed

:
Design and Implementation of Network Transfer Protocol for Big Genomic Data. 281-288 - Ana Cristina Oliveira

, Christof Fetzer, André Martin, Marco Spohn
:
Optimizing Query Prices for Data-as-a-Service. 289-296 - Alp Oral, Bedir Tekinerdogan

:
Supporting Performance Isolation in Software as a Service Systems with Rich Clients. 297-304
Big Data Research in Healthcare Special Track
BDRH Session 1
- Benjamin Yip, Hoyee W. Hirai, Yong-Hong Kuo

, Helen M. Meng, Samuel Y. S. Wong, Kelvin Kam-fai Tsoi:
Blood Pressure Management with Data Capturing in the Cloud among Hypertensive Patients: A Monitoring Platform for Hypertensive Patients. 305-308 - Kin Fai Ho

, Hoyee W. Hirai, Yong-Hong Kuo
, Helen M. Meng, Kelvin Kam-fai Tsoi:
Indoor Air Monitoring Platform and Personal Health Reporting System: Big Data Analytics for Public Health Research. 309-312 - Yong-Hong Kuo

, Janny M. Y. Leung
, Kelvin Kam-fai Tsoi, Helen M. Meng, Colin A. Graham
:
Embracing Big Data for Simulation Modelling of Emergency Department Processes and Activities. 313-316
BDRH Session 2
- Xin Lai, Liu Liu

, Paul B. S. Lai, Kelvin Kam-fai Tsoi, Haitian Wang
, Ka Chun Chong
, Benny Zee:
Risk-Adjusted Monitoring Method for Surgical Data: Methodology for Data Analytics (Work in Progress). 317-319 - Marc Chong, Maggie Haitian Wang

, Xin Lai, Benny Zee, Fung Hong, Ek Yeoh, Eliza Lai-Yi Wong
, Carrie Yam, Patsy Chau, Kelvin Kam-fai Tsoi, Colin A. Graham
:
Patient Flow Evaluation with System Dynamic Model in an Emergency Department: Data Analytics on Daily Hospital Records. 320-323 - Maggie Haitian Wang

, Kelvin Kam-fai Tsoi, Xin Lai, Marc Chong, Benny Zee, Tian Zheng, Shaw-Hwa Lo, Inchi Hu
:
Two Screening Methods for Genetic Association Study with Application to Psoriasis Microarray Data Sets. 324-326
Shenzhen Satellite Track
Shenzhen Satellite Session 1
- Chao Ma

, Yinda Wang, Haowen Liu, Hao Gui, Weiping Zhu, Xiaochuan Shi, Xuhui Li:
An Approach to Social Relationship Ranking on Internet-Based Social Platforms by Tempo-spatial Data Mining Using Location Prediction Technique. 327-334 - Xiaolu Zhu, Jinglin Li, Zhihan Liu, Fangchun Yang:

Optimization Approach to Depot Location in Car Sharing Systems with Big Data. 335-342 - Dingsheng Wan, Yan Xiao, Pengcheng Zhang, Hareton Leung:

Hydrological Big Data Prediction Based on Similarity Search and Improved BP Neural Network. 343-350 - Liqin Yang, Guosheng Kang, Weigang Cai, Qiang Zhou:

An Effective Process Mining Approach against Diverse Logs Based on Case Classification. 351-358
Taipei Satellite Track
Taipei Satellite Session 1
- Victor W. Chu, Raymond K. Wong, Fang Chen

, Chi-Hung Chi:
Web Service Recommendations Based on Time-Aware Bayesian Networks. 359-366 - Wei-Feng Tung, Guillaume Jordann:

Crowdsourcing Service Design for Social Enterprise Insight Innovation. 367-373 - Chuen-Min Huang, Cheng-Yi Wu:

Effects of Word Assignment in LDA for News Topic Discovery. 374-380 - Chieh-Hsin Liao, Yu-Heng Lei, Kai-Yu Liou, Jian-Shing Lin, Hsiao-Feng Yeh:

Using Big Data for Profiling Heavy Users in Top Video Apps. 381-385 - Chi-Ou Chen, Ye-Qi Zhuo, Chao-Chun Yeh, Che-Min Lin, Shih-Wei Liao:

Machine Learning-Based Configuration Parameter Tuning on Hadoop System. 386-392 - Yen-Hui Liang, Shiow-Yang Wu:

Sequence-Growth: A Scalable and Effective Frequent Itemset Mining Algorithm for Big Data Based on MapReduce Framework. 393-400
Application Track
Applications Session 1: Big Data and Health
- Brian Xu, Sathish Alampalayam Kumar

:
Big Data Analytics Framework for System Health Monitoring. 401-408 - Muhammad Kamran Lodhi, Rashid Ansari, Yingwei Yao, Gail M. Keenan, Diana J. Wilkie

, Ashfaq A. Khokhar:
Predictive Modeling for Comfortable Death Outcome Using Electronic Health Records. 409-415
Applications Session 2: Big Data and Network Management
- Hongyan Cui, Yuchen Zhang, Chenhang Ma, Wei Lai, Norman C. Beaulieu, Stanislav Sobolevsky

, Yunjie Liu:
Design and Realization of Cognitive Routing Resources Using Big Data Analysis in SDN. 424-429 - MingXue Wang, Robin Grindrod, Jimmy O'Meara, Mikel Zuzuarregui, Eloy Martinez, Enda Fallon

:
Enterprise Search with Development for Network Management System. 430-437 - José R. Ortiz-Ubarri, Humberto Ortiz-Zuazaga

, Albert Maldonado, Eric Santos, Jhensen Grullon:
Toa: A Web Based Network Flow Data Monitoring System at Scale. 438-443
Applications Session 3: Distributed Processing
- Jeyhun Karimov, A. Murat Ozbayoglu

, Erdogan Dogdu
:
k-Means Performance Improvements with Centroid Calculation Heuristics Both for Serial and Parallel Environments. 444-451 - Daniel Presser, Lau Cheuk Lung, Miguel Correia

:
Greft: Arbitrary Fault-Tolerant Distributed Graph Processing. 452-459
Applications Session 4: Social Network
- Bilal Abu-Salih

, Pornpit Wongthongtham, Amin Beheshti
, Dengya Zhu:
A Preliminary Approach to Domain-Based Evaluation of Users' Trustworthiness in Online Social Networks. 460-466 - Deepak Puthal, Surya Nepal

, Cécile Paris, Rajiv Ranjan
, Jinjun Chen:
Efficient Algorithms for Social Network Coverage and Reach. 467-474 - Rohit Parimi, Toma Trepka, Doina Caragea

, Cody Bennett:
How to Choose a Recommender System: Insights and Experiences for Large-Scale User Personalization. 475-482
Applications Session 5: Social Media
- Fenno F. Terry Heath III, Richard Hull, Elham Khabiri, Matthew Riemer, Noi Sukaviriya, Roman Vaculín:

Alexandria: Extensible Framework for Rapid Exploration of Social Media. 483-490 - Roberto Saia

, Ludovico Boratto
, Salvatore Carta:
A Latent Semantic Pattern Recognition Strategy for an Untrivial Targeted Advertising. 491-498
Applications Session 6: Image Processing
- Fatema Rashid, Ali Miri, Isaac Woungang:

Proof of Storage for Video Deduplication in the Cloud. 499-505 - Sridhar Vemula, Christopher Crick

:
Hadoop Image Processing Framework. 506-513 - Ranga Raju Vatsavai

:
A Scalable Complex Pattern Mining Framework for Global Settlement Mapping. 514-521
Applications Session 7: Big Data Application
- Hyejung Moon, Jangho Park, Sung-Kyung Kim:

Study on Corporate Governance of Stock Market in Korea: Network Analysis with Relationship of Major Shareholders. 522-525 - John Klein, Ian Gorton, Neil A. Ernst

, Patrick Donohoe, Kim Pham, Chrisjan Matser:
Application-Specific Evaluation of No SQL Databases. 526-534 - Dennis Wei, Kush R. Varshney, Marcy Wagman:

Optigrow: People Analytics for Job Transfers. 535-542
Applications Session 8: Optimization
- Andrea Acquaviva, Daniele Apiletti, Antonio Attanasio, Elena Baralis, Lorenzo Bottaccioli

, Federico Boni Castagnetti, Tania Cerquitelli, Silvia Chiusano, Enrico Macii, Dario Martellacci, Edoardo Patti:
Energy Signature Analysis: Knowledge at Your Fingertips. 543-550 - Kai-Fung Hong, Chien-Chih Chen, Yu-Ting Chiu, Kuo-Sen Chou:

Ctracer: Uncover C&C in Advanced Persistent Threats Based on Scalable Framework for Enterprise Log Data. 551-558 - Unekwu Idachaba, Frank Wang:

A Community-Based Cloud Computing Caching Service. 559-566
Applications Session 9: Evaluation
- Vinod Hegde, Milovan Krnjajic, Alexei Pozdnoukhov:

Unsupervised Event Detection with Infinite Poisson Mixture Model. 567-575 - Apostolos Papageorgiou

, Bin Cheng, Ernö Kovacs
:
Reconstructability-Aware Filtering and Forwarding of Time Series Data in Internet-of-Things Architectures. 576-583 - João Ricardo Lourenço, Veronika Abramova, Bruno Cabral

, Jorge Bernardino
, Paulo Carreiro, Marco Vieira
:
No SQL in Practice: A Write-Heavy Enterprise Application. 584-591
Applications Session 10: Big Data Framework
- Bin Cheng, Salvatore Longo, Flavio Cirillo, Martin Bauer, Ernö Kovacs

:
Building a Big Data Platform for Smart Cities: Experience and Lessons from Santander. 592-599 - Stanislav Sobolevsky

, Iva Bojic, Alexander Belyi
, Izabela Sitko, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Scaling of City Attractiveness for Foreign Visitors through Big Data of Human Economical and Social Media Activity. 600-607 - R. Bruce Wallace, Rafik A. Goubran, Frank Knoefel, Shawn Marshall, Michelle Porter

, Madelaine Harlow, Akshay Puli:
Automation of the Validation, Anonymization, and Augmentation of Big Data from a Multi-year Driving Study. 608-614
Applications Session 11: Big Data Use Cases
- Vladimir Hahanov

, Wajeb Gharibi, Eugenia Litvinova
, Svetlana Chumachenko
:
Big Data Driven Cyber Analytic System. 615-622 - Chien-An Lai, Jim Donahue, Aibek Musaev, Calton Pu:

Nimbus: Tuning Filters Service on Tweet Streams. 623-630
Short Paper Track
Session 1
- Hoi Ting Poon, Ali Miri:

Computation and Search over Encrypted XML Documents. 631-634 - Mohammed Nazim Feroz, Susan A. Mengel:

Phishing URL Detection Using URL Ranking. 635-638 - Yong-Hong Kuo

, Janny M. Y. Leung
, Helen M. Meng, Kelvin Kam-fai Tsoi:
A Real-Time Decision Support Tool for Disaster Response: A Mathematical Programming Approach. 639-642
Session 2
- Keren Ouaknine, Michael J. Carey, Scott Kirkpatrick:

The PigMix Benchmark on Pig, MapReduce, and HPCC Systems. 643-648 - Yi Shan, Yi Chen:

Scalable Query Optimization for Efficient Data Processing Using MapReduce. 649-652 - U. S. N. Raju

, Irlanki Sandeep, Nattam Sai Karthik, Rayapudi Siva Praveen, Mayank Singh Sachan:
Weighted Finite Automata Based Image Compression on Hadoop MapReduce Framework. 653-656 - Sangwhan Cha, Monica Wachowicz:

Developing a Real-Time Data Analytics Framework Using Hadoop. 657-660 - U. S. N. Raju

, Shibin George, V. Sairam Praneeth, Ranjeet Deo, Priyanka Jain:
Content Based Image Retrieval on Hadoop Framework. 661-664
Session 3
- Longzhuang Li, Douglas Boulware:

High-Order Tensor Decomposition for Large-Scale Data Analysis. 665-668 - Johann A. Bengua

, Ho N. Phien, Hoang Duong Tuan:
Optimal Feature Extraction and Classification of Tensors via Matrix Product State Decomposition. 669-672 - Verena Kantere, Maxim Filatov:

A Workflow Model for Adaptive Analytics on Big Data. 673-676 - Muhammad Raza Khan, Joshua Manoj, Anikate Singh, Joshua Blumenstock:

Behavioral Modeling for Churn Prediction: Early Indicators and Accurate Predictors of Custom Defection and Loyalty. 677-680 - Mariusz Kamola:

Analytics of Industrial Operational Data Inspired by Natural Language Processing. 681-684 - N. Denizcan Vanli, Huseyin Ozkan

, Ibrahim Delibalta
, Suleyman Serdar Kozat:
Online Nonlinear Classification for High-Dimensional Data. 685-688
Session 4
- Yun Tian, Bojian Xu, Yanqing Ji, Jesse Scholer:

Cloud Tree: A Library to Extend Cloud Services for Trees. 689-693 - Soo-Hyong Kim, Yoon-Joon Lee, Jaehwan John Lee

:
Matrix-Based XML Stream Processing Using a GPU. 694-697 - Kadjo Kouame, Naser Ezzati-Jivan, Michel R. Dagenais:

A Flexible Data-Driven Approach for Execution Trace Filtering. 698-703 - Weijia Xu, Wei Luo, Nicholas Woodward, Yan Zhang:

Supporting Data Driven Access through Automatic Keyword Extraction and Summarization. 704-707 - Ardi Imawan, Titus Irma Damaiyanti, Joonho Kwon:

Road Traffic Analytic Query Processing Based on a Timeline Modeling. 708-711 - Verena Kantere:

Approximate Queries on Big Heterogeneous Data. 712-715
Session 5
- Abdul Wasay, Manos Athanassoulis

, Stratos Idreos:
Queriosity: Automated Data Exploration. 716-719 - Feng Yu, Eric S. Jones, Wen-Chi Hou:

Write Optimization Using Asynchronous Update on Out-of-Core Column-Store Databases in Map-Reduce. 720-723 - Hai Nguyen, Matthew S. Weber

:
Internet Archives as a Tool for Research: Decay in Large Scale Archival Records. 724-727 - Armel Jacques Nzekon Nzeko'o, Matthieu Latapy, Maurice Tchuenté:

Social Network Analysis of Developers' and Users' Mailing Lists of Some Free Open Source Software. 728-732 - Purva Pruthi, Anu Yadav, Farheen Abbasi, Durga Toshniwal:

How Has Twitter Changed the Event Discussion Scenario? A Spatio-temporal Diffusion Analysis. 733-736
Session 6
- Kelvin Kam-fai Tsoi, Yong-Hong Kuo

, Helen M. Meng:
A Data Capturing Platform in the Cloud for Behavioral Analysis among Smokers: An Application Platform for Public Health Research. 737-740 - Minh-Son Dao, Koji Zettsu:

Discovering Environmental Impacts on Public Health Using Heterogeneous Big Sensory Data. 741-744 - Raghava Rao Mukkamala

, Jannie Iskou Sorensen, Abid Hussain, Ravi Vatrapu
:
Detecting Corporate Social Media Crises on Facebook Using Social Set Analysis. 745-748 - Victor Lawson, Lakshmish Ramaswamy:

Data Quality and Energy Management Tradeoffs in Sensor Service Clouds. 749-752 - Jih-Jeng Huang

:
Two Steps Genetic Programming for Big Data - Perspective of Distributed and High-Dimensional Data. 753-756
Visionary Track
Visionary Session
- Elisa Bertino:

Big Data - Security and Privacy. 757-761 - Philip S. Yu, Jiawei Zhang:

MCD: Mutual Clustering across Multiple Social Networks. 762-771 - Bhavani Thuraisingham:

Database Security: Past, Present, and Future. 772-774 - Hong Zhu, Ian Bayley, Muhammad Younas, David E. Lightfoot, Basel Yousef, Dongmei Liu:

Big SaaS: The Next Step Beyond Big Data. 775-784 - John A. Miller

, Lakshmish Ramaswamy, Krys J. Kochut, Arash Fard:
Research Directions for Big Data Graph Analytics. 785-794

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














