


default search action
BigData Congress 2014: Anchorage, AK, USA
- 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27 - July 2, 2014. IEEE Computer Society 2014, ISBN 978-1-4799-5057-7

BigData Research Session 1 - BigData Analytics Techniques
- Yang Zhou, Sangeetha Seshadri, Lawrence Chiu, Ling Liu:

GraphLens: Mining Enterprise Storage Workloads Using Graph Analytics. 1-8 - Mansurul Bhuiyan, Mohammad Al Hasan:

FSM-H: Frequent Subgraph Mining Algorithm in Hadoop. 9-16 - Jia Wang, Ada Wai-Chee Fu, James Cheng:

Rectangle Counting in Large Bipartite Graphs. 17-24
BigData Research Session 2 - MapReduce Model
- Jin Soung Yoo, Douglas Boulware, David Kimmey:

A Parallel Spatial Co-location Mining Algorithm Based on MapReduce. 25-31 - Lena Mashayekhy, Mahyar Movahed Nejad, Daniel Grosu

, Dajun Lu, Weisong Shi:
Energy-Aware Scheduling of MapReduce Jobs. 32-39 - Huseyin Ulusoy, Murat Kantarcioglu, Erman Pattuk, Kevin W. Hamlen:

Vigiles: Fine-Grained Access Control for MapReduce Systems. 40-47
BigData Research Session 3 - BigData Security
- Jingwei Huang

, David M. Nicol, Roy H. Campbell:
Denial-of-Service Threat to Hadoop/YARN Clusters with Multi-tenancy. 48-55 - Samuel Marchal

, Xiuyan Jiang, Radu State, Thomas Engel:
A Big Data Architecture for Large Scale Security Monitoring. 56-63 - Michael A. Hayes, Miriam A. M. Capretz:

Contextual Anomaly Detection in Big Sensor Data. 64-71
BigData Research Session 4 - BigData Performance Issues
- Jianting Zhang, Simin You, Le Gruenwald:

High-Performance Spatial Query Processing on Big Taxi Trip Data Using GPGPUs. 72-79 - Liping Zhang, Qi Chen, Kai Miao:

A Compatible LZMA ORC-Based Optimization for High Performance Big Data Load. 80-87
BigData Research Session 5 - BigData Analytics
- Angelos Molfetas, Anthony Wirth, Justin Zobel:

Storing a Collection of Differentially Compressed Files Recursively. 88-95 - Carsten Binnig

, Abdallah Salama, Erfan Zamanian, Harald Kornmayer
, Sven Listing, Alexander C. Müller:
XDB - A Novel Database Architecture for Data Analytics as a Service. 96-103 - Peter Ivie, Douglas Thain

:
DeltaDB: A Scalable Database Design for Time-Varying Schema-Free Data. 104-111
BigData Research Session 6 - MapReduce Framework
- Yifan Chen, Xiang Zhao, Bin Ge, Chuan Xiao

, Chi-Hung Chi:
Practising Scalable Graph Similarity Joins in MapReduce. 112-119 - Jessica Hartog, Renan Delvalle, Madhusudhan Govindaraju, Michael J. Lewis:

Configuring a MapReduce Framework for Performance-Heterogeneous Clusters. 120-127 - Johannes Schildgen, Thomas Jörg, Manuel Hoffmann

, Stefan Dessloch:
Marimba: A Framework for Making MapReduce Jobs Incremental. 128-135
BigData Research Session 7 - BigData Services
- Stanislav Sobolevsky

, Izabela Sitko, Remi Tachet des Combes, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Money on the Move: Big Data of Bank Card Transactions as the New Proxy for Human Mobility Patterns and Regional Delineation. The Case of Residents and Foreign Visitors in Spain. 136-143 - Xu Tan, Yuanchao Shu

, Xie Lu, Peng Cheng, Jiming Chen:
Characterizing and Modeling Package Dynamics in Express Shipping Service Network. 144-151 - Desheng Zhang, Tian He, Shan Lin, Sirajum Munir, John A. Stankovic:

Dmodel: Online Taxicab Demand Model from Big Sensor Data in a Roving Sensor Network. 152-159
BigData Research Session 8 - Distributed BigData Services
- Yanyan Xu, James Cheng, Ada Wai-Chee Fu, Yingyi Bu:

Distributed Maximal Clique Computation. 160-167 - Elif Dede, Bedri Sendir

, Pinar Kuzlu, J. Weachock, Madhusudhan Govindaraju, Lavanya Ramakrishnan:
A Processing Pipeline for Cassandra Datasets Based on Hadoop Streaming. 168-175 - Pedro Martins, Maryam Abbasi

, Pedro Furtado
:
AuDy: Automatic Dynamic Least-Weight Balancing for Stream Workloads Scalability. 176-183
BigData Research Session 9 - BigData Analytics Network
- Mark Thomas, Leigh Metcalf, Jonathan M. Spring, Paul Krystosek, Katherine Prevost:

SiLK: A Tool Suite for Unsampled Network Flow Analysis at Scale. 184-191 - Angelos Molfetas, Anthony Wirth, Justin Zobel:

Using Inter-file Similarity to Improve Intra-file Compression. 192-199
BigData Research Session 10 - BigData Management Model
- Chung-Chih Cheng, Fan-Chieh Cheng, Po-Hsiung Lin, Shih-Chia Huang:

A Cloud-Computing Local Histogram Construction Algorithm for Big Image Data. 200-203 - Tonglin Li, Ioan Raicu, Lavanya Ramakrishnan:

Scalable State Management for Scientific Applications in the Cloud. 204-211 - Eleanna Kafeza

, Andreas Kanavos, Christos Makris
, Pantelis Vikatos:
T-PICE: Twitter Personality Based Influential Communities Extraction System. 212-219 - Verena Kantere:

A Holistic Framework for Big Scientific Data Management. 220-226
BigData Research Session 11 - Mexico City Satellite Session
- Ángel Fernando Kuri Morales:

Data Base Analysis Using a Compact Data Set. 227-233 - Moisés Quezada Naquid, Ricardo Marcelín-Jiménez, José Luis González Compeán

:
The Babel File System. 234-241
BigData Research Session 12 - Coimbra Satellite Session
- Antonio M. Rinaldi

:
Using Multimedia Ontologies for Automatic Image Annotation and Classification. 242-249 - Diogo Anjos, Paulo Carreira

, Alexandre P. Francisco
:
Real-Time Integration of Building Energy Data. 250-257
BigData Research Session 13 - Taipei Satellite Session
- Rafat Hammad

, Ching-Seh Wu:
Provenance as a Service: A Data-centric Approach for Real-Time Monitoring. 258-265 - Ching-Han Chen, Ching-Yi Chen, Chih-Hsien Hsia

, Guan-Xin Wu:
Big Data Collection Gateway for Vision-Based Smart Meter Reading Network. 266-269 - Wei-Ho Tsai, Cin-Hao Ma:

Triangulation-Based Singer Identification for Duet Music Data Indexing. 270-275 - Wei-Ho Tsai, Cin-Hao Ma:

Speech and Singing Discrimination for Audio Data Indexing. 276-280 - Charles Chin-Ho Lin, Liang-Cheng Huang, Seng-cho Timothy Chou, Chih-Ho Liu, Han-Fang Cheng, I-Jen Chiang:

Temporal Event Tracing on Big Healthcare Data Analytics. 281-287 - Zhu Wang, Tiejian Luo, Guandong Xu, Xiang Wang:

The Application of Cartesian-Join of Bloom Filters to Supporting Membership Query of Multidimensional Data. 288-295 - Chunyu Wang

, Tzu-Li Tai, Jui-Shing Shu, Jyh-Biau Chang, Ce-Kuen Shieh:
Federated MapReduce to Transparently Run Applications on Multicluster Environment. 296-303 - Xiao Fu, Zhijian Wang, Hao Wu, Jia-qi Yang, Zizhao Wang:

How to Send a Self-Destructing Email: A Method of Self-Destructing Email System. 304-309 - Xiaoqing Yu, Huanhuan Liu, Jianhua Shi, Jenq-Neng Hwang, Wanggen Wan, Jing Lu:

Association Rule Mining of Personal Hobbies in Social Networks. 310-314
BigData Research Session 14 - Data Processing
- Carson Kai-Sang Leung

, Richard Kyle MacKinnon, Fan Jiang:
Reducing the Search Space for Big Data Mining for Interesting Patterns from Uncertain Data. 315-322 - Eirini C. Micheli, Giorgos Margaritis, Stergios V. Anastasiadis

:
Lethe: Cluster-Based Indexing for Secure Multi-user Search. 323-330 - Yifeng Geng, Xiaomeng Huang, Guangwen Yang:

Adaptive Indexing for Distributed Array Processing. 331-338
BigData Research Session 15 - Shenzhen Satellite Session
- Dingsheng Wan, Yan Xiao, Pengcheng Zhang, Jun Feng

, Yuelong Zhu, Qian Liu:
Hydrological Time Series Anomaly Mining Based on Symbolization and Distance Measure. 339-346 - Liqiang Wang, Shijun Liu, Li Pan, Lei Wu, Xiangxu Meng:

Enterprise Relationship Network: Build Foundation for Social Business. 347-354 - Yan Tang, Yu Wang, Kendra M. L. Cooper, Ling Li:

Towards Big Data Bayesian Network Learning - An Ensemble Learning Based Approach. 355-357 - Xue Bai, Fu Chen, Shaobin Zhan:

A Study on Sentiment Computing and Classification of Sina Weibo with Word2vec. 358-363 - Fu Chen, Shaobin Zhan, Guangjun Shi:

A Study on Trend Prediction in Sina Weibo Community. 364-365 - Changjian Wang, Yuxing Peng, Mingxing Tang, Dongsheng Li, Shanshan Li, Pengfei You:

MapCheckReduce: An Improved MapReduce Computing Model for Imprecise Applications. 366-373 - Saixia Lyu, Jianxun Liu, Mingdong Tang, Guosheng Kang, Buqing Cao, Yucong Duan:

Three-Level Views of the Web Service Network: An Empirical Study Based on ProgrammableWeb. 374-381 - Haisu Zhang, Sheng Zhang, Zhaolin Wu, Liwei Huang, Yutao Ma

:
Predicting Wikipedia Editor's Editing Interest Based on Factor Graph Model. 382-389 - Junming Zhang, Jinglin Li, Shangguang Wang

, Zhihan Liu, Quan Yuan, Fangchun Yang:
On Retrieving Moving Objects Gathering Patterns from Trajectory Data via Spatio-temporal Graph. 390-397
BigData Industry and Application Session 1 - Distributed
- Zhiyun Zheng, Zhimeng Du, Lun Li, Yike Guo

:
BigData Oriented Open Scalable Relational Data Model. 398-405 - Ivan Giangreco

, Ihab Al Kabary, Heiko Schuldt
:
ADAM - A Database and Information Retrieval System for Big Multimedia Collections. 406-413 - Tianjian Chen, Zhengrui Man, Hao Li, Xin Sun, Raymond K. Wong, Zhiwei Yu:

Building a Massive Stream Computing Platform for Flexible Applications. 414-421 - Alan Yu Shyang Tan

, Ryan Kok Leong Ko
, Grace P. Y. Ng:
OpenStack Café: A Novel Time-Based User-centric Resource Management Framework in the Cloud. 422-429 - Arpit Baheti, Durga Toshniwal:

Trend Analysis of Time Series Data Using Data Mining Techniques. 430-437 - Chongke Bi, Kenji Ono, Lu Yang:

Parallel POD Compression of Time-Varying Big Datasets Using m-Swap on the K Computer. 438-445
BigData Industry and Application Session 3 - BigData Management System
- Bo Hu, Yutao Ma

, Liang-Jie Zhang
, Jiake Shi, Jiayan Zhong:
A Key-Value Based Application Platform for Enterprise Big Data. 446-453 - Ahmed Abdeen Hamed, Xindong Wu:

Does Social Media Big Data Make the World Smaller? An Exploratory Analysis of Keyword-Hashtag Networks. 454-461 - Aniruddha Desai, Muaz Mian, David Hazel, Ankur Teredesai, Gregory Benner:

Data Visualization in Educational Datasets Using a Rule-Based Inference System. 462-469
BigData Industry and Application Session 4 - BigData Analytics Method
- Anirudh Thommandram, J. Mikael Eklund, Carolyn McGregor

, James Edward Pugh, Andrew G. James:
A Rule-Based Temporal Analysis Method for Online Health Analytics and Its Application for Real-Time Detection of Neonatal Spells. 470-477 - Shanqing Li, Lirong Song, Hui Zhao:

A Discriminant Framework for Detecting Similar Scientific Research Projects Based on Big Data Mining. 478-481
BigData Industry and Application Session 5 - BigData Analytics Algorithm
- Anil Kumar, Vikas Kapur, Apangshu Saha, Rajeev Kumar Gupta, Arun Singh, Santanu Chaudhury, Sumeet Agarwal

:
Distributed Implementation of Latent Rating Pattern Sharing Based Cross-domain Recommender System Approach. 482-489 - Apostolos Papageorgiou

, Manuel Zahn, Ernö Kovacs
:
Auto-configuration System and Algorithms for Big Data-Enabled Internet-of-Things Platforms. 490-497 - Matthew Saltz, Ayushi Jain, Abhishek Kothari, Arash Fard, John A. Miller

, Lakshmish Ramaswamy:
DualIso: An Algorithm for Subgraph Pattern Matching on Very Large Labeled Graphs. 498-505
BigData Industry and Application Session 6 - BigData Mining
- Philippe Lalanda, Catherine Hamon:

An Autonomic Mediation Framework for Complex Physical Environments. 506-513 - Leila Ismail

, Mohammad M. Masud, Latifur Khan
:
FSBD: A Framework for Scheduling of Big Data Mining in Cloud Computing. 514-521 - Srividya K. Bansal:

Towards a Semantic Extract-Transform-Load (ETL) Framework for Big Data Integration. 522-529
BigData Industry and Application Session 7 - NoSQL
- Julian Krumeich, Sven Jacobi, Dirk Werth, Peter Loos:

Big Data Analytics for Predictive Manufacturing Control - A Case Study from Process Industry. 530-537 - Yi-Cheng Huang, Wenwey Hseush, Yu-Chun Lai, Michael Fong:

BigObject Store: In-Place Computing for Interactive Analytics. 538-545 - Phani Rohit Mullangi, Gowtham Penematsa, Lakshmish Ramaswamy:

Scalable XPath Evaluation on Large-Scale Continuously Evolving XML Repositories. 546-553
BigData Industry and Application Session 8 - Big Spatial
- Wei Zhou, Chi-Hung Chi, Can Wang

, Raymond K. Wong, Chen Ding:
Bridging the Gap between Spatial Data Sources and Mashup Applications. 554-561 - Jorge Alencar, Tibérius O. Bonates, Carlile Lavor

:
A Combinatorial Approach to Multidimensional Scaling. 562-569
BigData Industry and Application Session 9 - Hadoop
- Oscar D. Lara Yejas, Weiqiang Zhuang, Adarsh Pannu:

Big R: Large-Scale Analytics on Hadoop Using R. 570-577 - Jun Fan, Xinhui Li, Chi Harold Liu

, Jeffrey Buell, Gavin Lu, Luke Lu:
Diagnosing Virtualized Hadoop Performance from Benchmark Results: An Exploratory Study. 578-585 - Sungyong Ahn, Sangkyu Park, Jae-Ki Hong, Wooseok Chang:

Performance Implications of SSDs in Virtualized Hadoop Clusters. 586-593
BigData Industry and Application Session 10 - BigData Privacy
- Abdulkareem Alsudais

, Gondy Leroy, Anthony Corso:
We Know Where You Are Tweeting From: Assigning a Type of Place to Tweets Using Natural Language Processing and Random Forests. 594-600 - Jeff Sedayao, Rahul Bhardwaj, Nakul Gorade:

Making Big Data, Privacy, and Anonymization Work Together in the Enterprise: Experiences and Issues. 601-607
BigData Industry and Application Session 11 - Cloud-Based BigData Storage
- Frank Zhigang Wang, Theo Dimitrakos

, Na Helian
, Sining Wu, Ling Li, Rodric Yates:
CloudJet4BigData: Streamlining Big Data via an Accelerated Socket Interface. 608-615 - Wei-Chih Huang, Chuan-Ming Liu, Chuan-Chi Lai

:
Resource Provisioning with QoS in Cloud Storage. 616-620 - Marwan Sabbouh, Kenneth McCracken, Geoff Cooney:

Data Sharing for Cloud Computing Platforms. 621-628
BigData Industry and Application Session 12 - BigData Analytics Architecture
- Raghava Rao Mukkamala

, Abid Hussain, Ravi K. Vatrapu
:
Towards a Set Theoretical Approach to Big Data Analytics. 629-636 - Aleksandr Drozd, Miquel Pericàs, Satoshi Matsuoka:

Efficient String Sorting on Multi - and Many-Core Architectures. 637-644 - Shantenu Jha

, Judy Qiu, André Luckow
, Pradeep Kumar Mantha, Geoffrey C. Fox:
A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures. 645-652
BigData Industry and Application Session 13 - NoSQL
- Rami Sellami, Sami Bhiri

, Bruno Defude
:
ODBAPI: A Unified REST API for Relational and NoSQL Data Stores. 653-660 - Richard K. Lomotey, Ralph Deters

:
Terms Mining in Document-Based NoSQL: Response to Unstructured Data. 661-668 - Shin'ichi Takeuchi, Yuhei Akahoshi, Bun Theang Ong, Komei Sugiura, Koji Zettsu:

Spatio-temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous Scientific Repositories. 669-676 - Hang Yang, Huajun Chen, Cai Yuan, Fang Lianhang:

An Intelligent System for Forecasting the Trend of Consumed Electricity. 677-682
BigData Industry and Application Session 14 - NoSQL
- Heather Champion

, Nick J. Pizzi, Raja Krishnamoorthy:
Tactical Clinical Text Mining for Improved Patient Characterization. 683-690
BigData Industry and Application Session 15 - Big Social
- Sanat Kumar Bista, Surya Nepal

, Cécile Paris:
Multifaceted Visualisation of Annotated Social Media Data. 699-706
BigData Industry and Application Session 16 - Graph Analytics
- Arko Provo Mukherjee, Srikanta Tirthapura

:
Enumerating Maximal Bicliques from a Large Graph Using MapReduce. 707-716 - Yue Zhao, Kenji Yoshigoe, Mengjun Xie

, Suijian Zhou, Remzi Seker, Jiang Bian:
LightGraph: Lighten Communication in Distributed Graph-Parallel Processing. 717-724 - Hao Lin, Shuo Yang, Samuel P. Midkiff

:
RABID: A Distributed Parallel R for Large Datasets. 725-732
BigData Industry and Application Session 17 - Cloud-Based BigData Service
- Shuang Chen, Mahboobeh Ghorbani, Yanzhi Wang, Paul Bogdan

, Massoud Pedram:
Trace-Based Analysis and Prediction of Cloud Computing User Behavior Using the Fractal Modeling Technique. 733-739 - Xiangdong Huang, Jianmin Wang

, Jian Bai, Guiguang Ding, Mingsheng Long
:
Inherent Replica Inconsistency in Cassandra. 740-747 - Miyuru Dayarathna, Toyotaro Suzumura:

Towards Emulation of Large Scale Complex Network Workloads on Graph Databases with XGDBench. 748-755
BigData Work-in-Progress Session 1 - BigData Processing
- Hyejung Moon, Hyun Suk Cho, Seo Hwa Jeong, Jangho Park:

Policy Design Based on Risk at Big Data Era: Case Study of Privacy Invasion in South Korea. 756-759 - Jingwei Huang

, Zbigniew T. Kalbarczyk, David M. Nicol:
Knowledge Discovery from Big Data for Intrusion Detection Using LDA. 760-761 - Harsh Kupwade Patil

, Ravi Seshadri:
Big Data Security and Privacy Issues in Healthcare. 762-765 - Xingcan Cui, Zhen Dong, Liwei Lin, Renyong Song, Xiaohui Yu

:
GrandLand Traffic Data Processing Platform. 766-767 - Cuiwen Xiong, Peng Zhang, Yan Li, Shipeng Zhang, Qingyun Liu, Jianlong Tan:

A Memory-Based Continuous Query Index for Stream Processing. 768-769
BigData Work-in-Progress Session 2 - BigData Analytics
- Jing Zhou, Xiaohui Yu

, Yang Liu, Ziqiang Yu:
Ranking Keyword Search Results with Query Logs. 770-771 - Namgyu Kim, William Wong Xiu Shun, Jieun Kim, Kee-Young Kwahk, Seung Ryul Jeong, Hyunchul Ahn

:
Constructing an Issue Network from the Perspective of Common R&D Keywords. 772-773 - Jarek Nabrzyski, Cheng Liu, Charles Vardeman, Sandra Gesing, Milan Budhatoki:

Agriculture Data for All - Integrated Tools for Agriculture Data Integration, Analytics, and Sharing. 774-775 - Zhen Zhao:

Asynchronous Service Analysis of Cloud DVR DataCenter. 776-777 - Muaz Mian, Ankur Teredesai, David Hazel, Sreenivasulu Pokuri, Krishna Uppala:

Work in Progress - In-Memory Analysis for Healthcare Big Data. 778-779 - Bo Liu, Liang Wu, Qiuxiang Dong, Yuanchun Zhou:

Large-Scale Heterogeneous Program Retrieval through Frequent Pattern Discovery and Feature Correlation Analysis. 780-781
BigData Work-in-Progress Session 3 - BigData Management Framework
- Chong Yang, Xiaohui Yu

, Yang Liu:
Towards Efficient KNN Joins on Data Streams. 782-783 - Fangzhou Yao, Roy H. Campbell:

CouchFS: A High-Performance File System for Large Data Sets. 784-785 - Daniel Lins da Silva

, Pedro Luiz Pizzigatti Corrêa
, Silvio Luiz Stanzani, Paulo Andre Filipak, Andreiwid Sheffer Corrêa
:
A Computational Framework for Integrating and Retrieving Biodiversity Data on a Large Scale. 786-787 - Matt MacDuff, Benno Lee, Sherman Beus:

Versioning Complex Data. 788-791 - Alexander Ditter, Dietmar Fey, Tobias Schön, Steven Oeckl:

On the Way to Big Data Applications in Industrial Computed Tomography. 792-793
BigData Work-in-Progress Session 4 - BigData Decision Support System
- Carlos R. Rivero

, Hasan M. Jamil
:
Towards a Novel Model for Distributed Big Data Service Composition Using Functional Graph Matching. 794-795 - Kerrie Holley, Gandhi Sivakumar, Kalapriya Kannan:

Enrichment Patterns for Big Data. 796-799 - Melyssa Barata, Jorge Bernardino

, Pedro Furtado
:
YCSB and TPC-H: Big Data and Decision Support Benchmarks. 800-801 - Jongbok Byun, Diane Rasmussen Pennington

, Jorge Cardenas, Srabasti Dutta, Jeral Kirwan:
Understanding Student Behaviors in Online Classroom: Data Scientific Approach. 802-803 - Katherine G. Herbert

, Emily Hill, Jerry Alan Fails, Joseph O. Ajala, Richard T. Boniface, Paul W. Cushman:
Scientific Data Infrastructure for Sustainability Science Mobile Applications. 804-805 - Andreiwid Sheffer Corrêa

, Pedro Luiz Pizzigatti Corrêa
, Daniel Lins da Silva
, Flávio Soares Corrêa da Silva:
Really Opened Government Data: A Collaborative Transparency at Sight. 806-807 - Rebecca Copeland, Noël Crespi:

Classifying and Aggregating Context Attributes for Business Service Requests - No 'One-Size-Fits-All'. 808-815 - Lídice García Ríos, José Alberto Incera Diéguez:

Big Data Infrastructure for analyzing data generated by Wireless Sensor Networks. 816-823 - Dymitr Ruta:

Automated Trading with Machine Learning on Big Data. 824-830

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














