Stop the war!
Остановите войну!
for scientists:
default search action
22nd KDD 2016: San Francisco, CA, USA
- Balaji Krishnapuram, Mohak Shah, Alexander J. Smola, Charu C. Aggarwal, Dou Shen, Rajeev Rastogi:
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13-17, 2016. ACM 2016, ISBN 978-1-4503-4232-2
Keynote Talks
- Jennifer T. Chayes:
Graphons and Machine Learning: Modeling and Estimation of Sparse Massive Networks. 1 - Nando de Freitas:
Learning to Learn and Compositionality with Deep Recurrent Neural Networks: Learning to Learn and Compositionality. 3 - Whitfield Diffie:
The Evolving Meaning of Information Security. 5 - Joseph M. Hellerstein:
People, Computers, and The Hot Mess of Real Data. 7 - Greg Papadopoulos:
A VC View of Investing in ML. 9
Panel
- Evangelos Simoudis, Mark Gorenberg, Tim Guleri, Matt Ocko, Greg Sands:
Big Data Needs Big Dreamers: Lessons from Successful Big Data Investors. 11-12
Applied Data Science Track Full Papers
- Klaus Ackermann, Eduardo Blancas Reyes, Sue He, Thomas Anderson Keller, Paul van der Boor, Romana Khan, Rayid Ghani, José Carlos González:
Designing Policy Recommendations to Reduce Home Abandonment in Mexico. 13-20 - Samet Ayhan, Hanan Samet:
Aircraft Trajectory Prediction Made Easy with Predictive Analytics. 21-30 - Reza Bosagh Zadeh, Xiangrui Meng, Alexander Ulanov, Burak Yavuz, Li Pu, Shivaram Venkataraman, Evan Randall Sparks, Aaron Staple, Matei Zaharia:
Matrix Computations and Optimization in Apache Spark. 31-38 - Mirela Madalina Botezatu, Ioana Giurgiu, Jasmina Bogojeska, Dorothea Wiesmann:
Predicting Disk Replacement towards Reliable Data Centers. 39-48 - Joel Brooks, Matthew Kerr, John V. Guttag:
Developing a Data-Driven Player Ranking in Soccer Using Predictive Model Weights. 49-55 - Matthew Burgess, Eugenia Giraudy, Julian Katz-Samuels, Joe Walsh, Derek Willis, Lauren Haynes, Rayid Ghani:
The Legislative Influence Detector: Finding Text Reuse in State Legislation. 57-66 - Samuel Carton, Jennifer Helsby, Kenneth Joseph, Ayesha Mahmud, Youngsoo Park, Joe Walsh, Crystal Cody, C. P. T. Estella Patterson, Lauren Haynes, Rayid Ghani:
Identifying Police Officers at Risk of Adverse Events. 67-76 - Alex Deng, Xiaolin Shi:
Data-Driven Metric Development for Online Controlled Experiments: Seven Lessons Learned. 77-86 - Bowen Du, Chuanren Liu, Wenjun Zhou, Zhenshan Hou, Hui Xiong:
Catch Me If You Can: Detecting Pickpocket Suspects from Large-Scale Transit Records. 87-96 - Rupesh Gupta, Guanfeng Liang, Hsiao-Ping Tseng, Ravi Kiran Holur Vijay, Xiaoyu Chen, Rómer Rosales:
Email Volume Optimization at LinkedIn. 97-106 - JungWoo Ha, Hyuna Pyo, Jeonghee Kim:
Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks. 107-115 - Jim C. Huang, Rodolphe Jenatton, Cédric Archambeau:
Online Dual Decomposition for Performance and Delivery-Based Distributed Ad Allocation. 117-126 - Bo Jin, Chao Che, Kuifei Yu, Yue Qu, Li Guo, Cuili Yao, Ruiyun Yu, Qiang Zhang:
Minimizing Legal Exposure of High-Tech Companies through Collaborative Filtering Methods. 127-136 - Navneet Kapur, Nikita I. Lytkin, Bee-Chung Chen, Deepak Agarwal, Igor Perisic:
Ranking Universities Based on Career Outcomes of Graduates. 137-144 - Muhammad Raza Khan, Joshua E. Blumenstock:
Predictors without Borders: Behavioral Modeling of Product Adoption in Three Developing Countries. 145-154 - Guimei Liu, Tam T. Nguyen, Gang Zhao, Wei Zha, Jianbo Yang, Jianneng Cao, Min Wu, Peilin Zhao, Wei Chen:
Repeat Buyer Prediction for E-Commerce. 155-164 - Haishan Liu, David Pardoe, Kun Liu, Manoj Thakur, Frank Cao, Chongzhe Li:
Audience Expansion for Online Social Network Advertising. 165-174 - Ping Luo, Su Yan, Zhiqiang Liu, Zhiyong Shen, Shengwen Yang, Qing He:
From Online Behaviors to Offline Retailing. 175-184 - Michael A. Madaio, Shang-Tse Chen, Oliver L. Haimson, Wenwen Zhang, Xiang Cheng, Matthew Hinds-Aldrich, Duen Horng Chau, Bistra Dilkina:
Firebird: Predicting Fire Risk and Prioritizing Fire Inspections in Atlanta. 185-194 - Eric Malmi, Pyry Takala, Hannu Toivonen, Tapani Raiko, Aristides Gionis:
DopeLearning: A Computational Approach to Rap Lyrics Generation. 195-204 - Sathappan Muthiah, Patrick Butler, Rupinder Paul Khandpur, Parang Saraf, Nathan Self, Alla Rozovskaya, Liang Zhao, Jose Cadena, Chang-Tien Lu, Anil Vullikanti, Achla Marathe, Kristen Maria Summers, Graham Katz, Andy Doyle, Jaime Arredondo, Dipak K. Gupta, David Mares, Naren Ramakrishnan:
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecasting System. 205-214 - Animesh Nandi, Atri Mandal, Shubham Atreja, Gargi Banerjee Dasgupta, Subhrajit Bhattacharya:
Anomaly Detection Using Program Control Flow Graph Mining From Execution Logs. 215-224 - Alexander G. Nikolaev, Shounak Gore, Venu Govindaraju:
Engagement Capacity and Engaging Team Formation for Reach Maximization of Online Social Media Platforms. 225-234 - Alexey Poyarkov, Alexey Drutsa, Andrey Khalyavin, Gleb Gusev, Pavel Serdyukov:
Boosted Decision Tree Regression Adjustment for Variance Reduction in Online Controlled Experiments. 235-244 - Mahsa Salehi, Laura Irina Rusu, Timothy M. Lynar, Anna Phan:
Dynamic and Robust Wildfire Risk Prediction System: An Unsupervised Approach. 245-254 - Ying Shan, T. Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, J. C. Mao:
Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features. 255-262 - Gursimran Singh, Shashank Srikant, Varun Aggarwal:
Question Independent Grading using Machine Learning: The Case of Computer Program Grading. 263-272 - Yu Sun, Nicholas Jing Yuan, Yingzi Wang, Xing Xie, Kieran McDonald, Rui Zhang:
Contextual Intent Tracking for Personal Assistants. 273-282 - Liang Tang, Bo Long, Bee-Chung Chen, Deepak Agarwal:
An Empirical Study on Recommendation with Multiple Types of Feedback. 283-292 - Ali Vanderveld, Addhyan Pandey, Angela Han, Rajesh Parekh:
An Engagement-Based Customer Lifetime Value System for E-commerce. 293-302 - Ellery Wulczyn, Madian Khabsa, Vrushank Vora, Matthew Heston, Joe Walsh, Christopher Berry, Rayid Ghani:
Identifying Earmarks in Congressional Bills. 303-311 - Ya Xu, Nanyu Chen:
Evaluating Mobile Apps with A/B and Quasi A/B Tests. 313-322 - Dawei Yin, Yuening Hu, Jiliang Tang, Tim Daly Jr., Mianwei Zhou, Hua Ouyang, Jianhui Chen, Changsung Kang, Hongbo Deng, Chikashi Nobata, Jean-Marc Langlois, Yi Chang:
Ranking Relevance in Yahoo Search. 323-332 - Shipeng Yu, Evangelia Christakopoulou, Abhishek Gupta:
Identifying Decision Makers from Professional Social Networks. 333-342 - Qingqi Yue, Ao Yuan, Xuan Che, Minh Huynh, Chunxiao Zhou:
Batch Model for Batched Timestamps Data Analysis with Application to the SSA Disability Program. 343-352 - Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, Wei-Ying Ma:
Collaborative Knowledge Base Embedding for Recommender Systems. 353-362 - XianXing Zhang, Yitong Zhou, Yiming Ma, Bee-Chung Chen, Liang Zhang, Deepak Agarwal:
GLMix: Generalized Linear Mixed Models For Large-Scale Response Prediction. 363-372 - Yijun Zhao, Bilal Ahmed, Thomas Thesen, Karen E. Blackmon, Jennifer G. Dy, Carla E. Brodley, Ruben Kuzniecky, Orrin Devinsky:
A Non-parametric Approach to Detect Epileptogenic Lesions using Restricted Boltzmann Machines. 373-382 - Chen Zhu, Hengshu Zhu, Hui Xiong, Pengliang Ding, Fang Xie:
Recruitment Market Trend Analysis with Sequential Latent Variable Models. 383-392 - Hengshu Zhu, Hui Xiong, Fangshuang Tang, Qi Liu, Yong Ge, Enhong Chen, Yanjie Fu:
Days on Market: Measuring Liquidity in Real Estate Markets. 393-402
Applied Data Science Track Invited Talks
- Jonathan D. Becher:
Can You Teach the Elephant to Dance? AKA: Culture Eats Data Science for Breakfast. 403 - Oliver Downs:
How Machine Learning has Finally Solved Wanamaker's Dilemma. 405 - Ralf Herbrich:
Learning Sparse Models at Scale. 407 - Ching Law:
Profiling Users from Online Social Behaviors with Applications for Tencent Social Ads. 409 - Ingo Mierswa:
The Wisdom of Crowds: Best Practices for Data Prep & Machine Learning Derived from Millions of Data Science Workflows. 411 - Jeff Schneider:
Bayesian Optimization and Embedded Learning Systems. 413 - Danny Shapiro:
Accelerating the Race to Autonomous Cars. 415 - Ashok Srivastava:
Large-Scale Machine Learning at Verizon: Theory and Applications. 417 - Duncan J. Watts:
Computational Social Science: Exciting Progress and Future Challenges. 419
Applied Data Science Track Posters
- Bo An, Haipeng Chen, Noseong Park, V. S. Subrahmanian:
MAP: Frequency-Based Maximization of Airline Profits based on an Ensemble Forecasting Approach. 421-430 - Nipun Batra, Amarjeet Singh, Kamin Whitehouse:
Gemello: Creating a Detailed Energy Breakdown from Just the Monthly Electricity Bill. 431-440 - Fedor Borisyuk, Krishnaram Kenthapadi, David Stein, Bo Zhao:
CaSMoS: A Framework for Learning Candidate Selection Models over Structured Queries and Documents. 441-450 - Boris Chidlovskii, Stéphane Clinchant, Gabriela Csurka:
Domain Adaptation in the Absence of Source Domain Data. 451-460 - Steven H. H. Ding, Benjamin C. M. Fung, Philippe Charland:
Kam1n0: MapReduce-based Assembly Clone Search for Reverse Engineering. 461-470 - Sahin Cem Geyik, Sergey Faleev, Jianqiang Shen, Sean O'Donnell, Santanu Kolay:
Joint Optimization of Multiple Performance Metrics in Online Video Advertising. 471-480 - Xiaoxiao Guo, Wei Li, Francesco Iorio:
Convolutional Neural Networks for Steady Flow Approximation. 481-490 - Zhaobin Kuang, James A. Thomson, Michael Caldwell, Peggy L. Peissig, Ron M. Stewart, David Page:
Computational Drug Repositioning Using Continuous Self-Controlled Case Series. 491-500 - Jia Li, Dhruv Arya, Viet Ha-Thuc, Shakti Sinha:
How to Get Them a Dream Job?: Entity-Aware Features for Personalized Job Search Ranking. 501-510 - Xiang Li, Milad Makkie, Binbin Lin, Mojtaba Sedigh Fazli, Ian Davidson, Jieping Ye, Tianming Liu, Shannon Quinn:
Scalable Fast Rank-1 Dictionary Learning for fMRI Big Data Analysis. 511-519 - Qiaoling Liu, Faizan Javed, Matt McNair:
CompanyDepot: Employer Name Normalization in the Online Recruitment Industry. 521-530 - Caroline Lo, Dan Frankowski, Jure Leskovec:
Understanding Behaviors that Lead to Purchasing: A Case Study of Pinterest. 531-540 - Corey Lynch, Kamelia Aryafar, Josh Attenberg:
Images Don't Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank. 541-548 - Hoang Nguyen, Jon D. Patrick:
Text Mining in Clinical Domain: Dealing with Noise. 549-558 - John Paparrizos, Ryen W. White, Eric Horvitz:
Detecting Devastating Diseases in Search Logs. 559-568 - Bryan Perozzi, Michael Schueppert, Jack Saalweachter, Mayur Thakur:
When Recommendation Goes Wrong: Anomalous Link Discovery in Recommendation Networks. 569-578 - Jim Pivarski, Collin Bennett, Robert L. Grossman:
Deploying Analytics with the Portable Format for Analytics (PFA). 579-588 - Hasan Poonawala, Vinay Kolar, Sebastien Blandin, Laura Wynter, Sambit Sahu:
Singapore in Motion: Insights on Public Transport Service Level Through Farecard and Mobile Data Analytics. 589-598 - Parang Saraf, Naren Ramakrishnan:
EMBERS AutoGSR: Automated Coding of Civil Unrest Events. 599-608 - Taraneh Taghavi, Maria Lupetini, Yaron Kretchmer:
Compute Job Memory Recommender System Using Machine Learning. 609-616 - Yinyan Tan, Zhe Fan, Guilin Li, Fangshan Wang, Zhengbing Li, Shikai Liu, Qiuling Pan, Eric P. Xing, Qirong Ho:
Scalable Time-Decaying Adaptive Prediction Algorithm. 617-626 - Jan Van Haaren, Horesh Ben Shitrit, Jesse Davis, Pascal Fua:
Analyzing Volleyball Match Data from the 2014 World Championships Using Machine Learning Techniques. 627-634 - Hongjian Wang, Daniel Kifer, Corina Graif, Zhenhui Li:
Crime Rate Inference with Big Data. 635-644 - Huizhi Xie, Juliette Aurisset:
Improving the Sensitivity of Online Controlled Experiments: Case Studies at Netflix. 645-654 - Huang Xu, Zhiwen Yu, Jingyuan Yang, Hui Xiong, Hengshu Zhu:
Talent Circle Detection in Job Transition Networks. 655-664 - Weinan Zhang, Tianxiong Zhou, Jun Wang, Jian Xu:
Bid-aware Gradient Descent for Unbiased Learning with Censored Data in Display Advertising. 665-674
Research Track Full Papers
- Takuya Akiba, Yosuke Yano:
Compact and Scalable Graph Neighborhood Sketching. 685-694 - Hesam Amoualian, Marianne Clausel, Éric Gaussier, Massih-Reza Amini:
Streaming-LDA: A Copula-based Approach to Modeling Topic Dependencies in Document Streams. 695-704 - Ashton Anderson, Jon M. Kleinberg, Sendhil Mullainathan:
Assessing Human Error Against a Benchmark of Perfection. 705-714 - David T. Arbour, Dan Garant, David D. Jensen:
Inferring Network Effects from Observational Data. 715-724 - Maria-Florina Balcan, Yingyu Liang, Le Song, David P. Woodruff, Bo Xie:
Communication Efficient Distributed Kernel Principal Component Analysis. 725-734 - Roel Bertens, Jilles Vreeken, Arno Siebes:
Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns. 735-744 - Marco Bressan, Stefano Leucci, Alessandro Panconesi, Prabhakar Raghavan, Erisa Terolli:
The Limits of Popularity-Based Recommendations, and the Role of Social Ties. 745-754 - Shiyu Chang, Yang Zhang, Jiliang Tang, Dawei Yin, Yi Chang, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Positive-Unlabeled Learning in Streaming Networks. 755-764 - Chen Chen, Hanghang Tong, Lei Xie, Lei Ying, Qing He:
FASCINATE: Fast Cross-Layer Dependency Inference on Multi-layered Networks. 765-774 - Shuo Chen, Thorsten Joachims:
Predicting Matchups and Preferences in Context. 775-784 - Tianqi Chen, Carlos Guestrin:
XGBoost: A Scalable Tree Boosting System. 785-794 - Wei Chen, Tian Lin, Zihan Tan, Mingfei Zhao, Xuren Zhou:
Robust Influence Maximization. 795-804 - Wei Cheng, Kai Zhang, Haifeng Chen, Guofei Jiang, Zhengzhang Chen, Wei Wang:
Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations. 805-814 - Konstantina Christakopoulou, Filip Radlinski, Katja Hofmann:
Towards Conversational Recommender Systems. 815-824 - Lorenzo De Stefani, Alessandro Epasto, Matteo Riondato, Eli Upfal:
TRIÈST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fixed Memory Size. 825-834 - Jaroslav M. Fowkes, Charles Sutton:
A Subsequence Interleaving Model for Sequential Pattern Mining. 835-844 - Mina Ghashami, Edo Liberty, Jeff M. Phillips:
Efficient Frequent Directions Algorithm for Sparse Matrices. 845-854 - Aditya Grover, Jure Leskovec:
node2vec: Scalable Feature Learning for Networks. 855-864 - Lei Han, Yu Zhang, Xiu-Feng Wan, Tong Zhang:
Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data. 865-874 - Lifang He, Chun-Ta Lu, Jiaqi Ma, Jianping Cao, Linlin Shen, Philip S. Yu:
Joint Community and Structural Hole Spanner Detection via Harmonic Modularity. 875-884 - Xinran He, David Kempe:
Robust Influence Maximization. 885-894 - Bryan Hooi, Hyun Ah Song, Alex Beutel, Neil Shah, Kijung Shin, Christos Faloutsos:
FRAUDAR: Bounding Graph Fraud in the Face of Camouflage. 895-904 - Hao Hu, Joey Velez-Ginorio, Guo-Jun Qi:
Temporal Order-based First-Take-All Hashing for Fast Attention-Deficit-Hyperactive-Disorder Detection. 905-914 - Hui-Ju Hung, Hong-Han Shuai, De-Nian Yang, Liang-Hao Huang, Wang-Chien Lee, Jian Pei, Ming-Syan Chen:
When Social Influence Meets Item Inference. 915-924 - Arun Shankar Iyer, J. Saketha Nath, Sunita Sarawagi:
Privacy-preserving Class Ratio Estimation. 925-934 - Himanshu Jain, Yashoteja Prabhu, Manik Varma:
Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications. 935-944 - Meng Jiang, Christos Faloutsos, Jiawei Han:
CatchTartan: Representing and Summarizing Dynamic Multicontextual Behaviors. 945-954 - Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufmann, Andrew Tomkins, Balint Miklos, Greg Corrado, László Lukács, Marina Ganea, Peter Young, Vivek Ramavajjala:
Smart Reply: Automated Response Suggestion for Email. 955-964 - Florian Lemmerich, Martin Becker, Philipp Singer, Denis Helic, Andreas Hotho, Markus Strohmaier:
Mining Subgroups with Exceptional Transition Behavior. 965-974 - Huayu Li, Yong Ge, Richang Hong, Hengshu Zhu:
Point-of-Interest Recommendations: Learning Potential Check-ins from Friends. 975-984 - Liangyue Li, Yuan Yao, Jie Tang, Wei Fan, Hanghang Tong:
QUINT: On Query-Specific Optimal Networks. 985-994 - Shangsong Liang, Emine Yilmaz, Evangelos Kanoulas:
Dynamic Clustering of Streaming Short Documents. 995-1004 - Junming Liu, Leilei Sun, Weiwei Chen, Hui Xiong:
Rebalancing Bike Sharing Systems: A Multi-source Data Smart Optimization. 1005-1014 - Yanchi Liu, Chuanren Liu, Bin Liu, Meng Qu, Hui Xiong:
Unified Point-of-Interest Recommendation with Temporal Interval Assessment. 1015-1024 - Son T. Mai, Ira Assent, Martin Storgaard:
AnyDBC: An Efficient Anytime Density-based Clustering Algorithm for Very Large Complex Datasets. 1025-1034 - Emaad A. Manzoor, Sadegh M. Milajerdi, Leman Akoglu:
Fast Memory-efficient Anomaly Detection in Streaming Heterogeneous Graphs. 1035-1044 - Yasuko Matsubara, Yasushi Sakurai:
Regime Shifts in Streams: Real-time Forecasting of Co-evolving Time Sequences. 1045-1054