


default search action
13th KDD 2007: San Jose, California, USA
- Pavel Berkhin, Rich Caruana, Xindong Wu:

Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, California, USA, August 12-15, 2007. ACM 2007, ISBN 978-1-59593-609-7 - Chris Anderson:

Calculating latent demand in the long tail. 1 - Usama M. Fayyad:

From mining the web to inventing the new sciences underlying the internet. 2-3 - Jon M. Kleinberg:

Challenges in mining social network data: processes, privacy, and paradoxes. 4-5
Research track papers
- Deepak Agarwal, Dhiman Barman, Dimitrios Gunopulos

, Neal E. Young
, Flip Korn, Divesh Srivastava:
Efficient and effective explanation of change in hierarchical summaries. 6-15 - Deepak Agarwal, Andrei Z. Broder, Deepayan Chakrabarti

, Dejan Diklic, Vanja Josifovski, Mayssam Sayyadian:
Estimating rates of rare events at multiple resolutions. 16-25 - Deepak Agarwal, Srujana Merugu:

Predictive discrete latent factor models for large scale dyadic data. 26-35 - Charu C. Aggarwal, Philip S. Yu:

On string classification in data streams. 36-45 - Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua Feng, Mohammed Javeed Zaki

:
Xproj: a framework for projected structural clustering of xml documents. 46-55 - Nikolay Archak, Anindya Ghose, Panagiotis G. Ipeirotis

:
Show me the money!: deriving the pricing power of product features by mining consumer reviews. 56-65 - Andrew Arnold, Yan Liu, Naoki Abe:

Temporal causal modeling with graphical granger methods. 66-75 - Ricardo A. Baeza-Yates

, Alessandro Tiberi:
Extracting semantic relations from query logs. 76-85 - Hila Becker, Marta Arias

:
Real-time ranking with concept drift using expert advice. 86-94 - Robert M. Bell, Yehuda Koren, Chris Volinsky:

Modeling relationships at multiple scales to improve accuracy of large recommender systems. 95-104 - Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra

:
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus. 105-112 - Wanpracha Art Chaovalitwongse, Ya-Ju Fan

, Rajesh C. Sachdeo:
Support feature machine for classification of abnormal brain activity. 113-122 - Jianhui Chen, Zheng Zhao, Jieping Ye, Huan Liu:

Nonlinear adaptive distance metric learning for clustering. 123-132 - Yixin Chen, Li Tu:

Density-based clustering for real-time stream data. 133-142 - Peter A. Chew, Brett W. Bader, Tamara G. Kolda

, Ahmed Abdelali
:
Cross-language information retrieval using PARAFAC2. 143-152 - Yun Chi, Xiaodan Song, Dengyong Zhou, Koji Hino, Belle L. Tseng:

Evolutionary spectral clustering by incorporating temporal smoothness. 153-162 - Yun Chi, Shenghuo Zhu, Xiaodan Song, Jun'ichi Tatemura, Belle L. Tseng:

Structural and temporal analysis of the blogosphere through community factorization. 163-172 - Sumit Chopra, Trivikraman Thampy, John Leahy, Andrew Caplin, Yann LeCun:

Discovering the hidden structure of house prices with a non-parametric latent manifold model. 173-182 - Paul Cotofrei, Kilian Stoffel:

Stochastic processes and temporal data mining. 183-190 - Daniel Crabtree, Peter Andreae, Xiaoying Gao:

Exploiting underrepresented query aspects for automatic query expansion. 191-200 - Aron Culotta, Michael L. Wick, Robert J. Hall, Matthew Marzilli, Andrew McCallum:

Canonicalization of database records using adaptive similarity measures. 201-209 - Wenyuan Dai, Gui-Rong Xue, Qiang Yang, Yong Yu:

Co-clustering based classification for out-of-domain documents. 210-219 - Kaustav Das, Jeff G. Schneider:

Detecting anomalous records in categorical datasets. 220-229 - Anirban Dasgupta

, Petros Drineas
, Boulos Harb, Vanja Josifovski, Michael W. Mahoney:
Feature selection methods for text classification. 230-239 - Ian Davidson, S. S. Ravi, Martin Ester:

Efficient incremental constrained clustering. 240-249 - Meghana Deodhar, Joydeep Ghosh:

A framework for simultaneous co-clustering and learning from complex data. 250-259 - Chris H. Q. Ding, Rong Jin, Tao Li, Horst D. Simon

:
A learning framework using Green's function and kernel regularization with application to recommender system. 260-269 - Dejing Dou, Gwen A. Frishkoff, Jiawei Rong, Robert M. Frank, Allen D. Malony, Don M. Tucker:

Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies. 270-279 - Gregory Druck, Chris Pal, Andrew McCallum, Xiaojin Zhu:

Semi-supervised classification with hybrid generative/discriminative methods. 280-289 - Lisa Friedland, David D. Jensen:

Finding tribes: identifying close-knit individuals from employment patterns. 290-299 - Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Huan Liu, Philip S. Yu:

Time-dependent event hierarchy construction. 300-309 - Byron J. Gao, Martin Ester, Jin-yi Cai, Oliver Schulte, Hui Xiong:

The minimum consistent subset cover problem and its applications in data mining. 310-319 - Rong Ge, Martin Ester, Wen Jin, Ian Davidson:

Constraint-driven clustering. 320-329 - Fosca Giannotti, Mirco Nanni, Fabio Pinelli

, Dino Pedreschi
:
Trajectory pattern mining. 330-339 - Zhen Guo, Zhongfei Zhang, Eric P. Xing, Christos Faloutsos

:
Enhanced max margin learning on multimodal data mining in a multimedia database. 340-349 - Hannes Heikinheimo, Jouni K. Seppänen, Eino Hinkkanen, Heikki Mannila, Taneli Mielikäinen:

Finding low-entropy sets and trees from binary data. 350-359 - Frizo A. L. Janssens, Wolfgang Glänzel, Bart De Moor:

Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis. 360-369 - Yookyung Jo, Carl Lagoze, C. Lee Giles

:
Detecting research topics via the correlation between graphs and texts. 370-379 - Panagiotis Karras, Dimitris Sacharidis

, Nikos Mamoulis:
Exploiting duality in summarization with deterministic guarantees. 380-389 - Yiping Ke

, James Cheng, Wilfred Ng
:
Correlation search in graph databases. 390-399 - Aleksander Kolcz, Wen-tau Yih:

Raising the baseline for high-precision text classifiers. 400-409 - Srivatsan Laxman, P. S. Sastry, K. P. Unnikrishnan:

A fast algorithm for finding frequent episodes in event streams. 410-419 - Jure Leskovec

, Andreas Krause, Carlos Guestrin, Christos Faloutsos
, Jeanne M. VanBriesen, Natalie S. Glance:
Cost-effective outbreak detection in networks. 420-429 - Jinyan Li, Guimei Liu

, Limsoon Wong
:
Mining statistically important equivalence classes and delta-discriminative emerging patterns. 430-439 - Ping Li:

Very sparse stable random projections for dimension reduction in lalpha (0 <alpha<=2) norm. 440-449 - Yi Liu, Rong Jin, Anil K. Jain:

BoostCluster: boosting clustering by pairwise constraints. 450-459 - David Lo

, Siau-Cheng Khoo, Chao Liu:
Efficient mining of iterative patterns for software specification discovery. 460-469 - Bo Long, Zhongfei (Mark) Zhang, Philip S. Yu:

A probabilistic framework for relational clustering. 470-479 - Heikki Mannila, Evimaria Terzi:

Nestedness and segmented nestedness. 480-489 - Qiaozhu Mei, Xuehua Shen, ChengXiang Zhai:

Automatic labeling of multinomial topic models. 490-499 - David M. Mimno

, Andrew McCallum:
Expertise modeling for matching papers with reviewers. 500-509 - Flavia Moser, Rong Ge, Martin Ester:

Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters. 510-519 - Ramesh Nallapati, Susan Ditmore, John D. Lafferty, Kin Ung:

Multiscale topic tomography. 520-529 - Siegfried Nijssen

, Élisa Fromont
:
Mining optimal decision trees from itemset lattices. 530-539 - Gaurav Pandey, Michael S. Steinbach

, Rohit Gupta, Tushar Garg, Vipin Kumar:
Association analysis-based transformations for protein interaction networks: a function prediction case study. 540-549 - Seung-Taek Park, David M. Pennock:

Applying collaborative filtering techniques to movie search for better ranking and browsing. 550-559 - Raymond K. Pon, Alfonso F. Cardenas, David Buttler

, Terence Critchlow:
Tracking multiple topics for finding interesting articles. 560-569 - Filip Radlinski, Thorsten Joachims:

Active exploration for learning rankings from clickthrough data. 570-579 - Mark Sandler:

Hierarchical mixture models: a probabilistic analysis. 580-589 - Issei Sato, Hiroshi Nakagawa:

Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior. 590-598 - Vincent Schickel-Zuber, Boi Faltings:

Using hierarchical clustering for learning theontologies used in recommendation systems. 599-608 - D. Sculley:

Practical learning from one-sided feedback. 609-618 - Benyah Shaparenko, Thorsten Joachims:

Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases. 619-628 - Shady Shehata

, Fakhri Karray, Mohamed Kamel
:
A concept-based model for enhancing text categorization. 629-637 - Victor S. Sheng, Charles X. Ling:

Partial example acquisition in cost-sensitive learning. 638-646 - Motoki Shiga

, Ichigaku Takigawa
, Hiroshi Mamitsuka
:
A spectral clustering approach to optimally combining numericalvectors with a modular network. 647-656 - Andrew T. Smith, Charles Elkan:

Making generative classifiers robust to selection bias. 657-666 - Xiuyao Song, Mingxi Wu, Christopher M. Jermaine, Sanjay Ranka

:
Statistical change detection for multi-dimensional data. 667-676 - Rohini K. Srihari, Li Xu, Tushar Saxena:

Use of ranked cross document evidence trails for hypothesis generation. 677-686 - Jimeng Sun

, Christos Faloutsos
, Spiros Papadimitriou, Philip S. Yu:
GraphScope: parameter-free mining of large time-evolving graphs. 687-696 - Gaurav Tandon, Philip K. Chan

:
Weighting versus pruning in rule validation for detecting network and host anomalies. 697-706 - Wei Tang, Hui Xiong, Shi Zhong, Jie Wu:

Enhancing semi-supervised clustering: a feature projection perspective. 707-716 - Chayant Tantipathananandh, Tanya Y. Berger-Wolf

, David Kempe:
A framework for community identification in dynamic social networks. 717-726 - Choon Hui Teo, Alexander J. Smola, S. V. N. Vishwanathan, Quoc V. Le:

A scalable modular convex solver for regularized risk minimization. 727-736 - Hanghang Tong

, Christos Faloutsos
, Brian Gallagher
, Tina Eliassi-Rad:
Fast best-effort pattern matching in large attributed graphs. 737-746 - Hanghang Tong

, Christos Faloutsos
, Yehuda Koren:
Fast direction-aware proximity for graph mining. 747-756 - David S. Vogel, Ognian Asparouhov, Tobias Scheffer:

Scalable look-ahead linear regression trees. 757-764 - Jilles Vreeken

, Matthijs van Leeuwen, Arno Siebes:
Characterising the difference. 765-774 - Li Wan, Wee Keong Ng

, Shuguo Han, Vincent C. S. Lee:
Privacy-preservation for gradient descent methods. 775-783 - Xuanhui Wang, ChengXiang Zhai, Xiao Hu

, Richard Sproat:
Mining correlated bursty topic patterns from coordinated text streams. 784-793 - Xuerui Wang, Chris Pal, Andrew McCallum:

Generalized component analysis for text with heterogeneous attributes. 794-803 - Raymond Chi-Wing Wong, Jian Pei

, Ada Wai-Chee Fu, Ke Wang:
Mining favorable facets. 804-813 - Junjie Wu, Hui Xiong, Peng Wu, Jian Chen:

Local decomposition for rare class analysis. 814-823 - Xiaowei Xu, Nurcan Yuruk, Zhidan Feng, Thomas A. J. Schweiger:

SCAN: a structural clustering algorithm for networks. 824-833 - Rong Yan, Jelena Tesic

, John R. Smith:
Model-shared subspace boosting for multi-label classification. 834-843 - Dragomir Yankov, Eamonn J. Keogh, Jose Medina, Bill Yuan-chi Chiu, Victor B. Zordan:

Detecting time series motifs under uniform scaling. 844-853 - Jieping Ye, Shuiwang Ji

, Jianhui Chen:
Learning the kernel matrix in discriminant analysis via quadratically constrained quadratic programming. 854-863 - Junsong Yuan

, Ying Wu, Ming Yang:
From frequent itemsets to semantically meaningful visual patterns. 864-873 - Xian Zhang

, Yu Hao, Xiaoyan Zhu, Ming Li, David R. Cheriton:
Information distance from a question to an answer. 874-883 - Hongkun Zhao, Weiyi Meng, Clement T. Yu:

Mining templates from search result records of search engines. 884-893 - Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu:

Joint optimization of wrapper generation and template detection. 894-902 - Jun Zhu, Bo Zhang, Zaiqing Nie, Ji-Rong Wen, Hsiao-Wuen Hon:

Webpage understanding: an integrated approach. 903-912
Industrial and government track papers
- Sitaram Asur, Srinivasan Parthasarathy

, Duygu Ucar:
An event-based framework for characterizing the evolutionary behavior of interaction graphs. 913-921 - Rebecca Castaño, Kiri Wagstaff

, Steve A. Chien, Timothy M. Stough, Benyang Tang:
On-board analysis of uncalibrated data for a spacecraft at mars. 922-930 - Andrew S. Fast, Lisa Friedland, Marc E. Maier, Brian J. Taylor, David D. Jensen, Henry G. Goldberg, John Komoroske:

Relational data pre-processing techniques for improved securities fraud detection. 941-949 - Ming Hua, Jian Pei

:
Cleaning disguised missing data: a heuristic approach. 950-958 - Ron Kohavi, Randal M. Henne, Dan Sommerfield:

Practical guide to controlled experiments on the web: listen to your customers not to the hippo. 959-967 - Ping Luo, Hui Xiong, Kevin Lü

, Zhongzhi Shi:
Distributed classification in peer-to-peer networks. 968-976 - Claudia Perlich, Saharon Rosset, Richard D. Lawrence, Bianca Zadrozny:

High-quantile modeling for customer wallet estimation and other applications. 977-985 - Junhua Zhao, Zhao Yang Dong

, Pei Zhang:
Mining complex power networks for blackout prevention. 986-994 - Shubin Zhao, Jonathan Betz:

Corroborate and learn facts from the web. 995-1003 - Guangyu Zhu, Timothy J. Bethea, Vikas Krishna:

Extracting relevant named entities for automated expense reimbursement. 1004-1012
Industrial and government track short papers
- Charu C. Aggarwal:

A framework for classification and segmentation of massive audio data streams. 1013-1017 - Chris Curry, Robert L. Grossman, David Locke, Steve Vejcik, Joseph Bugajski:

Detecting changes in large data sets of payment card data: a case study. 1018-1022 - Rong Pan, Junhui Zhao, Vincent Wenchen Zheng, Jeffrey Junfeng Pan, Dou Shen, Sinno Jialin Pan

, Qiang Yang:
Domain-constrained semi-supervised mining of tracking models in sensor networks. 1023-1027 - Wei Peng, Charles Perng, Tao Li, Haixun Wang:

Event summarization for system management. 1028-1032 - R. Bharat Rao, Jinbo Bi, Glenn Fung, Marcos Salganicoff, Nancy Obuchowski, David P. Naidich:

LungCAD: a clinically approved, machine learning system for lung cancer detection. 1033-1037 - Robert J. Yan, Charles X. Ling:

Machine learning for stock selection. 1038-1042 - Yanfang Ye, Dingding Wang, Tao Li, Dongyi Ye:

IMDS: intelligent malware detection system. 1043-1047 - Xiaoxin Yin, Jiawei Han, Philip S. Yu:

Truth discovery with multiple conflicting information providers on the web. 1048-1052
Panel
- Srinivasan Parthasarathy

:
Data mining at the crossroads: successes, failures and learning from them. 1053-1055

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














