


default search action
9th KDD 2003: Washington, DC, USA
- Lise Getoor, Ted E. Senator, Pedro M. Domingos, Christos Faloutsos:

Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24 - 27, 2003. ACM 2003, ISBN 1-58113-737-0
Invited talks
- Jim Gray:

On-line science: the world-wide telescope as a prototype for the new computational science. 3 - Daphne Koller:

Statistical learning from relational data. 4 - Andreas S. Weigend:

Analyzing customer behavior at Amazon.com. 5
Research track
- Charu C. Aggarwal:

Towards systematic design of distance functions for data mining applications. 9-18 - Arindam Banerjee, Inderjit S. Dhillon, Joydeep Ghosh, Suvrit Sra

:
Generative model-based clustering of directional data. 19-28 - Stephen D. Bay, Mark Schwabacher:

Mining distance-based outliers in near linear time with randomization and a simple pruning rule. 29-38 - Mikhail Bilenko, Raymond J. Mooney:

Adaptive duplicate detection using learnable string similarity measures. 39-48 - Richard J. Bolton, Niall M. Adams:

An iterative hypothesis-testing strategy for pattern discovery. 49-58 - Hervé Brönnimann, Bin Chen, Manoranjan Dash, Peter J. Haas, Peter Scheuermann:

Efficient data reduction with EASE. 59-68 - Alain Casali, Rosine Cicchetti, Lotfi Lakhal:

Extracting semantics from data cubes using cube transversals and closures. 69-78 - Darya Chudova, Scott Gaffney, Eric Mjolsness, Padhraic Smyth

:
Translation-invariant mixture models for curve clustering. 79-88 - Inderjit S. Dhillon, Subramanyam Mallela, Dharmendra S. Modha:

Information-theoretic co-clustering. 89-98 - Magdalini Eirinaki

, Michalis Vazirgiannis, Iraklis Varlamis
:
SEWeP: using site semantics and a taxonomy to enhance the Web personalization process. 99-108 - Mohammad El-Hajj, Osmar R. Zaïane:

Inverted matrix: efficient discovery of frequent items in large datasets in the context of interactive mining. 109-118 - Oren Etzioni, Rattapoom Tuchinda, Craig A. Knoblock, Alexander Yates:

To buy or not to buy: mining airfare data to minimize ticket purchase price. 119-128 - Aristides Gionis, Teija Kujala, Heikki Mannila:

Fragments of order. 129-136 - David Kempe, Jon M. Kleinberg, Éva Tardos:

Maximizing the spread of influence through a social network. 137-146 - Mehmet Koyutürk

, Ananth Grama:
PROXIMUS: a framework for analyzing very high dimensional discrete-attributed datasets. 147-156 - Elias Pampalk, Werner Goebl

, Gerhard Widmer
:
Visualizing changes in the structure of data for exploratory feature selection. 157-166 - Claudia Perlich, Foster J. Provost:

Aggregation-based feature invention and relational concept classes. 167-176 - Sunita Sarawagi, Soumen Chakrabarti, Shantanu Godbole:

Cross-training: learning probabilistic mappings between topics. 177-186 - Somayajulu Sripada

, Ehud Reiter, Jim Hunter, Jin Yu:
Generating English summaries of time series data using the Gricean maxims. 187-196 - Jeremy Tantrum, Alejandro Murua, Werner Stuetzle:

Assessment and pruning of hierarchical model based clustering. 197-205 - Jaideep Vaidya, Chris Clifton:

Privacy-preserving k-means clustering over vertically partitioned data. 206-215 - Michail Vlachos

, Marios Hadjieleftheriou, Dimitrios Gunopulos
, Eamonn J. Keogh:
Indexing multi-dimensional time-series with support for multiple distance measures. 216-225 - Haixun Wang, Wei Fan, Philip S. Yu, Jiawei Han:

Mining concept-drifting data streams using ensemble classifiers. 226-235 - Jianyong Wang, Jiawei Han, Jian Pei

:
CLOSET+: searching for the best strategies for mining frequent closed itemsets. 236-245 - Ke Wang, Yuelong Jiang, Laks V. S. Lakshmanan:

Mining unexpected rules by pushing user dynamics. 246-255 - Geoffrey I. Webb

, Shane M. Butler, Douglas A. Newlands:
On detecting differences between groups. 256-265 - Scott White, Padhraic Smyth

:
Algorithms for estimating relative importance in networks. 266-275 - Xintao Wu

, Daniel Barbará, Yong Ye:
Screening and interpreting multi-item associations based on log-linear modeling. 276-285 - Xifeng Yan, Jiawei Han:

CloseGraph: mining closed frequent graph patterns. 286-295 - Lan Yi, Bing Liu, Xiaoli Li

:
Eliminating noisy information in Web pages for data mining. 296-305 - Hwanjo Yu, Jiong Yang, Jiawei Han:

Classifying large data sets using SVMs with hierarchical clusters. 306-315 - Mohammed Javeed Zaki

, Charu C. Aggarwal:
XRules: an effective structural classifier for XML data. 316-325 - Mohammed Javeed Zaki

, Karam Gouda:
Fast vertical mining using diffsets. 326-335 - Yunyue Zhu, Dennis E. Shasha:

Efficient elastic burst detection in data streams. 336-345
Industrial/government track
- Kamal Ali, Steven P. Ketchpel:

Golden Path Analyzer: using divide-and-conquer to cluster Web clickstreams. 349-358 - David M. Fram, June S. Almenoff, William DuMouchel:

Empirical Bayesian data mining for discovering patterns in post-marketing drug safety. 359-368 - Tu Bao Ho, Trong Dung Nguyen, Saori Kawasaki, Si Quang Le

, DucDung Nguyen, Hideto Yokoi, Katsuhiko Takabayashi:
Mining hepatitis data with temporal abstraction. 369-377 - David D. Jensen, Matthew J. Rattigan, Hannah Blau:

Information awareness: a prospective technical assessment. 378-387 - Mark Last, Menahem Friedman, Abraham Kandel:

The data mining approach to automated software testing. 388-396 - Richard D. Lawrence, Se June Hong, Jacques Cherrier:

Passenger-based predictive modeling of airline no-show rates. 397-406 - Gregory Piatetsky-Shapiro, Tom Khabaza, Sridhar Ramaswamy:

Capturing best practice for microarray gene expression data analysis. 407-415 - R. Bharat Rao, Sathyakama Sandilya, Radu Stefan Niculescu, Colin Germond, Harsha Rao:

Clinical and financial outcomes analysis with existing hospital patient records. 416-425 - Ramendra K. Sahoo, Adam J. Oliner, Irina Rish, Manish Gupta, José E. Moreira, Sheng Ma, Ricardo Vilalta, Anand Sivasubramaniam:

Critical event prediction for proactive management in large-scale computer clusters. 426-435 - Rong She, Fei Chen, Ke Wang, Martin Ester, Jennifer L. Gardy, Fiona S. L. Brinkman:

Frequent-subsequence-based prediction of outer membrane proteins. 436-445 - Michael S. Steinbach

, Pang-Ning Tan
, Vipin Kumar, Steven A. Klooster, Christopher Potter:
Discovery of climate indices using clustering. 446-455 - Sholom M. Weiss, Stephen J. Buckley, Shubir Kapoor, Søren Damgaard:

Knowledge-based data mining. 456-461 - Yi-Leh Wu, Kingshy Goh, Beitao Li, Huaxin You, Edward Y. Chang:

The anatomy of a multimodal information filter. 462-471
Research track
- Shlomo Argamon, Marin Saric, Sterling Stuart Stein:

Style mining of electronic messages for multiple authorship discrimination: first results. 475-480 - Raj Bhatnagar, Goutham Kurra, Wen Niu:

Mining high dimensional data for classifier knowledge. 481-486 - Joong Hyuk Chang, Won Suk Lee:

Finding recent frequent itemsets adaptively over online data streams. 487-492 - Bill Yuan-chi Chiu, Eamonn J. Keogh, Stefano Lonardi

:
Probabilistic discovery of time series motifs. 493-498 - William W. Cohen, Richard C. Wang, Robert F. Murphy:

Understanding captions in biomedical publications. 499-504 - Wenliang Du, Justin Zhijun Zhan:

Using randomized response techniques for privacy-preserving data mining. 505-510 - William DuMouchel, Deepak K. Agarwal:

Applications of sampling and fractional factorial designs to model-free data squashing. 511-516 - Dmitriy Fradkin, David Madigan:

Experiments with random projections for machine learning. 517-522 - João Gama

, Ricardo Rocha, Pedro Medas:
Accurate decision trees for mining high-speed data streams. 523-528 - Sudipto Guha, Dimitrios Gunopulos

, Nick Koudas:
Correlating synchronous and asynchronous data streams. 529-534 - Sule Gündüz, M. Tamer Özsu

:
A Web page prediction model based on click-stream tree representation of user behavior. 535-540 - John E. Hopcroft, Omar Khan, Brian Kulis, Bart Selman:

Natural communities in large linked networks. 541-546 - Michael E. Houle:

Navigating massive data sets via local clustering. 547-552 - Wynne Hsu

, Jing Dai, Mong-Li Lee
:
Mining viewpoint patterns in image databases. 553-558 - Chris Jermaine:

Playing hide-and-seek with correlations. 559-564 - Daxin Jiang

, Jian Pei
, Aidong Zhang:
Interactive exploration of coherent patterns in time-series gene expression data. 565-570 - Ruoming Jin, Gagan Agrawal:

Efficient decision tree construction on streaming data. 571-576 - Sachindra Joshi, Neeraj Agrawal, Raghu Krishnapuram, Sumit Negi:

A bag of paths model for measuring structural similarity in Web documents. 577-582 - Toshihiro Kamishima:

Nantonac collaborative filtering: recommendation based on order responses. 583-588 - Yehuda Koren, David Harel:

A two-way visualization method for clustered data. 589-594 - Kelvin T. Leung, Douglas Stott Parker Jr.:

Empirical comparisons of various voting methods in bagging. 595-600 - Bing Liu, Robert L. Grossman, Yanhong Zhai:

Mining data records in Web pages. 601-606 - Guimei Liu

, Hongjun Lu, Wenwu Lou, Jeffrey Xu Yu:
On computing, storing and querying frequent patterns. 607-612 - Junshui Ma, Simon Perkins:

Online novelty detection on temporal sequences. 613-618 - Satoshi Morinaga, Kenji Yamanishi

, Jun'ichi Takeuchi:
Distributed cooperative mining for information consortia. 619-624 - Jennifer Neville, David D. Jensen, Lisa Friedland, Michael Hay:

Learning relational probability trees. 625-630 - Caleb C. Noble, Diane J. Cook:

Graph-based anomaly detection. 631-636 - Feng Pan, Gao Cong

, Anthony K. H. Tung
, Jiong Yang, Mohammed Javeed Zaki
:
Carpenter: finding closed patterns in long biological datasets. 637-642 - William Peter

, John Chiochetti, Clare Giardina:
New unsupervised clustering algorithm for large datasets. 643-648 - Karlton Sequeira, Mohammed Javeed Zaki

, Boleslaw K. Szymanski
, Christopher D. Carothers:
Improving spatial locality of programs via data mining. 649-654 - Chun Tang, Aidong Zhang, Jian Pei

:
Mining phenotypes and informative genes from gene expression data. 655-660 - Feng Tao, Fionn Murtagh, Mohsen M. Farid:

Weighted Association Rule Mining using weighted support and significance framework. 661-666 - Soon Tee Teoh, Kwan-Liu Ma:

PaintingClass: interactive construction, visualization and exploration of decision trees. 667-672 - Ioannis Tsamardinos, Constantin F. Aliferis, Alexander R. Statnikov:

Time and sample efficient discovery of Markov blankets and direct causal relations. 673-678 - Hang Yu, Ee-Chien Chang

:
Distributed multivariate regression based on influential observations. 679-684 - Lei Yu, Huan Liu:

Efficiently handling feature redundancy in high-dimensional data. 685-690
Industrial/government track
- Rafael Alonso, Jeffrey A. Bloom, Hua Li, Chumki Basu:

An adaptive nearest neighbor search for a parts acquisition ePortal. 693-698 - Philip S. Barry, Jianping Zhang, Mary McDonald:

Architecting a knowledge discovery engine for military commanders utilizing massive runs of simulations. 699-704 - Tamraparni Dasu, Gregg T. Vesonder, Jon R. Wright:

Data quality through knowledge engineering. 705-710 - Gloria T. Lau, Kincho H. Law, Gio Wiederhold:

Similarity analysis on government regulations. 711-716 - Uwe F. Mayer, Armand Sarkissian:

Experimental design for solicitation campaigns. 717-722 - Matthew Eric Otey, Srinivasan Parthasarathy

, Amol Ghoting, G. Li, Sundeep Narravula, Dhabaleswar K. Panda:
Towards NIC-based intrusion detection. 723-728 - Chang-Shing Perng, David Thoenen, Genady Grabarnik, Sheng Ma, Joseph L. Hellerstein:

Data-driven validation, completion and construction of event relationship networks. 729-734 - Kevin B. Pratt, Gleb Tschapek:

Visualizing concept drift. 735-740 - Keiko Shimazu, Atsuhito Momma, Koichi Furukawa:

Experimental study of discovering essential information from customer inquiry. 741-746 - Zhongfei (Mark) Zhang, John J. Salerno, Philip S. Yu:

Applying data mining in investigating money laundering crimes. 747-752

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














