


default search action
5th SDM 2005: Newport Beach, California, USA
- Hillol Kargupta, Jaideep Srivastava, Chandrika Kamath, Arnold Goodman:

Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005, Newport Beach, CA, USA, April 21-23, 2005. SIAM 2005, ISBN 978-0-89871-593-4
Statistics in Data Mining
- Sijin Liu, Xiaotong Shen, Wing Hung Wong:

Computational Developments of ψ-learning. 1-11 - Matthew Brand:

A Random Walks Perspective on Maximizing Satisfaction and Profit. 12-19 - Ronald K. Pearson:

Surveying Data for Patchy Structure. 20-31 - Chris H. Q. Ding, Jieping Ye:

2-Dimensional Singular Value Decomposition for 2D Maps and Images. 32-43
Stream Data Mining
- Graham Cormode

, S. Muthukrishnan:
Summarizing and Mining Skewed Data Streams. 44-55 - Charu C. Aggarwal, Philip S. Yu:

Online Analysis of Community Evolution in Data Streams. 56-67 - Chih-Hsiang Lin, Ding-Ying Chiu, Yi-Hung Wu, Arbee L. P. Chen:

Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding Window. 68-79 - Charu C. Aggarwal:

On Abnormality Detection in Spuriously Populated Data Streams. 80-91
Privacy Preserving Data Mining
- Zhiqiang Yang, Sheng Zhong, Rebecca N. Wright:

Privacy-Preserving Classification of Customer Data without Loss of Accuracy. 92-102 - Xintao Wu

, Ying Wu, Yongge Wang
, Yingjiu Li:
Privacy Aware Market Basket Data Set Generation: A Feasible Approach for Inverse Frequent Set Mining. 103-114 - Charu C. Aggarwal, Philip S. Yu:

On Variable Constraints in Privacy Preserving Data Mining. 115-125
Clustering
- David Gondek, Shivakumar Vaithyanathan, Ashutosh Garg:

Clustering with Model-level Constraints. 126-137 - Ian Davidson, S. S. Ravi:

Clustering with Constraints: Feasibility Issues and the k-Means Algorithm. 138-149 - Yu Xia, Jiming Peng:

A Cutting Algorithm for the Minimum Sum-of-Squared Error Clustering. 150-160
Scientific Data Mining
- Sameep Mehta, Steve Barr, Tat-Sang Choy, Hui Yang, Srinivasan Parthasarathy

, Raghu Machiraju, John Wilkins:
Dynamic Classification of Defect Structures in Molecular Dynamics Simulation Data. 161-172 - Bavani Arunasalam, Sanjay Chawla, Pei Sun:

Striking Two Birds With One Stone: Simultaneous Mining of Positive and Negative Spatial Patterns. 173-182 - Ata Kabán, Louisa Nolan, Somak Raychaudhury:

Finding Young Stellar Populations in Elliptical Galaxies from Independent Components of Optical Spectra. 183-194
Classifiers and Ensembles
- Qinghua Hu, Daren Yu, Zongxia Xie:

Hybrid Attribute Reduction for Classification Based on A Fuzzy Rough Set Technique. 195-204 - Jianyong Wang, George Karypis

:
HARMONY: Efficiently Mining the Best Rules for Classification. 205-216 - Carlotta Domeniconi, Bojun Yan:

On Error Correlation and Accuracy of Nearest Neighbor Ensemble Classifiers. 217-226
Association Rules and Database Issues
- Yiqiu Han, Wai Lam:

Lazy Learning for Classification Based on Query Projections. 227-238 - Bart Goethals

, Juho Muhonen, Hannu Toivonen:
Mining Non-Derivable Association Rules. 239-249 - Toon Calders, Bart Goethals

:
Depth-First Non-Derivable Itemset Mining. 250-261 - Dmitri V. Kalashnikov, Sharad Mehrotra, Zhaoqi Chen:

Exploiting Relationships for Domain-Independent Data Cleaning. 262-273
Graphs and Graphical Models
- Scott White, Padhraic Smyth

:
A Spectral Clustering Approach To Finding Communities in Graph. 274-285 - Chao Liu, Xifeng Yan, Hwanjo Yu, Jiawei Han, Philip S. Yu:

Mining Behavior Graphs for "Backtrace" of Noncrashing Bugs. 286-297 - Tak-Lam Wong, Wai Lam:

Learning to Refine Ontology for a New Web Site Using a Bayesian Approach. 298-309 - Radu Stefan Niculescu, Tom M. Mitchell, R. Bharat Rao:

Exploiting Parameter Related Domain Knowledge for Learning in Graphical Models. 310-321
SVM and Classification
- Navneet Panda, Edward Y. Chang:

Exploiting Geometry for Support Vector Machine Indexing. 322-333 - Shibin Qiu, Terran Lane:

Parallel Computation of RBF Kernels for Support Vector Classifiers. 334-345 - Yun Chi, Philip S. Yu, Haixun Wang, Richard R. Muntz:

Loadstar: A Load Shedding Scheme for Classifying Data Streams. 346-357
Complex Data Types: Text, Images, and Sequences
- Ying Zhao, George Karypis

:
Topic-driven Clustering for Document Datasets. 358-369 - Tomás Singliar, Milos Hauskrecht:

Variational Learning for Noisy-OR Component Analysis. 370-379 - Gemma Casas-Garriga:

Summarizing Sequential Data with Closed Partial Orders. 380-391
Statistics in Data Mining
- Kwok Pan Pang:

SUMSRM: A New Statistic for the Structural Break Detection in Time Series. 392-403 - Robert Gwadera, Mikhail J. Atallah, Wojciech Szpankowski:

Markov Models for Identification of Significant Episodes. 404-414 - Congnan Luo, Soon Myoung Chung:

Efficient Mining of Maximal Sequential Patterns Using Multiple Samples. 415-426
Scientific Data Mining
- Naren Ramakrishnan

, Chris Bailey-Kellogg, Satish Tadepalli, Varun Pandey:
Gaussian Processes for Active Data Mining of Spatial Aggregates. 427-438 - Xiaoli Zhang Fern, Carla E. Brodley, Mark A. Friedl:

Correlation Clustering for Learning Mixtures of Canonical Correlation Models. 439-448 - Michail Vlachos, Philip S. Yu, Vittorio Castelli:

On Periodicity Detection and Structural Periodic Similarity. 449-460
Poster Papers
- Jian Pei

, Moonjung Cho, David Wai-Lok Cheung:
Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses. 461-465 - Amir Bar-Or, Ran Wolff, Assaf Schuster, Daniel Keren:

Decision Tree Induction in High Dimensional, Hierarchically Distributed Databases. 466-470 - Daniel Lemire, Anna Maclachlan:

Slope One Predictors for Online Rating-Based Collaborative Filtering. 471-475 - Murat Dundar, Glenn Fung, Jinbo Bi, Sathyakama Sandilya, R. Bharat Rao:

Sparse Fisher Discriminant Analysis for Computer Aided Detection. 476-480 - Hamad Alhammady, Kotagiri Ramamohanarao:

Expanding the Training Data Space Using Emerging Patterns and Genetic Methods. 481-485 - Wei Fan, Janek Mathuria, Chang-Tien Lu

:
Making Data Mining Models Useful to Model Non-paying Customers of Exchange Carriers. 486-490 - Shuting Xu, Jun Zhang:

Matrix Condition Number Prediction with SVM Regression and Feature Selection. 491-495 - Reda Alhajj:

Cluster Validity Analysis of Alternative Results from Multi-Objective Optimization. 496-500 - Kuo-Yu Huang, Chia-Hui Chang

, Kuo-Zui Lin:
ClosedPROWL: Efficient Mining of Closed Frequent Continuities by Projected Window List Technology. 501-505 - Chotirat (Ann) Ratanamahatana, Eamonn J. Keogh:

Three Myths about Dynamic Time Warping Data Mining. 506-510 - Effrosini Kokiopoulou, Yousef Saad

:
PCA without eigenvalue calculations: a case study on face recognition. 511-515 - Raymond Chi-Wing Wong, Ada Wai-Chee Fu:

Mining Top-K Itemsets over a Sliding Window Based on Zipfian Distribution. 516-520 - Tao Li:

Hierarchical Document Classification Using Automatically Generated Hierarchy. 521-525 - Tao Li:

On Clustering Binary Data. 526-530 - Nitin Kumar, Venkata Nishanth Lolla, Eamonn J. Keogh, Stefano Lonardi, Chotirat (Ann) Ratanamahatana:

Time-series Bitmaps: a Practical Visualization Tool for Working with Large Time Series Databases. 531-535 - Rong She, Ke Wang, Yabo Xu, Philip S. Yu:

Pushing Feature Selection Ahead Of Join. 536-540 - Shiying Huang, Geoffrey I. Webb:

Discarding Insignificant Rules during Impact Rule Discovery in Large, Dense Databases. 541-545 - Himika Biswas, Somnath Pal:

SPID4.7: Discretization Using Successive Pseudo Deletion at Maximum Information Gain Boundary Points. 546-550 - Zheng Sun, Philip S. Yu, Xiang-Yang Li:

Iterative Mining for Rules with Constrained Antecedents. 551-555 - Al Mamunur Rashid, George Karypis

, John Riedl:
Influence in Ratings-Based Recommender Systems: An Algorithm-Independent Approach. 556-560 - Gianluigi Greco, Antonella Guzzo, Giuseppe Manco, Domenico Saccà:

Mining Unconnected Patterns in Workflows. 561-565 - Bharath Kumar Mohan:

The Best Nurturers in Computer Science Research. 566-570 - Tsuyoshi Idé

, Keisuke Inoue:
Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations. 571-575 - Ke Wang, Yabo Xu, Philip S. Yu, Rong She:

Building Decision Trees on Records Linked through Key References. 576-580 - Giuliano Tirenni, Abderrahim Labbi, André Elisseeff, Cesar Berrospi:

Efficient Allocation of Marketing Resources using Dynamic Programming. 581-585 - Haixun Wang, Chang-Shing Perng, Philip S. Yu:

Near-Neighbor Search in Pattern Distance Spaces. 586-590 - Haiyun Bian, Raj Bhatnagar:

An Algorithm for Well Structured Subspace Clusters. 591-595 - Vincent Shin-Mu Tseng, Chao-Hui Lee:

CBS: A New Classification Method by Using Sequential Patterns. 596-600 - Hong Cheng, Xifeng Yan, Jiawei Han:

SeqIndex: Indexing Sequences by Sequential Pattern Analysis. 601-605 - Chris H. Q. Ding, Xiaofeng He:

On the Equivalence of Nonnegative Matrix Factorization and Spectral Clustering. 606-610 - Gang Wu, Zhihua Zhang, Edward Y. Chang:

Kronecker Factorization for Speeding up Kernel Machines. 611-615 - Feng Kang, Rong Jin:

Symmetric Statistical Translation Models for Automatic Image Annotation. 616-620 - Kang Peng, Slobodan Vucetic, Zoran Obradovic:

Correcting Sampling Bias in Structural Genomics through Iterative Selection of Underrepresented Targets. 621-625 - Alina Beygelzimer, Emre Erdogan, Sheng Ma, Irina Rish:

Statictical Models for Unequally Spaced Time Series. 626-630 - Efstratios Gallopoulos

, Dimitrios Zeimpekis:
CLSI: A Flexible Approximation Scheme from Clustered Term-Document Matrices. 631-635 - Unil Yun, John J. Leggett:

WFIM: Weighted Frequent Itemset Mining with a weight range and a minimum weight. 636-640 - Martin H. C. Law, Alexander P. Topchy, Anil K. Jain:

Model-based Clustering With Probabilistic Constraints. 641-645

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














