The 7th Paci?c Asia Conference on Knowledge Discovery and Data Mining (PAKDD) was held from April 30 to May 2, 2003 in the Convention and Ex- bition Center (COEX), Seoul, Korea. The PAKDD conference is a major forum for academic researchers and industry practitioners in the Paci?c Asia region to share original research results and development experiences from di?erent KDD-related areas such as data mining, data warehousing, machine learning, databases, statistics, knowledge acquisition and discovery, data visualization, and knowledge-based systems. The conference was organized by the Advanced Information Technology Research Center (AITrc) at KAIST and the Statistical Research Center for Complex Systems (SRCCS) at Seoul National University. It was sponsored by the Korean Datamining Society (KDMS), the Korea Inf- mation Science Society (KISS), the United States Air Force O?ce of Scienti?c Research, the Asian O?ce of Aerospace Research & Development, and KAIST. It was held with cooperation from ACM’s Special Group on Knowledge Dis- very and Data Mining (SIGKDD).
Author(s): P. S. Bradley (auth.), Kyu-Young Whang, Jongwoo Jeon, Kyuseok Shim, Jaideep Srivastava (eds.)
Series: Lecture Notes in Computer Science 2637 : Lecture Notes in Artificial Intelligence
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2003
Language: English
Pages: 614
Tags: Artificial Intelligence (incl. Robotics); Probability and Statistics in Computer Science; Database Management; Information Storage and Retrieval; Information Systems Applications (incl.Internet); Business Information Systems
Data Mining as an Automated Service....Pages 1-13
Trends and Challenges in the Industrial Applications of KDD....Pages 14-14
Finding Event-Oriented Patterns in Long Temporal Sequences....Pages 15-26
Mining Frequent Episodes for Relating Financial Events and Stock Trends....Pages 27-39
An Efficient Algorithm of Frequent Connected Subgraph Extraction....Pages 40-51
Classifier Construction by Graph-Based Induction for Graph-Structured Data....Pages 52-62
Comparison of the Performance of Center-Based Clustering Algorithms....Pages 63-74
Automatic Extraction of Clusters from Hierarchical Clustering Representations....Pages 75-87
Large Scale Unstructured Document Classification Using Unlabeled Data and Syntactic Information....Pages 88-99
Extracting Shared Topics of Multiple Documents....Pages 100-110
An Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition....Pages 111-116
A Semi-supervised Algorithm for Pattern Discovery in Information Extraction from Textual Data....Pages 117-123
Mining Patterns of Dyspepsia Symptoms Across Time Points Using Constraint Association Rules....Pages 124-135
Predicting Protein Structural Class from Closed Protein Sequences....Pages 136-147
Learning Rules to Extract Protein Interactions from Biomedical Text....Pages 148-158
Predicting Protein Interactions in Human by Homologous Interactions in Yeast....Pages 159-165
Mining the Customer’s Up-To-Moment Preferences for E-commerce Recommendation....Pages 166-177
A Graph-Based Optimization Algorithm for Website Topology Using Interesting Association Rules....Pages 178-190
A Markovian Approach for Web User Profiling and Clustering....Pages 191-202
Extracting User Interests from Bookmarks on the Web....Pages 203-208
Mining Frequent Instances on Workflows....Pages 209-221
Real Time Video Data Mining for Surveillance Video Streams....Pages 222-233
Distinguishing Causal and Acausal Temporal Relations....Pages 234-240
Online Bayes Point Machines....Pages 241-252
Exploiting Hierarchical Domain Values for Bayesian Learning....Pages 253-264
A New Restricted Bayesian Network Classifier....Pages 265-270
AGRID: An Efficient Algorithm for Clustering Large High-Dimensional Datasets....Pages 271-282
Multi-level Clustering and Reasoning about Its Clusters Using Region Connection Calculus....Pages 283-294
An Efficient Cell-Based Clustering Method for Handling Large, High-Dimensional Data....Pages 295-300
Enhancing SWF for Incremental Association Mining by Itemset Maintenance....Pages 301-312
Reducing Rule Covers with Deterministic Error Bounds....Pages 313-324
Evolutionary Approach for Mining Association Rules on Dynamic Databases....Pages 325-336
Position Coded Pre-order Linked WAP-Tree for Web Log Sequential Pattern Mining....Pages 337-349
An Integrated System of Mining HTML Texts and Filtering Structured Documents....Pages 350-355
A New Sequential Mining Approach to XML Document Similarity Computation....Pages 356-362
Optimization of Fuzzy Rules for Classification Using Genetic Algorithm....Pages 363-375
Fast Pattern Selection for Support Vector Classifiers....Pages 376-387
Averaged Boosting: A Noise-Robust Ensemble Method....Pages 388-393
Improving Performance of Decision Tree Algorithms with Multi-edited Nearest Neighbor Rule....Pages 394-398
HOT: Hypergraph-Based Outlier Test for Categorical Data....Pages 399-410
A Method for Aggregating Partitions, Applications in K.D.D.....Pages 411-422
Efficiently Computing Iceberg Cubes with Complex Constraints through Bounding....Pages 423-429
Extraction of Tag Tree Patterns with Contractible Variables from Irregular Semistructured Data....Pages 430-436
Step-by-Step Regression: A More Efficient Alternative for Polynomial Multiple Linear Regression in Stream Cube....Pages 437-448
Progressive Weighted Miner: An Efficient Method for Time-Constraint Mining....Pages 449-460
Mining Open Source Software (OSS) Data Using Association Rules Network....Pages 461-466
Parallel FP-Growth on PC Cluster....Pages 467-473
Active Feature Selection Using Classes....Pages 474-485
Electricity Based External Similarity of Categorical Attributes....Pages 486-500
Weighted Proportional k-Interval Discretization for Naive-Bayes Classifiers....Pages 501-512
Dealing with Relative Similarity in Clustering: An Indiscernibility Based Approach....Pages 513-518
Considering Correlation between Variables to Improve Spatiotemporal Forecasting....Pages 519-531
Correlation Analysis of Spatial Time Series Datasets: A Filter-and-Refine Approach....Pages 532-544
When to Update the Sequential Patterns of Stream Data?....Pages 545-550
A New Clustering Algorithm for Transaction Data via Caucus....Pages 551-562
DBRS: A Density-Based Spatial Clustering Method with Random Sampling....Pages 563-575
Optimized Clustering for Anomaly Intrusion Detection....Pages 576-581
Finding Frequent Subgraphs from Graph Structured Data with Geometric Information and Its Application to Lossless Compression....Pages 582-594
Upgrading ILP Rules to First-Order Bayesian Networks....Pages 595-601
A Clustering Validity Assessment Index....Pages 602-608