This book constitutes the refereed proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2001, held in Hong Kong, China in April 2001.
The 38 revised full papers and 22 short papers presented were carefully reviewed and selected from a total of 152 submissions. The book offers topical sections on Web mining, text mining, applications and tools, concept hierarchies, feature selection, interestingness, sequence mining, spatial and temporal mining, association mining, classification and rule induction, clustering, and advanced topics and new methods.
Author(s): Hosagrahar Visvesvaraya Jagadish⋆ (auth.), David Cheung, Graham J. Williams, Qing Li (eds.)
Series: Lecture Notes in Computer Science 2035 : Lecture Notes in Artificial Intelligence
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2001
Language: English
Pages: 599
Tags: Artificial Intelligence (incl. Robotics); Information Storage and Retrieval; Business Information Systems; Information Systems Applications (incl.Internet); Probability and Statistics in Computer Science; Computers and Society
Incompleteness in Data Mining....Pages 1-1
Mining E-Commerce Data: The Good, the Bad, and the Ugly....Pages 2-2
Seamless Integration of Data Mining with DBMS and Applications....Pages 3-3
Applying Pattern Mining to Web Information Extraction....Pages 4-15
Empirical Study of Recommender Systems Using Linear Classifiers....Pages 16-27
iJADE eMiner - A Web-Based Mining Agent Based on Intelligent Java Agent Development Environment (iJADE) on Internet Shopping....Pages 28-40
A Characterized Rating Recommend System....Pages 41-46
Discovery of Frequent Tree Structured Patterns in Semistructured Web Documents....Pages 47-52
Text Categorization Using Weight Adjusted k -Nearest Neighbor Classification....Pages 53-65
Predictive Self-Organizing Networks for Text Categorization....Pages 66-77
Meta-learning Models for Automatic Textual Document Categorization....Pages 78-89
Efficient Algorithms for Concept Space Construction....Pages 90-101
Topic Detection, Tracking, and Trend Analysis Using Self-Organizing Neural Networks....Pages 102-107
Automatic Hypertext Construction through a Text Mining Approach by Self-Organizing Maps....Pages 108-113
Semantic Expectation-Based Causation Knowledge Extraction: A Study on Hong Kong Stock Movement Analysis....Pages 114-123
A Toolbox Approach to Flexible and Efficient Data Mining....Pages 124-135
Determining Progression in Glaucoma Using Visual Fields....Pages 136-147
Seabreeze Prediction Using Bayesian Networks....Pages 148-153
Semi-supervised Learning in Medical Image Database....Pages 154-160
On Application of Rough Data Mining Methods to Automatic Construction of Student Models....Pages 161-166
Concept Approximation in Concept Lattice....Pages 167-173
Generating Concept Hierarchies/Networks: Mining Additional Semantics in Relational Data....Pages 174-185
Representing Large Concept Hierarchies Using Lattice Data Structure....Pages 186-197
Feature Selection for Temporal Health Records....Pages 198-209
Boosting the Performance of Nearest Neighbour Methods with Feature Selection....Pages 210-221
Feature Selection for Meta-learning....Pages 222-233
Efficient Mining of Niches and Set Routines....Pages 234-246
Evaluation of Interestingness Measures for Ranking Discovered Knowledge....Pages 247-259
Peculiarity Oriented Mining and Its Application for Knowledge Discovery in Amino-Acid Data....Pages 260-269
Mining Sequence Patterns from Wind Tunnel Experimental Data for Flight Control....Pages 270-281
Scalable Hierarchical Clustering Method for Sequences of Categorical Values....Pages 282-293
FFS - An I/O-Efficient Algorithm for Mining Frequent Sequences....Pages 294-305
Sequential Index Structure for Content-Based Retrieval....Pages 306-311
The S 2 -Tree: An Index Structure for Subsequence Matching of Spatial Objects....Pages 312-323
Temporal Data Mining Using Hidden Markov-Local Polynomial Models....Pages 324-335
Patterns Discovery Based on Time-Series Decomposition....Pages 336-347
Criteria on Proximity Graphs for Boundary Extraction and Spatial Clustering....Pages 348-357
Micro Similarity Queries in Time Series Database....Pages 358-363
Mining Optimal Class Association Rule Set....Pages 364-375
Generating Frequent Patterns with the Frequent Pattern List....Pages 376-386
User-Defined Association Mining....Pages 387-399
Direct and Incremental Computing of Maximal Covering Rules....Pages 400-405
Towards Efficient Data Re-mining (DRM)....Pages 406-412
Data Allocation Algorithm for Parallel Association Rule Discovery....Pages 413-420
Direct Domain Knowledge Inclusion in the PA3 Rule Induction Algorithm....Pages 421-432
Hierarchical Classification of Documents with Error Control....Pages 433-443
An Efficient Data Compression Approach to the Classification Task....Pages 444-454
Combining the Strength of Pattern Frequency and Distance for Classification....Pages 455-466
A Scalable Algorithm for Rule Post-pruning of Large Decision Trees....Pages 467-476
Optimizing the Induction of Alternating Decision Trees....Pages 477-487
Building Behaviour Knowledge Space to Make Classification Decision....Pages 488-494
Efficient Hierarchical Clustering Algorithms Using Partially Overlapping Partitions....Pages 495-506
A Rough Set-Based Clustering Method with Modification of Equivalence Relations....Pages 507-512
Importance of Individual Variables in the k -Means Algorithm....Pages 513-518
A Hybrid Approach to Clustering in Very Large Databases....Pages 519-524
A Similarity Indexing Method for the Data Warehousing - Bit-Wise Indexing Method....Pages 525-537
Rule Reduction over Numerical Attributes in Decision Trees Using Multilayer Perceptron....Pages 538-549
Knowledge Acquisition from Both Human Expert and Data....Pages 550-561
Neighborhood Dependencies for Prediction....Pages 562-567
Learning Bayesian Networks with Hidden Variables Using the Combination of EM and Evolutionary Algorithms....Pages 568-574
Interactive Construction of Decision Trees....Pages 575-580
An Improved Learning Algorithm for Augmented Naive Bayes....Pages 581-586
Generalised RBF Networks Trained Using an IBL Algorithm for Mining Symbolic Data....Pages 587-593