Principles of Data Mining and Knowledge Discovery: 6th European Conference, PKDD 2002 Helsinki, Finland, August 19–23, 2002 Proceedings

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This book constitutes the refereed proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery, PKDD 2002, held in Helsinki, Finland in August 2002.
The 39 revised full papers presented together with 4 invited contributions were carefully reviewed and selected from numerous submissions. Among the topics covered are kernel methods, probabilistic methods, association rule mining, rough sets, sampling algorithms, pattern discovery, web text mining, meta data clustering, rule induction, information extraction, dependency detection, rare class prediction, classifier systems, text classification, temporal sequence analysis, unsupervised learning, time series analysis, medical data mining, etc.

Author(s): Kenji Abe, Shinji Kawasoe, Tatsuya Asai, Hiroki Arimura, Setsuo Arikawa (auth.), Tapio Elomaa, Heikki Mannila, Hannu Toivonen (eds.)
Series: Lecture Notes in Computer Science 2431 : Lecture Notes in Artificial Intelligence
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2002

Language: English
Pages: 514
Tags: Artificial Intelligence (incl. Robotics); Database Management; Mathematical Logic and Formal Languages; Probability and Statistics in Computer Science; Document Preparation and Text Processing; Information Storage and Retrieval

Optimized Substructure Discovery for Semi-structured Data....Pages 1-14
Fast Outlier Detection in High Dimensional Spaces....Pages 15-27
Data Mining in Schizophrenia Research — Preliminary Analysis....Pages 27-38
Fast Algorithms for Mining Emerging Patterns....Pages 39-50
On the Discovery of Weak Periodicities in Large Time Series....Pages 51-61
The Need for Low Bias Algorithms in Classification Learning from Large Data Sets....Pages 62-73
Mining All Non-derivable Frequent Itemsets....Pages 74-86
Iterative Data Squashing for Boosting Based on a Distribution-Sensitive Distance....Pages 86-98
Finding Association Rules with Some Very Frequent Attributes....Pages 99-111
Unsupervised Learning: Self-aggregation in Scaled Principal Component Space * ....Pages 112-124
A Classification Approach for Prediction of Target Events in Temporal Sequences....Pages 125-137
Privacy-Oriented Data Mining by Proof Checking....Pages 138-149
Choose Your Words Carefully: An Empirical Study of Feature Selection Metrics for Text Classification....Pages 150-162
Generating Actionable Knowledge by Expert-Guided Subgroup Discovery....Pages 163-175
Clustering Transactional Data....Pages 175-187
Multiscale Comparison of Temporal Patterns in Time-Series Medical Databases....Pages 188-199
Association Rules for Expressing Gradual Dependencies....Pages 200-211
Support Approximations Using Bonferroni-Type Inequalities....Pages 212-224
Using Condensed Representations for Interactive Association Rule Mining....Pages 225-236
Predicting Rare Classes: Comparing Two-Phase Rule Induction to Cost-Sensitive Boosting....Pages 237-249
Dependency Detection in MobiMine and Random Matrices....Pages 250-262
Long-Term Learning for Web Search Engines....Pages 263-274
Spatial Subgroup Mining Integrated in an Object-Relational Spatial Database....Pages 275-286
Involving Aggregate Functions in Multi-relational Search....Pages 287-298
Information Extraction in Structured Documents Using Tree Automata Induction....Pages 299-311
Algebraic Techniques for Analysis of Large Discrete-Valued Datasets....Pages 311-324
Geography of Di.erences between Two Classes of Data....Pages 325-337
Rule Induction for Classification of Gene Expression Array Data....Pages 338-347
Clustering Ontology-Based Metadata in the Semantic Web....Pages 348-360
Iteratively Selecting Feature Subsets for Mining from High-Dimensional Databases....Pages 361-372
SVM Classification Using Sequences of Phonemes and Syllables....Pages 373-384
A Novel Web Text Mining Method Using the Discrete Cosine Transform....Pages 385-397
A Scalable Constant-Memory Sampling Algorithm for Pattern Discovery in Large Databases....Pages 397-409
Answering the Most Correlated N Association Rules Efficiently....Pages 410-422
Mining Hierarchical Decision Rules from Clinical Databases Using Rough Sets and Medical Diagnostic Model....Pages 423-435
Efficiently Mining Approximate Models of Associations in Evolving Databases....Pages 435-448
Explaining Predictions from a Neural Network Ensemble One at a Time....Pages 449-460
Structuring Domain-Specific Text Archives by Deriving a Probabilistic XML DTD....Pages 461-474
Separability Index in Supervised Learning....Pages 475-487
Finding Hidden Factors Using Independent Component Analysis....Pages 488-488
Reasoning with Classifiers * ....Pages 489-493
A Kernel Approach for Learning from Almost Orthogonal Patterns....Pages 494-511
Learning with Mixture Models: Concepts and Applications....Pages 512-512