This book constitutes the refereed proceedings of the 7th International Conference on Discovery Science, DS 2004, held in Padova, Italy in October 2004.The 20 revised long papers and the 19 revised regular papers presented were carefully reviewed and selected from 80 submissions. The papers are organized in topical sections on pattern mining, classification, outlier detection, clustering, feature construction and generation, knowledge acquisition, discovery science in reality, machine learning algorithms, Web mining, applications of predictive methods, and interdisciplinary approaches.
Author(s): Suzuki E. (Ed), Arikawa S. (Ed)
Year: 2004
Language: English
Pages: 448
Table of Contents......Page 12
Predictive Graph Mining......Page 16
An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases......Page 31
Finding Optimal Pairs of Cooperative and Competing Patterns with Bounded Distance......Page 47
Mining Noisy Data Streams via a Discriminative Model......Page 62
CorClass: Correlated Association Rule Mining for Classification......Page 75
Maximum a Posteriori Tree Augmented Naive Bayes Classifiers......Page 88
Improving Prediction of Distance-Based Outliers......Page 104
Detecting Outliers via Logical Theories and Its Data Complexity......Page 116
Fast Hierarchical Clustering Algorithm Using Locality-Sensitive Hashing......Page 129
Measuring the Similarity for Heterogenous Data: An Ordered Probability-Based Approach......Page 144
Constructive Inductive Learning Based on Meta-attributes......Page 157
Resemblance Coefficient and a Quantum Genetic Algorithm for Feature Selection......Page 170
Extracting Positive Attributions from Scientific Papers......Page 184
Enhancing SVM with Visualization......Page 198
An Associative Information Retrieval Based on the Dependency of Term Co-occurrence......Page 210
On the Convergence of Incremental Knowledge Base Construction......Page 222
Privacy Problems with Anonymized Transaction Databases......Page 234
A Methodology for Biologically Relevant Pattern Discovery from Gene Expression Data......Page 245
Using the Computer to Study the Dynamics of the Handwriting Processes......Page 257
Product Recommendation in e-Commerce Using Direct and Indirect Confidence for Historical User Sessions......Page 270
Optimal Discovery of Subword Associations in Strings......Page 285
Tiling Databases......Page 293
A Clustering of Interestingness Measures......Page 305
Extracting Minimal and Closed Monotone DNF Formulas......Page 313
Characterizations of Multivalued Dependencies and Related Expressions......Page 321
Outlier Handling in the Neighbourhood-Based Learning of a Continuous Class......Page 329
A New Clustering Algorithm Based On Cluster Validity Indices......Page 337
An Efficient Rules Induction Algorithm for Rough Set Classification......Page 345
Analysing the Trade-Off Between Comprehensibility and Accuracy in Mimetic Models......Page 353
Generating AVTs Using GA for Learning Decision Tree Classifiers with Missing Data......Page 362
Using WWW-Distribution of Words in Detecting Peculiar Web Pages......Page 370
DHT Facilitated Web Service Discovery Incorporating Semantic Annotation......Page 378
Discovering Relationships Among Catalogs......Page 386
Reasoning-Based Knowledge Extraction for Text Classification......Page 395
A Useful System Prototype for Intrusion Detection – Architecture and Experiments......Page 403
Discovery of Hidden Similarity on Collaborative Filtering to Overcome Sparsity Problem......Page 411
Seamlessly Supporting Combined Knowledge Discovery and Query Answering: A Case Study......Page 418
A Structuralist Approach Towards Computational Scientific Discovery......Page 427
Extracting Modal Implications and Equivalences from Cognitive Minds......Page 435
P......Page 444
Z......Page 445