Within the last few years Data Warehousing and Knowledge Discovery technology has established itself as a key technology for enterprises that wish to improve the quality of the results obtained from data analysis, decision support, and the automatic extraction of knowledge from data. The Fourth International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2002) continues a series of successful conferences dedicated to this topic. Its main objective is to bring together researchers and practitioners to discuss research issues and experience in developing and deploying data warehousing and knowledge discovery systems, applications, and solutions. The conference focuses on the logical and physical design of data warehousing and knowledge discovery systems. The scope of the papers covers the most recent and relevant topics in the areas of association rules, clustering, Web mining, security, data mining techniques, data cleansing, applications, data warehouse design and maintenance, and OLAP. These proceedings contain the technical papers selected for presentation at the conference. We received more than 100 papers from over 20 countries, and the program committee finally selected 32 papers. The conference program included one invited talk: “Text Mining Applications of a Shallow Parser” by Walter Daelemans, Univer- ty of Antwerp, Belgium. We would like to thank the DEXA 2002 Workshop General Chair (Roland Wagner) th and the organizing committee of the 13 International Conference on Database and Expert Systems Applications (DEXA 2002) for their support and their cooperation.
Author(s): Marco Botta, Jean-Francois Boulicaut, Cyrille Masson, Rosa Meo (auth.), Yahiko Kambayashi, Werner Winiwarter, Masatoshi Arikawa (eds.)
Series: Lecture Notes in Computer Science 2454
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2002
Language: English
Pages: 339
Tags: Database Management; Information Storage and Retrieval; Computer Communication Networks; Information Systems Applications (incl.Internet); Multimedia Information Systems; Business Information Systems
A Comparison between Query Languages for the Extraction of Association Rules....Pages 1-10
Learning from Dissociations * ....Pages 11-20
Mining Association Rules from XML Data....Pages 21-30
Estimating Joint Probabilities from Marginal Ones * ....Pages 31-41
Self-Tuning Clustering: An Adaptive Clustering Method for Transaction Data....Pages 42-51
CoFD : An Algorithm for Non-distance Based Clustering in High Dimensional Spaces * ....Pages 52-62
An Efficient K -Medoids-Based Algorithm Using Previous Medoid Index, Triangular Inequality Elimination Criteria, and Partial Distance Search....Pages 63-72
A Hybrid Approach to Web Usage Mining....Pages 73-82
Building and Exploiting Ad Hoc Concept Hierarchies for Web Log Analysis....Pages 83-93
Authorization Based on Evidence and Trust * ....Pages 94-103
An Algorithm for Building User-Role Profiles in a Trust Environment 1 ....Pages 104-113
Neural-Based Approaches for Improving the Accuracy of Decision Trees....Pages 114-123
Approximate k -Closest-Pairs with Space Filling Curves....Pages 124-134
Optimal Dimension Order: A Generic Technique for the Similarity Join....Pages 135-149
Fast Discovery of Sequential Patterns by Memory Indexing....Pages 150-160
Dynamic Similarity for Fields with NULL Values....Pages 161-169
Outlier Detection Using Replicator Neural Networks....Pages 170-180
The Closed Keys Base of Frequent Itemsets....Pages 181-190
New Representation and Algorithm for Drawing RNA Structure with Pseudoknots * ....Pages 191-201
Boosting Naive Bayes for Claim Fraud Diagnosis....Pages 202-211
Optimization of Association Word Knowledge Base through Genetic Algorithm....Pages 212-221
Mining Temporal Patterns from Health Care Data * ....Pages 222-231
Adding a Performance-Oriented Perspective to Data Warehouse Design....Pages 232-244
Cost Modeling and Estimation for OLAP-XML Federations....Pages 245-254
Constraint-Free Join Processing on Hyperlinked Web Data....Pages 255-264
Focusing on Data Distribution in the WebD 2 W System....Pages 265-274
A Decathlon in Multidimensional Modeling: Open Issues and Some Solutions....Pages 275-285
Modeling and Imputation of Large Incomplete Multidimensional Datasets....Pages 286-295
PartJoin:An Efficient Storage and Query Execution for Data Warehouses....Pages 296-306
A Transactional Approach to Parallel Data Warehouse Maintenance....Pages 307-316
Striving towards Near Real-Time Data Integration for Data Warehouses....Pages 317-326
Time-Interval Sampling for Improved Estimations in Data Warehouses....Pages 327-337