This book constitutes the refereed proceedings of the 5th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2003, held in Prague, Czech Republic in September 2003.
The 41 revised full papers presented were carefully reviewed and selected from more than 130 submissions. The papers are organized in topical sections on data cubes and queries, multidimensional data models, Web warehousing, change detection, Web mining and association rules, association rules and decision trees, clustering, association rule mining, data analysis and discovery, ontologies and improving data quality, queries and data patterns, improving database query engines, and sampling and vector classification.
Author(s): Peter Fankhauser, Thomas Klement (auth.), Yahiko Kambayashi, Mukesh Mohania, Wolfram Wöß (eds.)
Series: Lecture Notes in Computer Science 2737
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2003
Language: English
Pages: 438
Tags: Computer Science, general
Front Matter....Pages -
XML for Data Warehousing Chances and Challenges....Pages 1-3
CPM: A Cube Presentation Model for OLAP....Pages 4-13
Computation of Sparse Data Cubes with Constraints....Pages 14-23
Answering Joint Queries from Multiple Aggregate OLAP Databases....Pages 24-34
An Approach to Enabling Spatial OLAP by Aggregating on Spatial Hierarchy....Pages 35-44
A Multidimensional Aggregation Object (MAO) Framework for Computing Distributive Aggregations....Pages 45-54
The GMD Data Model for Multidimensional Information: A Brief Introduction....Pages 55-65
An Application of Case-Based Reasoning in Multidimensional Database Architecture....Pages 66-75
MetaCube XTM: A Multidimensional Metadata Approach for Semantic Web Warehousing Systems....Pages 76-88
Designing Web Warehouses from XML Schemas....Pages 89-98
Building XML Data Warehouse Based on Frequent Patterns in User Queries....Pages 99-108
A Temporal Study of Data Sources to Load a Corporate Data Warehouse....Pages 109-118
Automatic Detection of Structural Changes in Data Warehouses....Pages 119-128
Performance Tests in Data Warehousing ETLM Process for Detection of Changes in Data Origin....Pages 129-139
Recent Developments in Web Usage Mining Research....Pages 140-150
Parallel Vector Computing Technique for Discovering Communities on the Very Large Scale Web Graph....Pages 151-160
Ordinal Association Rules towards Association Rules....Pages 161-171
Rough Set Based Decision Tree Model for Classification....Pages 172-181
Inference Based Classifier: Efficient Construction of Decision Trees for Sparse Categorical Attributes....Pages 182-191
Generating Effective Classifiers with Supervised Learning of Genetic Programming....Pages 192-201
Clustering by Regression Analysis....Pages 202-211
Handling Large Workloads by Profiling and Clustering....Pages 212-223
Incremental OPTICS: Efficient Computation of Updates in a Hierarchical Cluster Ordering....Pages 224-233
On Complementarity of Cluster and Outlier Detection Schemes....Pages 234-243
Cluster Validity Using Support Vector Machines....Pages 244-256
FSSM: Fast Construction of the Optimized Segment Support Map....Pages 257-266
Using a Connectionist Approach for Enhancing Domain Ontologies: Self-Organizing Word Category Maps Revisited....Pages 267-277
Parameterless Data Compression and Noise Filtering Using Association Rule Mining....Pages 278-287
Performance Evaluation of SQL-OR Variants for Association Rule Mining....Pages 288-298
A Distance-Based Approach to Find Interesting Patterns....Pages 299-308
Similarity Search in Structured Data....Pages 309-319
Using an Interest Ontology for Improved Support in Rule Mining....Pages 320-329
Fraud Formalization and Detection....Pages 330-339
Combining Noise Correction with Feature Selection....Pages 340-349
Pre-computing Approximate Hierarchical Range Queries in a Tree-Like Histogram....Pages 350-359
Comprehensive Log Compression with Frequent Patterns....Pages 360-370
Non-recursive Generation of Frequent K-itemsets from Frequent Pattern Tree Representations....Pages 371-380
A New Computation Model for Rough Set Theory Based on Database Systems....Pages 381-390
Computing SQL Queries with Boolean Aggregates....Pages 391-400
Fighting Redundancy in SQL....Pages 401-411
“On-the-fly” VS Materialized Sampling and Heuristics....Pages 412-421
Incremental and Decremental Proximal Support Vector Classification using Decay Coefficients....Pages 422-429
Back Matter....Pages -