This book constitutes the refereed proceedings of the 6th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2004, held in Zaragoza, Spain, in September 2004.
The 40 revised full papers presented were carefully reviewed and selected from over 100 submissions. The papers are organized in topical sections on data warehouse design; knowledge discovery framework and XML data mining, data cubes and queries; multidimensional schema and data aggregation; inductive databases and temporal rules; industrial applications; data clustering; data visualization and exploration; data classification, extraction, and interpretation; data semantics, association rule mining; event sequence mining; and pattern mining.
Author(s): Vicky Nassis, R. Rajugan, Tharam S. Dillon, Wenny Rahayu (auth.), Yahiko Kambayashi, Mukesh Mohania, Wolfram Wöß (eds.)
Series: Lecture Notes in Computer Science 3181
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2004
Language: English
Pages: 412
Tags: Database Management; Information Storage and Retrieval; Information Systems Applications (incl.Internet); Computer Communication Networks; Artificial Intelligence (incl. Robotics); Business Information Systems
Front Matter....Pages -
Conceptual Design of XML Document Warehouses....Pages 1-14
Bringing Together Partitioning, Materialized Views and Indexes to Optimize Performance of Relational Data Warehouses....Pages 15-25
GeoDWFrame: A Framework for Guiding the Design of Geographical Dimensional Schemas....Pages 26-37
Workload-Based Placement and Join Processing in Node-Partitioned Data Warehouses....Pages 38-47
Novelty Framework for Knowledge Discovery in Databases....Pages 48-57
Revisiting Generic Bases of Association Rules....Pages 58-67
Mining Maximal Frequently Changing Subtree Patterns from XML Documents....Pages 68-76
Discovering Pattern-Based Dynamic Structures from Versions of Unordered XML Documents....Pages 77-86
Space-Efficient Range-Sum Queries in OLAP....Pages 87-96
Answering Approximate Range Aggregate Queries on OLAP Data Cubes with Probabilistic Guarantees....Pages 97-107
Computing Complex Iceberg Cubes by Multiway Aggregation and Bounding....Pages 108-117
An Aggregate-Aware Retargeting Algorithm for Multiple Fact Data Warehouses....Pages 118-128
A Partial Pre-aggregation Scheme for HOLAP Engines....Pages 129-137
Discovering Multidimensional Structure in Relational Data....Pages 138-148
Inductive Databases as Ranking....Pages 149-158
Inductive Databases of Polynomial Equations....Pages 159-168
From Temporal Rules to Temporal Meta-rules....Pages 169-178
How Is BI Used in Industry?: Report from a Knowledge Exchange Network....Pages 179-188
Towards an Adaptive Approach for Mining Data Streams in Resource Constrained Environments....Pages 189-198
Exploring Possible Adverse Drug Reactions by Clustering Event Sequences....Pages 199-208
SCLOPE: An Algorithm for Clustering Data Streams of Categorical Attributes....Pages 209-218
Novel Clustering Approach that Employs Genetic Algorithm with New Representation Scheme and Multiple Objectives....Pages 219-228
Categorical Data Visualization and Clustering Using Subjective Factors....Pages 229-238
Multidimensional Data Visual Exploration by Interactive Information Segments....Pages 239-248
Metadata to Support Transformations and Data & Metadata Lineage in a Warehousing Environment....Pages 249-258
Classification Based on Attribute Dependency....Pages 259-268
OWDEAH: Online Web Data Extraction Based on Access History....Pages 269-278
Data Mining Approaches to Diffuse Large B–Cell Lymphoma Gene Expression Data Interpretation....Pages 279-288
Deriving Multiple Topics to Label Small Document Regions....Pages 289-298
Deriving Efficient SQL Sequences via Read-Aheads....Pages 299-308
Diversity in Random Subspacing Ensembles....Pages 309-319
Partitioned Approach to Association Rule Mining over Multiple Databases....Pages 320-330
A Tree Partitioning Method for Memory Management in Association Rule Mining....Pages 331-340
Mining Interesting Association Rules for Prediction in the Software Project Management Area....Pages 341-350
PROWL: An Efficient Frequent Continuity Mining Algorithm on Event Sequences....Pages 351-360
Algorithms for Discovery of Frequent Superset, Rather Than Frequent Subset....Pages 361-370
Improving Direct Counting for Frequent Itemset Mining....Pages 371-380
Mining Sequential Patterns with Item Constraints....Pages 381-390
Mining Borders of the Difference of Two Datacubes....Pages 391-400
Mining Periodic Patterns in Sequence Data....Pages 401-410
Back Matter....Pages -