For more than a decade, data warehousing and knowledge discovery technologies have been developing into key technologies for decision-making processes in com- nies. Since 1999, due to the relevant role of these technologies in academia and ind- try, the Data Warehousing and Knowledge Discovery (DaWaK) conference series have become an international forum where both practitioners and researchers share their findings, publish their relevant results and dispute in depth research issues and experiences on data warehousing and knowledge discovery systems and applications. The 7th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2005) continued series of successful conferences dedicated to these topics. In this edition, the conference tried to provide the right, logical balance between data warehousing and knowledge discovery. Regarding data warehousing, papers cover different relevant and still unsolved research problems, such as the modelling of ETL processes and integration problems, designing OLAP technologies from XML do- ments, modelling data warehouses and data mining applications together, impro- ments in query processing, partitioning and implementations. With regard to data mining, a variety of papers were presented on subjects including data mining te- niques, clustering, classification, text documents and classification, and patterns. These proceedings contain the technical papers that were selected for presentation at the conference. We received 196 abstracts, and finally received 162 papers from 38 countries, and the Program Committee eventually selected 51 papers, making an acceptance rate of 31.4 % of submitted papers.
Author(s): Johann Eder, Christian Koncilia, Karl Wiggisser (auth.), A Min Tjoa, Juan Trujillo (eds.)
Series: Lecture Notes in Computer Science 3589 : Information Systems and Applications, incl. Internet/Web, and HCI
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2005
Language: English
Pages: 544
Tags: Database Management; Information Storage and Retrieval; Information Systems Applications (incl.Internet); Computer Communication Networks; Artificial Intelligence (incl. Robotics); Business Information Systems
Front Matter....Pages -
A Tree Comparison Approach to Detect Changes in Data Warehouse Structures....Pages 1-10
Extending the UML for Designing Association Rule Mining Models for Data Warehouses....Pages 11-21
Event-Feeded Dimension Solution....Pages 22-31
XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses....Pages 32-42
Graph-Based Modeling of ETL Activities with Multi-level Transformations and Updates....Pages 43-52
Extending UML 2 Activity Diagrams with Business Intelligence Objects....Pages 53-63
Automatic Selection of Bitmap Join Indexes in Data Warehouses....Pages 64-73
A Survey of Open Source Tools for Business Intelligence....Pages 74-84
DWEB: A Data Warehouse Engineering Benchmark....Pages 85-94
A Set of Quality Indicators and Their Corresponding Metrics for Conceptual Models of Data Warehouses....Pages 95-104
Design and Development of a Tool for Integrating Heterogeneous Data Warehouses....Pages 105-114
An Evolutionary Approach to Schema Partitioning Selection in a Data Warehouse....Pages 115-125
Using Schema Transformation Pathways for Incremental View Maintenance....Pages 126-135
Data Mapper: An Operator for Expressing One-to-Many Data Transformations....Pages 136-145
Parallel Consistency Maintenance of Materialized Views Using Referential Integrity Constraints in Data Warehouses....Pages 146-156
Selective View Materialization in a Spatial Data Warehouse....Pages 157-167
PMC: Select Materialized Cells in Data Cubes....Pages 168-178
Progressive Ranking of Range Aggregates....Pages 179-189
On Efficient Storing and Processing of Long Aggregate Lists....Pages 190-199
Ad Hoc Star Join Query Processing in Cluster Architectures....Pages 200-209
A Precise Blocking Method for Record Linkage....Pages 210-220
Flexible Query Answering in Data Cubes....Pages 221-232
An Extendible Array Based Implementation of Relational Tables for Multi Dimensional Databases....Pages 233-242
Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data....Pages 243-253
A Machine Learning Approach to Identifying Database Sessions Using Unlabeled Data....Pages 254-264
Hybrid System of Case-Based Reasoning and Neural Network for Symbolic Features....Pages 265-274
Spatio–temporal Rule Mining: Issues and Techniques....Pages 275-284
Hybrid Approach to Web Content Outlier Mining Without Query Vector....Pages 285-294
Incremental Data Mining Using Concurrent Online Refresh of Materialized Data Mining Views....Pages 295-304
A Decremental Algorithm for Maintaining Frequent Itemsets in Dynamic Databases....Pages 305-314
Discovering Richer Temporal Association Rules from Interval-Based Data....Pages 315-325
Semantic Query Expansion Combining Association Rules with Ontologies and Information Retrieval Techniques....Pages 326-335
Maintenance of Generalized Association Rules Under Transaction Update and Taxonomy Evolution....Pages 336-345
Prince: An Algorithm for Generating Rule Bases Without Closure Computations....Pages 346-355
Efficient Compression of Text Attributes of Data Warehouse Dimensions....Pages 356-367
Effectiveness of Document Representation for Classification....Pages 368-377
2-PS Based Associative Text Classification....Pages 378-387
Intrusion Detection via Analysis and Modelling of User Commands....Pages 388-397
Dynamic Schema Navigation Using Formal Concept Analysis....Pages 398-407
FMC: An Approach for Privacy Preserving OLAP....Pages 408-417
Information Driven Evaluation of Data Hiding Algorithms....Pages 418-427
Essential Patterns: A Perfect Cover of Frequent Patterns....Pages 428-437
Processing Sequential Patterns in Relational Databases....Pages 438-447
Optimizing a Sequence of Frequent Pattern Queries....Pages 448-457
A General Effective Framework for Monotony and Tough Constraint Based Sequential Pattern Mining....Pages 458-467
Hiding Classification Rules for Data Sharing with Privacy Preservation....Pages 468-477
Clustering-Based Histograms for Multi-dimensional Data....Pages 478-487
Weighted K-Means for Density-Biased Clustering....Pages 488-497
A New Approach for Cluster Detection for Large Datasets with High Dimensionality....Pages 498-508
Gene Expression Biclustering Using Random Walk Strategies....Pages 509-519
Spectral Kernels for Classification....Pages 520-529
Data Warehousing and Knowledge Discovery: A Chronological View of Research Challenges....Pages 530-535
Back Matter....Pages -