We are glad to present the proceedings of the 5th biennial conference in the Intelligent Data Analysis series. The conference took place in Berlin, Germany, August 28–30, 2003. IDA has by now clearly grown up. Started as a small si- symposium of a larger conference in 1995 in Baden-Baden (Germany) it quickly attractedmoreinterest(bothsubmission-andattendance-wise),andmovedfrom London (1997) to Amsterdam (1999), and two years ago to Lisbon. Submission ratesalongwiththeeverimprovingqualityofpapershaveenabledtheor- nizers to assemble increasingly consistent and high-quality programs. This year we were again overwhelmed by yet another record-breaking submission rate of 180 papers. At the Program Chairs meeting we were – based on roughly 500 reviews – in the lucky position of carefully selecting 17 papers for oral and 42 for poster presentation. Poster presenters were given the opportunity to summarize their papers in 3-minute spotlight presentations. The oral, spotlight and poster presentations were then scheduled in a single-track, 2. 5-day conference program, summarized in this book. In accordance with the goal of IDA, “to bring together researchers from diverse disciplines,” we achieved a nice balance of presentations from the more theoreticalside(bothstatisticsandcomputerscience)aswellasmoreapplicati- oriented areas that illustrate how these techniques can be used in practice. Work presented in these proceedings ranges from theoretical contributions dealing, for example, with data cleaning and compression all the way to papers addressing practical problems in the areas of text classi?cation and sales-rate predictions. A considerable number of papers also center around the currently so popular applications in bioinformatics.
Author(s): Ad Feelders, Martijn Pardoel (auth.), Michael R. Berthold, Hans-Joachim Lenz, Elizabeth Bradley, Rudolf Kruse, Christian Borgelt (eds.)
Series: Lecture Notes in Computer Science 2810
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2003
Language: English
Pages: 632
Tags: Information Storage and Retrieval; Probability and Statistics in Computer Science; Artificial Intelligence (incl. Robotics); Pattern Recognition; Computer Appl. in Administrative Data Processing; Business Information Systems
Front Matter....Pages -
Pruning for Monotone Classification Trees....Pages 1-12
Regularized Learning with Flexible Constraints....Pages 13-24
Learning to Answer Emails....Pages 25-35
A Semi-supervised Method for Learning the Structure of Robot Environment Interactions....Pages 36-47
Using Domain Specific Knowledge for Automated Modeling....Pages 48-59
Resolving Rule Conflicts with Double Induction....Pages 60-67
A Novel Partial-Memory Learning Algorithm Based on Grey Relational Structure....Pages 68-75
Constructing Hierarchical Rule Systems....Pages 76-87
Text Categorization Using Hybrid Multiple Model Schemes....Pages 88-99
Learning Dynamic Bayesian Networks from Multivariate Time Series with Changing Dependencies....Pages 100-110
Topology and Intelligent Data Analysis....Pages 111-122
Coherent Conditional Probability as a Measure of Information of the Relevant Conditioning Events....Pages 123-133
Very Predictive Ngrams for Space-Limited Probabilistic Models....Pages 134-142
Interval Estimation Naïve Bayes....Pages 143-154
Mining Networks and Central Entities in Digital Libraries. A Graph Theoretic Approach Applied to Co-author Networks....Pages 155-166
Learning Linear Classifiers Sensitive to Example Dependent and Noisy Costs....Pages 167-178
An Effective Associative Memory for Pattern Recognition....Pages 179-186
Similarity Based Classification....Pages 187-197
Numerical Attributes in Decision Trees: A Hierarchical Approach....Pages 198-207
Similarity-Based Neural Networks for Applications in Computational Molecular Biology....Pages 208-218
Combining Pairwise Classifiers with Stacking....Pages 219-229
APRIORI-SD: Adapting Association Rule Learning to Subgroup Discovery....Pages 230-241
Solving Classification Problems Using Infix Form Genetic Programming....Pages 242-253
What Is Fuzzy about Fuzzy Clustering? Understanding and Improving the Concept of the Fuzzifier....Pages 254-264
A Mixture Model Approach for Binned Data Clustering....Pages 265-274
Fuzzy Clustering Based Segmentation of Time-Series....Pages 275-285
An Iterated Local Search Approach for Minimum Sum-of-Squares Clustering....Pages 286-296
Data Clustering in Tolerance Space....Pages 297-306
Refined Shared Nearest Neighbors Graph for Combining Multiple Data Clusterings....Pages 307-318
Clustering Mobile Trajectories for Resource Allocation in Mobile Environments....Pages 319-329
Fuzzy Clustering of Short Time-Series and Unevenly Distributed Sampling Points....Pages 330-340
Combining and Comparing Cluster Methods in a Receptor Database....Pages 341-351
Selective Sampling with a Hierarchical Latent Variable Model....Pages 352-363
Obtaining Quality Microarray Data via Image Reconstruction....Pages 364-375
Large Scale Mining of Molecular Fragments with Wildcards....Pages 376-385
Genome-Wide Prokaryotic Promoter Recognition Based on Sequence Alignment Kernel....Pages 386-396
Towards Automated Electrocardiac Map Interpretation: An Intelligent Contouring Tool Based on Spatial Aggregation....Pages 397-408
Study of Canada/US Dollar Exchange Rate Movements Using Recurrent Neural Network Model of FX-Market....Pages 409-417
Gaussian Mixture Density Estimation Applied to Microarray Data....Pages 418-429
Classification of Protein Localisation Patterns via Supervised Neural Network Learning....Pages 430-439
Applying Intelligent Data Analysis to Coupling Relationships in Object-Oriented Software....Pages 440-450
The Smaller the Better: Comparison of Two Approaches for Sales Rate Prediction....Pages 451-461
A Multiagent-Based Constructive Approach for Feedforward Neural Networks....Pages 462-473
Evolutionary System Identification via Descriptive Takagi Sugeno Fuzzy Systems....Pages 474-485
Minimum Message Length Criterion for Second-Order Polynomial Model Selection Applied to Tropical Cyclone Intensity Forecasting....Pages 486-496
On the Use of the GTM Algorithm for Mode Detection....Pages 497-508
Regularization Methods for Additive Models....Pages 509-520
Automated Detection of Influenza Epidemics with Hidden Markov Models....Pages 521-532
Guided Incremental Construction of Belief Networks....Pages 533-543
Distributed Regression for Heterogeneous Data Sets....Pages 544-553
A Logical Formalisation of the Fellegi-Holt Method of Data Cleaning....Pages 554-565
Compression Technique Preserving Correlations of a Multivariate Temporal Sequence....Pages 566-577
Condensed Representations in Presence of Missing Values....Pages 578-588
Measures of Rule Quality for Feature Selection in Text Categorization....Pages 589-598
Genetic Approach to Constructive Induction Based on Non-algebraic Feature Representation....Pages 599-610
Active Feature Selection Based on a Very Limited Number of Entities....Pages 611-622
Back Matter....Pages -