One of the superb characteristics of Intelligent Data Analysis (IDA) is that it is an interdisciplinary ?eld in which researchers and practitioners from a number of areas are involved in a typical project. This also creates a challenge in which the success of a team depends on the participation of users and domain experts who need to interact with researchers and developers of any IDA system. All this is usually re?ected in successful projects and of course on the papers that were evaluated by this year’s program committee from which the ?nal program has been developed. In our call for papers, we solicited papers on (i) applications and tools, (ii) theory and general principles, and (iii) algorithms and techniques. We received a total of 184 papers, reviewing these was a major challenge. Each paper was assigned to three reviewers. In the end 46 papers were accepted, which are all included in the proceedings and presented at the conference. This year’s papers re?ect the results of applied and theoretical researchfrom a number of disciplines all of which are related to the ?eld of Intelligent Data Analysis. To have the best combination of theoretical and applied research and also provide the best focus, we have divided this year’s IDA program into tu- rials, invited talks, panel discussions and technical sessions.
Author(s): Jean-Marc Andreoli, Guillaume Bouchard (auth.), A. Fazel Famili, Joost N. Kok, José M. Peña, Arno Siebes, Ad Feelders (eds.)
Series: Lecture Notes in Computer Science 3646 : Information Systems and Applications, incl. Internet/Web, and HCI
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2005
Language: English
Pages: 534
Tags: Information Storage and Retrieval; Artificial Intelligence (incl. Robotics); Probability and Statistics in Computer Science; Pattern Recognition; Business Information Systems
Front Matter....Pages -
Probabilistic Latent Clustering of Device Usage....Pages 1-11
Condensed Nearest Neighbor Data Domain Description....Pages 12-23
Balancing Strategies and Class Overlapping....Pages 24-35
Modeling Conditional Distributions of Continuous Variables in Bayesian Networks....Pages 36-45
Kernel K-Means for Categorical Data....Pages 46-56
Using Genetic Algorithms to Improve Accuracy of Economical Indexes Prediction....Pages 57-65
A Distance-Based Method for Preference Information Retrieval in Paired Comparisons....Pages 66-73
Knowledge Discovery in the Identification of Differentially Expressed Genes in Tumoricidal Macrophage....Pages 74-85
Searching for Meaningful Feature Interactions with Backward-Chaining Rule Induction....Pages 86-96
Exploring Hierarchical Rule Systems in Parallel Coordinates....Pages 97-108
Bayesian Networks Learning for Gene Expression Datasets....Pages 109-120
Pulse: Mining Customer Opinions from Free Text....Pages 121-132
Keystroke Analysis of Different Languages: A Case Study....Pages 133-144
Combining Bayesian Networks with Higher-Order Data Representations....Pages 145-156
Removing Statistical Biases in Unsupervised Sequence Learning....Pages 157-167
Learning from Ambiguously Labeled Examples....Pages 168-179
Learning Label Preferences: Ranking Error Versus Position Error....Pages 180-191
FCLib: A Library for Building Data Analysis and Data Discovery Tools....Pages 192-203
A Knowledge-Based Model for Analyzing GSM Network Performance....Pages 204-215
Sentiment Classification Using Information Extraction Technique....Pages 216-227
Extending the SOM Algorithm to Visualize Word Relationships....Pages 228-238
Towards Automatic and Optimal Filtering Levels for Feature Selection in Text Categorization....Pages 239-248
Block Clustering of Contingency Table and Mixture Model....Pages 249-259
Adaptive Classifier Combination for Visual Information Processing Using Data Context-Awareness....Pages 260-271
Self-poised Ensemble Learning....Pages 272-282
Discriminative Remote Homology Detection Using Maximal Unique Sequence Matches....Pages 283-292
From Local Pattern Mining to Relevant Bi-cluster Characterization....Pages 293-304
Machine-Learning with Cellular Automata....Pages 305-315
MDS polar : A New Approach for Dimension Reduction to Visualize High Dimensional Data....Pages 316-327
Miner Ants Colony: A New Approach to Solve a Mine Planning Problem....Pages 328-338
Extending the GA-EDA Hybrid Algorithm to Study Diversification and Intensification in GAs and EDAs....Pages 339-350
Spatial Approach to Pose Variations in Face Verification....Pages 351-361
Analysis of Feature Rankings for Classification....Pages 362-372
A Mixture Model-Based On-line CEM Algorithm....Pages 373-384
Reliable Hierarchical Clustering with the Self-organizing Map....Pages 385-396
Statistical Recognition of Noun Phrases in Unrestricted Text....Pages 397-408
Successive Restrictions Algorithm in Bayesian Networks....Pages 409-418
Modelling the Relationship Between Streamflow and Electrical Conductivity in Hollin Creek, Southeastern Australia....Pages 419-428
Biological Cluster Validity Indices Based on the Gene Ontology....Pages 429-439
An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering....Pages 440-451
Dealing with Data Corruption in Remote Sensing....Pages 452-463
Regularized Least-Squares for Parse Ranking....Pages 464-474
Bayesian Network Classifiers for Time-Series Microarray Data....Pages 475-485
Feature Discovery in Classification Problems....Pages 486-496
A New Hybrid NM Method and Particle Swarm Algorithm for Multimodal Function Optimization....Pages 497-508
Detecting Groups of Anomalously Similar Objects in Large Data Sets....Pages 509-519
Back Matter....Pages -