Automating the Design of Data Mining Algorithms: An Evolutionary Computation Approach

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

Data mining is a very active research area with many successful real-world app- cations. It consists of a set of concepts and methods used to extract interesting or useful knowledge (or patterns) from real-world datasets, providing valuable support for decision making in industry, business, government, and science. Although there are already many types of data mining algorithms available in the literature, it is still dif cult for users to choose the best possible data mining algorithm for their particular data mining problem. In addition, data mining al- rithms have been manually designed; therefore they incorporate human biases and preferences. This book proposes a new approach to the design of data mining algorithms. - stead of relying on the slow and ad hoc process of manual algorithm design, this book proposes systematically automating the design of data mining algorithms with an evolutionary computation approach. More precisely, we propose a genetic p- gramming system (a type of evolutionary computation method that evolves c- puter programs) to automate the design of rule induction algorithms, a type of cl- si cation method that discovers a set of classi cation rules from data. We focus on genetic programming in this book because it is the paradigmatic type of machine learning method for automating the generation of programs and because it has the advantage of performing a global search in the space of candidate solutions (data mining algorithms in our case), but in principle other types of search methods for this task could be investigated in the future.

Author(s): Gisele L. Pappa, Alex Freitas (auth.)
Series: Natural Computing Series
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2010

Language: English
Pages: 187
Tags: Data Mining and Knowledge Discovery; Artificial Intelligence (incl. Robotics)

Front Matter....Pages I-XIII
Introduction....Pages 1-16
Data Mining....Pages 17-46
Evolutionary Algorithms....Pages 47-84
Genetic Programming for Classification and Algorithm Design....Pages 85-108
Automating the Design of Rule Induction Algorithms....Pages 109-135
Computational Results on the Automatic Design of Full Rule Induction Algorithms....Pages 137-175
Directions for Future Research on the Automatic Design of Data Mining Algorithms....Pages 177-184
Back Matter....Pages 185-187