Principles and Theory for Data Mining and Machine Learning

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This book is a thorough introduction to the most important topics in data mining and machine learning. It begins with a detailed review of classical function estimation and proceeds with chapters on nonlinear regression, classification, and ensemble methods. The final chapters focus on clustering, dimension reduction, variable selection, and multiple comparisons. All these topics have undergone extraordinarily rapid development in recent years and this treatment offers a modern perspective emphasizing the most recent contributions. The presentation of foundational results is detailed and includes many accessible proofs not readily available outside original sources. While the orientation is conceptual and theoretical, the main points are regularly reinforced by computational comparisons.

Intended primarily as a graduate level textbook for statistics, computer science, and electrical engineering students, this book assumes only a strong foundation in undergraduate statistics and mathematics, and facility with using R packages. The text has a wide variety of problems, many of an exploratory nature. There are numerous computed examples, complete with code, so that further computations can be carried out readily. The book also serves as a handbook for researchers who want a conceptual overview of the central topics in data mining and machine learning.

Bertrand Clarke is a Professor of Statistics in the Department of Medicine, Department of Epidemiology and Public Health, and the Center for Computational Sciences at the University of Miami. He has been on the Editorial Board of the Journal of the American Statistical Association, the Journal of Statistical Planning and Inference, and Statistical Papers. He is co-winner, with Andrew Barron, of the 1990 Browder J. Thompson Prize from the Institute of Electrical and Electronic Engineers.

Ernest Fokoue is an Assistant Professor of Statistics at Kettering University. He has also taught at Ohio State University and been a long term visitor at the Statistical and Mathematical Sciences Institute where he was a Post-doctoral Research Fellow in the Data Mining and Machine Learning Program. In 2000, he was the winner of the Young Researcher Award from the International Association for Statistical Computing.

Hao Helen Zhang is an Associate Professor of Statistics in the Department of Statistics at North Carolina State University. For 2003-2004, she was a Research Fellow at SAMSI and in 2007, she won a Faculty Early Career Development Award from the National Science Foundation. She is on the Editorial Board of the Journal of the American Statistical Association and Biometrics.

Author(s): Bertrand Clarke, Ernest Fokoue, Hao Helen Zhang (auth.)
Series: Springer Series in Statistics
Edition: 1
Publisher: Springer-Verlag New York
Year: 2009

Language: English
Pages: 786
Tags: Statistical Theory and Methods; Data Mining and Knowledge Discovery; Computational Biology/Bioinformatics; Pattern Recognition; Probability and Statistics in Computer Science; Signal, Image and Speech Processing

Front Matter....Pages i-xiv
Variability, Information, and Prediction....Pages 1-52
Local Smoothers....Pages 53-116
Spline Smoothing....Pages 117-170
New Wave Nonparametrics....Pages 171-230
Supervised Learning: Partition Methods....Pages 231-306
Alternative Nonparametrics....Pages 307-363
Computational Comparisons....Pages 365-404
Unsupervised Learning: Clustering....Pages 405-491
Learning in High Dimensions....Pages 493-568
Variable Selection....Pages 569-678
Multiple Testing....Pages 679-742
Back Matter....Pages 1-38