Machine Learning with PySpark: With Natural Language Processing and Recommender Systems

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and recommender systems using PySpark. Machine Learning with PySpark shows you how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forest. You’ll also see unsupervised machine learning models such as K-means and hierarchical clustering. A major portion of the book focuses on feature engineering to create useful features with PySpark to train the machine learning models. The natural language processing section covers text processing, text mining, and embedding for classification. After reading this book, you will understand how to use PySpark’s machine learning library to build and train various machine learning models. Additionally you’ll become comfortable with related PySpark components, such as data ingestion, data processing, and data analysis, that you can use to develop data-driven intelligent applications. What You Will Learn • Build a spectrum of supervised and unsupervised machine learning algorithms • Implement machine learning algorithms with Spark MLlib libraries • Develop a recommender system with Spark MLlib libraries • Handle issues related to feature engineering, class balance, bias and variance, and cross validation for building an optimal fit model Who This Book Is For Data science and machine learning professionals.

Author(s): Pramod Singh
Edition: 1
Publisher: Apress
Year: 2019

Language: English
Commentary: True PDF
Pages: 223
Tags: Machine Learning; Natural Language Processing; Unsupervised Learning; Reinforcement Learning; Supervised Learning; Python; Recommender Systems; Clustering; Apache Spark; Linear Regression; Logistic Regression; PySpark; Semi-supervised Learning; Random Forest

Front Matter ....Pages i-xviii
Evolution of Data (Pramod Singh)....Pages 1-10
Introduction to Machine Learning (Pramod Singh)....Pages 11-21
Data Processing (Pramod Singh)....Pages 23-42
Linear Regression (Pramod Singh)....Pages 43-64
Logistic Regression (Pramod Singh)....Pages 65-98
Random Forests (Pramod Singh)....Pages 99-122
Recommender Systems (Pramod Singh)....Pages 123-157
Clustering (Pramod Singh)....Pages 159-190
Natural Language Processing (Pramod Singh)....Pages 191-218
Back Matter ....Pages 219-223