Course HighlightsCOURSE
Implementing Predictive Analytics with Spark in Azure HDInsight

Implementing Predictive Analytics with Spark in Azure HDInsight

Learn how to use Spark in Microsoft Azure HDInsight to create predictive analytics and machine learning solutions.

Implementing Predictive Analytics with Spark in Azure HDInsight Highlights

Course enrollment

Starts on

01 January 2019

Enrollment closes on
15 September 2019

  Course duration

Duration

  • Total 18 to 24 hours
  Course Fee

Fee

Free

Course enrollment

Starts on

01 January 2019

Enrollment closes on
15 September 2019

Course duration

Duration

  • Total 18 to 24 hours
Course Fee

Fee

Free

Enrollment is Closed

About this course

This course is part of the Microsoft Professional Program Certificate in Data Science and part of the Microsoft Professional Program Certificate in Big Data.

Are you ready for big data science? In this course, learn how to implement predictive analytics solutions for big data using Apache Spark in Microsoft Azure HDInsight. See how to work with Scala or Python to cleanse and transform data and build machine learning models with Spark ML (the machine learning library in Spark).

Note: To complete the hands-on elements in this course, you will require an Azure subscription and a Windows client computer. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions.

What you'll learn

  • Using Spark to explore data and prepare for modeling
  • Build supervised machine learning models
  • Evaluate and optimize models
  • Build recommenders and unsupervised machine learning models

Hide Course Syllabus

Course Syllabus

Introduction to Data Science with Spark
Get started with Spark clusters in Azure HDInsight, and use Spark to run Python or Scala code to work with data.

Getting Started with Machine Learning
Learn how to build classification and regression models using the Spark ML library.

Evaluating Machine Learning Models
Learn how to evaluate supervised learning models, and how to optimize model parameters.

Recommenders and Unsupervised Models
Learn how to build recommenders and clustering models using Spark ML.

Meet the instructors

Graeme Malcolm

Graeme Malcolm

Senior Content Developer Microsoft Learning Experiences

Graeme has been a trainer, consultant, and author for longer than he cares to remember, specializing in SQL Server and the Microsoft data platform. He is a Microsoft Certified Solutions Expert for the SQL Server Data Platform and Business Intelligence. After years of working with Microsoft as a partner and vendor, he now works in the Microsoft Learning Experiences team as a senior content developer, where he plans and creates content for developers and data professionals who want to get the best out of Microsoft technologies.

Course Outline

Enrollment is Closed
Welcome
Lab Setup Instructions
Introduction to Spark
Demo: Provisioning a Spark Cluster
Introduction to Machine Learning
Introduction to DataFrames
Demo: Getting Started with DataFrames
Exploring and Preparing Data
Demo: Data Exploration
Further Reading
Lab Instructions
Review Instructions
Question 1
Question 2
Question 3
Question 4
Classification
Demo: Building a Classification Model
Regression
Demo: Building a Regression Model
Introduction to Pipelines
Demo: Creating a Pipeline
Analyzing Text
Demo: Working with Text Features
Further Reading
Lab Instructions
Review Instructions
Question 1
Question 2
Question 3
Question 4
Evaluating a Classification Model
Demo: Evaluating a Classifier
Evaluating a Regression Model
Demo: Evaluating a Linear Regression Model
Introduction to Parameter Tuning
Demo: Using a Training / Validation Split
Cross-Validation
Demo: Cross-Validating Model Parameters
Further Reading
Lab Instructions
Review Instructions
Question 1
Question 2
Question 3
Question 4
Introduction to Recommenders
Collaborative Filtering
Demo: Creating a Recommender
Introduction to Clustering
K-Means Clustering
Demo: Creating a Clustering Model
Further Reading
Lab Instructions
Review Instructions
Question 1
Question 2
Question 3
Question 4
Exam Instructions
Question 1
Question 2
Question 3
Question 4
Question 5
Question 6
Question 7
Question 8
Question 9
Question 10
Course Certificate

Earn your certificate

Once you have completed this course, you will earn your certificate.

Implementing Predictive Analytics with Spark in Azure HDInsight