Course HighlightsCOURSE
Spark Fundamentals I

Spark Fundamentals I

Learn the fundamentals of Spark. Gain hands-on experience through online labs using Hadoop, Mesos, and more.

Build your knowledge of this important tool and take a big step forward in your data science career.

Spark Fundamentals I Highlights

  Course duration

Duration

  • 2 weeks
    2-3 hours/week
  Course Fee

Fee

US$ 99 - US$ 199

Course duration

Duration

  • 2 weeks
    2-3 hours/week
Course Fee

Fee

US$ 99 - US$ 199

Apache Spark is a unified analytics engine utilized in big data analysis and machine learning. It is used to discover trends and real-time insights in many industries, including financial services, healthcare, manufacturing, and retail. This makes it an important competence to have for individuals keen to pursue a career in data science.

During this course, you will explore the fundamentals of Spark and become familiar with various core Spark tools. You will discover why and when Spark is used. You will explore the components of the Spark unified stack. You will learn the fundamentals of Spark's principal data abstraction, the Resilient Distributed Dataset. You will learn how to download and install Spark standalone. Plus, you will be introduced to Scala and Python.

Apache Spark is a popular general-purpose processor that is ideal for working with big data. If you are keen to build your experience through hands-on lab sessions, then this Spark Fundamentals course in an ideal step to take.

This course comprises five purposely designed modules that take you on a carefully defined learning journey.

It is a self-paced course, which does not run to a fixed schedule with regard to completing modules or submitting assignments. It is anticipated that if you work 2-3 hours per week, you will complete the course within 2-3 weeks. However, you can work at your own pace as long as the course is completed before the deadline.

The materials for each module are accessible from the start of the course and will remain available for the duration of your enrollment. Methods of learning and assessment will include reading material, hands-on labs and online exams questions.

As part of our mentoring service you will have access to valuable guidance and support throughout the course. We provide a dedicated discussion space where you can ask questions, chat with your peers, and resolve issues. Depending on the payment plan you have chosen, you may also have access to live classes and webinars, which are an excellent opportunity to discuss problems with your mentor and ask questions. Mentoring services may vary package wise.

Once you have successfully completed the course, you will earn your IBM Certificate.

After completing this course, you will be able to:

  • Perform fast iterative algorithms.
  • Carry out interactive data mining.
  • Perform in-memory cluster computing.
  • Support Java, Python, R, and Scala APIs for development.
  • Combine SQL, streaming, and complex analytics in the same application.
  • Run Spark applications on top of Hadoop, Mesos, standalone, or in the cloud.
  • Work with HDFS, Cassandra, HBase, or S3.
  • Individuals who need to understand data and data insights for their job.
  • Individuals who aspire to become data scientists or data engineers.

You should have a basic understanding of:

  • Apache Hadoop and big data.
  • The Linux operating system.
  • Scala, Python, R, or Java programming languages.

Course Outline

General Information
Learning objectives
Syllabus
Grading Scheme
Change Log
Copyright and Trademarks
Learning objectives
Resilient Distributed Dataset - Part 1
Resilient Distributed Dataset - Part 2
Resilient Distributed Dataset - Part 3
Lab - RDD and Dataframes
Python RDD Solution
Scala RDD Solution
DataFrames Solution
Graded Review Questions
Learning objectives
Spark Libraries - Part 1
Spark Libraries - Part 2
Spark Libraries - Part 3
Lab - Scala Libraries
Solution - Part 1
Solution - Part 2
Solution - Part 3
Solution - Part 4
Graded Review Questions
Learning objectives
Configuration, monitoring, and tuning - Part 1
Configuration, monitoring, and tuning - Part 2
Lab - Spark Fundamentals
Solution
Graded Review Questions
Course Certificate

Earn your certificate

Once you have completed this course, you will earn your certificate.

Preview digital certificate
Spark Fundamentals I