Spark Fundamentals II
Learn the fundamentals of Spark architecture. Discover how to test, and debug Spark applications using SBT, Eclipse, and IntelliJ.
Further your knowledge of this important tool and take an important step forward in your data science career.
Earn your certificate
Once you have completed this course, you will earn your certificate.Preview digital certificate
Spark Fundamentals II is provided 100% online. You will therefore need access to the internet to be able to use the course materials. When you enroll for this course, you be able to access the course materials from the course link in your dashboard immediately. Please note, this course has been designed to be taken with Spark Fundamentals I, we therefore recommend that you complete this first course and then enroll for Spark Fundamentals II when you are ready. This will ensure you have covered the required topics for this subject.
Spark Fundamentals II is intended to enable you to develop critical Spark skills, including distributed datasets and DataFrame operations. You will use Scala, Java, and Python to create and run a Spark application. Plus, you will create applications using Spark SQL, and configure and tune Spark. We therefore recommend that you have a basic understanding of Apache Hadoop and big data, basic knowledge of Linux, and basic skills in using Scala, Python, and Java programming languages.
Yes, once you have successfully completed the course, you will earn a Certificate of Completion. However, remember you will also have gained valuable skills that you can refer to in interviews and in your profile on LinkedIn!
Yes. Spark Fundamentals II is totally online. You do not need to turn up to any classes in person. This means, however, that you need to have access to the internet, and also the necessary technology to access the course materials.
The great thing is that this means you can take this course wherever you live. And though you’ll be sitting in your room alone, you won’t be learning alone, for you will be encouraged to communicate and chat with your peers through the discussion space.
Apache Spark is fantastic data processing framework that can process large datasets quickly. It can also distribute processing tasks across many computers. Having the capacity to do both these things makes Apache Spark an important tool for processing big data and developing machine learning. It also has an API that is easy to use and can reduce the burden on developers. It’s therefore a great skillset to have on your resumé and LinkedIn profile.