Course HighlightsCOURSE
Data Science Hands-On with Open Source Tools

Data Science Hands-On with Open Source Tools

Data Science Hands-On with Open Source Tools Highlights

Course Enrollment

Starts on

12 February 2020

Enrollment closes on
31 December 2021

  Course Fee

Fee

US$99 - US$199

Course Enrollment

Starts on

12 February 2020

Enrollment closes on
31 December 2021

Course Fee

Fee

US$99 - US$199

About this course

What are the popular tools that data scientists use and need to know? In this course, you'll learn about Jupyter Notebooks with JupyterLab, Apache Zeppelin, and RStudio IDE. Across these three tools, you'll be able to use a variety of programming languages, including Python, R, Scala, and SQL. This is not a programming-oriented course. If you're an aspiring data scientist, or a data science manager looking to learn about what these tools are, and if you want to understand the gist of how these tools work, then this is the course for you! You will not be expected to know statistics, machine learning, and this course is beginner-friendly to new coders. Happy learning!

Course Syllabus

  • Module 1 -Introducing Skills Network Labs
    • What is Skills Network Labs?
    • Skills Network Labsh Account features
    • Creating a Skills Network Labs account
  • Module 2 -Introducing Jupyter Notebooks
    • What are Jupyter notebooks?
    • Getting started with Jupyter
    • Data and Notebooks in Jupyter
    • Sharing your Jupyter Notebooks and data
    • Apache Spark in Jupyter Notebooks
  • Module 3 - Introducing Zeppelin Notebooks
    • What are Zeppelin Notebooks?
    • Zeppelin for Scala
    • Getting started with Zeppelin
    • Managing your Interpreters in Zeppelin
    • Apache Spark in Zeppelin Notebooks
  • Module 4 - Introducing RStudio IDE
    • What is RStudio IDE?
    • Uploading files, Installing Packages and loading libraries in RStudio IDE
    • Getting started with RStudio IDE
    • RStudio Environment and History
    • Apache Spark in RStudio IDE

General Information

  • It is self-paced.
  • It can be taken at any time.
  • It can be taken as many times as you wish.

Recommended skills prior to taking this course

  • None

Requirements

  • None

Course Staff

Course Staff Image #1
Polong Lin

Polong Lin is a Data Scientist at IBM in Canada. Under the Emerging Technologies division, Polong is responsible for educating the next generation of data scientists. Polong is a regular speaker in conferences and meetups, and holds a M.Sc. in Cognitive Psychology.

Course Staff Image #1
Saeed Aghabozorgi

Saeed Aghabozorgi, PhD is a Data Scientist in IBM with a track record of developing enterprise level applications that substantially increases clients’ ability to turn data into actionable knowledge. He is a researcher in data mining field and expert in developing advanced analytic methods like machine learning and statistical modelling on large datasets.

Course Certificate

Earn your certificate

Once you have completed this course, you will earn your certificate.

Data Science Hands-On with Open Source Tools