Skip to main content

About the Program

They say that it is not the destination but the journey that got you there that matters. When you have completed this learning path, concepts such as how parallelism is performed on a cluster will be second nature. Your awareness of how to program, either using high-level or low-level languages will be highlights along the way. This learning path incorporates MapReduce and YARN, an introduction to Apache Pig, and simplifying data pipelines with Apache Kafka.

Hadoop Programming - Course Outline

Course 1: MapReduce and YARN

Effort: 5 hours Level: Intermediate

String together your understanding of Yet Another Resource Negotiator (YARN) by gaining exposure to MapReduce1, the tool-sets that start the processing of Big Data.

Course 2: Simplifying Data Pipelines with Apache Kafka

Effort: 3 hours Level: Intermediate

When you hear the terms, producer, consumer, topic category, broker, and cluster used together to describe a messaging system, something is brewing in the pipelines. Get connected and learn what that is, and what it means!

Learning Path Courses

  • /asset-v1:IBM+BD0115EN+v1+type@asset+block@MapReduce-and-YARN.jpg

    MapReduce and YARN

    • Details
  • /asset-v1:IBM+BD0123EN+v1+type@asset+block@Simplifying-data-pipelines-with-Apache-Kafka.jpg

    Simplifying data pipelines with Apache Kafka

    • Details