Hadoop 101

Loading...
icon

icon
Loading...
course-icon

Course

org-logo

Hadoop 101

Build your knowledge of Hadoop's architecture and core components. Explore using MapReduce and the Hadoop Distributed File System (HDFS). Learn how to modify configuration parameters.

Take an important step towards building critical skills for the fast-growing fields of big data and data science.

Self-Paced

Mentored

Beginner

time-icon

Duration

2 weeks, online
3-4 hours/week
fee-icon

Fee

$140

Loading...

Apache Hadoop is a popular framework for enabling applications to collect data from various locations and in various formats. Businesses use it to facilitate the execution of distributed processes against large amounts of data. It achieves this by allowing the clustering of multiple computers to analyze massive datasets quickly.
During this course, you will be introduced to the basics of Apache Hadoop. You will explore its architecture and core components, including MapReduce and the Hadoop Distributed File System (HDFS). You will learn how to add and remove nodes from Hadoop clusters, how to check available disk space on each node, and how to modify configuration parameters. Plus, you will learn about other Apache projects that are part of the Hadoop ecosystem, including Pig, Hive, HBase, ZooKeeper, Oozie, Sqoop, and Flume.
For individuals keen to build core skills for the big data domain, therefore, Hadoop 101 is an excellent place to start for this leg of your data science journey.

This IBM certified course comprises four purposely designed modules that take you on a carefully defined learning journey.
It is a self-paced course, which means it is not run to a fixed schedule with regard to completing modules or submitting assignments. To give you an idea of how long the course takes to complete, it is anticipated that if you work 3-4 hours per week, you will complete the course in 2 weeks. However, as long as the course is completed by the end of your enrollment, you can work at your own pace. And don't worry, you're not alone! You will be encouraged to stay connected with your learning community and mentors through the course discussion space.
The materials for each module are accessible from the start of the course and will remain available for the duration of your enrollment. Methods of learning and assessment will include videos, reading material, and online exam questions.
Once you have successfully completed the course, you will earn your IBM Certificate.

You will be able to:
  • Understand Hadoop architecture including MapReduce and HDFS.
  • Use the Hadoop file system shell and the Ambari Console to work with HDFS.
  • Start and stop Hadoop components.
  • Add/remove a node to/from a Hadoop cluster.
  • Modify Hadoop configuration parameters.

  • Individuals keen to learn Hadoop concepts and components.
  • College graduates who want to start their career in big data and Hadoop.
  • Experienced developers in big data seeking to upskill in Hadoop.

  • Knowledge of big data concepts.
  • Have taken the Introduction to Big Data course.
  • Know some basic Linux administration and commands.