0
For best deals, Call us now
Use code: UY10 for 10% Flat discount
Buy 1 Get 2 Certifications free with Exam

Hadoop Administration Certification Training (Self-Paced Learning)

> Hadoop Market is expected to reach $99.31B by 2022 growing at a CAGR of 42.1% from 2015 - Forbes

> Average salary for a Hadoop Administrator ranges from approximately $104,528 to $141,391 per annum – Indeed.com

> Hadoop and NoSQL software and services are the fastest growing technologies in the market - Technology Research Organization

USD 299 USD 399

Course Overview

Hadoop Administration Certification Training will guide you to gain expertise in maintaining complex Hadoop Clusters. You will learn exclusive Hadoop Admin activities like Planning of the Cluster, Installation, Cluster Configuration, Cluster Monitoring and Tuning. Furthermore, you will get to know about Cloudera Hadoop 2.0, and you will be mastering the security implementation and Hadoop v2 through industry-level cases studies.

Key Highlights

  • 24 Hours of Online Self-Paced Learning
  • Real-life Case Studies
  • Assignments
  • lifetime access to Learning Management System (LMS)
  • 24 x 7 Expert Support
  • Certification
  • Community forum for all our customers
  • No exam included

What You'll Learn

  • Hadoop Cluster and its Architecture
  • Hadoop Cluster Setup and Working
  • Hadoop Cluster Administration and Maintenance
  • Computational Frameworks, Managing Resources and Scheduling
  • Hadoop 2.x Cluster: Planning and Management
  • Hadoop Security and Cluster Monitoring
  • Cloudera Hadoop 2.x and its Features
  • Pig, Hive Installation and Working
  • HBase, Zookeeper Installation and Working
  • Understanding Oozie
  • Data Ingestion using Sqoop and Flume

Career Benefits

  • Expanding market
  • Better career opportunities
  • Great future prospects
  • Higher paycheck

Who Can Attend

  • Linux Administrators
  • Architects
  • System Administrators
  • IT Managers
  • Support Engineers
  • Database Administrators
  • Data Analytics Administrators
  • Cloud Systems Administrators
  • Windows Administrators
  • Infrastructure Administrators
  • Hadoop Developers
  • QA Professionals

Exam Formats

No Exam Included.

Course Delivery

This course is available in the following formats:

  • Self-Paced Learning Duration: 24 Hrs

Course Syllabus


Understanding Big Data and Hadoop

  • Learning Objectives: Understand Big Data and analyse limitations of traditional solutions. You will learn about the Hadoop and its core components and you will get to know about the difference between Hadoop 1.0 and Hadoop 2.x.


     

Topics:

  • Introduction to big data
  • Common big data domain scenarios
  • Limitations of traditional solutions
  • What is Hadoop?
  • Hadoop 1.0 ecosystem and its Core Components
  • Hadoop 2.x ecosystem and its Core Components
  • Application submission in YARN

Hadoop Cluster and its Architecture

  • Learning Objectives: In this module, you will learn about Hadoop Distributed File System, Hadoop Configuration Files and Hadoop Cluster Architecture. You will also get to know the roles and responsibilities of a Hadoop administrator.


     

Topics:

  • Distributed File System
  • Hadoop Cluster Architecture
  • Replication rules
  • Hadoop Cluster Modes
  • Rack awareness theory
  • Hadoop cluster administrator responsibilities
  • Understand working of HDFS
  • NTP server
  • Initial configuration required before installing Hadoop
  • Deploying Hadoop in a pseudo-distributed mode

Hadoop Cluster Setup and Working

  • Learning Objectives: Learn how to build a Hadoop multi-node cluster and understand the various properties of Namenode, Datanode and Secondary Namenode.


     

Topics:

  • OS Tuning for Hadoop Performance
  • Pre-requisite for installing Hadoop
  • Hadoop Configuration Files
  • Stale Configuration
  • RPC and HTTP Server Properties
  • Properties of Namenode, Datanode and Secondary Namenode
  • Log Files in Hadoop
  • Deploying a multi-node Hadoop cluster

Hadoop Cluster Administration and Maintenance

  • Learning Objectives: In this module, you will learn how to add or remove nodes to your cluster in adhoc and recommended way. You will also understand day to day Cluster Administration tasks like balancing data in cluster, protecting data by enabling trash, attempting a manual failover, creating backup within or across clusters.


     

Topics:

  • Commisioning and Decommissioning of Node
  • HDFS Balancer
  • Namenode Federation in Hadoop
  • High Availabilty in Hadoop
  • .Trash Functionality
  • Checkpointing in Hadoop
  • Distcp
  • Disk balancer

Computational Frameworks, Managing Resources and Scheduling

  • Learning Objectives: Get to know about the various processing frameworks in Hadoop and understand the YARN job execution flow. You will also learn about various schedulers and MapReduce programming model in the context of Hadoop administrator and schedulers.


     

Topics:

  • Different Processing Frameworks
  • Different phases in Mapreduce
  • Spark and its Features
  • Application Workflow in YARN
  • YARN Metrics
  • YARN Capacity Scheduler and Fair Scheduler
  • Service Level Authorization (SLA)

Hadoop 2.x Cluster: Planning and Management

  • Learning Objectives: In this module, you will understand the insights about Cluster Planning and Managing, what are the aspects one needs to think about when planning a setup of a new cluster.


     

Topics:

  • Planning a Hadoop 2.x cluster
  • Cluster sizing
  • Hardware, Network and Software considerations
  • Popular Hadoop distributions
  • Workload and usage patterns
  • Industry recommendations

Hadoop Security and Cluster Monitoring

  • Learning Objectives: Get to know about the Hadoop cluster monitoring and security concepts. You will also learn how to secure a Hadoop cluster with Kerberos.


     

Topics:

  • Monitoring Hadoop Clusters
  • Hadoop Security System Concepts
  • Securing a Hadoop Cluster With Kerberos
  • Common Misconfigurations
  • Overview on Kerberos
  • Checking log files to understand Hadoop clusters for troubleshooting

Cloudera Hadoop 2.x and its Features

  • Learning Objectives: In this module, you will learn about the Cloudera Hadoop 2.x and various features of it.


     

Topics:

  • Visualize Cloudera Manager
  • Features of Cloudera Manager
  • Build Cloudera Hadoop cluster using CDH
  • Installation choices in Cloudera
  • Cloudera Manager Vocabulary
  • Cloudera terminologies
  • Different tabs in Cloudera Manager
  • What is HUE?
  • Hue Architecture
  • Hue Interface
  • Hue Features

Pig, Hive Installation and Working (Self-paced)

  • Learning Objectives: Get to know the working and installation of Hadoop ecosystem components such as Pig and Hive.


     

Topics:

  • Explain Hive
  • Hive Setup
  • Hive Configuration
  • Working with Hive
  • Setting Hive in local and remote metastore mode
  • Pig setup
  • Working with Pig

HBase, Zookeeper Installation and Working (Self-paced)

  • Learning Objectives: In this module, you will learn about the working and installation of HBase and Zookeeper.


     

Topics:

  • What is NoSQL Database
  • HBase data model
  • HBase Architecture
  • MemStore, WAL, BlockCache
  • HBase Hfile
  • Compactions
  • HBase Read and Write
  • HBase balancer and hbck
  • HBase setup
  • Working with HBase
  • Installing Zookeeper

Understanding Oozie (Self-paced)

  • Learning Objectives: In this module, you will get to know about Apache Oozie which is a server-based workflow scheduling system to manage Hadoop jobs.


     

Topics:

  • Oozie overview
  • Oozie Features
  • Oozie workflow, coordinator and bundle
  • Start, End and Error Node
  • Action Node
  • Join and Fork
  • Decision Node
  • Oozie CLI
  • Install Oozie

Data Ingestion using Sqoop and Flume (Self-paced)

  • Learning Objectives: Learn about the different data ingestion tools such as Sqoop and Flume.


     

Topics:

  • Types of Data Ingestion
  • HDFS data loading commands
  • Purpose and features of Sqoop
  • Perform operations like, Sqoop Import, Export and Hive Import
  • Sqoop 2
  • Install Sqoop
  • Import data from RDBMS into HDFS
  • Flume features and architecture
  • Types of flow
  • Install Flume
  • Ingest Data From External Sources With Flume
  • Best Practices for Importing Data

FAQ's


What if I miss a class?

"You will never miss a lecture at Upskill Yourself! You can choose either of the two options:

  • View the recorded session of the class available in your LMS.
  • You can attend the missed session, in any other live batch."

What if I have queries after I complete this course?

Your access to the Support Team is for lifetime and will be available 24/7. The team will help you in resolving queries, during and after the course.

How soon after Signing up would I get access to the Learning Content?

Post-enrolment, the LMS access will be instantly provided to you and will be available for lifetime. You will be able to access the complete set of previous class recordings, PPTs, PDFs, assignments. Moreover the access to our 24x7 support team will be granted instantly as well. You can start learning right away.

Is the course material accessible to the students even after the course training is over?

Yes, the access to the course material will be available for lifetime once you have enrolled into the course.

Mike Williams, Direct Consultant