HADOOP ADMINISTRATION

Training Mode: offline

4.5/5

Short Description:
Apache Hadoop is a data management software which is open source that helps organizations analyze large volumes of structured and unstructured data, may be a extremely popular topic across the tech industry. It are often quickly learn to require advantage of the MapReduce framework through technical sessions and hands on labs.

Hadoop Developer/Admin Training – Course Content
Training Objectives of Hadoop Developer/Admin:
Hadoop Admin Course will provide the essential concepts of MapReduce applications developed using Hadoop, including an in depth check out framework components, use of Hadoop for a spread of knowledge analysis tasks, and various samples of Hadoop in action. This course will further examine related technologies like Hive, Pig, and Apache Accumulo.
Target Students / Prerequisites:
Students must be belonging thereto Background and conversant in Concepts in Java and Linux
Hadoop Architecture:
Introduction to

  • Parallel Computer vs. Distributed Computing
  • How to install Hadoop on your system
  • How to install Hadoop cluster on multiple
  • Hadoop Daemons introduction: NameNode, DataNode, JobTracker, TaskTracker
  • Explore HDFS (Hadoop Distributed File System) Explore the HDFS Apache Web UI,
  • NameNode architectures (EditLog, FsImage, location of replica) Secondary NameNode architectures
  • DataNode architecture

MapReduce Architecture:

  • Exploring JobTracker/TaskTracker
  • How a client submits a Map-Reduce job
  • Exploring Mapper/Reducer/Combiner-Shuffle: Sort & Partition
  • Input/output formats
  • Job Scheduling for: (FIFO, Fair Scheduler, Capacity Scheduler) Exploring the Apache MapReduce Web UI

Hadoop Developer Tasks:

  • Writing a map-reduce programme
  • Reading and writing data using
  • Java Hadoop Eclipse integration
  • Mapper in details
  • Reducer in details
  • Using Combiners
  • Reducing Intermediate Data with Combiners
  • Writing Partitioners for Better Load
  • Balancing Sorting in HDFS
  • Searching in HDFS
  • Indexing in HDFS
  • Hands-On Exercise


Hadoop Administrative Tasks:

  • Routine Administrative Procedures,
  • Understanding dfsadmin and mradmin Block Scanner, Balancer
  • Health Check & Safe mode,
  • DataNode commissioning/decommissioning,
  • Monitoring and Debugging on a production
  • cluster NameNode copying and Recovery,
  • ACL (Access control list) Upgrading Hadoop,


HBase Architecture:

  • Introduction to HBase
  • HBase vs. RDBMS
  • Exploring HBase Master & region server
  • Column Families and Regions
  • Basic HBase shell commands.


Hive Architecture:

  • Introduction to Hive
  • HBase vs Hive
  • Installation of Hive
  • HQL (Hive query language)
  • Basic Hive commands


Pig Architecture:

  • Introduction to Pig
  • Installation of Pig on your system
  • Basic Pig commands
  • Hands-On Exercise


Sqoop Architecture:

  • Introduction to Sqoop
  • Installation of Sqoop on your system
  • Import/Export data from RDBMS to HDFS
  • Import/Export data from RDBMS to HBase
  • Import/Export data from RDBMS to Hive
  • Hands-On Exercise


Mini Project / POC ( Proof of Concept ):

  • Facebook-Hive POC
  • Usages of Hadoop/Hive @ Facebook
  • Static & dynamic partitioning
  • UDF ( User defined functions )

As per your requirement course we wil schedule the timings.Our support is 24×7. The one solution to beat hectic schedule and travelling time and achieve these goals is by joining online courses. Most experienced and specialized instructors are going to be assigned to you.

Your Q? Here