Call +65 6100 0613

Instructor-Led Classroom Adult Training in Singapore - Learn New Skills to Enhance Your Employability from our SkillsFuture Courses

Hadoop Fundamental Training

This 3 days Cloudera Manager Analyst training explains Pig, Hive,Impala, metdata data management and Sqoop. You will learn The basic syntax of Pig Latin, How to load and store data using pig ,How to sort and filter data in Pig, How to use many of Pig’s built-in functions for data processing, How Pig uses bags ,tuples, maps to represent complex data,

Topics include:

  • Aggregate functions in Pig Latin
  • iterate through records in complex data structures
  • Grouping to combine data from multiple sources
  • Join operations in Pig, How to split a single data set into multiple relations
  • Hive and Impala different from a relation database
  • Databases and tables with Hive and Impala,
  • HiveQL,Impala SQL,SQL syntax compare,
  • Data types HiveQL and Impala SQL supports,
  • Create databases,tables,views,
  • Load data into tables
  • Alter and remove tables
  • Aave query results into tables and files


SkillsFuture Credit Applicable for Individual

WDA Training Grant Applicable for Company

Course Code: CRS-N-0040689

Course Booking

$850.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.

Course Details

Day1

Module 1: Get Started on Apache Hadoop

  • Why Hadoop?
  • Core Hadoop Components
  • Fundamental Concepts

Module 2:Hadoop cluster Installation using Cloudera manager

  • Rationale for Cluster Management Solution
  • Cloudera Manager Features
  • Cloudera Manager Installation using parcels
  • Hiveserver2, Pig, Impala, Hue, Sqoop Configuration
  • Impala load balancer

Module 3: MapReduce and Spark on YARN

  • Basic MapReduce Concepts
  •  Apache Spark concepts
  •  YARN Cluster Architecture
  • Optimize Spark jobs
  •  Resource Allocation
  •  Failure Recovery
  •  Using the YARN Web UI
  • YARN Application Logs

Day 2

Module 4: Hadoop Configuration and Daemon Logs

  • Cloudera Manager Constructs for Managing Configuration
  • Location Configuration and Applying Configuration changes
  • Managing Role Instances and Add Services
  • Configure the HDFS service
  • Configure the YARN Service
  • Configure Hadoop Daemon Logs

Module 5: Advanced Cluster Configuration

  • Configuring Hadoop Ports
  • Explicitly Including and Excluding Hosts
  • Configuring HDFS for Rack Awareness
  • Configuring HDFS High Availability using Cloudera Manager

Module 6: Managing Resources using Cloudera Manager

  • Configuring croups with Static Service Pools
  • The Fair Scheduler
  • Configuring Dynamic Resource Pools
  • Configure Static Resource Pools
  • YARN Memory and CPU Settings
  • Impala Query Scheduling

Day 3

Module 7: Cluster Maintenance using Cloudera Manager

  • Checking HDFS Status
  • Copying Data Between Clusters
  • Adding and Removing Cluster Nodes
  • Rebalancing the Cluster
  • Cluster Upgrading 

Module 8: Cluster Monitoring and Troubleshooting

  • Cloudera Manager Monitoring Features
  • Setup Alerts and Metrics using Cloudera Manager
  • Cloudera Manager Dash boards to Monitor Metrics
  • Monitor Hadoop Clusters using Cloudera Manager
  • Troubleshooting Hadoop Clusters
  • Common Misconfigurations

 

Who Should Attend

  • Data Scientists
  • Data Analyts
  • Hadoop Administrator
  • Big Data Analysts

Prerequisite

Nil

Trainers

Big Data TrainerMohan is certified cloud ear administrator and working as Big data architect in one of telecom Company. He has more than 12 years experience in the IT industry. His skill sets are Expertise in CDH,Hadoop, Spark, Scala, Hive/Hive-server2, Sqoop/Sqoop-Server2, Impala, Sentry, Kafka, solr, Hbase, Pig, Impala, Oozie, Fair scheduler,AWS,Map R. Provide the Big data consultancy services on Planning and sizing of Hadoop Cluster. Backup and recovery strategies, HA, Configure Hadoop security and authorization rules for Hive/Impala Jobs. Performance tunning of Map reduce Jobs, Spark jobs, Solr. Best Recommendations of Hardware and software for Solr,Hbase, Kafka,Zookeeper service ,Hadoop storage formats. Data Analysis using Impala, Impyla, spark. He graduated from S.V.University from India with a B.tech

Write Your Own Review

You're reviewing: Hadoop Fundamental Training

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses