Call +65 6100 0613 Email:

Instructor-led Classroom Adult Training in Singapore - Modular Fast Track Skill-Based Trainings

Big Data Analysis with Apache Hive

Apache Hive is a tool of choice for many data scientists because it allows them to work with SQL, a familiar syntax, to derive insights from Hadoop, reflecting the information that businesses seek to plan effectively.This course shows how to use Hive to process data, structure and optimize your data. The course will also show how to use  HUE, the Hadoop user interface, to leverage HiveQL when analyzing data..

Topics include:

  • Defining data structures in Hive
  • Selecting data
  • Joining tables
  • Manipulating data
  • Filtering results
  • Aggregating data
  • Using built-in aggregate functions
  • Mastering built-in table-generating functions
  • Using CUBE and ROLLUP
  • Using clauses: WHERE and HAVING
  • Using LIKE, JOIN, and SEMI JOIN
  • Using functions: String, math, date, and conditional

After your registered for the course, you can apply SSG grants below

Click here to apply SkillsFuture Credit for Individual

Click here to apply WSG Absentee Payroll for Company
Course Code: CRS-N-0034637

Course Booking

$298.00 (GST-exclusive)

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.

Training Grant and Subsidy

All Singaporeans aged 25 and above can use their $500 SkillsFuture Credit from the government to pay for a wide range of approved skills-related courses. Visit the SkillsFuture Credit website to choose from the courses available on the SkillsFuture Credit course directory

Course Details

Module 1: Get Started on Apache Hive

  • Why Hive?
  • Hive Concepts and Setup
  • Setting Up Demo Environment

Module 2: Manipulating Data in Hive

  • Understanding Data Structures in Hive
  • Ceating Tables in Hive
  • Handling CSV files in Hive
  • Partitioning Tables

Module 3: Getting Data from Hive

  • Getting data with SELECT
  • Retrieving Data from Complex Structures

Module 4: Aggregating Data with Hive

  • Simple Aggregations
  • Grouping Sets
  • Using CUBE and ROLLUP

Module 5: Filtering Reults with Hive

  • Simple filter with WHERE
  • Filtering aggregates with HAVING
  • Finding similar values with LIKE

Module 6: Joining Tables 

  • Comibining tables with JOIN
  • Where to use SEMI JOIN
  • Joining multiple tables together

Module 7: Manipulating Data

  • Data Manipulating Functions
  • String Functions
  • Math Functions
  • Date Functions
  • Conditonal Functions

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Big Data Analysts




Big Data TrainerSiva Kumar is a Bigdata solution architect with 10 years of IT experience in designing and architecture solutions for the Big Data domain and has been involved with several complex engagements. Technical blogger and owner of and 4+ years of experienced in big data related technologies training across USA, Canada, Singapore and India. Technical strengths include Apache Hadoop (Cloudera and Hortonworks), YARN, Mapreduce, Hive, Sqoop, Flume, Pig, HBase, Phoenix, Oozie, Kafka, Spark, MySQL, Postgres, Oracle, Netezza, Teradata, Java, Scala and Python, Apache Flink, Alluxio, Cassandra, MongoDB. Exposure in Banking, Insurance, Retail domain

Bid Data TrainerDr. Sarita Singh received her Ph.D. degree for her work done in the area of Information Security. She is the recipient of the prestigious Infosys fellowship for pursuing her Ph.D. Programme. She has more than twenty-five years of teaching and research experience in Singapore, Malaysia and India in the field of Programming, Information Security, Web-application Development, Computer Networks and Engineering related modules.

She has presented papers at several National and International Conferences and has written articles for magazines. She has authored text-books for Engineering courses as well.

Big Data TrainerKannan Gopal has a Masters in Computer Applications with over 20 years of experience in creating, delivering and supporting, products, projects and program in the area of enterprise data ops and analytics. Lately he has been involved in the creation of the digital banking app and working on IoT to create smart organizations. His expertise is in using data tools for analysis and visualization to solve challenges in organizations that rely on data for strategic and tactical decisions. He has experience in Design Thinking (HCD- human centered design) techniques and in leading organizations in the quest of digital transformation with data as its core for decisions and information.

Through his career, he has also developed teams and trained / coached many consultants and developers on SAP BI and related products and worked extensively in Europe and Asia. Through his career, he has developed solutions in the area of data modeling, data analysis and analytics – application of tools and process for solving customer issues. While at SAP, his team has won numerous patents for developing and applying data mining methods to enterprise management systems.

His keen interest lies in sharing and collaborating with people on new technologies and to create best practices in the area of analytics.

Big Data TrainerSunny Prakash is a Big Data Architect with Programming Background . He has around 7 years of IT experience. He has worked on Various Enterprise level solution. He is skilled in Cloud computing and Big Data Solution building in various Industrial sector and domain.

With an Strong background in Coding skills, he has a strong grips on Java,Python ,Scala ,HTML,JavaScript, Node Programming. Data Science is another specification where he has worked with larger Enterprises for finding their Data Insight and building Machine Learning mode

Big Data TrainerMohan is certified cloud ear administrator and working as Big data architect in one of telecom Company. He has more than 12 years experience in the IT industry. His skill sets are Expertise in CDH,Hadoop, Spark, Scala, Hive/Hive-server2, Sqoop/Sqoop-Server2, Impala, Sentry, Kafka, solr, Hbase, Pig, Impala, Oozie, Fair scheduler,AWS,Map R. Provide the Big data consultancy services on Planning and sizing of Hadoop Cluster. Backup and recovery strategies, HA, Configure Hadoop security and authorization rules for Hive/Impala Jobs. Performance tunning of Map reduce Jobs, Spark jobs, Solr. Best Recommendations of Hardware and software for Solr,Hbase, Kafka,Zookeeper service ,Hadoop storage formats. Data Analysis using Impala, Impyla, spark. He graduated from S.V.University from India with a

Write Your Own Review

You're reviewing: Big Data Analysis with Apache Hive

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha


Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses

Apache Spark Essential Training

Apache Spark Essential Training

8 Review(s)
$298.00 (GST-exclusive)
Apache Hbase Training

Apache Hbase Training

$298.00 (GST-exclusive)
Apache Solr Search Platform Training

Apache Solr Search Platform Training

4 Review(s)
$298.00 (GST-exclusive)
Apache Hadoop Big Data Training

Apache Hadoop Big Data Training

9 Review(s)
$498.00 (GST-exclusive)
Machine Learning with Apache Spark

Machine Learning with Apache Spark

1 Review(s)
$298.00 (GST-exclusive)