Call +65 6100 0613

Instructor-Led Classroom Adult Training in Singapore - Learn New Skills to Enhance Your Employability from our SkillsFuture Courses

Big Data Analysis with Apache Hive

Apache Hive is a tool of choice for many data scientists because it allows them to work with SQL, a familiar syntax, to derive insights from Hadoop, reflecting the information that businesses seek to plan effectively.This course shows how to use Hive to process data, structure and optimize your data. The course will also show how to use  HUE, the Hadoop user interface, to leverage HiveQL when analyzing data..

Topics include:

  • Defining data structures in Hive
  • Selecting data
  • Joining tables
  • Manipulating data
  • Filtering results
  • Aggregating data
  • Using built-in aggregate functions
  • Mastering built-in table-generating functions
  • Using CUBE and ROLLUP
  • Using clauses: WHERE and HAVING
  • Using LIKE, JOIN, and SEMI JOIN
  • Using functions: String, math, date, and conditional


Click here to submit SkillsFuture Credit for Individual

SSG WSG SkillsConnect WDA Absentee Payroll for Company

Course Code: CRS-N-0034637

Course Booking

$298.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Training Grant and Subsidy

Click on this Step by Step SkillsFuture Claim Guide on How to Submit SkillsFuture Claim

Course Details

Module 1: Get Started on Apache Hive

  • Why Hive?
  • Hive Concepts and Setup
  • Setting Up Demo Environment

Module 2: Manipulating Data in Hive

  • Understanding Data Structures in Hive
  • Ceating Tables in Hive
  • Handling CSV files in Hive
  • Partitioning Tables

Module 3: Getting Data from Hive

  • Getting data with SELECT
  • Retrieving Data from Complex Structures

Module 4: Aggregating Data with Hive

  • Simple Aggregations
  • Grouping Sets
  • Using CUBE and ROLLUP

Module 5: Filtering Reults with Hive

  • Simple filter with WHERE
  • Filtering aggregates with HAVING
  • Finding similar values with LIKE

Module 6: Joining Tables 

  • Comibining tables with JOIN
  • Where to use SEMI JOIN
  • Joining multiple tables together

Module 7: Manipulating Data

  • Data Manipulating Functions
  • String Functions
  • Math Functions
  • Date Functions
  • Conditonal Functions

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Big Data Analysts

Prerequisite

Nil

Trainers

Cyber Security TrainerDr. Sarita Singh received her Ph.D. degree for her work done in the area of Information Security. She is the recipient of the prestigious Infosys fellowship for pursuing her Ph.D. Programme. She has more than twenty-five years of teaching and research experience in Singapore, Malaysia and India in the field of Programming, Information Security, Web-application Development, Computer Networks and Engineering related modules.

She has presented papers at several National and International Conferences and has written articles for magazines. She has authored text-books for Engineering courses as well.

Cyber Security TrainerDr. Asankhaya Sharma is a cyber security expert and technology leader with over a decade of experience in creating security products for industry, academia and open-source community. He is passionate about building high performing teams and taking innovative products to market. He is also an Adjunct Professor at the Singapore Institute of Technology.

He currently leads the R&D function at SourceClear. SourceClear is a software security startup that is focussed on building security tools for software developers. Before that, he was a PhD student affiliated with the Programming Languages and Systems Lab at School of Computing, NUS. His doctoral thesis was on Certified Reasoning for Automated Verification.

Prior to starting his graduate studies, he worked at Microsoft. He was involved in the development of SQL Server 2008 and Visual Studio 2010. He was part of the MSIT Accelerated Professional Experiences program (APEX).

Big Data TrainerKannan Gopal has a Masters in Computer Applications with over 20 years of experience in creating, delivering and supporting, products, projects and program in the area of enterprise data ops and analytics. Lately he has been involved in the creation of the digital banking app and working on IoT to create smart organizations. His expertise is in using data tools for analysis and visualization to solve challenges in organizations that rely on data for strategic and tactical decisions. He has experience in Design Thinking (HCD- human centered design) techniques and in leading organizations in the quest of digital transformation with data as its core for decisions and information.

Through his career, he has also developed teams and trained / coached many consultants and developers on SAP BI and related products and worked extensively in Europe and Asia. Through his career, he has developed solutions in the area of data modeling, data analysis and analytics – application of tools and process for solving customer issues. While at SAP, his team has won numerous patents for developing and applying data mining methods to enterprise management systems.

His keen interest lies in sharing and collaborating with people on new technologies and to create best practices in the area of analytics.

Big Data TrainerSunny Prakash is a Big Data Architect with Programming Background . He has around 7 years of IT experience. He has worked on Various Enterprise level solution. He is skilled in Cloud computing and Big Data Solution building in various Industrial sector and domain.

With an Strong background in Coding skills, he has a strong grips on Java,Python ,Scala ,HTML,JavaScript, Node Programming. Data Science is another specification where he has worked with larger Enterprises for finding their Data Insight and building Machine Learning mode

Big Data TrainerMohan is certified cloud ear administrator and working as Big data architect in one of telecom Company. He has more than 12 years experience in the IT industry. His skill sets are Expertise in CDH,Hadoop, Spark, Scala, Hive/Hive-server2, Sqoop/Sqoop-Server2, Impala, Sentry, Kafka, solr, Hbase, Pig, Impala, Oozie, Fair scheduler,AWS,Map R. Provide the Big data consultancy services on Planning and sizing of Hadoop Cluster. Backup and recovery strategies, HA, Configure Hadoop security and authorization rules for Hive/Impala Jobs. Performance tunning of Map reduce Jobs, Spark jobs, Solr. Best Recommendations of Hardware and software for Solr,Hbase, Kafka,Zookeeper service ,Hadoop storage formats. Data Analysis using Impala, Impyla, spark. He graduated from S.V.University from India with a B.tech

Write Your Own Review

You're reviewing: Big Data Analysis with Apache Hive

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses