Call +65 6100 0613 Email: enquiry@tertiaryinfotech.com

Instructor-led Classroom Adult Training in Singapore - Modular Fast Track Skill-Based Trainings

Machine Learning with Apache Spark

Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Machine Learning algorithms comb through data and identify patterns that are too complex to be discerned by the human mind. These patterns can then be used for decision making and action

Apache Spark is a powerful platform that for running Machine Learning. This course will how you how to perforrm various Machine Learning using Apache Spark built in MLib component.

Topics include:

  • Overview of Apache Spark
  • Clustering
  • Regression
  • Classification
  • Recommendation

After your registered for the course, you can apply SSG grants below

Click here to apply SkillsFuture Credit for Individual

Click here to apply WSG Absentee Payroll for Company
Course Code: CRS-N-0042900

Course Booking

$298.00 (GST-exclusive)

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Training Grant and Subsidy

All Singaporeans aged 25 and above can use their $500 SkillsFuture Credit from the government to pay for a wide range of approved skills-related courses. Visit the SkillsFuture Credit website www.skillsfuture.sg/credit to choose from the courses available on the SkillsFuture Credit course directory

Course Details

Module 1: Apache Spark Basics

  • Recap of Apache Spark Basics 
  • Install Apache Spark on Local Computer
  • Read CSV Data
  • Manipulating Dataframe
  • ML Libraries

Module 2: Preprocessing

  • Normalizer
  • Standardizer
  • Tokenizer
  • TF-IDF

Module 3: Clustering

  • What is Clustering
  • Clustering Algorithms
  • KMeans Clustering
  • Hierarchical Clustering

Module 4: Classification

  • What is Classification
  • Naives Bayes Clasiifier
  • Decision Tree Classifer 
  • Multi Layer Perceptron

Module 5: Regression

  • What is Clustering
  • Clustering Algorithms
  • Linear Regression
  • Decision Tree Regression
  • Gradient Boosted Tree Regression

Module 6: ML Pipeline

  • What is Pipeline
  • Creating a Pipeline for Movie Review Classification

Module 7: Recommendation (Optional)

  • Recommendation Systems
  • Collaborative Filtering

Who Should Attend

  • Big Data Analysts
  • Data Scientists
  • Data Analysts

Prerequisite

Prerequisite

This is a intermediate course. Participants should have basic knowledge on the following subjects:

  • Python
  • Apache Spark

Software Requirement

Download and unzip Apache Spark https://spark.apache.org/downloads.html.

Trainers

Apache Spark TrainerSiva Kumar is a Bigdata solution architect with 10 years of IT experience in designing and architecture solutions for the Big Data domain and has been involved with several complex engagements. Technical blogger and owner of hadooptutorial.info and 4+ years of experienced in big data related technologies training across USA, Canada, Singapore and India. Technical strengths include Apache Hadoop (Cloudera and Hortonworks), YARN, Mapreduce, Hive, Sqoop, Flume, Pig, HBase, Phoenix, Oozie, Kafka, Spark, MySQL, Postgres, Oracle, Netezza, Teradata, Java, Scala and Python, Apache Flink, Alluxio, Cassandra, MongoDB. Exposure in Banking, Insurance, Retail domain

Apache Spark TrainerDr. Sarita Singh received her Ph.D. degree for her work done in the area of Information Security. She is the recipient of the prestigious Infosys fellowship for pursuing her Ph.D. Programme. She has more than twenty-five years of teaching and research experience in Singapore, Malaysia and India in the field of Programming, Information Security, Web-application Development, Computer Networks and Engineering related modules.

She has presented papers at several National and International Conferences and has written articles for magazines. She has authored text-books for Engineering courses as well.

Apache Spark TrainerDr. Alfred Ang is the founder of Tertiary Courses. He is a serial entrepreneur. He founded OSWeb2Design Singapore Pte Ltd in 2007 offering web development, e-commerce store development, graphics design, ebook publishing, mobile apps development, and digital marketing services. He established the first online gardening store in Singapore, Eco City Hydroponics Pte Ltd in 2010, offering a wide range of gardening products such as seeds, plant nutrients, hydroponics kits etc. Eco City Hydroponics has become the most popular and successful gardening store in Singapore. He founded Tertiary Infotech Pte Ltd in 2012 and transformed the business to a training platform, Tertiary Courses in 2014. Tertiary Courses offers a wide range of SkillsFuture courses for PMETs to upgrade their skills and knowledge. He also established Tertiary Courses Malaysia in 2016. He also founded Tertiary Robotics in 2015 offering Arduino, Raspberry Pi, Microbit and Robotics products

Dr. Alfred Ang earned his Ph.D. from National University of Singapore in 2000, majoring in Electrical and Electronics Engineering. He also completed an online MBA course with U21 Global based in Australia. He obtained his B.Sc (Hons) from National University of Singapore in 1992, majoring in Physics. He topped his Physics cohort for 3 consecutive years and funded his degree study with Book price, awards and tuition. He has worked in Defence, Electronics and Semiconductor Industries. His current interests include Machine Learning, Deep Learning, Artificial Intelligence, Internet of Things, Robotics and Programming.

Dr. Alfred Ang was Distinguished Toastmasters (DTM) and Senior Member of IEEE. He has published more than 20 peer reviewed papers and co-inventors for more than 20 inventions.

Apache Spark TrainerSunny Prakash is a Big Data Architect with Programming Background . He has around 7 years of IT experience. He has worked on Various Enterprise level solution. He is skilled in Cloud computing and Big Data Solution building in various Industrial sector and domain.

With an Strong background in Coding skills, he has a strong grips on Java,Python ,Scala ,HTML,JavaScript, Node Programming. Data Science is another specification where he has worked with larger Enterprises for finding their Data Insight and building Machine Learning mode

Customer Reviews (3)

Might RecommendReview by ST
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Maybe can try using more complex data, because real life data is not usually clean (Posted on 12/9/2018)
Will RecommendReview by Steve
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
The module is excellent for industrial orientated applications. It is suggested to have the case study details in the optional module. (Posted on 12/9/2018)
NilReview by PS
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Nil (Posted on 8/7/2017)

Write Your Own Review

You're reviewing: Machine Learning with Apache Spark

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses

Big Data Analysis with Apache Hive

Big Data Analysis with Apache Hive

3 Review(s)
$298.00 (GST-exclusive)
Python Machine Learning with Scikit Learn Training

Python Machine Learning with Scikit Learn Training

22 Review(s)
$298.00 (GST-exclusive)
R Machine Learning Training

R Machine Learning Training

12 Review(s)
$298.00 (GST-exclusive)
Apache Spark Essential Training

Apache Spark Essential Training

8 Review(s)
$298.00 (GST-exclusive)
Solving Problems with Machine Learning

Solving Problems with Machine Learning

2 Review(s)
$298.00 (GST-exclusive)
Deep Learning and Machine Learning with TensorFlow

Deep Learning and Machine Learning with TensorFlow

35 Review(s)
$498.00 (GST-exclusive)
Apache Hbase Training

Apache Hbase Training

$298.00 (GST-exclusive)
Apache Solr Search Platform Training

Apache Solr Search Platform Training

5 Review(s)
$298.00 (GST-exclusive)
Machine Learning for Network Security

Machine Learning for Network Security

8 Review(s)
$298.00 (GST-exclusive)