Call +65 6100 0613

Instructor-Led Classroom Adult Training in Singapore - Learn New Skills to Enhance Your Employability from our SkillsFuture Courses

Text Mining with R

It is estimated that over 70% of potentially useable business information is unstructured, often in the form of text data. Text mining provides a collection of techniques that allow us to derive actionable insights from these data.

This course will show you the various tools and major techniques for mining and analyzing text data to discover interesting patterns, extract useful knowledge, and support decision making, with an emphasis on statistical approaches, to making sense of unstructured data. Work with a live example of extraction of data from Web and perform all the facets of text mining using R.

The topics include:

  • Sentiment analysis
  • Word cloud
  • Ngrams
  • Topics Modeling
  • LDA
  • Extracting text from social media


Click here to submit SkillsFuture Credit for Individual

SSG WSG SkillsConnect WDA Absentee Payroll for Company

Course Code: CRS-N-0044088

Course Booking

$298.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Training Grant and Subsidy

All Singaporeans aged 25 and above can use their $500 SkillsFuture Credit from the government to pay for a wide range of approved skills-related courses. Visit the SkillsFuture Credit website www.skillsfuture.sg/credit to choose from the courses available on the SkillsFuture Credit course directory

Course Details

Module 1: Introduction

  • What is text mining
  • Applications of text mining

Module 2: Basic Text Functions

  • Text manipulation functions
  • Working with strings
  • Working with gsub
  • Advanced methods
  • Convert to corpus

Module 3: Importing Data

  • Converting docx into corpus
  • Converting pdf into corpus
  • Converting html to corpus
  • Web scraping

Module 4: Tidytext Package

  • Tidying text objects
  • Tidying document term matrix objects
  • Tidying document frequency matrix objects
  • Tidying corpus objects
  • Mining literacy works

Module 5: Word Frequencies & Relationships

  • Pre-processing text
  • Wordcloud
  • Frequency analysis
  • nGrams & bigrams
  • Bigrams for sentiment analysis
  • Visualizing bigrams network

Module 6: Sentiment Analysis

  • Sentiment libraries
  • Analyzing positive & negative words
  • Comparing 3 sentiment libraries
  • Common positive & negative words

Module 7: Topic Modelling

  • Latent Semantic Indexing (LSI)
  • Latent Dirichlet Allocation (LDA)
  • Word topic probabilities
  • Document - topic probabilities
  • Chapters probabilities
  • Per document classification

Module 8: Document Similarity & Classifier

  • Text alignment & pairwise comparison
  • Minihashing and locality sensitive hashing
  • Extract key words 
  • Classify by location, language, topic

Module 9: Working internet and social media (Optional)

  • Extracting data from amazon
  • Extracting data from twitter
  • Extracting youtube comments
  • Extracting facebook comments


Click here to submit SkillsFuture Credit for Individual

SSG WSG SkillsConnect WDA Absentee Payroll for Company

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Finance Analysts
  • Marketers

Prerequisite

Basic knowledge of R is assumed.

Trainers

R TrainerDwight Nuwan Fonseka have a degree in Biotechnology (from NUS) ,Advanced diploma in Pharamceutical management (from MDIS) and Masters in Education (from NTU). He have 8 years experience of teaching biology at O and A levels/ IB level in international schools in Singapore and overseas.

R Programming TrainerRavi Kumar Tiwari got his PhD from NUS (Chemical Engineering) in 2013. After graduation, he worked 3 years as a research scientist in the Institute of High Performance Computing (IHPC). He is currently a big data R data analyst in Rakuten. His core skills are R, big data, Hadoop and machine learning.

Java TrainerSiva Kumar is an experienced Solutions Architect with a demonstrated history of working in the computer software industry. Skilled in J2EE Web Services, Oracle Database, Maven, C++, and Apache Kafka. He is a strong engineering professional with a Bachelor of Technology (B.Tech.) focused in Computer Science from JNTU University Hyderabad.

Customer Reviews (4)

Will RecommendReview by Ken
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Nil (Posted on 3/29/2018)
Will RecommendReview by Kie Hian
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
The trainer is excellent. He was able to communicate the knowledge effectively at a fast pace, whilst still ensuring that all the trainees kept up with the progress. (Posted on 3/29/2018)
Will RecommendReview by Wymen
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Nil (Posted on 1/23/2018)
Will RecommendReview by lim kah kheng
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Do you conduct Microsoft R server course ? (Posted on 10/27/2017)

Write Your Own Review

You're reviewing: Text Mining with R

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses