How to Troubleshoot Apps for the Modern Connected Worker
Data Mining- Big Data landscape
1. Data Mining & Big Data
Landscape
Thanh Luong
30th Aug 2016
2. Agenda
• Data Mining, Data Science
• Data Analytics in Business
• Big Data
• Data Analytics Techniques
oClassification (Decision tree, ANN, K-nearest neighbor, Bayesian)
oRegression
oCluster
oText Mining
oSVM
oMachine Learning
10. Data types vs mining methods
• Data types and models
oFlat data tables
oRelational databases
oTemporal & spatial data
oTransactional databases
oMultimedia data
oGenome databases
oMaterials science data
oTextual data
oWeb data
oEtc.
• Mining tasks and method
oClassification / Prediction
Decision trees
Bayesian classification
Neural networks
Rule induction
Support vector machine (SVM)
Hidden Markov Model
Etc
oDescription
Association analysis
Clustering
Summarization
Etc.
17. Study 2: Regression
• List all the variable available for making the model
• Establish a Dependent Variable (DV) of interest
• Examine visual (if possible) relationships between variables of interest
• Find a way to predict DV using other variables