SlideShare uma empresa Scribd logo
1 de 36
Baixar para ler offline
Team: NCCU
A Linear Ensemble of Classification Models
with Novel Backward Cumulative Features
for MOOC Dropout Prediction
Chih-Ming Chen, Man-Kwan Shan,

Ming-Feng Tsai, Yi-Hsuan Yang,

Hsin-Ping Chen, Pei-Wen Yeh,

and Sin-Ya Peng
Research Center for

Information Technology Innovation,

Academia Sinica
Department of Computer Science,
National Chengchi University
Team: NCCU
A Linear Ensemble of Classification Models
with Novel Backward Cumulative Features
for MOOC Dropout Prediction
Chih-Ming Chen, Man-Kwan Shan,

Ming-Feng Tsai, Yi-Hsuan Yang,

Hsin-Ping Chen, Pei-Wen Yeh,

and Sin-Ya Peng
Research Center for

Information Technology Innovation,

Academia Sinica
Department of Computer Science,
National Chengchi University
linearly combination of several models
the proposed data engineering method
it’s able to generate a bunch of distinct feature sets
Key Point Summary
Latent Space Representation

— Clustering Model

— Skip-Gram Model
Backward Cumulative Features

— Generate 30 distinct sets of features
Linear Model

+

Tree-based Model
alleviate the feature sparsity problem
alleviate the bias problem of statistical feature
good match
weakness when using sparse feature
Workflow
Train Data
(75%)
Validate Data
(25%)
Train Data
0"
50000"
100000"
150000"
200000"
10/27/2013"
11/27/2013"
12/27/2013"1/27/2014"2/27/2014"
3/27/2014"4/27/2014"
5/27/2014"6/27/2014"
7/27/2014"
Training'Date'Distribu.on
0"
20000"
40000"
60000"
80000"
100000"
120000"
140000"
10/27/2013"
11/27/2013"
12/27/2013"1/27/2014"2/27/2014"
3/27/2014"4/27/2014"
5/27/2014"6/27/2014"
7/27/2014"
Tes.ng'Date'Distribu.on
Split the training data based on

the time distribution.

stable results
— 2 settings
Workflow
Train Data
(75%)
Validate Data
(25%)
Learned

Model Offline
Evaluation
Test Data
Train Data
Submission
method 1
cross-validation
— 2 settings
check if it leads to better performance
Workflow
Train Data
(75%)
Validate Data
(25%)
Learned

Model Offline
Evaluation
Test Data
Train Data
Learned

Model
Submission
method 2
— 2 settings
check if it leads to better performance
Workflow
Train Data
(75%)
Validate Data
(25%)
Learned

Model Offline
Evaluation
Test Data
Train Data
Learned

Model
Submission
— 2 settings
Prediction Model Overview
Logistic Regression
Gradient Boosting
Classifier
Raw

Data
Support Vector

Classifier
Student
Course
Time
A classical approach to a general prediction task.
— 2 solutions
Features
Prediction Model Overview
Logistic Regression
Gradient Boosting
Classifier
Gradient Boosting
Decision Trees
Raw

Data
Support Vector

Classifier
Student
Course
Time
Backward
Cumulation
the feature engineering towards the MOOC dataset.
— 2 solutions
Features
Prediction Model Overview
Logistic Regression
Gradient Boosting
Classifier
Gradient Boosting
Decision Trees
Linear

Combination
Final

Prediction
Raw

Data
Support Vector

Classifier
Student
Course
Time
Backward
Cumulation
— 2 solutions
Features
Prediction Model Overview
Logistic Regression
Gradient Boosting
Classifier
Gradient Boosting
Decision Trees
Linear

Combination
Final

Prediction
Raw

Data
Support Vector

Classifier
Student
Course
Time
Backward
Cumulation
solution 1
solution 2
xgboost
scikit-learn
— 2 solutions
http://scikit-learn.org/stable/https://github.com/dmlc/xgboost
Prediction Model Overview
Logistic Regression
Gradient Boosting
Classifier
Gradient Boosting
Decision Trees
Linear

Combination
Final

Prediction
Raw

Data
Support Vector

Classifier
Student
Course
Time
Backward
Cumulation
Feature Extraction / Feature Engineering
— 2 solutions
Feature Extraction
Student
Course
Time
Enrolment ID
• Bag-of-words Feature

— Boolean (0/1)

— Term Frequency (TF)
• Probability Value

— Ratio

— Naive Bayes
• Latent Space Feature

— Clustering

— DeepWalk
Raw

Data
Feature Extraction
Student
Course
Time
Enrolment ID
• Bag-of-words Feature

— Boolean (0/1)

— Term Frequency (TF)
• Probability Value

— Ratio

— Naive Bayes
• Latent Space Feature

— Clustering

— DeepWalk
Raw

Data
describing the status
e.g.

the month of the course

the number of registration
video 5
problem 10
wiki 0
discussion 2
navigation 0
Feature Extraction
Student
Course
Time
Enrolment ID
• Bag-of-words Feature

— Boolean (0/1)

— Term Frequency (TF)
• Probability Value

— Ratio

— Naive Bayes
• Latent Space Feature

— Clustering

— DeepWalk
Raw

Data
dropout probability
P( dropout|containing objects )
:= P(O1|dropout) … P(Od|dropout)
O = {O1, O2, …, Od}objects:
e.g.

the dropout ratio of the course
estimate the probability from observed data
Feature Extraction
Student
Course
Time
Enrolment ID
• Bag-of-words Feature

— Boolean (0/1)

— Term Frequency (TF)
• Probability Value

— Ratio

— Naive Bayes
• Latent Space Feature

— Clustering

— DeepWalk
Raw

Data
Latent Topic
K-means Clustering on

1. registered courses

2. containing objects
some features are sparse
DeepWalk / Skip-Gram

for obtaining a dense

feature representation

DeepWalk
https://github.com/phanein/deepwalk
The Goal — Find the representation of each node of a graph.
It’s an extension work of word2vec’s Skip-Gram model.
DeepWalk
https://github.com/phanein/deepwalk
The Goal — Find the representation of each node of a graph.
It’s an extension work of word2vec’s Skip-Gram model.
The core is to model the context information.

(in practical, the node’s neighbours)
Similar objects are mapped into similar space.
From DeepWalk to the MOOC Problem
U1
U2
U3
Course A
Course B
Course C
Course D
Course E
https://github.com/phanein/deepwalk
From DeepWalk to the MOOC Problem
U1
U2
U3
Course A
Course B
Course C
Course D
U1
Course B
Course E
Course C
U2 U1
Random Walk
https://github.com/phanein/deepwalk
Treat Random Walks on heterogeneous graph

as the sentence.
From DeepWalk to the MOOC Problem
U1
U2
U3
Course A
Course B
Course C
Course D
U1
Course B
Course E
Course C
U2 U1
Random Walk
https://github.com/phanein/deepwalk
Treat Random Walks on heterogeneous graph

as the sentence.
U1
0.3 0.2 -0.1 0.5 -0.8 Course B 0.1 0.3 -0.5 1.2 -0.3
Performance
Bag-of-words Bag-of-words
Probability
Bag-of-words
Probability
Naive Bayes
Bag-of-words
Probability
Naive Bayes
Latent Space
> 0.890 > 0.901 > 0.902 > 0.903
Backward
Cumulation
Models
Combination
Backward Cumulative Features — Motivation
O X O X O O X O X X X X X X O
X X X X O X X X X X O O O X X
X X X X X X X X X X X O X O X
Logs Table
10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27
U1
U2
U3
Backward Cumulative Features — Motivation
O X O X O O X O O X X X X X O
X X X X O X X X X X O O O X X
X X X X X X X X X X X O X O X
different period
Logs Table
different number of logs
10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27
U1
U2
U3
Backward Cumulative Features
O X O X O O X O X X X X X X O
X X X X O X X X X X O O O X X
X X X X X X X X X X X O X O X
Consider only the logs in last N days. N=2
10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27
U1
U2
U3
Backward Cumulative Features
O X O X O O X O X X X X X X O
X X X X O X X X X X O O O X X
X X X X X X X X X X X O X O X
Consider only the logs in last N days. N=3
10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27
U1
U2
U3
Backward Cumulative Features
O X O X O O X O X X X X X X O
X X X X O X X X X X O O O X X
X X X X X X X X X X X O X O X
Consider only the logs in last N days. N=4
10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27
U1
U2
U3
Backward Cumulative Features
O X O X O O X O X X X X X X O
X X X X O X X X X X O O O X X
X X X X X X X X X X X O X O X
Consider only the logs in last N days. N=5
10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27
U1
U2
U3
Backward Cumulative Features
Raw

Data
Student
Course
Time
Backward
Cumulation
.
.
.
Feature Set 1
N=1
N=2
N=3
N=29
N=30
Feature Set 2
Feature Set 3
.
.
.
Feature Set 29
Feature Set 30
— 2 strategies
Backward Cumulative Features
Raw

Data
Student
Course
Time
Backward
Cumulation
.
.
.
Feature Set 1
N=1
N=2
N=3
N=29
N=30
Feature Set 2
Feature Set 3
.
.
.
Feature Set 29
Feature Set 30
Classifier
Strategy 1.

Concatenate all features.
— 2 strategies
Backward Cumulative Features
Raw

Data
Student
Course
Time
Backward
Cumulation
.
.
.
Feature Set 1
N=1
N=2
N=3
N=29
N=30
Feature Set 2
Feature Set 3
.
.
.
Feature Set 29
Feature Set 30
Classifier
Classifier
Classifier
Classifier
Classifier
Strategy 2.

Build 30 distinct models.
Average
— 2 strategies
Prediction Model Overview
Logistic Regression
Gradient Boosting
Classifier
Gradient Boosting
Decision Trees
Linear

Combination
Final

Prediction
Raw

Data
Support Vector

Classifier
Student
Course
Time
Backward
Cumulation
solution 1 * 0.5
solution 2 * 0.5
xgboost
scikit-learn
What We Learned from the Competition
• Team Work is important

— share ideas

— share solutions
• Model diversity & feature diversity

— diverse models / features can capture different characteristic of the data
• Realize the data

— the goal

— the evaluation metric

— the data structure
What We Learned from the Competition
• Team Work is important

— share ideas

— share solutions
• Model diversity & feature diversity

— diverse models / features can capture different characteristic of the data
• Realize the data

— the goal

— the evaluation metric

— the data structure
Start earlier …
What We Learned from the Competition
• Team Work is important

— share ideas

— share solutions
• Model diversity & feature diversity

— diverse models / features can capture different characteristic of the data
• Realize the data

— the goal

— the evaluation metric

— the data structure
Start earlier …
Feature Format
Data Partition
Feature Scale
several things to be discussed
e.g.
changecandy [at] gmail.com
Any Question?

Mais conteúdo relacionado

Mais procurados

[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...npinto
 
Loss Calibrated Variational Inference
Loss Calibrated Variational InferenceLoss Calibrated Variational Inference
Loss Calibrated Variational InferenceTomasz Kusmierczyk
 
H2O World - Generalized Low Rank Models - Madeleine Udell
H2O World - Generalized Low Rank Models - Madeleine UdellH2O World - Generalized Low Rank Models - Madeleine Udell
H2O World - Generalized Low Rank Models - Madeleine UdellSri Ambati
 
Automatic variational inference with latent categorical variables
Automatic variational inference with latent categorical variablesAutomatic variational inference with latent categorical variables
Automatic variational inference with latent categorical variablesTomasz Kusmierczyk
 
Towards typesafe deep learning in scala
Towards typesafe deep learning in scalaTowards typesafe deep learning in scala
Towards typesafe deep learning in scalaTongfei Chen
 
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
H2O World - Consensus Optimization and Machine Learning - Stephen BoydH2O World - Consensus Optimization and Machine Learning - Stephen Boyd
H2O World - Consensus Optimization and Machine Learning - Stephen BoydSri Ambati
 
Dynamic Feature Induction: The Last Gist to the State-of-the-Art
Dynamic Feature Induction: The Last Gist to the State-of-the-ArtDynamic Feature Induction: The Last Gist to the State-of-the-Art
Dynamic Feature Induction: The Last Gist to the State-of-the-ArtJinho Choi
 
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)Olivier Teytaud
 
A nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaA nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaAlexander Litvinenko
 
Actors for Behavioural Simulation
Actors for Behavioural SimulationActors for Behavioural Simulation
Actors for Behavioural SimulationClarkTony
 
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean OptimizationEfficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimizationjfrchicanog
 
Hyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradientHyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradientFabian Pedregosa
 
11 Machine Learning Important Issues in Machine Learning
11 Machine Learning Important Issues in Machine Learning11 Machine Learning Important Issues in Machine Learning
11 Machine Learning Important Issues in Machine LearningAndres Mendez-Vazquez
 
Aaex5 group2(中英夾雜)
Aaex5 group2(中英夾雜)Aaex5 group2(中英夾雜)
Aaex5 group2(中英夾雜)Shiang-Yun Yang
 

Mais procurados (20)

[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
Loss Calibrated Variational Inference
Loss Calibrated Variational InferenceLoss Calibrated Variational Inference
Loss Calibrated Variational Inference
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
H2O World - Generalized Low Rank Models - Madeleine Udell
H2O World - Generalized Low Rank Models - Madeleine UdellH2O World - Generalized Low Rank Models - Madeleine Udell
H2O World - Generalized Low Rank Models - Madeleine Udell
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
Automatic variational inference with latent categorical variables
Automatic variational inference with latent categorical variablesAutomatic variational inference with latent categorical variables
Automatic variational inference with latent categorical variables
 
Towards typesafe deep learning in scala
Towards typesafe deep learning in scalaTowards typesafe deep learning in scala
Towards typesafe deep learning in scala
 
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
H2O World - Consensus Optimization and Machine Learning - Stephen BoydH2O World - Consensus Optimization and Machine Learning - Stephen Boyd
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
 
Dynamic Feature Induction: The Last Gist to the State-of-the-Art
Dynamic Feature Induction: The Last Gist to the State-of-the-ArtDynamic Feature Induction: The Last Gist to the State-of-the-Art
Dynamic Feature Induction: The Last Gist to the State-of-the-Art
 
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
 
Claire98
Claire98Claire98
Claire98
 
A nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaA nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formula
 
Actors for Behavioural Simulation
Actors for Behavioural SimulationActors for Behavioural Simulation
Actors for Behavioural Simulation
 
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean OptimizationEfficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
 
Hyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradientHyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradient
 
11 Machine Learning Important Issues in Machine Learning
11 Machine Learning Important Issues in Machine Learning11 Machine Learning Important Issues in Machine Learning
11 Machine Learning Important Issues in Machine Learning
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
ICPR 2016
ICPR 2016ICPR 2016
ICPR 2016
 
Aaex5 group2(中英夾雜)
Aaex5 group2(中英夾雜)Aaex5 group2(中英夾雜)
Aaex5 group2(中英夾雜)
 

Semelhante a NCCU Ensemble Models Predict MOOC Dropout

Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...MLconf
 
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Universitat Politècnica de Catalunya
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017SERC at Carleton College
 
Lecture 06 marco aurelio ranzato - deep learning
Lecture 06   marco aurelio ranzato - deep learningLecture 06   marco aurelio ranzato - deep learning
Lecture 06 marco aurelio ranzato - deep learningmustafa sarac
 
Accelerating Random Forests in Scikit-Learn
Accelerating Random Forests in Scikit-LearnAccelerating Random Forests in Scikit-Learn
Accelerating Random Forests in Scikit-LearnGilles Louppe
 
Surveillance scene classification using machine learning
Surveillance scene classification using machine learningSurveillance scene classification using machine learning
Surveillance scene classification using machine learningUtkarsh Contractor
 
Learning to Rank with Neural Networks
Learning to Rank with Neural NetworksLearning to Rank with Neural Networks
Learning to Rank with Neural NetworksBhaskar Mitra
 
Multimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QAMultimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QAJin-Hwa Kim
 
Quantum Gaussian Processes - Gawel Kus
Quantum Gaussian Processes - Gawel KusQuantum Gaussian Processes - Gawel Kus
Quantum Gaussian Processes - Gawel KusAdvanced-Concepts-Team
 
20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variations20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variationsAndres Mendez-Vazquez
 
Matrix Factorizations for Recommender Systems
Matrix Factorizations for Recommender SystemsMatrix Factorizations for Recommender Systems
Matrix Factorizations for Recommender SystemsDmitriy Selivanov
 
Ai and ml study group lecture 1 and 2
Ai and ml study group   lecture 1 and 2Ai and ml study group   lecture 1 and 2
Ai and ml study group lecture 1 and 2Ashley Davis
 
Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...
Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...
Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...Wee Hyong Tok
 
Fast Object Recognition from 3D Depth Data with Extreme Learning Machine
Fast Object Recognition from 3D Depth Data with Extreme Learning MachineFast Object Recognition from 3D Depth Data with Extreme Learning Machine
Fast Object Recognition from 3D Depth Data with Extreme Learning MachineSoma Boubou
 
Learning with classification and clustering, neural networks
Learning with classification and clustering, neural networksLearning with classification and clustering, neural networks
Learning with classification and clustering, neural networksShaun D'Souza
 
TensorFlow and Deep Learning Tips and Tricks
TensorFlow and Deep Learning Tips and TricksTensorFlow and Deep Learning Tips and Tricks
TensorFlow and Deep Learning Tips and TricksBen Ball
 

Semelhante a NCCU Ensemble Models Predict MOOC Dropout (20)

Deep Learning for Computer Vision: Attention Models (UPC 2016)
Deep Learning for Computer Vision: Attention Models (UPC 2016)Deep Learning for Computer Vision: Attention Models (UPC 2016)
Deep Learning for Computer Vision: Attention Models (UPC 2016)
 
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
 
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
 
Lecture 06 marco aurelio ranzato - deep learning
Lecture 06   marco aurelio ranzato - deep learningLecture 06   marco aurelio ranzato - deep learning
Lecture 06 marco aurelio ranzato - deep learning
 
Accelerating Random Forests in Scikit-Learn
Accelerating Random Forests in Scikit-LearnAccelerating Random Forests in Scikit-Learn
Accelerating Random Forests in Scikit-Learn
 
Surveillance scene classification using machine learning
Surveillance scene classification using machine learningSurveillance scene classification using machine learning
Surveillance scene classification using machine learning
 
Learning to Rank with Neural Networks
Learning to Rank with Neural NetworksLearning to Rank with Neural Networks
Learning to Rank with Neural Networks
 
Multimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QAMultimodal Residual Networks for Visual QA
Multimodal Residual Networks for Visual QA
 
Quantum Gaussian Processes - Gawel Kus
Quantum Gaussian Processes - Gawel KusQuantum Gaussian Processes - Gawel Kus
Quantum Gaussian Processes - Gawel Kus
 
Backpropagation for Deep Learning
Backpropagation for Deep LearningBackpropagation for Deep Learning
Backpropagation for Deep Learning
 
20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variations20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variations
 
Matrix Factorizations for Recommender Systems
Matrix Factorizations for Recommender SystemsMatrix Factorizations for Recommender Systems
Matrix Factorizations for Recommender Systems
 
Ai and ml study group lecture 1 and 2
Ai and ml study group   lecture 1 and 2Ai and ml study group   lecture 1 and 2
Ai and ml study group lecture 1 and 2
 
Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...
Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...
Bootstrap Custom Image Classification using Transfer Learning by Danielle Dea...
 
Fast Object Recognition from 3D Depth Data with Extreme Learning Machine
Fast Object Recognition from 3D Depth Data with Extreme Learning MachineFast Object Recognition from 3D Depth Data with Extreme Learning Machine
Fast Object Recognition from 3D Depth Data with Extreme Learning Machine
 
Learning with classification and clustering, neural networks
Learning with classification and clustering, neural networksLearning with classification and clustering, neural networks
Learning with classification and clustering, neural networks
 
K-Means Algorithm
K-Means AlgorithmK-Means Algorithm
K-Means Algorithm
 
Lec11 object-re-id
Lec11 object-re-idLec11 object-re-id
Lec11 object-re-id
 
TensorFlow and Deep Learning Tips and Tricks
TensorFlow and Deep Learning Tips and TricksTensorFlow and Deep Learning Tips and Tricks
TensorFlow and Deep Learning Tips and Tricks
 

Mais de 志明 陳

RecSys'19 SMORe
RecSys'19 SMOReRecSys'19 SMORe
RecSys'19 SMORe志明 陳
 
ML Toolkit Share
ML Toolkit ShareML Toolkit Share
ML Toolkit Share志明 陳
 
CM NCCU Class2
CM NCCU Class2CM NCCU Class2
CM NCCU Class2志明 陳
 
CM NCCU Class1
CM NCCU Class1CM NCCU Class1
CM NCCU Class1志明 陳
 
CM UTaipei Kaggle Share
CM UTaipei Kaggle ShareCM UTaipei Kaggle Share
CM UTaipei Kaggle Share志明 陳
 
MLDM CM Kaggle Tips
MLDM CM Kaggle TipsMLDM CM Kaggle Tips
MLDM CM Kaggle Tips志明 陳
 
CM KaggleTW Share
CM KaggleTW ShareCM KaggleTW Share
CM KaggleTW Share志明 陳
 
KAMERA 2nd solution
KAMERA 2nd solutionKAMERA 2nd solution
KAMERA 2nd solution志明 陳
 
PIXNET Page Views 1st solution
PIXNET Page Views 1st solutionPIXNET Page Views 1st solution
PIXNET Page Views 1st solution志明 陳
 

Mais de 志明 陳 (10)

RecSys'19 SMORe
RecSys'19 SMOReRecSys'19 SMORe
RecSys'19 SMORe
 
Oop
OopOop
Oop
 
ML Toolkit Share
ML Toolkit ShareML Toolkit Share
ML Toolkit Share
 
CM NCCU Class2
CM NCCU Class2CM NCCU Class2
CM NCCU Class2
 
CM NCCU Class1
CM NCCU Class1CM NCCU Class1
CM NCCU Class1
 
CM UTaipei Kaggle Share
CM UTaipei Kaggle ShareCM UTaipei Kaggle Share
CM UTaipei Kaggle Share
 
MLDM CM Kaggle Tips
MLDM CM Kaggle TipsMLDM CM Kaggle Tips
MLDM CM Kaggle Tips
 
CM KaggleTW Share
CM KaggleTW ShareCM KaggleTW Share
CM KaggleTW Share
 
KAMERA 2nd solution
KAMERA 2nd solutionKAMERA 2nd solution
KAMERA 2nd solution
 
PIXNET Page Views 1st solution
PIXNET Page Views 1st solutionPIXNET Page Views 1st solution
PIXNET Page Views 1st solution
 

Último

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 

Último (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

NCCU Ensemble Models Predict MOOC Dropout

  • 1. Team: NCCU A Linear Ensemble of Classification Models with Novel Backward Cumulative Features for MOOC Dropout Prediction Chih-Ming Chen, Man-Kwan Shan,
 Ming-Feng Tsai, Yi-Hsuan Yang,
 Hsin-Ping Chen, Pei-Wen Yeh,
 and Sin-Ya Peng Research Center for
 Information Technology Innovation,
 Academia Sinica Department of Computer Science, National Chengchi University
  • 2. Team: NCCU A Linear Ensemble of Classification Models with Novel Backward Cumulative Features for MOOC Dropout Prediction Chih-Ming Chen, Man-Kwan Shan,
 Ming-Feng Tsai, Yi-Hsuan Yang,
 Hsin-Ping Chen, Pei-Wen Yeh,
 and Sin-Ya Peng Research Center for
 Information Technology Innovation,
 Academia Sinica Department of Computer Science, National Chengchi University linearly combination of several models the proposed data engineering method it’s able to generate a bunch of distinct feature sets
  • 3. Key Point Summary Latent Space Representation
 — Clustering Model
 — Skip-Gram Model Backward Cumulative Features
 — Generate 30 distinct sets of features Linear Model
 +
 Tree-based Model alleviate the feature sparsity problem alleviate the bias problem of statistical feature good match weakness when using sparse feature
  • 4. Workflow Train Data (75%) Validate Data (25%) Train Data 0" 50000" 100000" 150000" 200000" 10/27/2013" 11/27/2013" 12/27/2013"1/27/2014"2/27/2014" 3/27/2014"4/27/2014" 5/27/2014"6/27/2014" 7/27/2014" Training'Date'Distribu.on 0" 20000" 40000" 60000" 80000" 100000" 120000" 140000" 10/27/2013" 11/27/2013" 12/27/2013"1/27/2014"2/27/2014" 3/27/2014"4/27/2014" 5/27/2014"6/27/2014" 7/27/2014" Tes.ng'Date'Distribu.on Split the training data based on
 the time distribution.
 stable results — 2 settings
  • 5. Workflow Train Data (75%) Validate Data (25%) Learned
 Model Offline Evaluation Test Data Train Data Submission method 1 cross-validation — 2 settings check if it leads to better performance
  • 6. Workflow Train Data (75%) Validate Data (25%) Learned
 Model Offline Evaluation Test Data Train Data Learned
 Model Submission method 2 — 2 settings check if it leads to better performance
  • 7. Workflow Train Data (75%) Validate Data (25%) Learned
 Model Offline Evaluation Test Data Train Data Learned
 Model Submission — 2 settings
  • 8. Prediction Model Overview Logistic Regression Gradient Boosting Classifier Raw
 Data Support Vector
 Classifier Student Course Time A classical approach to a general prediction task. — 2 solutions Features
  • 9. Prediction Model Overview Logistic Regression Gradient Boosting Classifier Gradient Boosting Decision Trees Raw
 Data Support Vector
 Classifier Student Course Time Backward Cumulation the feature engineering towards the MOOC dataset. — 2 solutions Features
  • 10. Prediction Model Overview Logistic Regression Gradient Boosting Classifier Gradient Boosting Decision Trees Linear
 Combination Final
 Prediction Raw
 Data Support Vector
 Classifier Student Course Time Backward Cumulation — 2 solutions Features
  • 11. Prediction Model Overview Logistic Regression Gradient Boosting Classifier Gradient Boosting Decision Trees Linear
 Combination Final
 Prediction Raw
 Data Support Vector
 Classifier Student Course Time Backward Cumulation solution 1 solution 2 xgboost scikit-learn — 2 solutions http://scikit-learn.org/stable/https://github.com/dmlc/xgboost
  • 12. Prediction Model Overview Logistic Regression Gradient Boosting Classifier Gradient Boosting Decision Trees Linear
 Combination Final
 Prediction Raw
 Data Support Vector
 Classifier Student Course Time Backward Cumulation Feature Extraction / Feature Engineering — 2 solutions
  • 13. Feature Extraction Student Course Time Enrolment ID • Bag-of-words Feature
 — Boolean (0/1)
 — Term Frequency (TF) • Probability Value
 — Ratio
 — Naive Bayes • Latent Space Feature
 — Clustering
 — DeepWalk Raw
 Data
  • 14. Feature Extraction Student Course Time Enrolment ID • Bag-of-words Feature
 — Boolean (0/1)
 — Term Frequency (TF) • Probability Value
 — Ratio
 — Naive Bayes • Latent Space Feature
 — Clustering
 — DeepWalk Raw
 Data describing the status e.g.
 the month of the course
 the number of registration video 5 problem 10 wiki 0 discussion 2 navigation 0
  • 15. Feature Extraction Student Course Time Enrolment ID • Bag-of-words Feature
 — Boolean (0/1)
 — Term Frequency (TF) • Probability Value
 — Ratio
 — Naive Bayes • Latent Space Feature
 — Clustering
 — DeepWalk Raw
 Data dropout probability P( dropout|containing objects ) := P(O1|dropout) … P(Od|dropout) O = {O1, O2, …, Od}objects: e.g.
 the dropout ratio of the course estimate the probability from observed data
  • 16. Feature Extraction Student Course Time Enrolment ID • Bag-of-words Feature
 — Boolean (0/1)
 — Term Frequency (TF) • Probability Value
 — Ratio
 — Naive Bayes • Latent Space Feature
 — Clustering
 — DeepWalk Raw
 Data Latent Topic K-means Clustering on
 1. registered courses
 2. containing objects some features are sparse DeepWalk / Skip-Gram
 for obtaining a dense
 feature representation

  • 17. DeepWalk https://github.com/phanein/deepwalk The Goal — Find the representation of each node of a graph. It’s an extension work of word2vec’s Skip-Gram model.
  • 18. DeepWalk https://github.com/phanein/deepwalk The Goal — Find the representation of each node of a graph. It’s an extension work of word2vec’s Skip-Gram model. The core is to model the context information.
 (in practical, the node’s neighbours) Similar objects are mapped into similar space.
  • 19. From DeepWalk to the MOOC Problem U1 U2 U3 Course A Course B Course C Course D Course E https://github.com/phanein/deepwalk
  • 20. From DeepWalk to the MOOC Problem U1 U2 U3 Course A Course B Course C Course D U1 Course B Course E Course C U2 U1 Random Walk https://github.com/phanein/deepwalk Treat Random Walks on heterogeneous graph
 as the sentence.
  • 21. From DeepWalk to the MOOC Problem U1 U2 U3 Course A Course B Course C Course D U1 Course B Course E Course C U2 U1 Random Walk https://github.com/phanein/deepwalk Treat Random Walks on heterogeneous graph
 as the sentence. U1 0.3 0.2 -0.1 0.5 -0.8 Course B 0.1 0.3 -0.5 1.2 -0.3
  • 22. Performance Bag-of-words Bag-of-words Probability Bag-of-words Probability Naive Bayes Bag-of-words Probability Naive Bayes Latent Space > 0.890 > 0.901 > 0.902 > 0.903 Backward Cumulation Models Combination
  • 23. Backward Cumulative Features — Motivation O X O X O O X O X X X X X X O X X X X O X X X X X O O O X X X X X X X X X X X X X O X O X Logs Table 10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27 U1 U2 U3
  • 24. Backward Cumulative Features — Motivation O X O X O O X O O X X X X X O X X X X O X X X X X O O O X X X X X X X X X X X X X O X O X different period Logs Table different number of logs 10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27 U1 U2 U3
  • 25. Backward Cumulative Features O X O X O O X O X X X X X X O X X X X O X X X X X O O O X X X X X X X X X X X X X O X O X Consider only the logs in last N days. N=2 10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27 U1 U2 U3
  • 26. Backward Cumulative Features O X O X O O X O X X X X X X O X X X X O X X X X X O O O X X X X X X X X X X X X X O X O X Consider only the logs in last N days. N=3 10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27 U1 U2 U3
  • 27. Backward Cumulative Features O X O X O O X O X X X X X X O X X X X O X X X X X O O O X X X X X X X X X X X X X O X O X Consider only the logs in last N days. N=4 10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27 U1 U2 U3
  • 28. Backward Cumulative Features O X O X O O X O X X X X X X O X X X X O X X X X X O O O X X X X X X X X X X X X X O X O X Consider only the logs in last N days. N=5 10/13 10/14 10/15 10/16 10/17 10/18 10/19 10/20 10/21 10/22 10/23 10/24 10/25 10/26 10/27 U1 U2 U3
  • 29. Backward Cumulative Features Raw
 Data Student Course Time Backward Cumulation . . . Feature Set 1 N=1 N=2 N=3 N=29 N=30 Feature Set 2 Feature Set 3 . . . Feature Set 29 Feature Set 30 — 2 strategies
  • 30. Backward Cumulative Features Raw
 Data Student Course Time Backward Cumulation . . . Feature Set 1 N=1 N=2 N=3 N=29 N=30 Feature Set 2 Feature Set 3 . . . Feature Set 29 Feature Set 30 Classifier Strategy 1.
 Concatenate all features. — 2 strategies
  • 31. Backward Cumulative Features Raw
 Data Student Course Time Backward Cumulation . . . Feature Set 1 N=1 N=2 N=3 N=29 N=30 Feature Set 2 Feature Set 3 . . . Feature Set 29 Feature Set 30 Classifier Classifier Classifier Classifier Classifier Strategy 2.
 Build 30 distinct models. Average — 2 strategies
  • 32. Prediction Model Overview Logistic Regression Gradient Boosting Classifier Gradient Boosting Decision Trees Linear
 Combination Final
 Prediction Raw
 Data Support Vector
 Classifier Student Course Time Backward Cumulation solution 1 * 0.5 solution 2 * 0.5 xgboost scikit-learn
  • 33. What We Learned from the Competition • Team Work is important
 — share ideas
 — share solutions • Model diversity & feature diversity
 — diverse models / features can capture different characteristic of the data • Realize the data
 — the goal
 — the evaluation metric
 — the data structure
  • 34. What We Learned from the Competition • Team Work is important
 — share ideas
 — share solutions • Model diversity & feature diversity
 — diverse models / features can capture different characteristic of the data • Realize the data
 — the goal
 — the evaluation metric
 — the data structure Start earlier …
  • 35. What We Learned from the Competition • Team Work is important
 — share ideas
 — share solutions • Model diversity & feature diversity
 — diverse models / features can capture different characteristic of the data • Realize the data
 — the goal
 — the evaluation metric
 — the data structure Start earlier … Feature Format Data Partition Feature Scale several things to be discussed e.g.