SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Deep Learning to help student’s
Deep Learning
Hak Kim | AI Team
The return of the MOOC
● Education is no longer a one-time event but
a lifelong experience
● Working lives are now so lengthy and
fast-changing
● People must be able to acquire new skill
throughout their careers
● MOOC is broadening access to cutting-edge
vocational subjects
Backgrounds
Hak Kim |
Challenges
● Significant increase in student numbers
● Impracticality to assess each student’s
level of engagement by human instructors
● Increasingly fast dev and update cycle of
course contents
● Diverse demographics of students in each
online classroom
In the World of MOOCs
Hak Kim |
● Challenging problem to forecast the future
performance of students as they interact
with online coursework
● Reliable early-stage prediction of a student’s
future performance is critical to facilitate
timely (educational) interventions during a
course
Only a few prior works have explored the problem
from a deep learning perspective!
Student Perf. Prediction
Hak Kim |
GritNet
Graduate (user 1122)
● First views four lecture videos, then gets a quiz correct
● In the subsequent 13 events, watches a series of videos, answers quizzes
and submits first project three times and finally get it passed
Drop-out (user 2214)
● Views four lectures sparsely and fails the first project three times in a row
With this raw student activity data, GritNet can make a prediction as to
whether or not student would graduate (or any other student outcomes)
Student Event Sequence
Hak Kim |
Input
● Feed time-stamped students’ raw event sequence
(w/o engineered features)
● Encode them by one-hot encoding of possible event tuple
{action, timestamp} - to reserve ordering information and to
capture student’s learning speed
Output
● Probability of a student performance
● E.g., graduation or any other student outcomes
Model
● GMP layer is able to capture most relevant part of the input
event sequence and to deal with imbalanced nature of data!
GritNet
Hak Kim |
Training objective is the negative log likelihood of the observed
event sequence of student activities
● Binary cross entropy is minimized using stochastic gradient descent on mini-batches (size of 32)
Model optimization
● BLSTM layers of 128 cell dimensions per direction
● Embedding layer dimension grid-searched
● Dropout applied to the BLSTM output
● Note: Baseline model (logistic regression based) was also optimized extensively
Trained a different model for different weeks based on student’s week-by-week event records
GritNet Training
Hak Kim |
On Udacity Data
● ND-A program: 777 graduated out of 1,853 students (41.9%)
● ND-B program: 1,005 graduated from 8,301 students (12.1%)
Results
● Compared student graduation prediction accuracy in terms of
mean area under the ROC curve (AUC) over 5-fold cross validation
● GritNet consistently outperforms the baseline and provides
superior performance by more than 5% abs on both ND-{A,B}
dataset
● On ND-B dataset, the baseline model requires a wait of two
months to reach the same performance that the GritNet shows
within three weeks
I.e., improvements of GritNet are substantially pronounced in the first few
weeks when accurate predictions are most challenging!
Prediction Performance
5.3% abs (7.7% rel)
5% abs
Hak Kim |
Intermission
GritNet is the current state of the art for student prediction problem
● P1: Does not need any feature engineering (learns from raw input)
● P2: Can operate on any student event data associated with a timestamp
(even when highly imbalanced)
Hak Kim |
Remaining challenges
● Significant increase in student numbers
● Impracticality to assess each student’s
level of engagement by human instructors
● Increasingly fast dev and update cycle of
course contents
● Diverse demographics of students in each
online classroom
In the World of MOOCs
Hak Kim |
What’s Next
● So far GritNet trained and tested on the same course
data
● Real-time prediction during on-going course
(before the course finishes)
Hak Kim |
Real-Time Prediction
Observation
● GMP layer output captures generalizable sequential
representation of input event sequence
● FC layer learns the course-specific features
GritNet can be viewed as a sequence-level feature (or
embedding) extractor plus a classifier
Intuition
Hak Kim |
Unsupervised DA framework
● Trained from previous (source) course
but to be deployed on another (target) course
● E.g., v1 → v2, existing → newly-launched ND program
Ordinal Input Encoding
● Group (raw) content IDs into subcategories
● Normalize them using natural ordering implicit
in the content paths within each category
Domain Adaptation
Hak Kim |
Domain Adaptation
Hak Kim |
Unsupervised DA framework
● Trained from previous (source) course
but to be deployed on another (target) course
● E.g., v1 → v2, existing → newly-launched ND program
Ordinal Input Encoding
● Group (raw) content IDs into subcategories
● Normalize them using natural ordering implicit
in the content paths within each category
Pseudo-Labels generation
● No target labels from the target course
● Use a trained GritNet to assign hard
one-hot labels (Step 4)
Domain Adaptation
(cont’d)
Hak Kim |
Transfer Learning
● Continue training the last FC layer on the
target course while keeping the GritNet
core fixed (Step 6 and Step 7)
● The last FC layer limits the number of
params to learn
Very practical solution for faster adaptation time and applicability for even a smaller size target course!
Domain Adaptation
(cont’d)
Hak Kim |
Udacity Data
Setup
● Vanilla baseline: LR trained on source course, evaluated on target course
● GritNet baseline: GritNet trained on source course, evaluated on target course
○ BLSTM layers of 256 cell dimensions per direction and embedding layer of 512 dimension
● Domain adaptation
● Oracle: use oracle target labels instead of pseudo-labels
Experimental Setup
Hak Kim |
Generalization Performance
Hak Kim |
70.60% ARR
58.06% ARR
Up to 84.88% ARR at week 3Up to 75.07% ARR at week 3
Results
● AUC Recovery Rate (ARR): AUC reduction that unsupervised domain adaptation provides divided by
the absolute reduction that oracle training produces
○ Clear wins of GritNet baseline vs. vanilla baseline
○ Domain adaptation enhances real-time predictions explicitly in the first few weeks!
Summary
GritNet is the current state of the art for student prediction problem
● P1: Does not need any feature engineering (learns from raw input)
● P2: Can operate on any student event data associated with a timestamp
(even when highly imbalanced)
● P3: Can be transferred to new courses without any (students’ outcome) label!
Full papers available on arXiv (GritNet, GritNet2)
Hak Kim |
Looking ahead
Is a student likely to benefit from (sequential) interventions?
Student, X Intervention, T
Learning outcomes, Y
? ... ...
... ...
Hak Kim |
Wrap Up
Questions?
hak@udacity.com
Hak Kim |
Thank You!

Mais conteúdo relacionado

Semelhante a [243] Deep Learning to help student’s Deep Learning

CAPP and Curriculum Management at HCT
CAPP and Curriculum Management at HCTCAPP and Curriculum Management at HCT
CAPP and Curriculum Management at HCTKhalid Tariq
 
face-recognition-attendance-system.ppt.pptx
face-recognition-attendance-system.ppt.pptxface-recognition-attendance-system.ppt.pptx
face-recognition-attendance-system.ppt.pptxjadisaipavan8
 
Technical Debt Management
Technical Debt ManagementTechnical Debt Management
Technical Debt ManagementMark Niebergall
 
final_proposal_defence.pptx
final_proposal_defence.pptxfinal_proposal_defence.pptx
final_proposal_defence.pptxAmanRegmi
 
Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...
Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...
Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...Dat Nguyen
 
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementWebinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementMongoDB
 
Online course registration system development software engineering project pr...
Online course registration system development software engineering project pr...Online course registration system development software engineering project pr...
Online course registration system development software engineering project pr...MD.HABIBUR Rahman
 
Fostering pre-university student participation in OSGeo through the Google Co...
Fostering pre-university student participation in OSGeo through the Google Co...Fostering pre-university student participation in OSGeo through the Google Co...
Fostering pre-university student participation in OSGeo through the Google Co...Jeff McKenna
 
NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...
NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...
NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...ssuser4b1f48
 
Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...
Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...
Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...cccschamp
 
On Using Cloud Computing to Support Online Courses
On Using Cloud Computing to Support Online CoursesOn Using Cloud Computing to Support Online Courses
On Using Cloud Computing to Support Online CoursesGermán Moltó
 
EduHPC-15 final paper
EduHPC-15 final paperEduHPC-15 final paper
EduHPC-15 final paperMaha Aziz
 
Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012
Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012
Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012TEST Huddle
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillationNAVER Engineering
 
Software development life cycles (sdlc)
Software development life cycles (sdlc)Software development life cycles (sdlc)
Software development life cycles (sdlc)Yuriy Kravchenko
 
Transitioning to Lightning: End User Training Programs
Transitioning to Lightning: End User Training ProgramsTransitioning to Lightning: End User Training Programs
Transitioning to Lightning: End User Training ProgramsTargetX
 

Semelhante a [243] Deep Learning to help student’s Deep Learning (20)

CAPP and Curriculum Management at HCT
CAPP and Curriculum Management at HCTCAPP and Curriculum Management at HCT
CAPP and Curriculum Management at HCT
 
face-recognition-attendance-system.ppt.pptx
face-recognition-attendance-system.ppt.pptxface-recognition-attendance-system.ppt.pptx
face-recognition-attendance-system.ppt.pptx
 
Technical Debt Management
Technical Debt ManagementTechnical Debt Management
Technical Debt Management
 
final_proposal_defence.pptx
final_proposal_defence.pptxfinal_proposal_defence.pptx
final_proposal_defence.pptx
 
Project template
Project templateProject template
Project template
 
ShobhaResume
ShobhaResumeShobhaResume
ShobhaResume
 
Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...
Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...
Vjai paper reading201808-acl18-simple-and_effective multi-paragraph reading c...
 
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementWebinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
 
Online course registration system development software engineering project pr...
Online course registration system development software engineering project pr...Online course registration system development software engineering project pr...
Online course registration system development software engineering project pr...
 
Fostering pre-university student participation in OSGeo through the Google Co...
Fostering pre-university student participation in OSGeo through the Google Co...Fostering pre-university student participation in OSGeo through the Google Co...
Fostering pre-university student participation in OSGeo through the Google Co...
 
NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...
NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...
NS-CUK Seminar: J.H.Lee, Review on "Task Relation-aware Continual User Repres...
 
Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...
Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...
Flipped-Hybrid Project Based Advanced Manufacturing Course design by Red Rock...
 
On Using Cloud Computing to Support Online Courses
On Using Cloud Computing to Support Online CoursesOn Using Cloud Computing to Support Online Courses
On Using Cloud Computing to Support Online Courses
 
Using cosmic in agile projects
Using cosmic in agile projectsUsing cosmic in agile projects
Using cosmic in agile projects
 
Webcat
WebcatWebcat
Webcat
 
EduHPC-15 final paper
EduHPC-15 final paperEduHPC-15 final paper
EduHPC-15 final paper
 
Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012
Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012
Stefan Mohacsi - 15 years of Evolving Model-Based Testing - EuroSTAR 2012
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillation
 
Software development life cycles (sdlc)
Software development life cycles (sdlc)Software development life cycles (sdlc)
Software development life cycles (sdlc)
 
Transitioning to Lightning: End User Training Programs
Transitioning to Lightning: End User Training ProgramsTransitioning to Lightning: End User Training Programs
Transitioning to Lightning: End User Training Programs
 

Mais de NAVER D2

[211] 인공지능이 인공지능 챗봇을 만든다
[211] 인공지능이 인공지능 챗봇을 만든다[211] 인공지능이 인공지능 챗봇을 만든다
[211] 인공지능이 인공지능 챗봇을 만든다NAVER D2
 
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...NAVER D2
 
[215] Druid로 쉽고 빠르게 데이터 분석하기
[215] Druid로 쉽고 빠르게 데이터 분석하기[215] Druid로 쉽고 빠르게 데이터 분석하기
[215] Druid로 쉽고 빠르게 데이터 분석하기NAVER D2
 
[245]Papago Internals: 모델분석과 응용기술 개발
[245]Papago Internals: 모델분석과 응용기술 개발[245]Papago Internals: 모델분석과 응용기술 개발
[245]Papago Internals: 모델분석과 응용기술 개발NAVER D2
 
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈NAVER D2
 
[235]Wikipedia-scale Q&A
[235]Wikipedia-scale Q&A[235]Wikipedia-scale Q&A
[235]Wikipedia-scale Q&ANAVER D2
 
[244]로봇이 현실 세계에 대해 학습하도록 만들기
[244]로봇이 현실 세계에 대해 학습하도록 만들기[244]로봇이 현실 세계에 대해 학습하도록 만들기
[244]로봇이 현실 세계에 대해 학습하도록 만들기NAVER D2
 
[234]Fast & Accurate Data Annotation Pipeline for AI applications
[234]Fast & Accurate Data Annotation Pipeline for AI applications[234]Fast & Accurate Data Annotation Pipeline for AI applications
[234]Fast & Accurate Data Annotation Pipeline for AI applicationsNAVER D2
 
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load BalancingOld version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load BalancingNAVER D2
 
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지NAVER D2
 
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기NAVER D2
 
[224]네이버 검색과 개인화
[224]네이버 검색과 개인화[224]네이버 검색과 개인화
[224]네이버 검색과 개인화NAVER D2
 
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)NAVER D2
 
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기NAVER D2
 
[213] Fashion Visual Search
[213] Fashion Visual Search[213] Fashion Visual Search
[213] Fashion Visual SearchNAVER D2
 
[232] TensorRT를 활용한 딥러닝 Inference 최적화
[232] TensorRT를 활용한 딥러닝 Inference 최적화[232] TensorRT를 활용한 딥러닝 Inference 최적화
[232] TensorRT를 활용한 딥러닝 Inference 최적화NAVER D2
 
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지NAVER D2
 
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터NAVER D2
 
[223]기계독해 QA: 검색인가, NLP인가?
[223]기계독해 QA: 검색인가, NLP인가?[223]기계독해 QA: 검색인가, NLP인가?
[223]기계독해 QA: 검색인가, NLP인가?NAVER D2
 
[231] Clova 화자인식
[231] Clova 화자인식[231] Clova 화자인식
[231] Clova 화자인식NAVER D2
 

Mais de NAVER D2 (20)

[211] 인공지능이 인공지능 챗봇을 만든다
[211] 인공지능이 인공지능 챗봇을 만든다[211] 인공지능이 인공지능 챗봇을 만든다
[211] 인공지능이 인공지능 챗봇을 만든다
 
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
 
[215] Druid로 쉽고 빠르게 데이터 분석하기
[215] Druid로 쉽고 빠르게 데이터 분석하기[215] Druid로 쉽고 빠르게 데이터 분석하기
[215] Druid로 쉽고 빠르게 데이터 분석하기
 
[245]Papago Internals: 모델분석과 응용기술 개발
[245]Papago Internals: 모델분석과 응용기술 개발[245]Papago Internals: 모델분석과 응용기술 개발
[245]Papago Internals: 모델분석과 응용기술 개발
 
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
 
[235]Wikipedia-scale Q&A
[235]Wikipedia-scale Q&A[235]Wikipedia-scale Q&A
[235]Wikipedia-scale Q&A
 
[244]로봇이 현실 세계에 대해 학습하도록 만들기
[244]로봇이 현실 세계에 대해 학습하도록 만들기[244]로봇이 현실 세계에 대해 학습하도록 만들기
[244]로봇이 현실 세계에 대해 학습하도록 만들기
 
[234]Fast & Accurate Data Annotation Pipeline for AI applications
[234]Fast & Accurate Data Annotation Pipeline for AI applications[234]Fast & Accurate Data Annotation Pipeline for AI applications
[234]Fast & Accurate Data Annotation Pipeline for AI applications
 
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load BalancingOld version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
 
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
 
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
 
[224]네이버 검색과 개인화
[224]네이버 검색과 개인화[224]네이버 검색과 개인화
[224]네이버 검색과 개인화
 
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
 
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
 
[213] Fashion Visual Search
[213] Fashion Visual Search[213] Fashion Visual Search
[213] Fashion Visual Search
 
[232] TensorRT를 활용한 딥러닝 Inference 최적화
[232] TensorRT를 활용한 딥러닝 Inference 최적화[232] TensorRT를 활용한 딥러닝 Inference 최적화
[232] TensorRT를 활용한 딥러닝 Inference 최적화
 
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
 
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
 
[223]기계독해 QA: 검색인가, NLP인가?
[223]기계독해 QA: 검색인가, NLP인가?[223]기계독해 QA: 검색인가, NLP인가?
[223]기계독해 QA: 검색인가, NLP인가?
 
[231] Clova 화자인식
[231] Clova 화자인식[231] Clova 화자인식
[231] Clova 화자인식
 

Último

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 

Último (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

[243] Deep Learning to help student’s Deep Learning

  • 1. Deep Learning to help student’s Deep Learning Hak Kim | AI Team
  • 2. The return of the MOOC ● Education is no longer a one-time event but a lifelong experience ● Working lives are now so lengthy and fast-changing ● People must be able to acquire new skill throughout their careers ● MOOC is broadening access to cutting-edge vocational subjects Backgrounds Hak Kim |
  • 3. Challenges ● Significant increase in student numbers ● Impracticality to assess each student’s level of engagement by human instructors ● Increasingly fast dev and update cycle of course contents ● Diverse demographics of students in each online classroom In the World of MOOCs Hak Kim |
  • 4. ● Challenging problem to forecast the future performance of students as they interact with online coursework ● Reliable early-stage prediction of a student’s future performance is critical to facilitate timely (educational) interventions during a course Only a few prior works have explored the problem from a deep learning perspective! Student Perf. Prediction Hak Kim |
  • 6. Graduate (user 1122) ● First views four lecture videos, then gets a quiz correct ● In the subsequent 13 events, watches a series of videos, answers quizzes and submits first project three times and finally get it passed Drop-out (user 2214) ● Views four lectures sparsely and fails the first project three times in a row With this raw student activity data, GritNet can make a prediction as to whether or not student would graduate (or any other student outcomes) Student Event Sequence Hak Kim |
  • 7. Input ● Feed time-stamped students’ raw event sequence (w/o engineered features) ● Encode them by one-hot encoding of possible event tuple {action, timestamp} - to reserve ordering information and to capture student’s learning speed Output ● Probability of a student performance ● E.g., graduation or any other student outcomes Model ● GMP layer is able to capture most relevant part of the input event sequence and to deal with imbalanced nature of data! GritNet Hak Kim |
  • 8. Training objective is the negative log likelihood of the observed event sequence of student activities ● Binary cross entropy is minimized using stochastic gradient descent on mini-batches (size of 32) Model optimization ● BLSTM layers of 128 cell dimensions per direction ● Embedding layer dimension grid-searched ● Dropout applied to the BLSTM output ● Note: Baseline model (logistic regression based) was also optimized extensively Trained a different model for different weeks based on student’s week-by-week event records GritNet Training Hak Kim |
  • 9. On Udacity Data ● ND-A program: 777 graduated out of 1,853 students (41.9%) ● ND-B program: 1,005 graduated from 8,301 students (12.1%) Results ● Compared student graduation prediction accuracy in terms of mean area under the ROC curve (AUC) over 5-fold cross validation ● GritNet consistently outperforms the baseline and provides superior performance by more than 5% abs on both ND-{A,B} dataset ● On ND-B dataset, the baseline model requires a wait of two months to reach the same performance that the GritNet shows within three weeks I.e., improvements of GritNet are substantially pronounced in the first few weeks when accurate predictions are most challenging! Prediction Performance 5.3% abs (7.7% rel) 5% abs Hak Kim |
  • 10. Intermission GritNet is the current state of the art for student prediction problem ● P1: Does not need any feature engineering (learns from raw input) ● P2: Can operate on any student event data associated with a timestamp (even when highly imbalanced) Hak Kim |
  • 11. Remaining challenges ● Significant increase in student numbers ● Impracticality to assess each student’s level of engagement by human instructors ● Increasingly fast dev and update cycle of course contents ● Diverse demographics of students in each online classroom In the World of MOOCs Hak Kim |
  • 12. What’s Next ● So far GritNet trained and tested on the same course data ● Real-time prediction during on-going course (before the course finishes) Hak Kim |
  • 14. Observation ● GMP layer output captures generalizable sequential representation of input event sequence ● FC layer learns the course-specific features GritNet can be viewed as a sequence-level feature (or embedding) extractor plus a classifier Intuition Hak Kim |
  • 15. Unsupervised DA framework ● Trained from previous (source) course but to be deployed on another (target) course ● E.g., v1 → v2, existing → newly-launched ND program Ordinal Input Encoding ● Group (raw) content IDs into subcategories ● Normalize them using natural ordering implicit in the content paths within each category Domain Adaptation Hak Kim |
  • 16. Domain Adaptation Hak Kim | Unsupervised DA framework ● Trained from previous (source) course but to be deployed on another (target) course ● E.g., v1 → v2, existing → newly-launched ND program Ordinal Input Encoding ● Group (raw) content IDs into subcategories ● Normalize them using natural ordering implicit in the content paths within each category
  • 17. Pseudo-Labels generation ● No target labels from the target course ● Use a trained GritNet to assign hard one-hot labels (Step 4) Domain Adaptation (cont’d) Hak Kim |
  • 18. Transfer Learning ● Continue training the last FC layer on the target course while keeping the GritNet core fixed (Step 6 and Step 7) ● The last FC layer limits the number of params to learn Very practical solution for faster adaptation time and applicability for even a smaller size target course! Domain Adaptation (cont’d) Hak Kim |
  • 19. Udacity Data Setup ● Vanilla baseline: LR trained on source course, evaluated on target course ● GritNet baseline: GritNet trained on source course, evaluated on target course ○ BLSTM layers of 256 cell dimensions per direction and embedding layer of 512 dimension ● Domain adaptation ● Oracle: use oracle target labels instead of pseudo-labels Experimental Setup Hak Kim |
  • 20. Generalization Performance Hak Kim | 70.60% ARR 58.06% ARR Up to 84.88% ARR at week 3Up to 75.07% ARR at week 3 Results ● AUC Recovery Rate (ARR): AUC reduction that unsupervised domain adaptation provides divided by the absolute reduction that oracle training produces ○ Clear wins of GritNet baseline vs. vanilla baseline ○ Domain adaptation enhances real-time predictions explicitly in the first few weeks!
  • 21. Summary GritNet is the current state of the art for student prediction problem ● P1: Does not need any feature engineering (learns from raw input) ● P2: Can operate on any student event data associated with a timestamp (even when highly imbalanced) ● P3: Can be transferred to new courses without any (students’ outcome) label! Full papers available on arXiv (GritNet, GritNet2) Hak Kim |
  • 22. Looking ahead Is a student likely to benefit from (sequential) interventions? Student, X Intervention, T Learning outcomes, Y ? ... ... ... ... Hak Kim |