SlideShare uma empresa Scribd logo
1 de 19
Baixar para ler offline
Automatic classification of students in online
courses using machine learning techniques
D. Monllao Olive
School of Computer Science and Software Engineering
The University of Western Australia
Crawley WA 6009, AUSTRALIA
Principal supervisor: Dr Du Huynh
Co-supervisor: Assoc/Prof Mark Reynolds
External supervisor: Dr Martin Dougiamas
Master of Philosophy - part time student
Contents
1. Problem description
1.1. Online education and Moodle
1.2. Detection of students at risk of dropping out of courses
1.3. Students engagement in online courses
2. Literature review
3. Aim
4. Progress
5. Methodology
6. Timeline
1
Online education and Moodle
● Traditional education better for social interactions
● Online education offers more flexibility but relies more in self discipline
● Moodle stats (May 2017 https://moodle.net/stats/)
○ 103 million users worldwide
○ 12 million courses
○ 215 million forum posts
○ 589 million quiz questions https://moodle.org/logo/
2
Students at risk of dropping out of courses
● Students that are not engaged in the course don’t participate
● Different stakeholders interested in reducing online courses drop out rates
○ Students
○ Teachers
○ Educational institutions
https://www.thehrdigest.com/wp-content/upload
s/2016/03/college-degree.jpg
3
Students’ engagement in online courses
● Engaged students participate in the course activities
● Engagement is not as easy to detect in online courses as in face-to-face
education
● Some examples of engagement indicators:
○ Regular accesses to the course
○ Replies to other course participants’ forum posts
○ Quick reply to teacher’s feedback
○ Percentage of accessed course resources https://userscontent2.emaze.com/images/f5e5
a8b9-e038-4620-b54f-b935902facd9/cdeb1f7
d712f03715e4b0aed325ee967.jpg 4
1. From an educational point of view
● Description of online students’ engagement indicators [1] [2]
● Factor analysis and correlations between indicators and students retention
● Limitations:
○ Not very empirically rigorous
○ Limited studied dataset, results biased to a few courses
○ Indicators correlate individually
Literature review - Learning analytics
5
[1] Katrina A. Meyer. Student engagement in online learning: What works and why. ASHE Higher Education Report, 40(6):1–114, 2014.
[2] Kate S. Hone and Ghada R. El Said. Exploring the factors affecting MOOC retention: A survey study. Computers & Education, 98:157–168, 2016.
Literature review - Educational data mining
6
[3] Carlos Marquez-Vera, Alberto Cano, Cristobal Romero, Amin Yousef Mohammad Noaman, Habib Mousa Fardoun, and Sebastian Ventura. Early
dropout prediction using data mining: a case study with high school students. Expert Systems, 33(1):107–124, 2016. EXSY-Dec-13-227.R3.
[4] J. M. Luna, C. Castro, and C. Romero. Mdm tool: A data mining framework integrated into moodle. Computer Applications in Engineering Education,
25(1):90–102, 2017
2. From a data mining point of view:
● Some recent studies using machine learning techniques like Decision
Trees, Association rules or Evolutionary algorithms or [3] [4]
● Limitations:
○ Limited studied dataset
○ Basic student engagement indicators
Aim
To find the model that better predicts students at risk of dropping out of
any ongoing Moodle course.
7
Aim - How to achieve it?
● By using multiple and different institutions’ datasets
○ To prevent the model to be overfit to courses of a particular institution or format
● By selecting a subset of the literature student engagement indicators
○ To discard indicators that don’t correlate well
● By adding course information to the training dataset
○ To make the model adaptable to all sort of courses
● By limiting the studied activity logs to the most relevant time range
○ To improve the model accuracy
8
Progress
● Student engagement indicators literature review
● Moodle analytics API developed (https://github.com/moodlehq/moodle-tool_inspire)
○ Machine learning backend plugins. Shipped with Python (Tensorflow) and PHP (php-ml)
○ Very extendable
○ Prototype: http://prototype.moodle.net/inspirephase1/
○ Experimental model included: Students at risk of dropping out
● Contributions to the most popular PHP machine learning library
○ https://github.com/php-ai/php-ml/graphs/contributors
http://php-ml.readthedoc
s.io/en/latest/assets/php
-ml-logo.png
9
Progress - Analytics API - Data flow chart
10
Progress - Prototype - Models list
11
Progress - Prototype - Predictions list
12
Progress - Prototype - Prediction details
13
Methodology - Overview
1. Training dataset preparation from raw Moodle sites data
○ One sample for each student enrolment in each course of each Moodle site
○ Features: Student engagement indicators calculations, course information and
information about the included activity logs time range
○ Label: Did the student drop out of the course?
○ Output: A .csv file
14
Methodology - Overview
2. Machine learning training and performance evaluation
a. Inputs: A .csv file
b. Cross-validation (hyper parameters tuning)
c. Prediction model performance evaluation
■ The process is repeated multiple times
d. Outputs: The average accuracy (Matthews correlation coefficient) and the standard
deviation of all performance evaluations 15
Methodology - Parameters
● Repeat the described process with different parameters:
a. Using different subsets of student engagement indicators
b. Adding more course information when required
c. Limiting the student activity logs that are used
d. Using different machine learning algorithms
■ e.g. Neural networks, Support vector machines, Random forests...
16
Timeline
17
Task / Milestone Date
Training courses and literature review September 2016 - May 2017
Thesis proposal seminar and proposal submission to the Graduate Research
School at UWA
May 2017
Learning Analytics and Educational Data Mining survey June 2017 - December 2017
Paper describing the analytics framework developed and used for this research November 2017 - July 2018
Paper detailing different combinations of parameters and results July 2018 - December 2019
Limit to nominate examiners and thesis submission August 2020 - September 2020
Questions?
Thanks for coming.
Any questions?
18

Mais conteúdo relacionado

Mais procurados

1. ISD Project Plan
1. ISD Project Plan1. ISD Project Plan
1. ISD Project PlanMaruf Hamidi
 
Section0 course introduction
Section0 course introductionSection0 course introduction
Section0 course introductionDương Tùng
 
1.5 Come Together: Harnessing the Power of Peer Support Through User Groups
1.5 Come Together: Harnessing the Power of Peer Support Through User Groups1.5 Come Together: Harnessing the Power of Peer Support Through User Groups
1.5 Come Together: Harnessing the Power of Peer Support Through User GroupsTargetX
 
Broadening the scope of a Maths module for student Technology teachers
Broadening the scope of a Maths module for student Technology teachersBroadening the scope of a Maths module for student Technology teachers
Broadening the scope of a Maths module for student Technology teachersUofGlasgowLTU
 
Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...
Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...
Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...Blackboard APAC
 
TLC2016 - Inspiring a Sense of Educational Community
TLC2016 - Inspiring a Sense of Educational CommunityTLC2016 - Inspiring a Sense of Educational Community
TLC2016 - Inspiring a Sense of Educational CommunityBlackboardEMEA
 
Student Selection Based On Academic Achievement System using k-mean Algorithm
Student Selection Based On Academic Achievement System using k-mean AlgorithmStudent Selection Based On Academic Achievement System using k-mean Algorithm
Student Selection Based On Academic Achievement System using k-mean AlgorithmNik Ridhuan
 
GSoC Sri Lanka Meetup - Introduction to GSoC
GSoC Sri Lanka Meetup - Introduction to GSoCGSoC Sri Lanka Meetup - Introduction to GSoC
GSoC Sri Lanka Meetup - Introduction to GSoCHarshana Martin
 

Mais procurados (10)

1. ISD Project Plan
1. ISD Project Plan1. ISD Project Plan
1. ISD Project Plan
 
6. User Interface
6. User Interface6. User Interface
6. User Interface
 
Section0 course introduction
Section0 course introductionSection0 course introduction
Section0 course introduction
 
1.5 Come Together: Harnessing the Power of Peer Support Through User Groups
1.5 Come Together: Harnessing the Power of Peer Support Through User Groups1.5 Come Together: Harnessing the Power of Peer Support Through User Groups
1.5 Come Together: Harnessing the Power of Peer Support Through User Groups
 
Broadening the scope of a Maths module for student Technology teachers
Broadening the scope of a Maths module for student Technology teachersBroadening the scope of a Maths module for student Technology teachers
Broadening the scope of a Maths module for student Technology teachers
 
Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...
Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...
Moodlerooms Enterprise Upgrade Process | Shirley Li (Macquarie University) & ...
 
TLC2016 - Inspiring a Sense of Educational Community
TLC2016 - Inspiring a Sense of Educational CommunityTLC2016 - Inspiring a Sense of Educational Community
TLC2016 - Inspiring a Sense of Educational Community
 
Tb in kaart eng
Tb in kaart engTb in kaart eng
Tb in kaart eng
 
Student Selection Based On Academic Achievement System using k-mean Algorithm
Student Selection Based On Academic Achievement System using k-mean AlgorithmStudent Selection Based On Academic Achievement System using k-mean Algorithm
Student Selection Based On Academic Achievement System using k-mean Algorithm
 
GSoC Sri Lanka Meetup - Introduction to GSoC
GSoC Sri Lanka Meetup - Introduction to GSoCGSoC Sri Lanka Meetup - Introduction to GSoC
GSoC Sri Lanka Meetup - Introduction to GSoC
 

Semelhante a Automatic classification of students in online courses using machine learning techniques

MOOCs & Learning Analytics
MOOCs & Learning AnalyticsMOOCs & Learning Analytics
MOOCs & Learning AnalyticsEDSA project
 
Krakow presentation speak_appsmngm_final
Krakow presentation speak_appsmngm_finalKrakow presentation speak_appsmngm_final
Krakow presentation speak_appsmngm_finalSpeakApps Project
 
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CobosMeta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CoboseMadrid network
 
SOLAR - learning analytics, the state of the art
SOLAR - learning analytics, the state of the artSOLAR - learning analytics, the state of the art
SOLAR - learning analytics, the state of the artRebecca Ferguson
 
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...ijceronline
 
CUR532 Facilitator Training Program.pptx
CUR532 Facilitator Training Program.pptxCUR532 Facilitator Training Program.pptx
CUR532 Facilitator Training Program.pptxErinCrawfordAndrade
 
ECI519_Syllabus_Spring_2016-6
ECI519_Syllabus_Spring_2016-6ECI519_Syllabus_Spring_2016-6
ECI519_Syllabus_Spring_2016-6Shaun Kellogg
 
Higher Education Technology Outlook in Africa
Higher Education Technology Outlook in AfricaHigher Education Technology Outlook in Africa
Higher Education Technology Outlook in AfricaGreig Krull
 
MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...
MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...
MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...AIRCC Publishing Corporation
 
Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...
Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...
Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...AIRCC Publishing Corporation
 
Learning analytics research informed institutional practice
Learning analytics research informed institutional practiceLearning analytics research informed institutional practice
Learning analytics research informed institutional practiceYi-Shan Tsai
 
Higher Education Technology Outlook in Africa
Higher Education Technology Outlook in AfricaHigher Education Technology Outlook in Africa
Higher Education Technology Outlook in AfricaSaide OER Africa
 
Learning Analytics: New thinking supporting educational research
Learning Analytics: New thinking supporting educational researchLearning Analytics: New thinking supporting educational research
Learning Analytics: New thinking supporting educational researchAndrew Deacon
 
28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M
28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M
28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3MeMadrid network
 
Analytics in Action - Education
Analytics in Action - EducationAnalytics in Action - Education
Analytics in Action - EducationLee Schlenker
 
Prospect for learning analytics to achieve adaptive learning model
Prospect for learning analytics to achieve  adaptive learning modelProspect for learning analytics to achieve  adaptive learning model
Prospect for learning analytics to achieve adaptive learning modelOpen Cyber University of Korea
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...eMadrid network
 
Educational Data Mining & Students Performance Prediction using SVM Techniques
Educational Data Mining & Students Performance Prediction using SVM TechniquesEducational Data Mining & Students Performance Prediction using SVM Techniques
Educational Data Mining & Students Performance Prediction using SVM TechniquesIRJET Journal
 
V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...
V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...
V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...eMadrid network
 

Semelhante a Automatic classification of students in online courses using machine learning techniques (20)

MOOCs & Learning Analytics
MOOCs & Learning AnalyticsMOOCs & Learning Analytics
MOOCs & Learning Analytics
 
Krakow presentation speak_appsmngm_final
Krakow presentation speak_appsmngm_finalKrakow presentation speak_appsmngm_final
Krakow presentation speak_appsmngm_final
 
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CobosMeta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
 
Learning Managment System
Learning Managment SystemLearning Managment System
Learning Managment System
 
SOLAR - learning analytics, the state of the art
SOLAR - learning analytics, the state of the artSOLAR - learning analytics, the state of the art
SOLAR - learning analytics, the state of the art
 
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
 
CUR532 Facilitator Training Program.pptx
CUR532 Facilitator Training Program.pptxCUR532 Facilitator Training Program.pptx
CUR532 Facilitator Training Program.pptx
 
ECI519_Syllabus_Spring_2016-6
ECI519_Syllabus_Spring_2016-6ECI519_Syllabus_Spring_2016-6
ECI519_Syllabus_Spring_2016-6
 
Higher Education Technology Outlook in Africa
Higher Education Technology Outlook in AfricaHigher Education Technology Outlook in Africa
Higher Education Technology Outlook in Africa
 
MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...
MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...
MULTILEVEL ANALYSIS OF STUDENT’S FEEDBACKUSING MOODLE LOGS IN VIRTUAL CLOUD E...
 
Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...
Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...
Multilevel Analysis of Student's Feedback Using Moodle Logs in Virtual Cloud ...
 
Learning analytics research informed institutional practice
Learning analytics research informed institutional practiceLearning analytics research informed institutional practice
Learning analytics research informed institutional practice
 
Higher Education Technology Outlook in Africa
Higher Education Technology Outlook in AfricaHigher Education Technology Outlook in Africa
Higher Education Technology Outlook in Africa
 
Learning Analytics: New thinking supporting educational research
Learning Analytics: New thinking supporting educational researchLearning Analytics: New thinking supporting educational research
Learning Analytics: New thinking supporting educational research
 
28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M
28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M
28_09_2018 eMadrid seminar on MOOCs by Pedro J. Muñoz Merino, UC3M
 
Analytics in Action - Education
Analytics in Action - EducationAnalytics in Action - Education
Analytics in Action - Education
 
Prospect for learning analytics to achieve adaptive learning model
Prospect for learning analytics to achieve  adaptive learning modelProspect for learning analytics to achieve  adaptive learning model
Prospect for learning analytics to achieve adaptive learning model
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
 
Educational Data Mining & Students Performance Prediction using SVM Techniques
Educational Data Mining & Students Performance Prediction using SVM TechniquesEducational Data Mining & Students Performance Prediction using SVM Techniques
Educational Data Mining & Students Performance Prediction using SVM Techniques
 
V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...
V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...
V Jornadas eMadrid sobre “Educación Digital”. Pedro Muñoz Merino, Universidad...
 

Mais de David Monllaó

Moodle learning analytics desde diferentes perspectivas (#mootgt19)
Moodle learning analytics desde diferentes perspectivas (#mootgt19)Moodle learning analytics desde diferentes perspectivas (#mootgt19)
Moodle learning analytics desde diferentes perspectivas (#mootgt19)David Monllaó
 
El equipo de integracion de Moodle HQ es tu muy mejor amigo
El equipo de integracion de Moodle HQ es tu muy mejor amigoEl equipo de integracion de Moodle HQ es tu muy mejor amigo
El equipo de integracion de Moodle HQ es tu muy mejor amigoDavid Monllaó
 
Install solr and global search
Install solr and global searchInstall solr and global search
Install solr and global searchDavid Monllaó
 
Add your plugin contents to global search
Add your plugin contents to global searchAdd your plugin contents to global search
Add your plugin contents to global searchDavid Monllaó
 
How to improve your moodle site performance
How to improve your moodle site performanceHow to improve your moodle site performance
How to improve your moodle site performanceDavid Monllaó
 
Testing Moodle functionality automatically
Testing Moodle functionality automaticallyTesting Moodle functionality automatically
Testing Moodle functionality automaticallyDavid Monllaó
 

Mais de David Monllaó (6)

Moodle learning analytics desde diferentes perspectivas (#mootgt19)
Moodle learning analytics desde diferentes perspectivas (#mootgt19)Moodle learning analytics desde diferentes perspectivas (#mootgt19)
Moodle learning analytics desde diferentes perspectivas (#mootgt19)
 
El equipo de integracion de Moodle HQ es tu muy mejor amigo
El equipo de integracion de Moodle HQ es tu muy mejor amigoEl equipo de integracion de Moodle HQ es tu muy mejor amigo
El equipo de integracion de Moodle HQ es tu muy mejor amigo
 
Install solr and global search
Install solr and global searchInstall solr and global search
Install solr and global search
 
Add your plugin contents to global search
Add your plugin contents to global searchAdd your plugin contents to global search
Add your plugin contents to global search
 
How to improve your moodle site performance
How to improve your moodle site performanceHow to improve your moodle site performance
How to improve your moodle site performance
 
Testing Moodle functionality automatically
Testing Moodle functionality automaticallyTesting Moodle functionality automatically
Testing Moodle functionality automatically
 

Último

Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyRafigAliyev2
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonPayment Village
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp onlinebalibahu1313
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Calllward7
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeralNABLAS株式会社
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理pyhepag
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxDilipVasan
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Jon Hansen
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfscitechtalktv
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Valters Lauzums
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理pyhepag
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdfvyankatesh1
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfEmmanuel Dauda
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...ssuserf63bd7
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group MeetingAlison Pitt
 

Último (20)

Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 

Automatic classification of students in online courses using machine learning techniques

  • 1. Automatic classification of students in online courses using machine learning techniques D. Monllao Olive School of Computer Science and Software Engineering The University of Western Australia Crawley WA 6009, AUSTRALIA Principal supervisor: Dr Du Huynh Co-supervisor: Assoc/Prof Mark Reynolds External supervisor: Dr Martin Dougiamas Master of Philosophy - part time student
  • 2. Contents 1. Problem description 1.1. Online education and Moodle 1.2. Detection of students at risk of dropping out of courses 1.3. Students engagement in online courses 2. Literature review 3. Aim 4. Progress 5. Methodology 6. Timeline 1
  • 3. Online education and Moodle ● Traditional education better for social interactions ● Online education offers more flexibility but relies more in self discipline ● Moodle stats (May 2017 https://moodle.net/stats/) ○ 103 million users worldwide ○ 12 million courses ○ 215 million forum posts ○ 589 million quiz questions https://moodle.org/logo/ 2
  • 4. Students at risk of dropping out of courses ● Students that are not engaged in the course don’t participate ● Different stakeholders interested in reducing online courses drop out rates ○ Students ○ Teachers ○ Educational institutions https://www.thehrdigest.com/wp-content/upload s/2016/03/college-degree.jpg 3
  • 5. Students’ engagement in online courses ● Engaged students participate in the course activities ● Engagement is not as easy to detect in online courses as in face-to-face education ● Some examples of engagement indicators: ○ Regular accesses to the course ○ Replies to other course participants’ forum posts ○ Quick reply to teacher’s feedback ○ Percentage of accessed course resources https://userscontent2.emaze.com/images/f5e5 a8b9-e038-4620-b54f-b935902facd9/cdeb1f7 d712f03715e4b0aed325ee967.jpg 4
  • 6. 1. From an educational point of view ● Description of online students’ engagement indicators [1] [2] ● Factor analysis and correlations between indicators and students retention ● Limitations: ○ Not very empirically rigorous ○ Limited studied dataset, results biased to a few courses ○ Indicators correlate individually Literature review - Learning analytics 5 [1] Katrina A. Meyer. Student engagement in online learning: What works and why. ASHE Higher Education Report, 40(6):1–114, 2014. [2] Kate S. Hone and Ghada R. El Said. Exploring the factors affecting MOOC retention: A survey study. Computers & Education, 98:157–168, 2016.
  • 7. Literature review - Educational data mining 6 [3] Carlos Marquez-Vera, Alberto Cano, Cristobal Romero, Amin Yousef Mohammad Noaman, Habib Mousa Fardoun, and Sebastian Ventura. Early dropout prediction using data mining: a case study with high school students. Expert Systems, 33(1):107–124, 2016. EXSY-Dec-13-227.R3. [4] J. M. Luna, C. Castro, and C. Romero. Mdm tool: A data mining framework integrated into moodle. Computer Applications in Engineering Education, 25(1):90–102, 2017 2. From a data mining point of view: ● Some recent studies using machine learning techniques like Decision Trees, Association rules or Evolutionary algorithms or [3] [4] ● Limitations: ○ Limited studied dataset ○ Basic student engagement indicators
  • 8. Aim To find the model that better predicts students at risk of dropping out of any ongoing Moodle course. 7
  • 9. Aim - How to achieve it? ● By using multiple and different institutions’ datasets ○ To prevent the model to be overfit to courses of a particular institution or format ● By selecting a subset of the literature student engagement indicators ○ To discard indicators that don’t correlate well ● By adding course information to the training dataset ○ To make the model adaptable to all sort of courses ● By limiting the studied activity logs to the most relevant time range ○ To improve the model accuracy 8
  • 10. Progress ● Student engagement indicators literature review ● Moodle analytics API developed (https://github.com/moodlehq/moodle-tool_inspire) ○ Machine learning backend plugins. Shipped with Python (Tensorflow) and PHP (php-ml) ○ Very extendable ○ Prototype: http://prototype.moodle.net/inspirephase1/ ○ Experimental model included: Students at risk of dropping out ● Contributions to the most popular PHP machine learning library ○ https://github.com/php-ai/php-ml/graphs/contributors http://php-ml.readthedoc s.io/en/latest/assets/php -ml-logo.png 9
  • 11. Progress - Analytics API - Data flow chart 10
  • 12. Progress - Prototype - Models list 11
  • 13. Progress - Prototype - Predictions list 12
  • 14. Progress - Prototype - Prediction details 13
  • 15. Methodology - Overview 1. Training dataset preparation from raw Moodle sites data ○ One sample for each student enrolment in each course of each Moodle site ○ Features: Student engagement indicators calculations, course information and information about the included activity logs time range ○ Label: Did the student drop out of the course? ○ Output: A .csv file 14
  • 16. Methodology - Overview 2. Machine learning training and performance evaluation a. Inputs: A .csv file b. Cross-validation (hyper parameters tuning) c. Prediction model performance evaluation ■ The process is repeated multiple times d. Outputs: The average accuracy (Matthews correlation coefficient) and the standard deviation of all performance evaluations 15
  • 17. Methodology - Parameters ● Repeat the described process with different parameters: a. Using different subsets of student engagement indicators b. Adding more course information when required c. Limiting the student activity logs that are used d. Using different machine learning algorithms ■ e.g. Neural networks, Support vector machines, Random forests... 16
  • 18. Timeline 17 Task / Milestone Date Training courses and literature review September 2016 - May 2017 Thesis proposal seminar and proposal submission to the Graduate Research School at UWA May 2017 Learning Analytics and Educational Data Mining survey June 2017 - December 2017 Paper describing the analytics framework developed and used for this research November 2017 - July 2018 Paper detailing different combinations of parameters and results July 2018 - December 2019 Limit to nominate examiners and thesis submission August 2020 - September 2020