SlideShare uma empresa Scribd logo
1 de 45
Data Mining in Education
Social Media + Text
Qiang Hao
neohao@uga.edu
http://tobeneo.com
Goals
• What is Data Mining?
• What tools / knowledge do you need to do
Data Mining?
• What is the basic process of Data Mining?
Questions Answered by Data Mining
• Can we predict whether the coming email is
a spam?
Questions Answered by Data Mining
• Can we predict whether the coming email is
a spam?
Questions Answered by Data Mining
• Can we predict whether the coming email is
a spam?
money
you
he
……
Questions Answered by Data Mining
• What is the attitude of people on Twitter
towards the presidential candidate Donald
Trump?
Questions Answered by Data Mining
• What is the attitude of people on Twitter
towards the presidential candidate Donald
Trump?
#Trump
#DonaldTrump
#GOPTrump
Questions Answered by Data Mining
• What is the attitude of people on Twitter
towards the presidential candidate Donald
Trump?
Questions Answered by Data Mining
• What is the attitude of people on Twitter
towards the presidential candidate Donald
Trump?
a, an, the, is, are,
was, were, if …
Questions Answered by Data Mining
• What is the attitude of people on Twitter
towards the presidential candidate Donald
Trump?
Questions Answered by Data Mining
• What is the attitude of people on Twitter
towards the presidential candidate Donald
Trump?
Negative
Neutral
Positive
Educational Questions to Answer by
Data Mining
Educational Questions to Answer by
Data Mining
• What algorithm can score essays as teachers
do?
Educational Questions to Answer by
Data Mining
• What courses should we recommend to
students based on their online activities?
Educational Questions to Answer by
Data Mining
• Does the intervention improve students’
lexical variety in their writing?
Educational Questions to Answer by
Data Mining
• Are there different patterns in students’
questions; if so, are the patterns related to
their academic performance?
Educational Questions to Answer by
Data Mining
• What sub-topics do students tend to cover
when discussing this topic?
Educational Questions to Answer by
Data Mining
• What predictor is the most important one for
whether college students seek help online in
their learning?
Goals
• What is Data Mining?
Replicable
Reproducible
Automatic
Goals
• What is Data Mining?
• What tools / knowledge do you need to do
Data Mining?
Tools / Knowledge
Tools / Knowledge
Carmen Reinhart Kenneth Rogoff
Thomas Herndon
Goals
• What tools / knowledge do you need to do
Data Mining?
Expert level of knowledge in statistics
Intermediate level of knowledge in
programming
Familiarity with R/Python
R for SAS and SPSS Users
Robert A. Muenchen
Goals
Hands-On Programming
with R
Garrett Grolemund
Goals
Goals
• What is Data Mining?
• What tools / knowledge do you need to do
Data Mining?
• What is the basic process of Data Mining?
Data Collection
Data Cleaning
Data Processing
Data Analysis
Sharing Data and Results
Research Pipeline
Data Collection
• XML
Data Collection
Data Collection
• JSON
Mining the Social Web 2nd
Edition
Matthew A. Russell
Python
Data Collection
Data Cleaning
Data Processing
Data Processing
Data Processing
Data Processing
Text Analysis with R for
Students of Literature
Matthew L. Jockers
Data Analysis
• Lexical Variety
• Classification
• Clustering Analysis
• Latent Semantic Analysis
• Support Vector Machine
• Sentimental Analysis
• Topic Modeling
Data Analysis
Renkl, A. (1997). Learning from worked‐out examples: A
study on individual differences. Cognitive science, 21(1), 1-29.
Data Analysis
An Introduction to
Statistical Learning
Gareth James
Daniela Witten
Trevor Hastie
Robert Tibshirani
Sharing Data and Results
• R + KnitR + RPub
• GitHub
Sharing Data and Results
• R + KnitR + RPub:
http://rpubs.com/neohao/online-help-
seeking
Sharing Data and Results
• GitHub: https://github.com/Neo-
Hao/TwitterHashtagR
Sharing Data and Results
Version control with Git
Jon Loeliger
Thanks!

Mais conteúdo relacionado

Destaque

Educational Data Mining in relation to education statistics of Nepal
Educational Data Mining in relation to education statistics of NepalEducational Data Mining in relation to education statistics of Nepal
Educational Data Mining in relation to education statistics of Nepal
Raj Subit
 
Social media and it's use in disease surveillance
Social media and it's use in disease surveillanceSocial media and it's use in disease surveillance
Social media and it's use in disease surveillance
Dan Aronne
 
Izobrazevanje za data-mining
Izobrazevanje za data-miningIzobrazevanje za data-mining
Izobrazevanje za data-mining
butest
 
Identification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A SurveyIdentification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A Survey
Cybersecurity Education and Research Centre
 
Data Mining A Healthcare Database
Data Mining A Healthcare DatabaseData Mining A Healthcare Database
Data Mining A Healthcare Database
brucco
 

Destaque (20)

Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
 
Educational Data Mining in relation to education statistics of Nepal
Educational Data Mining in relation to education statistics of NepalEducational Data Mining in relation to education statistics of Nepal
Educational Data Mining in relation to education statistics of Nepal
 
Learning Analytics in Education: Using Student’s Big Data to Improve Teaching
Learning Analytics in Education:  Using Student’s Big Data to Improve TeachingLearning Analytics in Education:  Using Student’s Big Data to Improve Teaching
Learning Analytics in Education: Using Student’s Big Data to Improve Teaching
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
 
Edwards using data to drive instruction
Edwards   using data to drive instructionEdwards   using data to drive instruction
Edwards using data to drive instruction
 
Using Data to Drive Instruction
Using Data to Drive InstructionUsing Data to Drive Instruction
Using Data to Drive Instruction
 
#SPW13 - Educational Data Mining: Empowering young innovators - María Begoña ...
#SPW13 - Educational Data Mining: Empowering young innovators - María Begoña ...#SPW13 - Educational Data Mining: Empowering young innovators - María Begoña ...
#SPW13 - Educational Data Mining: Empowering young innovators - María Begoña ...
 
Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Textometry and Information Discovery : A New Approach to Mining Textual Data ...Textometry and Information Discovery : A New Approach to Mining Textual Data ...
Textometry and Information Discovery : A New Approach to Mining Textual Data ...
 
Text mining and data mining
Text mining and data mining Text mining and data mining
Text mining and data mining
 
Using Twitter Data to Predict Flu Outbreak
Using Twitter Data to Predict Flu OutbreakUsing Twitter Data to Predict Flu Outbreak
Using Twitter Data to Predict Flu Outbreak
 
Social media and it's use in disease surveillance
Social media and it's use in disease surveillanceSocial media and it's use in disease surveillance
Social media and it's use in disease surveillance
 
A Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data MiningA Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data Mining
 
Izobrazevanje za data-mining
Izobrazevanje za data-miningIzobrazevanje za data-mining
Izobrazevanje za data-mining
 
How to Use Data to Drive Instruction
How to Use Data to Drive InstructionHow to Use Data to Drive Instruction
How to Use Data to Drive Instruction
 
Identification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A SurveyIdentification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A Survey
 
Identification of User Patterns in Social Networks by Data Mining Techniques:...
Identification of User Patterns in Social Networks by Data Mining Techniques:...Identification of User Patterns in Social Networks by Data Mining Techniques:...
Identification of User Patterns in Social Networks by Data Mining Techniques:...
 
Data Mining A Healthcare Database
Data Mining A Healthcare DatabaseData Mining A Healthcare Database
Data Mining A Healthcare Database
 
DETECTING MALICIOUS FACEBOOK APPLICATIONS - IEEE PROJECTS IN PONDICHERRY,BUL...
DETECTING MALICIOUS FACEBOOK APPLICATIONS  - IEEE PROJECTS IN PONDICHERRY,BUL...DETECTING MALICIOUS FACEBOOK APPLICATIONS  - IEEE PROJECTS IN PONDICHERRY,BUL...
DETECTING MALICIOUS FACEBOOK APPLICATIONS - IEEE PROJECTS IN PONDICHERRY,BUL...
 
Data collection chapter 15 from the companion website for educational research
Data collection   chapter 15 from the companion website for educational researchData collection   chapter 15 from the companion website for educational research
Data collection chapter 15 from the companion website for educational research
 
My topik
My topikMy topik
My topik
 

Semelhante a Data Mining and Text Mining in Educational Research

Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
Thinkful
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
Thinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
Thinkful
 
The users at the end of the mouse - using the 3As to model personas
The users at the end of the mouse - using the 3As to model personasThe users at the end of the mouse - using the 3As to model personas
The users at the end of the mouse - using the 3As to model personas
Dana Chisnell
 
the grate slidshare of my
the grate slidshare of mythe grate slidshare of my
the grate slidshare of my
Merjerz
 
new slide goes here
new slide goes herenew slide goes here
new slide goes here
Merjerz
 
Using voter personas to understand who is coming to your election department ...
Using voter personas to understand who is coming to your election department ...Using voter personas to understand who is coming to your election department ...
Using voter personas to understand who is coming to your election department ...
Dana Chisnell
 

Semelhante a Data Mining and Text Mining in Educational Research (20)

Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Using Data for Fun and Profit
Using Data for Fun and ProfitUsing Data for Fun and Profit
Using Data for Fun and Profit
 
Thinkful DC - Intro to Data Science
Thinkful DC - Intro to Data Science Thinkful DC - Intro to Data Science
Thinkful DC - Intro to Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Cyber Summit 2014 - An Open Data Initiative: A general overview and learning ...
Cyber Summit 2014 - An Open Data Initiative: A general overview and learning ...Cyber Summit 2014 - An Open Data Initiative: A general overview and learning ...
Cyber Summit 2014 - An Open Data Initiative: A general overview and learning ...
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Wild West of Data Wrangling (PyTN)
The Wild West of Data Wrangling (PyTN)The Wild West of Data Wrangling (PyTN)
The Wild West of Data Wrangling (PyTN)
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraising
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
The users at the end of the mouse - using the 3As to model personas
The users at the end of the mouse - using the 3As to model personasThe users at the end of the mouse - using the 3As to model personas
The users at the end of the mouse - using the 3As to model personas
 
Users at the end of the mouse - Persona modeler
Users at the end of the mouse - Persona modelerUsers at the end of the mouse - Persona modeler
Users at the end of the mouse - Persona modeler
 
the grate slidshare of my
the grate slidshare of mythe grate slidshare of my
the grate slidshare of my
 
new slide goes here
new slide goes herenew slide goes here
new slide goes here
 
Clare Corthell: Learning Data Science Online
Clare Corthell: Learning Data Science OnlineClare Corthell: Learning Data Science Online
Clare Corthell: Learning Data Science Online
 
Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?
 
Using voter personas to understand who is coming to your election department ...
Using voter personas to understand who is coming to your election department ...Using voter personas to understand who is coming to your election department ...
Using voter personas to understand who is coming to your election department ...
 

Mais de Qiang Hao

Hong Kong Citer 2013 presentation
Hong Kong Citer 2013 presentationHong Kong Citer 2013 presentation
Hong Kong Citer 2013 presentation
Qiang Hao
 
Should the government be responsible for making sure that people lead healthy...
Should the government be responsible for making sure that people lead healthy...Should the government be responsible for making sure that people lead healthy...
Should the government be responsible for making sure that people lead healthy...
Qiang Hao
 
Is talking the most effective and satisfying way of communicating with others?
Is talking the most effective and satisfying way of communicating with others?Is talking the most effective and satisfying way of communicating with others?
Is talking the most effective and satisfying way of communicating with others?
Qiang Hao
 
Does everyone, even people who choose to live alone, need a network or family?
Does everyone, even people who choose to live alone, need a network or family?Does everyone, even people who choose to live alone, need a network or family?
Does everyone, even people who choose to live alone, need a network or family?
Qiang Hao
 
Do people put too much importance on getting every detail right on a project ...
Do people put too much importance on getting every detail right on a project ...Do people put too much importance on getting every detail right on a project ...
Do people put too much importance on getting every detail right on a project ...
Qiang Hao
 
Is it better for people to learn from others than to learn on their own?
Is it better for people to learn from others than to learn on their own?Is it better for people to learn from others than to learn on their own?
Is it better for people to learn from others than to learn on their own?
Qiang Hao
 
Is an idealistic approach less valuable than a practical approach?
Is an idealistic approach less valuable than a practical approach?Is an idealistic approach less valuable than a practical approach?
Is an idealistic approach less valuable than a practical approach?
Qiang Hao
 
Summary of Group E-portofolio
Summary of Group E-portofolioSummary of Group E-portofolio
Summary of Group E-portofolio
Qiang Hao
 

Mais de Qiang Hao (14)

Selecting the Most Important Predictors of Computer Science Students' Online ...
Selecting the Most Important Predictors of Computer Science Students' Online ...Selecting the Most Important Predictors of Computer Science Students' Online ...
Selecting the Most Important Predictors of Computer Science Students' Online ...
 
Introduction to the Genetic Algorithm
Introduction to the Genetic AlgorithmIntroduction to the Genetic Algorithm
Introduction to the Genetic Algorithm
 
The effect of precommitment on student achievement within a project-based lea...
The effect of precommitment on student achievement within a project-based lea...The effect of precommitment on student achievement within a project-based lea...
The effect of precommitment on student achievement within a project-based lea...
 
structural equation modeling
structural equation modelingstructural equation modeling
structural equation modeling
 
Hong Kong Citer 2013 presentation
Hong Kong Citer 2013 presentationHong Kong Citer 2013 presentation
Hong Kong Citer 2013 presentation
 
Should the government be responsible for making sure that people lead healthy...
Should the government be responsible for making sure that people lead healthy...Should the government be responsible for making sure that people lead healthy...
Should the government be responsible for making sure that people lead healthy...
 
Is talking the most effective and satisfying way of communicating with others?
Is talking the most effective and satisfying way of communicating with others?Is talking the most effective and satisfying way of communicating with others?
Is talking the most effective and satisfying way of communicating with others?
 
Do small decisions often have major consequences?
Do small decisions often have major  consequences?Do small decisions often have major  consequences?
Do small decisions often have major consequences?
 
Does everyone, even people who choose to live alone, need a network or family?
Does everyone, even people who choose to live alone, need a network or family?Does everyone, even people who choose to live alone, need a network or family?
Does everyone, even people who choose to live alone, need a network or family?
 
Does the process of doing something matter more than the outcome?
Does the process of doing something  matter more than the outcome?Does the process of doing something  matter more than the outcome?
Does the process of doing something matter more than the outcome?
 
Do people put too much importance on getting every detail right on a project ...
Do people put too much importance on getting every detail right on a project ...Do people put too much importance on getting every detail right on a project ...
Do people put too much importance on getting every detail right on a project ...
 
Is it better for people to learn from others than to learn on their own?
Is it better for people to learn from others than to learn on their own?Is it better for people to learn from others than to learn on their own?
Is it better for people to learn from others than to learn on their own?
 
Is an idealistic approach less valuable than a practical approach?
Is an idealistic approach less valuable than a practical approach?Is an idealistic approach less valuable than a practical approach?
Is an idealistic approach less valuable than a practical approach?
 
Summary of Group E-portofolio
Summary of Group E-portofolioSummary of Group E-portofolio
Summary of Group E-portofolio
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 

Data Mining and Text Mining in Educational Research