SlideShare a Scribd company logo
1 of 43
1© Copyright 2015 Pivotal. All rights reserved. 1© Copyright 2015 Pivotal. All rights reserved.
Srivatsan Ramanujam,
Regunathan Radhakrishnan
Principal Data Scientists
Pivotal Data Science
How Data Science is Preventing
College Dropouts and Advancing
Student Success
2© Copyright 2015 Pivotal. All rights reserved.
Agenda
 Justifying the cost of college education
 Institutional Data Lake
 Overview of typical data sources
– Structured
– Unstructured
 Models to predict student success
– Predict time-to-graduate
– Predict term GPA
– Predict course Grade
 Operationalizing student success models
3© Copyright 2015 Pivotal. All rights reserved.
Justifying the Cost of College
Education
4© Copyright 2015 Pivotal. All rights reserved.
Educators’ Concerns
 Cost of education has been steeply increasing and
as a consequence student debt has also been
rising steeply
 How do we justify the value of college education?
 How do we ensure students graduate on time,
don’t drop-out, get better jobs?
 What are factors which educators can influence to
improve student graduation rates?
5© Copyright 2015 Pivotal. All rights reserved.
Institutional Data Lake
6© Copyright 2015 Pivotal. All rights reserved.
Business Goals
 Educators would like to study
factors that affect student
success
 To introduce policies and tools
that positively impact student
success
Institutional Data Lake
Admissions
data
Registration
data
demographics
data
Online
forums data
Blackboard
data
Card swipes
at campus
facilities
Structured data sources Unstructured data sources
Behaviors
Student Cluster
Analysis
Outcomes
GPA Prediction
Retention Prediction
7© Copyright 2015 Pivotal. All rights reserved.
Advantages of Institutional Data Lake
 Derive insights from data that help drive institution policies
 Deploy data-driven apps that positively impacts student success
Institutional Data Lake
Admissions
data
Registration
data
demographics
data
Online
forums data
Blackboard
data
Card swipes
at campus
facilities
Structured data sources Unstructured data sources
Analytics
Education
research
Data-driven
apps
e.g Predict time-to-grad
Predict drop-outs
e.g course recommender
Term GPA predictor
8© Copyright 2015 Pivotal. All rights reserved.
Problem Statement
 Given data related to a student’s activity and profile, predict a student’s
success ( e.g. time to graduate)
– Identify key attributes that influence the time to graduate which
will assist institution take action on some of the identified attributes
 Our approach:
– Create 360-degree view of student’s activity and profile
– Apply machine learning methods to predict time to graduate
– Interrogate the developed models to understand key factors
9© Copyright 2015 Pivotal. All rights reserved.
Data Science Toolkit
KEY LANGUAGES
P L A T F O R M
KEY TOOLS
MLlib
PL/X
ModelingTools
VisualizationTools
Platform
10© Copyright 2015 Pivotal. All rights reserved.
Data Lake
Business Levers
Apps
Pipeline of a Data Science Driven App
MLlib
PL/X
Model Building
Model Tuning
Continuous Model
Improvement
Data Feeds
Ingest Filter Enrich Sink
SpringXD
Greenplum
11© Copyright 2015 Pivotal. All rights reserved.
• For embarrassingly parallel
tasks, we can use procedural
languages to easily
parallelize any stand-alone
library in Java, Python, R,
pgSQL or C/C++
• The interpreter/VM of the
language ‘X’ is installed on
each node of the MPP
environment
Standby
Master
…
Master
Host
SQL
Interconnect
Segment Host
Segment
Segment
Segment Host
Segment
Segment
Segment Host
Segment
Segment
Segment Host
Segment
Segment
Data Parallelism through PL/X: X in Python, R,
Java, C/C++ and pgSQL
• plpython and python are loaded as dynamic
libraries on the master and segment nodes
(libpython.so and plpython.so are under
$GPHOME/ext/python)
12© Copyright 2015 Pivotal. All rights reserved.
MADlib: Scalable, In-Database Machine Learning
http://vldb.org/pvldb/vol5/p1700_joehellerstein_vldb2012.pdf
13© Copyright 2015 Pivotal. All rights reserved.
Functions
Supervised Learning
Regression Models
• Cox Proportional Hazards Regression
• Elastic Net Regularization
• Generalized Linear Models
• Linear Regression
• Logistic Regression
• Marginal Effects
• Multinomial Regression
• Ordinal Regression
• Robust Variance, Clustered Variance
• Support Vector Machines
Tree Methods
• Decision Tree
• Random Forest
Other Methods
• Conditional Random Field
• Naïve Bayes
Unsupervised Learning
• Association Rules (Apriori)
• Clustering (K-means)
• Topic Modeling (LDA)
Statistics
Descriptive
• Cardinality Estimators
• Correlation
• Summary
Inferential
• Hypothesis Tests
Other Statistics
• Probability Functions
Other Modules
• Conjugate Gradient
• Linear Solvers
• PMML Export
• Random Sampling
• Term Frequency for Text
Time Series
• ARIMA
Data Types and Transformations
• Array Operations
• Dimensionality Reduction (PCA)
• Encoding Categorical Variables
• Matrix Operations
• Matrix Factorization (SVD, Low Rank)
• Norms and Distance Functions
• Sparse Vectors
Model Evaluation
• Cross Validation
Predictive Analytics Library
@ApacheMADlib
14© Copyright 2015 Pivotal. All rights reserved.
Overview of Typical Data Sources
15© Copyright 2015 Pivotal. All rights reserved.
Typical Data Sources
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
Learn
activity
network
activity
Card swipes
activity
16© Copyright 2015 Pivotal. All rights reserved.
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
learn
activity
network
activity
card swipes
activity
17© Copyright 2015 Pivotal. All rights reserved.
Admit Rate and Yield Rate
 Admit_rate = num admitted/ num applications Yield rate = num enrolled/ num admitted
18© Copyright 2015 Pivotal. All rights reserved.
Percentage of Foreign Applicants Increases, while Percentage
of Residents Decreases
19© Copyright 2015 Pivotal. All rights reserved.
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
learn
activity
network
activity
card swipes
activity
20© Copyright 2015 Pivotal. All rights reserved.
When do Students Take Internships?
21© Copyright 2015 Pivotal. All rights reserved.
Changes in Enrollment Info: College, Major, Program
How many changes?
when?
• Late grads more likely to change college, major or program and
change later in their student career
22© Copyright 2015 Pivotal. All rights reserved.
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
learn
activity
network
activity
card swipes
activity
23© Copyright 2015 Pivotal. All rights reserved.
High School GPA, Rank
24© Copyright 2015 Pivotal. All rights reserved.
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
learn
activity
network
activity
card swipes
activity
25© Copyright 2015 Pivotal. All rights reserved.
Percentage of Assignments Submitted/Course/Term
• Number of assignments submitted per course/
Actual number of assignments for that course
• Average this ratio across courses taken in each
Academic period
• Students who drop out are less likely to submit all assignments in a course
26© Copyright 2015 Pivotal. All rights reserved.
How Quickly do Students Submit Assignments?
• Calculate time diff between
first_submission and student’s
submission
• Average across courses and
assignments per academic period
• Time converted in terms of hours
27© Copyright 2015 Pivotal. All rights reserved.
Participation Ratio
Participation ratio = number of posted messages/number of viewed messages
• Number of viewed messages always
greater than number of posted messages
• Normal students view more messages
than they post
28© Copyright 2015 Pivotal. All rights reserved.
Popularity of Messages by Posters
• Median viewer count largest for normal grads
• Msg_hit_count is not the highest for normal grads
29© Copyright 2015 Pivotal. All rights reserved.
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
learn
activity
network
activity
card swipes
activity
30© Copyright 2015 Pivotal. All rights reserved.
Network Activity – Usage by Class Label
• Incomplete and late grads tend to use
the network more
31© Copyright 2015 Pivotal. All rights reserved.
Student 360
application,
admission
academic
activity
(e.g courses, GPA)
previous
education
profile
demographic
profileBlackboard
learn
activity
network
activity
card swipes
activity
32© Copyright 2015 Pivotal. All rights reserved.
Students’ network logs and card activity
Card Swipes – Laundry and Dining Facility Use
• Late grads are more likely to use laundry facilities while classes are in session
• Late grads also tend to have breakfast, lunch and dinner later than on-time grads
33© Copyright 2015 Pivotal. All rights reserved.
Models
Predict Time-to-Graduate
Predict Term GPA
Predict Course Grade
34© Copyright 2015 Pivotal. All rights reserved.
Modeling Approach: Two Types of Features
Fall 2008 Spring 2009 Summer 2009
Fall 2008
cohort joins
Extract
features
until end of
term
Perform
Modeling
&
Scoring
Whether or
not a student will
graduate on time
Extract
features
until end of
term
Perform
Modeling
&
Scoring
Whether or
not a student will
graduate on time
Static features
• Remain the same irrespective of time window
• E.g gender, admission attribute, sat score etc
Time sensitive features
• Depends on time window (w1,w2)
• E.g course activity, bblearn activity etc
w1
w2
35© Copyright 2015 Pivotal. All rights reserved.
Modeling – Algorithms and Libraries
• Logistic Regression (MADlib)
 XGBoost (https://github.com/dmlc/xgboost)
• AdaBoost (Scikit-Learn)
• RandomForest (Scikit-Learn)
• Extracted three categories of features from all data sources
- Static ~ 50 features, time sensitive ~ 115 features
- Card Swipes, Network Logs ~ 110 features
• Built models for three different problems
- Time-to-graduate
- Course grade
- Term GPA
Algorithms and Libraries
36© Copyright 2015 Pivotal. All rights reserved.
Operationalization
37© Copyright 2015 Pivotal. All rights reserved.
Operationalization Pipeline
Refreshed data (incoming
daily/weekly/monthly updates)
feature gen.
pipeline
Static features
Static + time-sensitive
LMS features
Static + time-sensitive LMS +
network + card logs features
In-database
parallel grid-search
(XGBoost)
MADlib Logistic
Regression
Sklearn
AdaBoost
Sklearn
RandomForest
Model
selection
Serialize to
disk
Structured, unstructured
data sources in data lake
scoring results
• Student ID
• Feature names, values, importance scores
• Predictions (late, normal, dropped)
User-notification on
smartphone app
Cleared by Data
Scientist
Modeling pipeline
38© Copyright 2015 Pivotal. All rights reserved.
In-Database Parallel Grid Search Using
https://github.com/vatsan/gp_xgboost_gridsearch
• XGBoost (eXtreme
Gradient Boosting) is a
popular library used in
many prize winning
Kaggle contests.
• Implemented in C++ with
Python and R bindings
• Supports multi-core
• Implemented in-database
parallel grid-search for
XGBoost using PL/Python
39© Copyright 2015 Pivotal. All rights reserved.
In-Database Grid Search – Approach
https://github.com/vatsan/gp_xgboost_gridsearch
Refreshed data (incoming
daily/weekly/monthly updates)
feature gen.
pipeline training dataset
(distributed table)
Model
selection
structured,
unstructured
data sources
scored results
grid search
params dict
Grid params table
(expanded)
master
segments
param-list-1 param-list-n. . .
training set(serialized) training set(serialized)
Driver function
(PL/Python)
pickle
and
distribute
mdl-1 mdl-n. . .
40© Copyright 2015 Pivotal. All rights reserved.
Model Training and Scoring: XGBoost
https://github.com/vatsan/gp_xgboost_gridsearch
Training Scoring
41© Copyright 2015 Pivotal. All rights reserved.
Conclusions
 We built an institutional data lake for our customer, to serve
as a platform for education research and data-driven apps
for student success
 Reviewed typical data sources in an institutional data lake
– Interesting features that make up student 360
 Using open source, scalable data science toolkits we built
three different models for predicting student success
 We set-up a pipeline for operationalizing our models, to be
consumed by a data driven smartphone app
42© Copyright 2015 Pivotal. All rights reserved.
Pivotal Data Science Blogs
1. Scaling native (C++) apps on Pivotal MPP
2. Predicting commodity futures through Tweets
3. A pipeline for distributed topic & sentiment analysis of tweets on Greenplum
4. Using data science to predict TV viewer behavior
5. Twitter NLP: Scaling part-of-speech tagging
6. Distributed deep learning on MPP and Hadoop
7. Multi-variate time series forecasting
8. Pivotal for good – Crisis Textline
http://blog.pivotal.io/data-science-pivotal
43© 2015 Pivotal Software, Inc. All rights reserved.

More Related Content

What's hot

Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...VMware Tanzu
 
Data Science At Scale for IoT on the Pivotal Platform
Data Science At Scale for IoT on the Pivotal PlatformData Science At Scale for IoT on the Pivotal Platform
Data Science At Scale for IoT on the Pivotal PlatformGautam S. Muralidhar
 
Pivotal Digital Transformation Forum: Becoming a Data Driven Enterprise
Pivotal Digital Transformation Forum: Becoming a Data Driven EnterprisePivotal Digital Transformation Forum: Becoming a Data Driven Enterprise
Pivotal Digital Transformation Forum: Becoming a Data Driven EnterpriseVMware Tanzu
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Your AI Transformation
Your AI Transformation Your AI Transformation
Your AI Transformation Sri Ambati
 
Data and its Role in Your Digital Transformation
Data and its Role in Your Digital TransformationData and its Role in Your Digital Transformation
Data and its Role in Your Digital TransformationVMware Tanzu
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)Cloudera, Inc.
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati
 
seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019DataKitchen
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...StampedeCon
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Precisely
 
Next Steps In Your Digital Transformation
Next Steps In Your Digital TransformationNext Steps In Your Digital Transformation
Next Steps In Your Digital TransformationVMware Tanzu
 
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...Lightbend
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...GetInData
 
How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...
How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...
How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...Kai Wähner
 
Dell | Your Path – Our Platform & Great Partnerships
Dell | Your Path – Our Platform & Great PartnershipsDell | Your Path – Our Platform & Great Partnerships
Dell | Your Path – Our Platform & Great PartnershipsDataWorks Summit
 

What's hot (20)

Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
Customer Spotlight: How WellCare Accelerated Big Data Delivery to Improve Ana...
 
Data Science At Scale for IoT on the Pivotal Platform
Data Science At Scale for IoT on the Pivotal PlatformData Science At Scale for IoT on the Pivotal Platform
Data Science At Scale for IoT on the Pivotal Platform
 
Pivotal Digital Transformation Forum: Becoming a Data Driven Enterprise
Pivotal Digital Transformation Forum: Becoming a Data Driven EnterprisePivotal Digital Transformation Forum: Becoming a Data Driven Enterprise
Pivotal Digital Transformation Forum: Becoming a Data Driven Enterprise
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Your AI Transformation
Your AI Transformation Your AI Transformation
Your AI Transformation
 
Data and its Role in Your Digital Transformation
Data and its Role in Your Digital TransformationData and its Role in Your Digital Transformation
Data and its Role in Your Digital Transformation
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 
Cloud Digital Transformation
Cloud Digital TransformationCloud Digital Transformation
Cloud Digital Transformation
 
seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
 
Next Steps In Your Digital Transformation
Next Steps In Your Digital TransformationNext Steps In Your Digital Transformation
Next Steps In Your Digital Transformation
 
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
 
How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...
How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...
How to create intelligent Business Processes thanks to Big Data (BPM, Apache ...
 
Dell | Your Path – Our Platform & Great Partnerships
Dell | Your Path – Our Platform & Great PartnershipsDell | Your Path – Our Platform & Great Partnerships
Dell | Your Path – Our Platform & Great Partnerships
 

Similar to How Data Science is Preventing College Dropouts and Advancing Student Success

Grip op studieresultaten
Grip op studieresultatenGrip op studieresultaten
Grip op studieresultatenSURF Events
 
Higher ed - wiztango active learning platform
Higher ed - wiztango active learning platformHigher ed - wiztango active learning platform
Higher ed - wiztango active learning platformWIZTANGO
 
ATD OER Degree Initiative | SRI-rpkGROUP Services
ATD OER Degree Initiative | SRI-rpkGROUP ServicesATD OER Degree Initiative | SRI-rpkGROUP Services
ATD OER Degree Initiative | SRI-rpkGROUP ServicesAchieving the Dream
 
OER Degee Initiative Kickoff | Data & Evaluation Services
OER Degee Initiative Kickoff |  Data & Evaluation ServicesOER Degee Initiative Kickoff |  Data & Evaluation Services
OER Degee Initiative Kickoff | Data & Evaluation ServicesAchieving the Dream
 
Digital Transformation in Higher Education - The Changing Student Relationship
Digital Transformation in Higher Education - The Changing Student RelationshipDigital Transformation in Higher Education - The Changing Student Relationship
Digital Transformation in Higher Education - The Changing Student RelationshipAndy Steer
 
Student Success Plan Learner Relationship Management Tech Review
Student Success Plan Learner Relationship Management Tech ReviewStudent Success Plan Learner Relationship Management Tech Review
Student Success Plan Learner Relationship Management Tech Reviewshawngormley
 
Digital apprenticeships community event
Digital apprenticeships community eventDigital apprenticeships community event
Digital apprenticeships community eventJames Clay
 
Spohrer darwin woi 20151119 v2
Spohrer darwin woi 20151119 v2Spohrer darwin woi 20151119 v2
Spohrer darwin woi 20151119 v2ISSIP
 
A comprehensive approach to digital transformation at the University of South...
A comprehensive approach to digital transformation at the University of South...A comprehensive approach to digital transformation at the University of South...
A comprehensive approach to digital transformation at the University of South...MuleSoft
 
Univate: Your University is your best Incubator
Univate: Your University is your best IncubatorUnivate: Your University is your best Incubator
Univate: Your University is your best IncubatorJide Adebayo
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Geoffrey Fox
 
Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...
Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...
Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...Alexandra M. Pickett
 
Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...
Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...
Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...Charter School Capital
 
Sap (higher) education&research s_lc_m_l1-2
Sap (higher) education&research s_lc_m_l1-2Sap (higher) education&research s_lc_m_l1-2
Sap (higher) education&research s_lc_m_l1-2Ahmed Eissa ☁
 
Prospective Student Web Content Team - University of Edinburgh intro session
Prospective Student Web Content Team - University of Edinburgh intro sessionProspective Student Web Content Team - University of Edinburgh intro session
Prospective Student Web Content Team - University of Edinburgh intro sessionNeil Allison
 

Similar to How Data Science is Preventing College Dropouts and Advancing Student Success (20)

Grip op studieresultaten
Grip op studieresultatenGrip op studieresultaten
Grip op studieresultaten
 
Higher ed - wiztango active learning platform
Higher ed - wiztango active learning platformHigher ed - wiztango active learning platform
Higher ed - wiztango active learning platform
 
ATD OER Degree Initiative | SRI-rpkGROUP Services
ATD OER Degree Initiative | SRI-rpkGROUP ServicesATD OER Degree Initiative | SRI-rpkGROUP Services
ATD OER Degree Initiative | SRI-rpkGROUP Services
 
OER Degee Initiative Kickoff | Data & Evaluation Services
OER Degee Initiative Kickoff |  Data & Evaluation ServicesOER Degee Initiative Kickoff |  Data & Evaluation Services
OER Degee Initiative Kickoff | Data & Evaluation Services
 
Digital Transformation in Higher Education - The Changing Student Relationship
Digital Transformation in Higher Education - The Changing Student RelationshipDigital Transformation in Higher Education - The Changing Student Relationship
Digital Transformation in Higher Education - The Changing Student Relationship
 
Student Success Plan: Helping students reach their goals!!
Student Success Plan: Helping students reach their goals!!Student Success Plan: Helping students reach their goals!!
Student Success Plan: Helping students reach their goals!!
 
Student Success Plan Learner Relationship Management Tech Review
Student Success Plan Learner Relationship Management Tech ReviewStudent Success Plan Learner Relationship Management Tech Review
Student Success Plan Learner Relationship Management Tech Review
 
Digital apprenticeships community event
Digital apprenticeships community eventDigital apprenticeships community event
Digital apprenticeships community event
 
Spohrer darwin woi 20151119 v2
Spohrer darwin woi 20151119 v2Spohrer darwin woi 20151119 v2
Spohrer darwin woi 20151119 v2
 
A comprehensive approach to digital transformation at the University of South...
A comprehensive approach to digital transformation at the University of South...A comprehensive approach to digital transformation at the University of South...
A comprehensive approach to digital transformation at the University of South...
 
Univate: Your University is your best Incubator
Univate: Your University is your best IncubatorUnivate: Your University is your best Incubator
Univate: Your University is your best Incubator
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...
 
12120
1212012120
12120
 
Growth Strategies for Charter Schools
Growth Strategies for Charter SchoolsGrowth Strategies for Charter Schools
Growth Strategies for Charter Schools
 
Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...
Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...
Nelson Baker & Yakut Gazi: Workshop: Affordable Degrees at Scale: What Does i...
 
Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...
Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...
Growth Strategies for Charter Schools: Start-up, Expansion, Maturity | Califo...
 
Sap (higher) education&research s_lc_m_l1-2
Sap (higher) education&research s_lc_m_l1-2Sap (higher) education&research s_lc_m_l1-2
Sap (higher) education&research s_lc_m_l1-2
 
12118
1211812118
12118
 
Prospective Student Web Content Team - University of Edinburgh intro session
Prospective Student Web Content Team - University of Edinburgh intro sessionProspective Student Web Content Team - University of Edinburgh intro session
Prospective Student Web Content Team - University of Edinburgh intro session
 
RIFT@NTLT2013
RIFT@NTLT2013RIFT@NTLT2013
RIFT@NTLT2013
 

More from VMware Tanzu

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItVMware Tanzu
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023VMware Tanzu
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleVMware Tanzu
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023VMware Tanzu
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductVMware Tanzu
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready AppsVMware Tanzu
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And BeyondVMware Tanzu
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023VMware Tanzu
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptxVMware Tanzu
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchVMware Tanzu
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishVMware Tanzu
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVMware Tanzu
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - FrenchVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023VMware Tanzu
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootVMware Tanzu
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerVMware Tanzu
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeVMware Tanzu
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsVMware Tanzu
 

More from VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Recently uploaded

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Recently uploaded (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

How Data Science is Preventing College Dropouts and Advancing Student Success

  • 1. 1© Copyright 2015 Pivotal. All rights reserved. 1© Copyright 2015 Pivotal. All rights reserved. Srivatsan Ramanujam, Regunathan Radhakrishnan Principal Data Scientists Pivotal Data Science How Data Science is Preventing College Dropouts and Advancing Student Success
  • 2. 2© Copyright 2015 Pivotal. All rights reserved. Agenda  Justifying the cost of college education  Institutional Data Lake  Overview of typical data sources – Structured – Unstructured  Models to predict student success – Predict time-to-graduate – Predict term GPA – Predict course Grade  Operationalizing student success models
  • 3. 3© Copyright 2015 Pivotal. All rights reserved. Justifying the Cost of College Education
  • 4. 4© Copyright 2015 Pivotal. All rights reserved. Educators’ Concerns  Cost of education has been steeply increasing and as a consequence student debt has also been rising steeply  How do we justify the value of college education?  How do we ensure students graduate on time, don’t drop-out, get better jobs?  What are factors which educators can influence to improve student graduation rates?
  • 5. 5© Copyright 2015 Pivotal. All rights reserved. Institutional Data Lake
  • 6. 6© Copyright 2015 Pivotal. All rights reserved. Business Goals  Educators would like to study factors that affect student success  To introduce policies and tools that positively impact student success Institutional Data Lake Admissions data Registration data demographics data Online forums data Blackboard data Card swipes at campus facilities Structured data sources Unstructured data sources Behaviors Student Cluster Analysis Outcomes GPA Prediction Retention Prediction
  • 7. 7© Copyright 2015 Pivotal. All rights reserved. Advantages of Institutional Data Lake  Derive insights from data that help drive institution policies  Deploy data-driven apps that positively impacts student success Institutional Data Lake Admissions data Registration data demographics data Online forums data Blackboard data Card swipes at campus facilities Structured data sources Unstructured data sources Analytics Education research Data-driven apps e.g Predict time-to-grad Predict drop-outs e.g course recommender Term GPA predictor
  • 8. 8© Copyright 2015 Pivotal. All rights reserved. Problem Statement  Given data related to a student’s activity and profile, predict a student’s success ( e.g. time to graduate) – Identify key attributes that influence the time to graduate which will assist institution take action on some of the identified attributes  Our approach: – Create 360-degree view of student’s activity and profile – Apply machine learning methods to predict time to graduate – Interrogate the developed models to understand key factors
  • 9. 9© Copyright 2015 Pivotal. All rights reserved. Data Science Toolkit KEY LANGUAGES P L A T F O R M KEY TOOLS MLlib PL/X ModelingTools VisualizationTools Platform
  • 10. 10© Copyright 2015 Pivotal. All rights reserved. Data Lake Business Levers Apps Pipeline of a Data Science Driven App MLlib PL/X Model Building Model Tuning Continuous Model Improvement Data Feeds Ingest Filter Enrich Sink SpringXD Greenplum
  • 11. 11© Copyright 2015 Pivotal. All rights reserved. • For embarrassingly parallel tasks, we can use procedural languages to easily parallelize any stand-alone library in Java, Python, R, pgSQL or C/C++ • The interpreter/VM of the language ‘X’ is installed on each node of the MPP environment Standby Master … Master Host SQL Interconnect Segment Host Segment Segment Segment Host Segment Segment Segment Host Segment Segment Segment Host Segment Segment Data Parallelism through PL/X: X in Python, R, Java, C/C++ and pgSQL • plpython and python are loaded as dynamic libraries on the master and segment nodes (libpython.so and plpython.so are under $GPHOME/ext/python)
  • 12. 12© Copyright 2015 Pivotal. All rights reserved. MADlib: Scalable, In-Database Machine Learning http://vldb.org/pvldb/vol5/p1700_joehellerstein_vldb2012.pdf
  • 13. 13© Copyright 2015 Pivotal. All rights reserved. Functions Supervised Learning Regression Models • Cox Proportional Hazards Regression • Elastic Net Regularization • Generalized Linear Models • Linear Regression • Logistic Regression • Marginal Effects • Multinomial Regression • Ordinal Regression • Robust Variance, Clustered Variance • Support Vector Machines Tree Methods • Decision Tree • Random Forest Other Methods • Conditional Random Field • Naïve Bayes Unsupervised Learning • Association Rules (Apriori) • Clustering (K-means) • Topic Modeling (LDA) Statistics Descriptive • Cardinality Estimators • Correlation • Summary Inferential • Hypothesis Tests Other Statistics • Probability Functions Other Modules • Conjugate Gradient • Linear Solvers • PMML Export • Random Sampling • Term Frequency for Text Time Series • ARIMA Data Types and Transformations • Array Operations • Dimensionality Reduction (PCA) • Encoding Categorical Variables • Matrix Operations • Matrix Factorization (SVD, Low Rank) • Norms and Distance Functions • Sparse Vectors Model Evaluation • Cross Validation Predictive Analytics Library @ApacheMADlib
  • 14. 14© Copyright 2015 Pivotal. All rights reserved. Overview of Typical Data Sources
  • 15. 15© Copyright 2015 Pivotal. All rights reserved. Typical Data Sources Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard Learn activity network activity Card swipes activity
  • 16. 16© Copyright 2015 Pivotal. All rights reserved. Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard learn activity network activity card swipes activity
  • 17. 17© Copyright 2015 Pivotal. All rights reserved. Admit Rate and Yield Rate  Admit_rate = num admitted/ num applications Yield rate = num enrolled/ num admitted
  • 18. 18© Copyright 2015 Pivotal. All rights reserved. Percentage of Foreign Applicants Increases, while Percentage of Residents Decreases
  • 19. 19© Copyright 2015 Pivotal. All rights reserved. Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard learn activity network activity card swipes activity
  • 20. 20© Copyright 2015 Pivotal. All rights reserved. When do Students Take Internships?
  • 21. 21© Copyright 2015 Pivotal. All rights reserved. Changes in Enrollment Info: College, Major, Program How many changes? when? • Late grads more likely to change college, major or program and change later in their student career
  • 22. 22© Copyright 2015 Pivotal. All rights reserved. Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard learn activity network activity card swipes activity
  • 23. 23© Copyright 2015 Pivotal. All rights reserved. High School GPA, Rank
  • 24. 24© Copyright 2015 Pivotal. All rights reserved. Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard learn activity network activity card swipes activity
  • 25. 25© Copyright 2015 Pivotal. All rights reserved. Percentage of Assignments Submitted/Course/Term • Number of assignments submitted per course/ Actual number of assignments for that course • Average this ratio across courses taken in each Academic period • Students who drop out are less likely to submit all assignments in a course
  • 26. 26© Copyright 2015 Pivotal. All rights reserved. How Quickly do Students Submit Assignments? • Calculate time diff between first_submission and student’s submission • Average across courses and assignments per academic period • Time converted in terms of hours
  • 27. 27© Copyright 2015 Pivotal. All rights reserved. Participation Ratio Participation ratio = number of posted messages/number of viewed messages • Number of viewed messages always greater than number of posted messages • Normal students view more messages than they post
  • 28. 28© Copyright 2015 Pivotal. All rights reserved. Popularity of Messages by Posters • Median viewer count largest for normal grads • Msg_hit_count is not the highest for normal grads
  • 29. 29© Copyright 2015 Pivotal. All rights reserved. Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard learn activity network activity card swipes activity
  • 30. 30© Copyright 2015 Pivotal. All rights reserved. Network Activity – Usage by Class Label • Incomplete and late grads tend to use the network more
  • 31. 31© Copyright 2015 Pivotal. All rights reserved. Student 360 application, admission academic activity (e.g courses, GPA) previous education profile demographic profileBlackboard learn activity network activity card swipes activity
  • 32. 32© Copyright 2015 Pivotal. All rights reserved. Students’ network logs and card activity Card Swipes – Laundry and Dining Facility Use • Late grads are more likely to use laundry facilities while classes are in session • Late grads also tend to have breakfast, lunch and dinner later than on-time grads
  • 33. 33© Copyright 2015 Pivotal. All rights reserved. Models Predict Time-to-Graduate Predict Term GPA Predict Course Grade
  • 34. 34© Copyright 2015 Pivotal. All rights reserved. Modeling Approach: Two Types of Features Fall 2008 Spring 2009 Summer 2009 Fall 2008 cohort joins Extract features until end of term Perform Modeling & Scoring Whether or not a student will graduate on time Extract features until end of term Perform Modeling & Scoring Whether or not a student will graduate on time Static features • Remain the same irrespective of time window • E.g gender, admission attribute, sat score etc Time sensitive features • Depends on time window (w1,w2) • E.g course activity, bblearn activity etc w1 w2
  • 35. 35© Copyright 2015 Pivotal. All rights reserved. Modeling – Algorithms and Libraries • Logistic Regression (MADlib)  XGBoost (https://github.com/dmlc/xgboost) • AdaBoost (Scikit-Learn) • RandomForest (Scikit-Learn) • Extracted three categories of features from all data sources - Static ~ 50 features, time sensitive ~ 115 features - Card Swipes, Network Logs ~ 110 features • Built models for three different problems - Time-to-graduate - Course grade - Term GPA Algorithms and Libraries
  • 36. 36© Copyright 2015 Pivotal. All rights reserved. Operationalization
  • 37. 37© Copyright 2015 Pivotal. All rights reserved. Operationalization Pipeline Refreshed data (incoming daily/weekly/monthly updates) feature gen. pipeline Static features Static + time-sensitive LMS features Static + time-sensitive LMS + network + card logs features In-database parallel grid-search (XGBoost) MADlib Logistic Regression Sklearn AdaBoost Sklearn RandomForest Model selection Serialize to disk Structured, unstructured data sources in data lake scoring results • Student ID • Feature names, values, importance scores • Predictions (late, normal, dropped) User-notification on smartphone app Cleared by Data Scientist Modeling pipeline
  • 38. 38© Copyright 2015 Pivotal. All rights reserved. In-Database Parallel Grid Search Using https://github.com/vatsan/gp_xgboost_gridsearch • XGBoost (eXtreme Gradient Boosting) is a popular library used in many prize winning Kaggle contests. • Implemented in C++ with Python and R bindings • Supports multi-core • Implemented in-database parallel grid-search for XGBoost using PL/Python
  • 39. 39© Copyright 2015 Pivotal. All rights reserved. In-Database Grid Search – Approach https://github.com/vatsan/gp_xgboost_gridsearch Refreshed data (incoming daily/weekly/monthly updates) feature gen. pipeline training dataset (distributed table) Model selection structured, unstructured data sources scored results grid search params dict Grid params table (expanded) master segments param-list-1 param-list-n. . . training set(serialized) training set(serialized) Driver function (PL/Python) pickle and distribute mdl-1 mdl-n. . .
  • 40. 40© Copyright 2015 Pivotal. All rights reserved. Model Training and Scoring: XGBoost https://github.com/vatsan/gp_xgboost_gridsearch Training Scoring
  • 41. 41© Copyright 2015 Pivotal. All rights reserved. Conclusions  We built an institutional data lake for our customer, to serve as a platform for education research and data-driven apps for student success  Reviewed typical data sources in an institutional data lake – Interesting features that make up student 360  Using open source, scalable data science toolkits we built three different models for predicting student success  We set-up a pipeline for operationalizing our models, to be consumed by a data driven smartphone app
  • 42. 42© Copyright 2015 Pivotal. All rights reserved. Pivotal Data Science Blogs 1. Scaling native (C++) apps on Pivotal MPP 2. Predicting commodity futures through Tweets 3. A pipeline for distributed topic & sentiment analysis of tweets on Greenplum 4. Using data science to predict TV viewer behavior 5. Twitter NLP: Scaling part-of-speech tagging 6. Distributed deep learning on MPP and Hadoop 7. Multi-variate time series forecasting 8. Pivotal for good – Crisis Textline http://blog.pivotal.io/data-science-pivotal
  • 43. 43© 2015 Pivotal Software, Inc. All rights reserved.