SlideShare uma empresa Scribd logo
1 de 28
Microsoft Global AI Bootcamp
Best practices in building machine
learning models in Azure ML
Zeydy Ortiz, Ph. D.
zortiz @ datacrunchlab.com
www.linkedin.com/in/zortiz
@DrZeydy @DataCrunch_Lab
DataCrunch
Lab
Data
Scientist
Computer Engineer
Computer Science
Performance
Engineer
About me
Zeydy Ortiz
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Founded in 2016, ICMM is a nonprofit research-driven
agency based in Raleigh, NC
 Mission: To create a sustainable financial future for consumers
 CEO: Dr. Diane Chen
 Research Fellow: Patrick Royal
Research Project: Create a Machine Learning system to
help credit counseling agencies (CCA) retain consumers
enrolled in debt management plans (DMP)
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Agenda
• AI & ML, what’s the
relationship?
• About Azure ML
• ML Case Study (with
examples from Gallery)
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
AI & ML, what’s the relationship?
Source: https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep-learning-ai/
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
ML/AI is currently being used in many
sectors and business functions
Retail Healthcare Financial Industrial
Education Pharmaceutical Real Estate Transportation
Advertising Manufacturing Legal Utilities
Marketing Sales
Customer
Experience
Human Resources
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Use cases of ML/AI in business
Search
Sales lead scoring
Demand forecasting
Predictive maintenance
Fraud detection & prevention
Advertisement placement
Capacity planning
Dynamic pricing
Route planning
Increased revenue
Increased efficiency
Reduced cost
Increased customer
satisfaction
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Microsoft Azure
Machine Learning
Studio
studio.azureml.net
- Easy drag-and-drop
- Extensible
- Multiple deployment
options
#GlobalAIBootcamp
@DataCrunch_Lab
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Case Study
ML for customer
retention in DMP
programs
#GlobalAIBootcamp
@DataCrunch_Lab
Debt Management Plans (DMP)
“A debt management plan sets up a
payment schedule for you to repay your
debts, with the goal of helping
creditors receive the money owed to
them and ultimately improving your
financial and credit standing.”
“It usually takes 3-5 years to complete
payments under a debt management
program, after which you may be able
to reestablish credit.”
From National Foundation for Credit
Counseling – www.nfcc.org
Photo by Francisco T Santos on Unsplash
#GlobalAIBootcamp
@DataCrunch_Lab
Why use Machine Learning?
Historical data is already been used by credit counseling agencies.
However, currently not able to provide personalized service.
Photo by chuttersnap on Unsplash
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
What problem are we solving?
Organization’s Challenge
Improve customer retention
in DMP program
ML Problem
 Clustering
 Classification
 Regression
 Recommender System
First step: Identify how long a new consumer is expected to
stay in DMP program
#GlobalAIBootcamp
@DataCrunch_Lab
What data is
available?
Demographic
Financial
Program
What is known at
enrollment time?
#GlobalAIBootcamp
@DataCrunch_Lab
Photo by Mika Baumeister on Unsplash
DataCrunch
Lab
Data is messy
 Errors in data entry
 Calculation errors
 Outliers
 Many categories
Sex: F, W, female, california
Age: -1, 104
Debt: $1,765,234
Referral: Yahoo, Web, Organic
Consult with subject matter expert to incorporate context
and determine what is reasonable
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Cleaning Data
Checklist
 Fields not known at
enrollment time
 Missing values
 Fields with many zeros
 Fields with near zero
variance
 Highly correlated fields
 Outliers
 Categorical fields with many
different values
 Data Leakage
Identify and determine how to
treat these fields or values
- Ignore
- Substitute
- Remove
- Transform
- Consolidate
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Incorporating best practices in ML
7
12
32
0 5 10 15 20 25 30 35
BEST ALGORITHM
PROCESSED DATA
RAW DATA
Mean Absolute Error
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
“
”
The No Free Lunch (NFL)
theorem states that there is no
[machine learning] model that
works best for every problem.
- Eric Cai
Based on work by David H. Wolpert “The Lack of A Priori Distinctions between
Learning Algorithms”, 1996
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Machine Learning
Modules
Azure ML provides
many built-in models
Can be extended with
R & Python
Documentation
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Experiment in Gallery
Predicting Median House Values
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Understand the assumptions behind the
algorithms
Linear regression
Predict numeric target
 House sales price
 Energy use
 Taxi fare
Poisson regression
Predict count data
 # calls received in a call center
 # patients arriving in ER
 # months in program
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Assessing performance of algorithms
Azure ML Studio provides modules to
Split Data
Partition and Sample
Cross Validate Model
Tune Model Hyperparameters
#GlobalAIBootcamp
@DataCrunch_Lab
This is where Azure AutoML can help
DataCrunch
Lab
Which model is best for this data set?
Use test data set to assess performance
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
What is the model using to make predictions?
Does it make sense?
Should we use these fields?
#GlobalAIBootcamp
@DataCrunch_Lab
“Start with the end in mind”
Deploying the algorithm requires
coordination with the organization
Options: Web service (API), Batch, Local
Photo by Matt Lamers on Unsplash
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Key takeaways
• Follow industry best
practices
• The ML problem is not the
organization’s problem
• Yes, clean your data
• Compare multiple algorithms
• Be skeptical of your models
• Consider your options for
deployment
#GlobalAIBootcamp
@DataCrunch_Lab
DataCrunch
Lab
Team capabilities
• Data science consulting
• Custom software development
• Machine Learning, Artificial
Intelligence, and Cognitive
technologies
• Big data & IoT Solutions
Innovation Awards
Grand Prize Winner
Highest Potential Value
to Manufacturers
#GlobalAIBootcamp
@DataCrunch_Lab
Thank you!
Zeydy Ortiz, Ph. D.
zortiz @ datacrunchlab.com
www.linkedin.com/in/zortiz
@DrZeydy @DataCrunch_Lab

Mais conteúdo relacionado

Mais procurados

تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...
تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...
تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...
Arab International Academy
 
Artificial inteligence
Artificial inteligenceArtificial inteligence
Artificial inteligence
akki_hearts
 
Introduction to Deep Learning, Keras, and TensorFlow
Introduction to Deep Learning, Keras, and TensorFlowIntroduction to Deep Learning, Keras, and TensorFlow
Introduction to Deep Learning, Keras, and TensorFlow
Sri Ambati
 
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckArtificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
SlideTeam
 

Mais procurados (20)

تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...
تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...
تأثير على الاداء الرياضي للاعبي كرة القدم قبل المنافسة الرياضية-عبدالله بارزي...
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
 
دور المكتبة الوطنية في تحقيق التنمية المستدامة دراسة حالة لرؤية المملكة العرب...
دور المكتبة الوطنية في تحقيق التنمية المستدامة دراسة حالة لرؤية المملكة العرب...دور المكتبة الوطنية في تحقيق التنمية المستدامة دراسة حالة لرؤية المملكة العرب...
دور المكتبة الوطنية في تحقيق التنمية المستدامة دراسة حالة لرؤية المملكة العرب...
 
Artificial Intelligence (AI).pptx
Artificial Intelligence (AI).pptxArtificial Intelligence (AI).pptx
Artificial Intelligence (AI).pptx
 
Artificial intelligence : what it is
Artificial intelligence : what it isArtificial intelligence : what it is
Artificial intelligence : what it is
 
Machine learning life cycle
Machine learning life cycleMachine learning life cycle
Machine learning life cycle
 
What is MLOps
What is MLOpsWhat is MLOps
What is MLOps
 
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
AI: A Key Enabler for Sustainable Development Goals
AI: A Key Enabler for Sustainable Development GoalsAI: A Key Enabler for Sustainable Development Goals
AI: A Key Enabler for Sustainable Development Goals
 
Artificial inteligence
Artificial inteligenceArtificial inteligence
Artificial inteligence
 
Handwritten digits recognition report
Handwritten digits recognition reportHandwritten digits recognition report
Handwritten digits recognition report
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Deep learning ppt
Deep learning pptDeep learning ppt
Deep learning ppt
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 
Introduction to Deep Learning, Keras, and TensorFlow
Introduction to Deep Learning, Keras, and TensorFlowIntroduction to Deep Learning, Keras, and TensorFlow
Introduction to Deep Learning, Keras, and TensorFlow
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Tensorflow presentation
Tensorflow presentationTensorflow presentation
Tensorflow presentation
 
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckArtificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
 
مقدمة عن الذكاء الإصطناعي
مقدمة عن الذكاء الإصطناعيمقدمة عن الذكاء الإصطناعي
مقدمة عن الذكاء الإصطناعي
 

Semelhante a Best practices in building machine learning models in Azure ML

Semelhante a Best practices in building machine learning models in Azure ML (20)

Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a profession
 
Data Driven Engineering 2014
Data Driven Engineering 2014Data Driven Engineering 2014
Data Driven Engineering 2014
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 
Machine Learning for SEOs - SMXL
Machine Learning for SEOs - SMXLMachine Learning for SEOs - SMXL
Machine Learning for SEOs - SMXL
 
Intelligent Big Data analytics for the future.
Intelligent Big Data analytics for the future.Intelligent Big Data analytics for the future.
Intelligent Big Data analytics for the future.
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
Data Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsData Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analytics
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Introduction to Azure Machine Learning
Introduction to Azure Machine LearningIntroduction to Azure Machine Learning
Introduction to Azure Machine Learning
 
Deep Learning in the Real World
Deep Learning in the Real WorldDeep Learning in the Real World
Deep Learning in the Real World
 
Train, explain, acclaim. Build a good model in three steps
Train, explain, acclaim.  Build a good model in three stepsTrain, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim. Build a good model in three steps
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Dont wait what 300 ld leaders have learned about building data fluency
 Dont wait what 300 ld leaders have learned about building data fluency Dont wait what 300 ld leaders have learned about building data fluency
Dont wait what 300 ld leaders have learned about building data fluency
 
Managing machine learning
Managing machine learningManaging machine learning
Managing machine learning
 
Whats Next for Machine Learning
Whats Next for Machine LearningWhats Next for Machine Learning
Whats Next for Machine Learning
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Ai in finance
Ai in financeAi in finance
Ai in finance
 
EDW 2015 cognitive computing panel session
EDW 2015 cognitive computing panel session EDW 2015 cognitive computing panel session
EDW 2015 cognitive computing panel session
 

Mais de Zeydy Ortiz, Ph. D. (6)

Bias in AI
Bias in AIBias in AI
Bias in AI
 
Coverting data into business value
Coverting data into business valueCoverting data into business value
Coverting data into business value
 
Analytics>Forward - Design Thinking for Data Science
Analytics>Forward - Design Thinking for Data ScienceAnalytics>Forward - Design Thinking for Data Science
Analytics>Forward - Design Thinking for Data Science
 
Scalable Data Science with Spark and R
Scalable Data Science with Spark and RScalable Data Science with Spark and R
Scalable Data Science with Spark and R
 
Data Science for Social Good
Data Science for Social GoodData Science for Social Good
Data Science for Social Good
 
Zeydy Ortiz _66
Zeydy Ortiz _66Zeydy Ortiz _66
Zeydy Ortiz _66
 

Último

Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Abortion pills in Riyadh +966572737505 get cytotec
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
vexqp
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 

Último (20)

Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 

Best practices in building machine learning models in Azure ML

  • 1. Microsoft Global AI Bootcamp Best practices in building machine learning models in Azure ML Zeydy Ortiz, Ph. D. zortiz @ datacrunchlab.com www.linkedin.com/in/zortiz @DrZeydy @DataCrunch_Lab
  • 3. DataCrunch Lab Founded in 2016, ICMM is a nonprofit research-driven agency based in Raleigh, NC  Mission: To create a sustainable financial future for consumers  CEO: Dr. Diane Chen  Research Fellow: Patrick Royal Research Project: Create a Machine Learning system to help credit counseling agencies (CCA) retain consumers enrolled in debt management plans (DMP) #GlobalAIBootcamp @DataCrunch_Lab
  • 4. DataCrunch Lab Agenda • AI & ML, what’s the relationship? • About Azure ML • ML Case Study (with examples from Gallery) #GlobalAIBootcamp @DataCrunch_Lab
  • 5. DataCrunch Lab AI & ML, what’s the relationship? Source: https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep-learning-ai/ #GlobalAIBootcamp @DataCrunch_Lab
  • 6. DataCrunch Lab ML/AI is currently being used in many sectors and business functions Retail Healthcare Financial Industrial Education Pharmaceutical Real Estate Transportation Advertising Manufacturing Legal Utilities Marketing Sales Customer Experience Human Resources #GlobalAIBootcamp @DataCrunch_Lab
  • 7. DataCrunch Lab Use cases of ML/AI in business Search Sales lead scoring Demand forecasting Predictive maintenance Fraud detection & prevention Advertisement placement Capacity planning Dynamic pricing Route planning Increased revenue Increased efficiency Reduced cost Increased customer satisfaction #GlobalAIBootcamp @DataCrunch_Lab
  • 8. DataCrunch Lab Microsoft Azure Machine Learning Studio studio.azureml.net - Easy drag-and-drop - Extensible - Multiple deployment options #GlobalAIBootcamp @DataCrunch_Lab
  • 10. DataCrunch Lab Case Study ML for customer retention in DMP programs #GlobalAIBootcamp @DataCrunch_Lab
  • 11. Debt Management Plans (DMP) “A debt management plan sets up a payment schedule for you to repay your debts, with the goal of helping creditors receive the money owed to them and ultimately improving your financial and credit standing.” “It usually takes 3-5 years to complete payments under a debt management program, after which you may be able to reestablish credit.” From National Foundation for Credit Counseling – www.nfcc.org Photo by Francisco T Santos on Unsplash #GlobalAIBootcamp @DataCrunch_Lab
  • 12. Why use Machine Learning? Historical data is already been used by credit counseling agencies. However, currently not able to provide personalized service. Photo by chuttersnap on Unsplash #GlobalAIBootcamp @DataCrunch_Lab
  • 13. DataCrunch Lab What problem are we solving? Organization’s Challenge Improve customer retention in DMP program ML Problem  Clustering  Classification  Regression  Recommender System First step: Identify how long a new consumer is expected to stay in DMP program #GlobalAIBootcamp @DataCrunch_Lab
  • 14. What data is available? Demographic Financial Program What is known at enrollment time? #GlobalAIBootcamp @DataCrunch_Lab Photo by Mika Baumeister on Unsplash
  • 15. DataCrunch Lab Data is messy  Errors in data entry  Calculation errors  Outliers  Many categories Sex: F, W, female, california Age: -1, 104 Debt: $1,765,234 Referral: Yahoo, Web, Organic Consult with subject matter expert to incorporate context and determine what is reasonable #GlobalAIBootcamp @DataCrunch_Lab
  • 16. DataCrunch Lab Cleaning Data Checklist  Fields not known at enrollment time  Missing values  Fields with many zeros  Fields with near zero variance  Highly correlated fields  Outliers  Categorical fields with many different values  Data Leakage Identify and determine how to treat these fields or values - Ignore - Substitute - Remove - Transform - Consolidate #GlobalAIBootcamp @DataCrunch_Lab
  • 17. DataCrunch Lab Incorporating best practices in ML 7 12 32 0 5 10 15 20 25 30 35 BEST ALGORITHM PROCESSED DATA RAW DATA Mean Absolute Error #GlobalAIBootcamp @DataCrunch_Lab
  • 18. DataCrunch Lab “ ” The No Free Lunch (NFL) theorem states that there is no [machine learning] model that works best for every problem. - Eric Cai Based on work by David H. Wolpert “The Lack of A Priori Distinctions between Learning Algorithms”, 1996 #GlobalAIBootcamp @DataCrunch_Lab
  • 19. DataCrunch Lab Machine Learning Modules Azure ML provides many built-in models Can be extended with R & Python Documentation #GlobalAIBootcamp @DataCrunch_Lab
  • 20. DataCrunch Lab Experiment in Gallery Predicting Median House Values #GlobalAIBootcamp @DataCrunch_Lab
  • 21. DataCrunch Lab Understand the assumptions behind the algorithms Linear regression Predict numeric target  House sales price  Energy use  Taxi fare Poisson regression Predict count data  # calls received in a call center  # patients arriving in ER  # months in program #GlobalAIBootcamp @DataCrunch_Lab
  • 22. DataCrunch Lab Assessing performance of algorithms Azure ML Studio provides modules to Split Data Partition and Sample Cross Validate Model Tune Model Hyperparameters #GlobalAIBootcamp @DataCrunch_Lab This is where Azure AutoML can help
  • 23. DataCrunch Lab Which model is best for this data set? Use test data set to assess performance #GlobalAIBootcamp @DataCrunch_Lab
  • 24. DataCrunch Lab What is the model using to make predictions? Does it make sense? Should we use these fields? #GlobalAIBootcamp @DataCrunch_Lab
  • 25. “Start with the end in mind” Deploying the algorithm requires coordination with the organization Options: Web service (API), Batch, Local Photo by Matt Lamers on Unsplash #GlobalAIBootcamp @DataCrunch_Lab
  • 26. DataCrunch Lab Key takeaways • Follow industry best practices • The ML problem is not the organization’s problem • Yes, clean your data • Compare multiple algorithms • Be skeptical of your models • Consider your options for deployment #GlobalAIBootcamp @DataCrunch_Lab
  • 27. DataCrunch Lab Team capabilities • Data science consulting • Custom software development • Machine Learning, Artificial Intelligence, and Cognitive technologies • Big data & IoT Solutions Innovation Awards Grand Prize Winner Highest Potential Value to Manufacturers #GlobalAIBootcamp @DataCrunch_Lab
  • 28. Thank you! Zeydy Ortiz, Ph. D. zortiz @ datacrunchlab.com www.linkedin.com/in/zortiz @DrZeydy @DataCrunch_Lab