SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
Data Science and Business Analysis: A
Look at Best Practices for Roles, Skills, and Processes
Bob. E. Hayes, PhD
bob@appuri.com
@bobehayes
Bob E. Hayes, PhD
Chief Research Officer
Email: bob@appuri.com
Web: www.appuri.com
Twitter: @bobehayes
• Author of three books on customer experience management
and analytics
• PhD in industrial-organizational psychology
• #1 blogger overall on CustomerThink
(http://customerthink.com/author/bobehayes/)
• #1 blogger on the topic of customer analytics
(http://customerthink.com/top-authors-category/)
• Top expert in Big Data and Data Science
• https://www.maptive.com/the-top-100-big-data-experts/
• http://www.kdnuggets.com/2015/02/top-big-data-
influencers-brands.html
3
What is Data Science?
Data science is way of extracting insights from data using the
powers of computer science and statistics applied to data from a
specific field of study
Involves the collection, analysis and interpretation of data to
extract empirically-based insights that augment and enhance
human decisions and algorithms
4
Data Science Study
Invited data professionals via:
• AnalyticsWeek Newsletter
• Blog post
• Social media (Twitter, LinkedIn, Google+)
600+ completed surveys
• Self-assessment rating of proficiency of 25 skills across five skill areas:
• Business, Technology, Programming, Math & Modeling, Statistics
• 9 additional questions
• Overall satisfaction with outcome of analytics projects
5
Data Science Skills Assessed
Area Skills*
Business
1. Product design and development
2. Project management
3. Business development
4. Budgeting
5. Governance & Compliance (e.g., security)
Technology
6. Managing unstructured data (e.g., noSQL)
7. Managing structured data (e.g., SQL, JSON, XML)
8. Natural Language Processing (NLP) and text mining
9. Machine Learning (e.g., decision trees, neural nets, Support Vector Machine, clustering)
10. Big and Distributed Data (e.g., Hadoop, Map/Reduce, Spark)
Math &
Modeling
11. Optimization (e.g., linear, integer, convex, global)
12. Math (e.g., linear algebra, real analysis, calculus)
13. Graphical Models (e.g., social networks)
14. Algorithms (e.g., computational complexity, Computer Science theory) and Simulations (e.g., discrete, agent-based, continuous)
15. Bayesian Statistics (e.g., Markov Chain Monte Carlo)
Programming
16. Systems Administration (e.g., UNIX) and Design
17. Database Administration (MySQL, NOSQL)
18. Cloud Management
19. Back-End Programming (e.g., JAVA/Rails/Objective C)
20. Front-End Programming (e.g., JavaScript, HTML, CSS)
Statistics
21. Data Management (e.g., recoding, de-duplicating, Integrating disparate data sources, Web scraping)
22. Data Mining (e.g. R, Python, SPSS, SAS) and Visualization (e.g., graphics, mapping, web-based data visualization) tools
23. Statistics and statistical modeling (e.g., general linear model, ANOVA, MANOVA, Spatio-temporal, Geographical Information System (GIS))
24. Science/Scientific Method (e.g., experimental design, research design)
25. Communication (e.g., sharing results, writing/publishing, presentations, blogging)
* List of skills adapted from Analyzing the Analyzers by Harlan D. Harris, Sean Patrick Murphy and Marck Vaisman
6
Proficiency Ratings*
Proficiency
Level
Scale	
Value
Description
Don't know 0 You possess no knowledge
Fundamental
Awareness
20 You have a common knowledge or an understanding of basic techniques and concepts.
Novice 40
You have the level of experience gained in a classroom and/or experimental scenarios or
as a trainee on-the-job. You are expected to need help when performing this skill.
Intermediate 60
You are able to successfully complete tasks in this competency as requested. Help from an
expert may be required from time to time, but you can usually perform the skill
independently.
Advanced 80
You can perform the actions associated with this skill without assistance. You are certainly
recognized within your immediate organization as "a person to ask" when difficult
questions arise regarding this skill.
Expert 100
You are known as an expert in this area. You can provide guidance, troubleshoot and
answer questions related to this area of expertise and the field where the skill is used.
* Rating scale is based on a proficiency rating scale used by NIH. Respondent instructions: You will be asked about your proficiency for a
variety of skills. Please use the following scale when indicating your level of proficiency for each skill.
7
Sample
8
Proficiency varies across skills
Top 10 Data Science Skills
1. Communication
2. Managing structured data
3. Data mining and visualization tools
4. Science / Scientific method
5. Math
6. Project management
7. Data management
8. Statistics and statistical modeling
9. Product design and development
10. Business development
9
Job Roles in Data Science
*Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person, entrepreneur); Creative
(e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
10
Proficiency in 25 skills varies by job role
• Different types of data scientists
possess different skills
• Biz Management – strong in
business skills
• Developer – strong in
technology/programming skills
• Researcher – strong in math/
statistics skills
• Creatives – average in all skills
11
Structure of Data Science Skills
* Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the three Skills areas
(based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92 (Math / Stats)
Factor Analysis of Data Skills
• Data reduction technique
• Examines the statistical relationships (e.g.,
correlations) among a large set of variables and
tries to explain these correlations using a smaller
number of variables (factors)
• Elements (or factor loadings) of the factor pattern
matrix represent the strength of relationship
between the variables and each of the underlying
factors
• Tells us two things:
1. number of underlying factors that
describe the initial set of variables
2. which variables are best represented by
each factor
12
Structure of Data Science Skills
* Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the three Skills areas
(based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92 (Math / Stats)
Plot the factor loadings
for the 25 data skills into
a 3-dimensional space
Three Distinct Skill Sets
• Business
• Technology / Programming
• Math / Statistics
13
The Structure of Data Science Skills
14
Proficiency in general skill areas varies by job role
15
Business Skills: Proficiency varies by job role
*Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person, entrepreneur) n = 86;
Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
16
Technology and Math/Statistics Skills: Proficiency varies by job role
*Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person,
entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
17
Top Data Science Skills by Job Role
18
Satisfaction with Work Outcome
*Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person,
entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
19
In Search of the Data Scientist Unicorn
20
Data Science as a Team Sport
Impact of Business Expert
21
Data Science as a Team Sport
Impact of Technology / Programming Expert
22
Data Science as a Team Sport
Impact of Math & Modeling / Statistics Expert
23
Getting Insight from Data: The Scientific Method
1. Formulate
Questions
2. Generate
hypothesis/
hunch
3. Gather /
Generate data
4. Analyze data
/ Test
hypothesis
5. Take action /
Communicate
results
• Start with a problem statement.
• What are your hunches /
hypotheses?
• Be sure your hypotheses are
testable.
• You can use experimental or
observational approach to analyzing
data.
• Integrate your data silos to ask
bigger questions; connect the dots
and get a 360 degree view of your
customers.
• Employ Predictive analytics /
Inferential statistics to test
hypotheses
• Employ machine learning to
quickly surface insights
• Implement your findings
• Use Prescriptive analytics to
guide course of action
24
Scientific Method and Data Science Skills
25
What skills are linked to project success?
26
Importance of Data Science Skills by Job Role
27
Education and Data Science Skills
28
Lack of Gender Diversity
29
Lack of Gender Diversity – Other Science Roles
30
Job Roles in Data Science by Gender
31
Highest Level of Education Attained
32
Gender Comparison of Proficiency across Skills
33
Advice for Data Scientists
• Be specific when talking about “data scientists”
• There are different types – defined by what they do and the skills they possess
• Work with other data professionals who have complementary skills.
Teamwork is key to successful data science projects.
• Learn to use data mining and visualization tools
• R, Python, SPSS, SAS, graphics, mapping, web-based data visualization
• Be an advocate for women in the field of data science

Mais conteúdo relacionado

Mais procurados

Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptxSadhanaParameswaran
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientistVijayMohan Vasu
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big DataIndu Khemchandani
 
Using Machine Learning to Optimize COVID-19 Predictions
Using Machine Learning to Optimize COVID-19 PredictionsUsing Machine Learning to Optimize COVID-19 Predictions
Using Machine Learning to Optimize COVID-19 PredictionsDatabricks
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Edureka!
 
1.8 discretization
1.8 discretization1.8 discretization
1.8 discretizationKrish_ver2
 
Cross validation
Cross validationCross validation
Cross validationRidhaAfrawe
 
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdfExploratory Data Analysis - A Comprehensive Guide to EDA.pdf
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdfJamieDornan2
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data ScienceKenny Daniel
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelHiram Ting
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 

Mais procurados (20)

Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Lecture #01
Lecture #01Lecture #01
Lecture #01
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
Data science
Data scienceData science
Data science
 
Using Machine Learning to Optimize COVID-19 Predictions
Using Machine Learning to Optimize COVID-19 PredictionsUsing Machine Learning to Optimize COVID-19 Predictions
Using Machine Learning to Optimize COVID-19 Predictions
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
 
1.8 discretization
1.8 discretization1.8 discretization
1.8 discretization
 
Big data
Big dataBig data
Big data
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Cross validation
Cross validationCross validation
Cross validation
 
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdfExploratory Data Analysis - A Comprehensive Guide to EDA.pdf
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data Science
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Research methodology
Research methodology Research methodology
Research methodology
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate Level
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 

Destaque

Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeDATAVERSITY
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDATAVERSITY
 
RWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkRWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkDATAVERSITY
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceDATAVERSITY
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindDATAVERSITY
 
RWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsRWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsDATAVERSITY
 
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?DATAVERSITY
 
RWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipRWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipDATAVERSITY
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDATAVERSITY
 
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...DATAVERSITY
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenDATAVERSITY
 
Enterprise Data World Webinar: How to Get Your MDM Program Up & Running
Enterprise Data World Webinar: How to Get Your MDM Program Up & RunningEnterprise Data World Webinar: How to Get Your MDM Program Up & Running
Enterprise Data World Webinar: How to Get Your MDM Program Up & RunningDATAVERSITY
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with DataSeth Familian
 
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureLDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureDATAVERSITY
 
Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...
Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...
Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...DATAVERSITY
 
Yosemite part-4 webinar-final
Yosemite part-4 webinar-finalYosemite part-4 webinar-final
Yosemite part-4 webinar-finalDATAVERSITY
 
Enterprise Data World 2016 and CDO Vision Mural Summary
Enterprise Data World 2016 and CDO Vision Mural SummaryEnterprise Data World 2016 and CDO Vision Mural Summary
Enterprise Data World 2016 and CDO Vision Mural SummaryDATAVERSITY
 
Introduction-and-RDF-Representation-of-FHIR-for-Clinical-Data
Introduction-and-RDF-Representation-of-FHIR-for-Clinical-DataIntroduction-and-RDF-Representation-of-FHIR-for-Clinical-Data
Introduction-and-RDF-Representation-of-FHIR-for-Clinical-DataDATAVERSITY
 
Using Semantic Technology to Drive Agile Analytics - SLIDES
Using Semantic Technology to Drive Agile Analytics - SLIDESUsing Semantic Technology to Drive Agile Analytics - SLIDES
Using Semantic Technology to Drive Agile Analytics - SLIDESDATAVERSITY
 

Destaque (19)

Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL Code
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
 
RWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkRWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance Framework
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business Intelligence
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data Mind
 
RWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsRWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile Efforts
 
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
 
RWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipRWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data Stewardship
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
 
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
 
Enterprise Data World Webinar: How to Get Your MDM Program Up & Running
Enterprise Data World Webinar: How to Get Your MDM Program Up & RunningEnterprise Data World Webinar: How to Get Your MDM Program Up & Running
Enterprise Data World Webinar: How to Get Your MDM Program Up & Running
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureLDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
 
Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...
Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...
Yosemite Project - Part 3 - Transformations for Integrating VA data with FHIR...
 
Yosemite part-4 webinar-final
Yosemite part-4 webinar-finalYosemite part-4 webinar-final
Yosemite part-4 webinar-final
 
Enterprise Data World 2016 and CDO Vision Mural Summary
Enterprise Data World 2016 and CDO Vision Mural SummaryEnterprise Data World 2016 and CDO Vision Mural Summary
Enterprise Data World 2016 and CDO Vision Mural Summary
 
Introduction-and-RDF-Representation-of-FHIR-for-Clinical-Data
Introduction-and-RDF-Representation-of-FHIR-for-Clinical-DataIntroduction-and-RDF-Representation-of-FHIR-for-Clinical-Data
Introduction-and-RDF-Representation-of-FHIR-for-Clinical-Data
 
Using Semantic Technology to Drive Agile Analytics - SLIDES
Using Semantic Technology to Drive Agile Analytics - SLIDESUsing Semantic Technology to Drive Agile Analytics - SLIDES
Using Semantic Technology to Drive Agile Analytics - SLIDES
 

Semelhante a Smart Data Slides: Data Science and Business Analysis - A Look at Best Practices for Roles, Skills, and Processes

DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION Elvis Muyanja
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
Data Science Training in Chennai-January
Data Science Training in Chennai-JanuaryData Science Training in Chennai-January
Data Science Training in Chennai-JanuaryDataMites
 
Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1DataMites
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-JanuaryDataMites
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-JanuaryDataMites
 
JavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceJavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceMark West
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxsumitkumar600840
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career pathRubikal
 
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Rohit Dubey
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceMark West
 
What is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdfWhat is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdfRoshni Sharma
 
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceGeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceMark West
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data ScienceAjay Ohri
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistLisa Cohen
 
Data Analyst Beginner Guide for 2023
Data Analyst Beginner Guide for 2023Data Analyst Beginner Guide for 2023
Data Analyst Beginner Guide for 2023Careervira
 

Semelhante a Smart Data Slides: Data Science and Business Analysis - A Look at Best Practices for Roles, Skills, and Processes (20)

Investigating data scientists
Investigating data scientistsInvestigating data scientists
Investigating data scientists
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
Data Science Training in Chennai-January
Data Science Training in Chennai-JanuaryData Science Training in Chennai-January
Data Science Training in Chennai-January
 
Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1Data Science Course in Chennai-January-1
Data Science Course in Chennai-January-1
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-January
 
Data Science Certification in Pune-January
Data Science Certification in Pune-JanuaryData Science Certification in Pune-January
Data Science Certification in Pune-January
 
JavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceJavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data Science
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
 
Data Analytics Career Paths
Data Analytics Career PathsData Analytics Career Paths
Data Analytics Career Paths
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career path
 
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data Science
 
What is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdfWhat is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdf
 
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceGeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data Science
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 
DataScience_RoadMap_2023.pdf
DataScience_RoadMap_2023.pdfDataScience_RoadMap_2023.pdf
DataScience_RoadMap_2023.pdf
 
Data Analyst Beginner Guide for 2023
Data Analyst Beginner Guide for 2023Data Analyst Beginner Guide for 2023
Data Analyst Beginner Guide for 2023
 

Mais de DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Mais de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Último

VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Roomdivyansh0kumar0
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxAndy Lambert
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...lizamodels9
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Tina Ji
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Roland Driesen
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Roland Driesen
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdftbatkhuu1
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsMichael W. Hawkins
 
Unlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfUnlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfOnline Income Engine
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetCreating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetDenis Gagné
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 

Último (20)

VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptx
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdf
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael Hawkins
 
Unlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfUnlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdf
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetCreating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 

Smart Data Slides: Data Science and Business Analysis - A Look at Best Practices for Roles, Skills, and Processes

  • 1. Data Science and Business Analysis: A Look at Best Practices for Roles, Skills, and Processes Bob. E. Hayes, PhD bob@appuri.com @bobehayes
  • 2. Bob E. Hayes, PhD Chief Research Officer Email: bob@appuri.com Web: www.appuri.com Twitter: @bobehayes • Author of three books on customer experience management and analytics • PhD in industrial-organizational psychology • #1 blogger overall on CustomerThink (http://customerthink.com/author/bobehayes/) • #1 blogger on the topic of customer analytics (http://customerthink.com/top-authors-category/) • Top expert in Big Data and Data Science • https://www.maptive.com/the-top-100-big-data-experts/ • http://www.kdnuggets.com/2015/02/top-big-data- influencers-brands.html
  • 3. 3 What is Data Science? Data science is way of extracting insights from data using the powers of computer science and statistics applied to data from a specific field of study Involves the collection, analysis and interpretation of data to extract empirically-based insights that augment and enhance human decisions and algorithms
  • 4. 4 Data Science Study Invited data professionals via: • AnalyticsWeek Newsletter • Blog post • Social media (Twitter, LinkedIn, Google+) 600+ completed surveys • Self-assessment rating of proficiency of 25 skills across five skill areas: • Business, Technology, Programming, Math & Modeling, Statistics • 9 additional questions • Overall satisfaction with outcome of analytics projects
  • 5. 5 Data Science Skills Assessed Area Skills* Business 1. Product design and development 2. Project management 3. Business development 4. Budgeting 5. Governance & Compliance (e.g., security) Technology 6. Managing unstructured data (e.g., noSQL) 7. Managing structured data (e.g., SQL, JSON, XML) 8. Natural Language Processing (NLP) and text mining 9. Machine Learning (e.g., decision trees, neural nets, Support Vector Machine, clustering) 10. Big and Distributed Data (e.g., Hadoop, Map/Reduce, Spark) Math & Modeling 11. Optimization (e.g., linear, integer, convex, global) 12. Math (e.g., linear algebra, real analysis, calculus) 13. Graphical Models (e.g., social networks) 14. Algorithms (e.g., computational complexity, Computer Science theory) and Simulations (e.g., discrete, agent-based, continuous) 15. Bayesian Statistics (e.g., Markov Chain Monte Carlo) Programming 16. Systems Administration (e.g., UNIX) and Design 17. Database Administration (MySQL, NOSQL) 18. Cloud Management 19. Back-End Programming (e.g., JAVA/Rails/Objective C) 20. Front-End Programming (e.g., JavaScript, HTML, CSS) Statistics 21. Data Management (e.g., recoding, de-duplicating, Integrating disparate data sources, Web scraping) 22. Data Mining (e.g. R, Python, SPSS, SAS) and Visualization (e.g., graphics, mapping, web-based data visualization) tools 23. Statistics and statistical modeling (e.g., general linear model, ANOVA, MANOVA, Spatio-temporal, Geographical Information System (GIS)) 24. Science/Scientific Method (e.g., experimental design, research design) 25. Communication (e.g., sharing results, writing/publishing, presentations, blogging) * List of skills adapted from Analyzing the Analyzers by Harlan D. Harris, Sean Patrick Murphy and Marck Vaisman
  • 6. 6 Proficiency Ratings* Proficiency Level Scale Value Description Don't know 0 You possess no knowledge Fundamental Awareness 20 You have a common knowledge or an understanding of basic techniques and concepts. Novice 40 You have the level of experience gained in a classroom and/or experimental scenarios or as a trainee on-the-job. You are expected to need help when performing this skill. Intermediate 60 You are able to successfully complete tasks in this competency as requested. Help from an expert may be required from time to time, but you can usually perform the skill independently. Advanced 80 You can perform the actions associated with this skill without assistance. You are certainly recognized within your immediate organization as "a person to ask" when difficult questions arise regarding this skill. Expert 100 You are known as an expert in this area. You can provide guidance, troubleshoot and answer questions related to this area of expertise and the field where the skill is used. * Rating scale is based on a proficiency rating scale used by NIH. Respondent instructions: You will be asked about your proficiency for a variety of skills. Please use the following scale when indicating your level of proficiency for each skill.
  • 8. 8 Proficiency varies across skills Top 10 Data Science Skills 1. Communication 2. Managing structured data 3. Data mining and visualization tools 4. Science / Scientific method 5. Math 6. Project management 7. Data management 8. Statistics and statistical modeling 9. Product design and development 10. Business development
  • 9. 9 Job Roles in Data Science *Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person, entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
  • 10. 10 Proficiency in 25 skills varies by job role • Different types of data scientists possess different skills • Biz Management – strong in business skills • Developer – strong in technology/programming skills • Researcher – strong in math/ statistics skills • Creatives – average in all skills
  • 11. 11 Structure of Data Science Skills * Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the three Skills areas (based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92 (Math / Stats) Factor Analysis of Data Skills • Data reduction technique • Examines the statistical relationships (e.g., correlations) among a large set of variables and tries to explain these correlations using a smaller number of variables (factors) • Elements (or factor loadings) of the factor pattern matrix represent the strength of relationship between the variables and each of the underlying factors • Tells us two things: 1. number of underlying factors that describe the initial set of variables 2. which variables are best represented by each factor
  • 12. 12 Structure of Data Science Skills * Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the three Skills areas (based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92 (Math / Stats) Plot the factor loadings for the 25 data skills into a 3-dimensional space Three Distinct Skill Sets • Business • Technology / Programming • Math / Statistics
  • 13. 13 The Structure of Data Science Skills
  • 14. 14 Proficiency in general skill areas varies by job role
  • 15. 15 Business Skills: Proficiency varies by job role *Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person, entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
  • 16. 16 Technology and Math/Statistics Skills: Proficiency varies by job role *Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person, entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
  • 17. 17 Top Data Science Skills by Job Role
  • 18. 18 Satisfaction with Work Outcome *Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person, entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
  • 19. 19 In Search of the Data Scientist Unicorn
  • 20. 20 Data Science as a Team Sport Impact of Business Expert
  • 21. 21 Data Science as a Team Sport Impact of Technology / Programming Expert
  • 22. 22 Data Science as a Team Sport Impact of Math & Modeling / Statistics Expert
  • 23. 23 Getting Insight from Data: The Scientific Method 1. Formulate Questions 2. Generate hypothesis/ hunch 3. Gather / Generate data 4. Analyze data / Test hypothesis 5. Take action / Communicate results • Start with a problem statement. • What are your hunches / hypotheses? • Be sure your hypotheses are testable. • You can use experimental or observational approach to analyzing data. • Integrate your data silos to ask bigger questions; connect the dots and get a 360 degree view of your customers. • Employ Predictive analytics / Inferential statistics to test hypotheses • Employ machine learning to quickly surface insights • Implement your findings • Use Prescriptive analytics to guide course of action
  • 24. 24 Scientific Method and Data Science Skills
  • 25. 25 What skills are linked to project success?
  • 26. 26 Importance of Data Science Skills by Job Role
  • 27. 27 Education and Data Science Skills
  • 28. 28 Lack of Gender Diversity
  • 29. 29 Lack of Gender Diversity – Other Science Roles
  • 30. 30 Job Roles in Data Science by Gender
  • 31. 31 Highest Level of Education Attained
  • 32. 32 Gender Comparison of Proficiency across Skills
  • 33. 33 Advice for Data Scientists • Be specific when talking about “data scientists” • There are different types – defined by what they do and the skills they possess • Work with other data professionals who have complementary skills. Teamwork is key to successful data science projects. • Learn to use data mining and visualization tools • R, Python, SPSS, SAS, graphics, mapping, web-based data visualization • Be an advocate for women in the field of data science