SlideShare uma empresa Scribd logo
1 de 19
Baixar para ler offline
Organizing for Data Science 
Dan Mallinger 
Data Science Practice Manager 
September 2014
CONFIDENTIAL | Dan Mallinger 
• Data Science Practice Manager 
− Think Big Analytics 
• Working with clients across 
− Financial Services 
− Advertising 
− Manufacturing 
− Social 
− Network Providers 
CONFIDENTIAL 2
CONFIDENTIAL | Today 
• Define Data Science in the Organization 
• Look at Current Perspectives on Organization 
• Discuss Shortcomings 
• Review a Real World Solution 
CONFIDENTIAL 3
Ÿ Use Data to Improve Our 
Business 
Ÿ Better Understand Customers 
Ÿ Act Proactively, Not Reactively 
CONFIDENTIAL | What Do We Hope to Do? 
CONFIDENTIAL 4
CONFIDENTIAL | Ÿ Scale 
Ÿ Robustness 
Ÿ Repeatability 
Why Organize? 
CONFIDENTIAL 5
Ÿ Revolutionizing Ad Targeting 
Ÿ Automating Deals and 
Recommendations 
Ÿ Alerting Admins to New Network 
Attacks 
CONFIDENTIAL | Perception: What Does Data Science Do? 
CONFIDENTIAL 6
CONFIDENTIAL | Ÿ Specific Data Expertise 
Ÿ Exploratory Analysis 
Ÿ Modeling 
Ÿ Creativity 
Ÿ Programming 
Ÿ Big Data 
Ÿ Communication 
Ÿ Ability to Target Impact 
Ÿ Unstructured Analysis 
Ÿ Organizational Politics 
Ÿ Visualization 
Ÿ … 
What Does It Take? 
CONFIDENTIAL 7
CONFIDENTIAL | The New Toy: A Center of Excellence 
Ÿ Centralized 
- Brings data, analysis, and 
processing together 
- Data scientists support one 
another 
Ÿ Distributed 
- Data scientists close to 
business 
- Multiple models for rotating 
data scientists into lines of 
business 
CONFIDENTIAL 8 
Line of 
Business A 
CoE 
Line of 
Business B 
Line of 
Business C
CONFIDENTIAL | Ÿ Specific Data Expertise 
Ÿ Exploratory Analysis 
Ÿ Modeling 
Ÿ Creativity 
Ÿ Programming 
Ÿ Big Data 
Ÿ Communication 
Ÿ Ability to Target Impact 
Ÿ Unstructured Analysis 
Ÿ Organizational Politics 
Ÿ Visualization 
Ÿ … 
What Does It Still Take? 
CONFIDENTIAL 9
CONFIDENTIAL | Ÿ Designed a great home for unicorns 
Ÿ But they are still unicorns 
CONFIDENTIAL 10 
If You Build It, They Will Come?
Ÿ Unravel Capability 
Ÿ Map Activities to Functional Roles 
Ÿ Align Functions with Process, 
Not Individuals 
Ÿ Don’t Forget to Scale 
CONFIDENTIAL | Working with Horses, Not Unicorns 
CONFIDENTIAL 11
Ÿ Identify Fraudulent Sessions 
Ÿ Cross Channel Analysis 
Ÿ Next Best Action 
Ÿ Optimize Pathways 
Ÿ Determine Session Interest 
Ÿ Customizing Experience 
Ÿ Proactive Outreach 
Ÿ Search Analysis 
Ÿ Content Optimization 
CONFIDENTIAL | CLIENT EXAMPLE 
Clickstream Data in Action 
CONFIDENTIAL 12
Ÿ Billions of clicks 
Ÿ Unstructured data 
Ÿ How do we model it?! 
CONFIDENTIAL | Ÿ Model the SIGNAL 
Ÿ Not the data 
CLIENT EXAMPLE 
Scaling Data Science 
CONFIDENTIAL 13
MPP Web 
CONFIDENTIAL | CLIENT EXAMPLE 
Clickstream Data Science in Action 
CONFIDENTIAL 14 
Hadoop 1.0 
Feature Selection & 
Dimensionality Reduction
CONFIDENTIAL | Ÿ Feature Selection 
- Forests 
- Clustering 
Ÿ Dimensionality Reduction 
- SVM 
Ÿ Challenges 
- Job Latency 
- Limited Iterations 
CLIENT EXAMPLE 
Extracting Signal: Hadoop 1.0 
CONFIDENTIAL 15
CONFIDENTIAL | CLIENT EXAMPLE 
Extracting Signal: Hadoop 2.0 
• Spark 
− Faster response in exploration 
− Better Support for Iterative Models 
• Genetic Algorithms 
• Neural Networks 
• Challenges 
− In memory: costly and limiting 
− MapReduce does not go away 
CONFIDENTIAL 16
Ÿ Focus on Technical Skills 
- EDA 
- Modeling 
- Programming / Big Data 
Ÿ Communication Skills 
- Capturing signal needs 
- Iterating with stakeholders 
CONFIDENTIAL | CLIENT EXAMPLE 
Horses, Not Unicorns 
CONFIDENTIAL 17 
Hadoop 1.0
CONFIDENTIAL | CLIENT EXAMPLE 
CoE Next Steps 
• Continue to make signal available to analysts 
− Next up: Extracting signal from text 
• Act as a capability search party 
− Sprints of new insights and tools 
• Finalize operating model 
− Funding structure 
− Engagement model with lines of business 
CONFIDENTIAL 18
CONFIDENTIAL | Discussion Over Drinks 
CONFIDENTIAL 19

Mais conteúdo relacionado

Mais procurados

"Making Data Actionable" by Budiman Rusly (KMK Online)
"Making Data Actionable" by Budiman Rusly (KMK Online)"Making Data Actionable" by Budiman Rusly (KMK Online)
"Making Data Actionable" by Budiman Rusly (KMK Online)Tech in Asia ID
 
The Five Data Questions
The Five Data QuestionsThe Five Data Questions
The Five Data Questionscrystalpullen
 
"Simplify Your Analytics Strategy" by Narendra Mulani
 "Simplify Your Analytics Strategy" by Narendra Mulani "Simplify Your Analytics Strategy" by Narendra Mulani
"Simplify Your Analytics Strategy" by Narendra MulaniSai Sandeep MN
 
Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)sgfta2020
 
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)Tech in Asia ID
 
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...Formulatedby
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraMolly Alexander
 
Data Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science CatalystData Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science CatalystFormulatedby
 
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...Formulatedby
 
Using text analytics to manage mobile qual to manage mobile Qual Data - Civicom
Using text analytics to manage mobile qual to manage mobile Qual Data - CivicomUsing text analytics to manage mobile qual to manage mobile Qual Data - Civicom
Using text analytics to manage mobile qual to manage mobile Qual Data - CivicomMerlien Institute
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategyAnkita Kumari
 
Dynamics Day 2016: digital transformation - getting personal
Dynamics Day 2016: digital transformation - getting personalDynamics Day 2016: digital transformation - getting personal
Dynamics Day 2016: digital transformation - getting personalIntergen
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)Tech in Asia ID
 
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya SatyamoorthyH2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya SatyamoorthySri Ambati
 
Data Science in Action for an Insurance Product - Shawn Jin
Data Science in Action for an Insurance Product - Shawn JinData Science in Action for an Insurance Product - Shawn Jin
Data Science in Action for an Insurance Product - Shawn JinMolly Alexander
 
Bde presentation dv
Bde presentation dvBde presentation dv
Bde presentation dvBigDataExpo
 
Sisense Introduction PPT
Sisense Introduction PPTSisense Introduction PPT
Sisense Introduction PPTKhirod Sahu
 

Mais procurados (20)

"Making Data Actionable" by Budiman Rusly (KMK Online)
"Making Data Actionable" by Budiman Rusly (KMK Online)"Making Data Actionable" by Budiman Rusly (KMK Online)
"Making Data Actionable" by Budiman Rusly (KMK Online)
 
The Five Data Questions
The Five Data QuestionsThe Five Data Questions
The Five Data Questions
 
"Simplify Your Analytics Strategy" by Narendra Mulani
 "Simplify Your Analytics Strategy" by Narendra Mulani "Simplify Your Analytics Strategy" by Narendra Mulani
"Simplify Your Analytics Strategy" by Narendra Mulani
 
Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)Big data sharing at fintech academy oct19 (1)
Big data sharing at fintech academy oct19 (1)
 
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
"Planning Your Analytics Implementation" by Bachtiar Rifai (Kofera Technology)
 
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
 
1415 gold sanford
1415 gold sanford1415 gold sanford
1415 gold sanford
 
Data Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science CatalystData Science Salon: Digital Transformation: The Data Science Catalyst
Data Science Salon: Digital Transformation: The Data Science Catalyst
 
1530 track2 reid
1530 track2 reid1530 track2 reid
1530 track2 reid
 
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
 
Using text analytics to manage mobile qual to manage mobile Qual Data - Civicom
Using text analytics to manage mobile qual to manage mobile Qual Data - CivicomUsing text analytics to manage mobile qual to manage mobile Qual Data - Civicom
Using text analytics to manage mobile qual to manage mobile Qual Data - Civicom
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Dynamics Day 2016: digital transformation - getting personal
Dynamics Day 2016: digital transformation - getting personalDynamics Day 2016: digital transformation - getting personal
Dynamics Day 2016: digital transformation - getting personal
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
 
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya SatyamoorthyH2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
 
Data Science in Action for an Insurance Product - Shawn Jin
Data Science in Action for an Insurance Product - Shawn JinData Science in Action for an Insurance Product - Shawn Jin
Data Science in Action for an Insurance Product - Shawn Jin
 
Bde presentation dv
Bde presentation dvBde presentation dv
Bde presentation dv
 
Sisense Introduction PPT
Sisense Introduction PPTSisense Introduction PPT
Sisense Introduction PPT
 
Unlock Your Data
Unlock Your DataUnlock Your Data
Unlock Your Data
 

Semelhante a Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL

Big Analytics: Building Lasting Value
Big Analytics: Building Lasting ValueBig Analytics: Building Lasting Value
Big Analytics: Building Lasting ValueDan Mallinger
 
Customer segmentation for business success with knime
Customer segmentation for business success with knimeCustomer segmentation for business success with knime
Customer segmentation for business success with knimeKnoldus Inc.
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Competitive Advantage from the Data Lake
Competitive Advantage from the Data LakeCompetitive Advantage from the Data Lake
Competitive Advantage from the Data LakeArgyle Executive Forum
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...BigDataEverywhere
 
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data LayerDenodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data LayerDenodo
 
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...Databricks
 
Build it…will they come by Shawn Trainer
 Build it…will they come by Shawn Trainer Build it…will they come by Shawn Trainer
Build it…will they come by Shawn TrainerData Con LA
 
The new patterns of innovation
The new patterns of innovationThe new patterns of innovation
The new patterns of innovationkrushali98
 
Transforming Business with Smarter Analytics
Transforming Business with Smarter AnalyticsTransforming Business with Smarter Analytics
Transforming Business with Smarter AnalyticsCTI Group
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceData Science Milan
 
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationEnabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationDenodo
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudPerficient, Inc.
 
Making Money Out of Data
Making Money Out of DataMaking Money Out of Data
Making Money Out of DataDigital Vidya
 
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...Innovation Enterprise
 
ISDI MDA Master Class November_2015
ISDI MDA Master Class November_2015ISDI MDA Master Class November_2015
ISDI MDA Master Class November_2015Jacques Warren
 
Data Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business SchoolData Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business SchoolGam Dias
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying dataHans Verstraeten
 

Semelhante a Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL (20)

Big Analytics: Building Lasting Value
Big Analytics: Building Lasting ValueBig Analytics: Building Lasting Value
Big Analytics: Building Lasting Value
 
Customer segmentation for business success with knime
Customer segmentation for business success with knimeCustomer segmentation for business success with knime
Customer segmentation for business success with knime
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Competitive Advantage from the Data Lake
Competitive Advantage from the Data LakeCompetitive Advantage from the Data Lake
Competitive Advantage from the Data Lake
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data LayerDenodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
Denodo DataFest 2016: Enterprise View of Data with Semantic Data Layer
 
BI and Big Data DeepDive - Pressmart
BI and Big Data DeepDive - PressmartBI and Big Data DeepDive - Pressmart
BI and Big Data DeepDive - Pressmart
 
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
 
Build it…will they come by Shawn Trainer
 Build it…will they come by Shawn Trainer Build it…will they come by Shawn Trainer
Build it…will they come by Shawn Trainer
 
The new patterns of innovation
The new patterns of innovationThe new patterns of innovation
The new patterns of innovation
 
Transforming Business with Smarter Analytics
Transforming Business with Smarter AnalyticsTransforming Business with Smarter Analytics
Transforming Business with Smarter Analytics
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial Intelligence
 
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data VirtualizationEnabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics Cloud
 
Making Money Out of Data
Making Money Out of DataMaking Money Out of Data
Making Money Out of Data
 
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
Actionable Analytics - Solving Real World Problems With Big Data, Xerox Innov...
 
ISDI MDA Master Class November_2015
ISDI MDA Master Class November_2015ISDI MDA Master Class November_2015
ISDI MDA Master Class November_2015
 
Data Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business SchoolData Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business School
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying data
 

Mais de MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

Mais de MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Último

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 

Último (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 

Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL

  • 1. Organizing for Data Science Dan Mallinger Data Science Practice Manager September 2014
  • 2. CONFIDENTIAL | Dan Mallinger • Data Science Practice Manager − Think Big Analytics • Working with clients across − Financial Services − Advertising − Manufacturing − Social − Network Providers CONFIDENTIAL 2
  • 3. CONFIDENTIAL | Today • Define Data Science in the Organization • Look at Current Perspectives on Organization • Discuss Shortcomings • Review a Real World Solution CONFIDENTIAL 3
  • 4. Ÿ Use Data to Improve Our Business Ÿ Better Understand Customers Ÿ Act Proactively, Not Reactively CONFIDENTIAL | What Do We Hope to Do? CONFIDENTIAL 4
  • 5. CONFIDENTIAL | Ÿ Scale Ÿ Robustness Ÿ Repeatability Why Organize? CONFIDENTIAL 5
  • 6. Ÿ Revolutionizing Ad Targeting Ÿ Automating Deals and Recommendations Ÿ Alerting Admins to New Network Attacks CONFIDENTIAL | Perception: What Does Data Science Do? CONFIDENTIAL 6
  • 7. CONFIDENTIAL | Ÿ Specific Data Expertise Ÿ Exploratory Analysis Ÿ Modeling Ÿ Creativity Ÿ Programming Ÿ Big Data Ÿ Communication Ÿ Ability to Target Impact Ÿ Unstructured Analysis Ÿ Organizational Politics Ÿ Visualization Ÿ … What Does It Take? CONFIDENTIAL 7
  • 8. CONFIDENTIAL | The New Toy: A Center of Excellence Ÿ Centralized - Brings data, analysis, and processing together - Data scientists support one another Ÿ Distributed - Data scientists close to business - Multiple models for rotating data scientists into lines of business CONFIDENTIAL 8 Line of Business A CoE Line of Business B Line of Business C
  • 9. CONFIDENTIAL | Ÿ Specific Data Expertise Ÿ Exploratory Analysis Ÿ Modeling Ÿ Creativity Ÿ Programming Ÿ Big Data Ÿ Communication Ÿ Ability to Target Impact Ÿ Unstructured Analysis Ÿ Organizational Politics Ÿ Visualization Ÿ … What Does It Still Take? CONFIDENTIAL 9
  • 10. CONFIDENTIAL | Ÿ Designed a great home for unicorns Ÿ But they are still unicorns CONFIDENTIAL 10 If You Build It, They Will Come?
  • 11. Ÿ Unravel Capability Ÿ Map Activities to Functional Roles Ÿ Align Functions with Process, Not Individuals Ÿ Don’t Forget to Scale CONFIDENTIAL | Working with Horses, Not Unicorns CONFIDENTIAL 11
  • 12. Ÿ Identify Fraudulent Sessions Ÿ Cross Channel Analysis Ÿ Next Best Action Ÿ Optimize Pathways Ÿ Determine Session Interest Ÿ Customizing Experience Ÿ Proactive Outreach Ÿ Search Analysis Ÿ Content Optimization CONFIDENTIAL | CLIENT EXAMPLE Clickstream Data in Action CONFIDENTIAL 12
  • 13. Ÿ Billions of clicks Ÿ Unstructured data Ÿ How do we model it?! CONFIDENTIAL | Ÿ Model the SIGNAL Ÿ Not the data CLIENT EXAMPLE Scaling Data Science CONFIDENTIAL 13
  • 14. MPP Web CONFIDENTIAL | CLIENT EXAMPLE Clickstream Data Science in Action CONFIDENTIAL 14 Hadoop 1.0 Feature Selection & Dimensionality Reduction
  • 15. CONFIDENTIAL | Ÿ Feature Selection - Forests - Clustering Ÿ Dimensionality Reduction - SVM Ÿ Challenges - Job Latency - Limited Iterations CLIENT EXAMPLE Extracting Signal: Hadoop 1.0 CONFIDENTIAL 15
  • 16. CONFIDENTIAL | CLIENT EXAMPLE Extracting Signal: Hadoop 2.0 • Spark − Faster response in exploration − Better Support for Iterative Models • Genetic Algorithms • Neural Networks • Challenges − In memory: costly and limiting − MapReduce does not go away CONFIDENTIAL 16
  • 17. Ÿ Focus on Technical Skills - EDA - Modeling - Programming / Big Data Ÿ Communication Skills - Capturing signal needs - Iterating with stakeholders CONFIDENTIAL | CLIENT EXAMPLE Horses, Not Unicorns CONFIDENTIAL 17 Hadoop 1.0
  • 18. CONFIDENTIAL | CLIENT EXAMPLE CoE Next Steps • Continue to make signal available to analysts − Next up: Extracting signal from text • Act as a capability search party − Sprints of new insights and tools • Finalize operating model − Funding structure − Engagement model with lines of business CONFIDENTIAL 18
  • 19. CONFIDENTIAL | Discussion Over Drinks CONFIDENTIAL 19