SlideShare uma empresa Scribd logo
1 de 21
Baixar para ler offline
Executive Briefing
Why managing machines is harder than you think
Peter Skomoroch - @peteskomoroch
Strata Data Conference, London - May 1, 2019
Background: Machine Learning & Data Products
Peter Skomoroch
@peteskomoroch
• Co-Founder and CEO of SkipFlag, Enterprise AI
startup acquired in 2018 by Workday
• 18+ years building machine learning products
• Principal Data Scientist, ran Data Products team at
LinkedIn. ML & Search at MIT, AOL, ProfitLogic
• Co-Host of O’Reilly AI Bots Podcast, Startup Advisor
Better, Faster Decisions at Scale
• Machine learning drove massive growth
at consumer internet companies over the
last decade
• A wave of AI startups and vertical
machine learning applications have
emerged across other industries
• For many problems, machine learning
makes better, faster, and more
repeatable decisions at scale
• Amazon, Google, and Microsoft are now
re-organizing themselves around AI
Data Products
Automated systems that collect and learn from data to
make user facing decisions with machine learning
Machine Learning Projects are Hard
• The transition to machine learning will be about 100x harder than the
transition to mobile
• Companies that adopt an experimental culture can still succeed
• Some of the biggest challenges are organizational, not technical
• Data driven companies like Google and Facebook have a strategic
advantage building ML products based on their data & compute assets,
large user population, tracking & instrumentation, and AI talent
If you only do things where you know
the answer in advance, your company
goes away.
Jeff Bezos
Founder, Chairman & CEO of Amazon.com
• Machine Learning shifts
engineering from a deterministic
process to a probabilistic one
• Take intelligent risks
• Most successful ML products are
experiments at massive scale
• Companies driven by analytics
and experimental insights are
more likely to succeed
Experimental Culture
Data Pipelines & Analytics Before AI
Credit: @mrogati
ML Algorithms Need Lots of Labelled Data
Common Crawl: ~4B pages monthly
Combined Pools of Data Give Better Results
https://www.flickr.com/photos/nakrnsm/3814916578
• Learning patterns across large
numbers of customers is the
power behind recommendations
from companies like Amazon and
Netflix
• The more precise or nuanced a
prediction, the more data will
need to be pooled
• You need large amounts of
labelled training data
• Transfer learning may help push
these limits further
Democratize Data Access
• Allow teams across your company to combine real data to improve
their product areas, design with data, and discover new insights
• Share derived data and input features for ML models across teams
• At LinkedIn we had a rich repository of signals like connection
strength, inferred skills, and other datasets that greatly accelerated
new product development
• Empower small teams to build things
quickly and compound returns on
feature engineering & derived data
See https://www.confluent.io/ebook/i-heart-logs-event-data-stream-processing-and-data-integration/
Product Management for Machine Learning
Image source: Martin Eriksson https://www.mindtheproduct.com/2011/10/what-exactly-is-a-product-manager/
• A Data Product Manager (PM)
has core product skills (strategy,
roadmaps, prioritization, etc.)
along with an intuitive grasp of
ML
• They help identify and prioritize
the highest value applications for
machine learning and do what it
takes to make them successful
Good ML Product Managers Have Data Expertise
• Know the difference between easy, hard, and impossible machine
learning problems
• Even if something is feasible from a machine learning perspective,
the level of effort may not justify building the feature
• Know your company’s data inside and out including quality issues,
limitations, biases, and gaps that need to be addressed
• Develop an intuitive understanding of your company’s data and how
it can be used to solve customer problems
Apply ML to a Metric the Business Cares About
Machine Learning Product Development
1. Verify you are solving the right problem
2. Theory + model design (in parallel with UI design)
3. Data collection, labelling, and cleaning
4. Feature engineering, model training, offline validation
5. Model deployment, monitoring & large scale training
• Iterate: repeat process, refine live model & improve
• 80% of effort and gains come from iterations after shipping v 1.0
• Use derived data from the system to build new products
ML Adds Uncertainty to Product Roadmaps
• PMs are often uncomfortable with expensive ideas that have an
uncertain probability of success
• Many organizations will struggle to justify the expense of projects that
require significant research investment upfront
• Some ML products may need to be split into time boxed projects that
get to market in a shorter time frame
• What can you productize now vs. much later on?
• Keep track of dependencies on other teams and have a “Plan B”
Every single company I've worked at
and talked to has the same problem
without a single exception so far —
poor data quality, especially tracking
data
Ruslan Belkin
VP of Engineering, Salesforce.com
• Guide user input when you can
• Use auto suggest fields
• Validate user inputs, emails
• Collect user tags, votes, ratings
• Track impressions, queries, clicks
• Sessionize logs
• Disambiguate and annotate
entities (company names,
locations, etc.)
Data Quality & Standardization
Testing Machine Learning Products
• Algorithm work that drags on without integration in the product where it can
be seen and tested by real users is risky
• Ship a complete MVP in production ASAP, benchmark, and iterate
• Beware unintended consequences from seemingly small product changes
• Remember the prototype is not the product - see what happens when you
use a more realistic data set or scale up your inputs
• Real world data changes over time, ensure your model tests and
benchmarks keep up with changes in underlying data
• Machine learning systems tend to fail in unexpected ways
Look at Your Input Data & Prediction Errors
Flywheel Effects & Data Products
• Users generate data as a side effect of
using most software products
• That data in turn, can improve the
product’s algorithms and enable new
types of recommendations, leading to
more data
• These “Flywheels” get better the more
customers use them leading to unique
competitive moats
• This works well in platforms, networks or
marketplaces where value compounds
* https://medium.freecodecamp.org/the-business-implications-of-machine-learning-11480b99184d
Final Thoughts
• Machine learning products are hard to
build, but within reach of teams who
invest in data infrastructure
• Some of the biggest challenges are
organizational, not technical
• Good product leaders are a key factor in
shipping successful ML products
• Find a machine learning application with
a direct connection to a metric your
organization values and ship it
Send me questions! @peteskomoroch
Q&A / Discussion

Mais conteúdo relacionado

Mais procurados

The Creative Method v2
The Creative Method v2The Creative Method v2
The Creative Method v2Jason Theodor
 
Introduction of Artificial Intelligence and Machine Learning
Introduction of Artificial Intelligence and Machine Learning Introduction of Artificial Intelligence and Machine Learning
Introduction of Artificial Intelligence and Machine Learning bigdata trunk
 
Building AI Product using AI Product Thinking
Building AI Product using AI Product Thinking Building AI Product using AI Product Thinking
Building AI Product using AI Product Thinking Saurabh Kaushik
 
Build and Modernize Intelligent Apps​
Build and Modernize Intelligent Apps​Build and Modernize Intelligent Apps​
Build and Modernize Intelligent Apps​Lorenzo Barbieri
 
Responsible AI in Industry (ICML 2021 Tutorial)
Responsible AI in Industry (ICML 2021 Tutorial)Responsible AI in Industry (ICML 2021 Tutorial)
Responsible AI in Industry (ICML 2021 Tutorial)Krishnaram Kenthapadi
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence PresentationAdarsh Pathak
 
AI in Manufacturing & the Proposed EU Artificial Intelligence Act
AI in Manufacturing & the Proposed EU Artificial Intelligence ActAI in Manufacturing & the Proposed EU Artificial Intelligence Act
AI in Manufacturing & the Proposed EU Artificial Intelligence ActBarry O'Sullivan
 
10 Key Considerations for AI/ML Model Governance
10 Key Considerations for AI/ML Model Governance10 Key Considerations for AI/ML Model Governance
10 Key Considerations for AI/ML Model GovernanceQuantUniversity
 
GenAI in Research with Responsible AI
GenAI in Researchwith Responsible AIGenAI in Researchwith Responsible AI
GenAI in Research with Responsible AILiming Zhu
 
Predictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data miningPredictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data miningSAS Asia Pacific
 
Responsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillarsResponsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillarsSofus Macskássy
 
How to win data science competitions with Deep Learning
How to win data science competitions with Deep LearningHow to win data science competitions with Deep Learning
How to win data science competitions with Deep LearningSri Ambati
 
Data Driven Growth
Data Driven GrowthData Driven Growth
Data Driven Growthjwegan
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)byteLAKE
 
What Are The Latest Trends in Data Science?
What Are The Latest Trends in Data Science?What Are The Latest Trends in Data Science?
What Are The Latest Trends in Data Science?Bernard Marr
 
State of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptxState of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptxEithuThutun
 
7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...
7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...
7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...Agile India
 
“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...
“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...
“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...Edge AI and Vision Alliance
 

Mais procurados (20)

The Creative Method v2
The Creative Method v2The Creative Method v2
The Creative Method v2
 
Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
Introduction of Artificial Intelligence and Machine Learning
Introduction of Artificial Intelligence and Machine Learning Introduction of Artificial Intelligence and Machine Learning
Introduction of Artificial Intelligence and Machine Learning
 
Building AI Product using AI Product Thinking
Building AI Product using AI Product Thinking Building AI Product using AI Product Thinking
Building AI Product using AI Product Thinking
 
Build and Modernize Intelligent Apps​
Build and Modernize Intelligent Apps​Build and Modernize Intelligent Apps​
Build and Modernize Intelligent Apps​
 
Responsible AI in Industry (ICML 2021 Tutorial)
Responsible AI in Industry (ICML 2021 Tutorial)Responsible AI in Industry (ICML 2021 Tutorial)
Responsible AI in Industry (ICML 2021 Tutorial)
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
AI in Manufacturing & the Proposed EU Artificial Intelligence Act
AI in Manufacturing & the Proposed EU Artificial Intelligence ActAI in Manufacturing & the Proposed EU Artificial Intelligence Act
AI in Manufacturing & the Proposed EU Artificial Intelligence Act
 
10 Key Considerations for AI/ML Model Governance
10 Key Considerations for AI/ML Model Governance10 Key Considerations for AI/ML Model Governance
10 Key Considerations for AI/ML Model Governance
 
GenAI in Research with Responsible AI
GenAI in Researchwith Responsible AIGenAI in Researchwith Responsible AI
GenAI in Research with Responsible AI
 
Predictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data miningPredictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data mining
 
Responsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillarsResponsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillars
 
How to win data science competitions with Deep Learning
How to win data science competitions with Deep LearningHow to win data science competitions with Deep Learning
How to win data science competitions with Deep Learning
 
Data Driven Growth
Data Driven GrowthData Driven Growth
Data Driven Growth
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
 
What Are The Latest Trends in Data Science?
What Are The Latest Trends in Data Science?What Are The Latest Trends in Data Science?
What Are The Latest Trends in Data Science?
 
State of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptxState of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptx
 
7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...
7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...
7 Steps to Design, Build, and Scale an AI Product by Allie Miller at #AgileIn...
 
“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...
“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...
“Autonomous Driving AI Workloads: Technology Trends and Optimization Strategi...
 

Semelhante a Executive Briefing: Why managing machines is harder than you think

Bridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder SupportBridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder SupportPeter Skomoroch
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOProduct School
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning ModelsTash Bickley
 
How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...Skyl.ai
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...Skyl.ai
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyPerficient, Inc.
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...Comit Projects Ltd
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarConcept Searching, Inc
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Manju Devadas
 
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...Skyl.ai
 
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...Manju Devadas
 
FTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven OrganizationFTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven OrganizationNaveen Jain
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfSubrat Kumar Dash
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsApplause
 
Analytics in manufacturing
Analytics in manufacturingAnalytics in manufacturing
Analytics in manufacturingSaurav Kumar
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLPSkyl.ai
 

Semelhante a Executive Briefing: Why managing machines is harder than you think (20)

Bridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder SupportBridging the AI Gap: Building Stakeholder Support
Bridging the AI Gap: Building Stakeholder Support
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
 
How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data Strategy
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Managing AI Products
Managing AI ProductsManaging AI Products
Managing AI Products
 
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
Machine Learning Application to Manufacturing using Tableau, Tableau and Goog...
 
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
Future of Ecommerce: How to Improve the Online Shopping Experience Using Mach...
 
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...Pluto7   -  Tableau Webinar on enabling Organization to be Data Driven in 201...
Pluto7 - Tableau Webinar on enabling Organization to be Data Driven in 201...
 
FTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven OrganizationFTFCU - How to Become a Data Driven Organization
FTFCU - How to Become a Data Driven Organization
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdf
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
 
Analytics in manufacturing
Analytics in manufacturingAnalytics in manufacturing
Analytics in manufacturing
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLP
 

Mais de Peter Skomoroch

Managing Machines: The New AI Dev Stack
Managing Machines: The New AI Dev StackManaging Machines: The New AI Dev Stack
Managing Machines: The New AI Dev StackPeter Skomoroch
 
Building Competitive Moats With Data
Building Competitive Moats With DataBuilding Competitive Moats With Data
Building Competitive Moats With DataPeter Skomoroch
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustPeter Skomoroch
 
SF Data Science: Developing Data Products
SF Data Science: Developing Data ProductsSF Data Science: Developing Data Products
SF Data Science: Developing Data ProductsPeter Skomoroch
 
Skills, Reputation, and Search
Skills, Reputation, and SearchSkills, Reputation, and Search
Skills, Reputation, and SearchPeter Skomoroch
 
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social TaggingLinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social TaggingPeter Skomoroch
 
Developing Data Products
Developing Data ProductsDeveloping Data Products
Developing Data ProductsPeter Skomoroch
 
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, TokyoPractical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, TokyoPeter Skomoroch
 
Street Fighting Data Science
Street Fighting Data ScienceStreet Fighting Data Science
Street Fighting Data SciencePeter Skomoroch
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science SummitPeter Skomoroch
 
Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011Peter Skomoroch
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopPeter Skomoroch
 
Prototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.orgPrototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.orgPeter Skomoroch
 

Mais de Peter Skomoroch (14)

Managing Machines: The New AI Dev Stack
Managing Machines: The New AI Dev StackManaging Machines: The New AI Dev Stack
Managing Machines: The New AI Dev Stack
 
Building Competitive Moats With Data
Building Competitive Moats With DataBuilding Competitive Moats With Data
Building Competitive Moats With Data
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data Exhaust
 
SF Data Science: Developing Data Products
SF Data Science: Developing Data ProductsSF Data Science: Developing Data Products
SF Data Science: Developing Data Products
 
Skills, Reputation, and Search
Skills, Reputation, and SearchSkills, Reputation, and Search
Skills, Reputation, and Search
 
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social TaggingLinkedIn Endorsements: Reputation, Virality, and Social Tagging
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
 
Developing Data Products
Developing Data ProductsDeveloping Data Products
Developing Data Products
 
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, TokyoPractical Problem Solving with Data - Onlab Data Conference, Tokyo
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
 
Street Fighting Data Science
Street Fighting Data ScienceStreet Fighting Data Science
Street Fighting Data Science
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science Summit
 
Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011Geo Analytics Tutorial - Where 2.0 2011
Geo Analytics Tutorial - Where 2.0 2011
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With Hadoop
 
Prototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.orgPrototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.org
 
Elasticwulf Pycon Talk
Elasticwulf Pycon TalkElasticwulf Pycon Talk
Elasticwulf Pycon Talk
 

Último

Farmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan ManchFarmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan ManchRashtriya Kisan Manch
 
Measuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield MetricsMeasuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield MetricsCIToolkit
 
Management and managerial skills training manual.pdf
Management and managerial skills training manual.pdfManagement and managerial skills training manual.pdf
Management and managerial skills training manual.pdffillmonipdc
 
LPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business SectorLPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business Sectorthomas851723
 
Introduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-EngineeringIntroduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-Engineeringthomas851723
 
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)jennyeacort
 
Unlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency MatrixUnlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency MatrixCIToolkit
 
From Goals to Actions: Uncovering the Key Components of Improvement Roadmaps
From Goals to Actions: Uncovering the Key Components of Improvement RoadmapsFrom Goals to Actions: Uncovering the Key Components of Improvement Roadmaps
From Goals to Actions: Uncovering the Key Components of Improvement RoadmapsCIToolkit
 
Fifteenth Finance Commission Presentation
Fifteenth Finance Commission PresentationFifteenth Finance Commission Presentation
Fifteenth Finance Commission Presentationmintusiprd
 
Beyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why Diagram
Beyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why DiagramBeyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why Diagram
Beyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why DiagramCIToolkit
 
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证jdkhjh
 
Board Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch PresentationBoard Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch Presentationcraig524401
 
How-How Diagram: A Practical Approach to Problem Resolution
How-How Diagram: A Practical Approach to Problem ResolutionHow-How Diagram: A Practical Approach to Problem Resolution
How-How Diagram: A Practical Approach to Problem ResolutionCIToolkit
 
Motivational theories an leadership skills
Motivational theories an leadership skillsMotivational theories an leadership skills
Motivational theories an leadership skillskristinalimarenko7
 
Simplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes ThinkingSimplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes ThinkingCIToolkit
 
Reflecting, turning experience into insight
Reflecting, turning experience into insightReflecting, turning experience into insight
Reflecting, turning experience into insightWayne Abrahams
 
LPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations ReviewLPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations Reviewthomas851723
 

Último (18)

Farmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan ManchFarmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan Manch
 
Measuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield MetricsMeasuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield Metrics
 
Management and managerial skills training manual.pdf
Management and managerial skills training manual.pdfManagement and managerial skills training manual.pdf
Management and managerial skills training manual.pdf
 
LPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business SectorLPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business Sector
 
Introduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-EngineeringIntroduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-Engineering
 
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
 
Unlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency MatrixUnlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency Matrix
 
From Goals to Actions: Uncovering the Key Components of Improvement Roadmaps
From Goals to Actions: Uncovering the Key Components of Improvement RoadmapsFrom Goals to Actions: Uncovering the Key Components of Improvement Roadmaps
From Goals to Actions: Uncovering the Key Components of Improvement Roadmaps
 
Fifteenth Finance Commission Presentation
Fifteenth Finance Commission PresentationFifteenth Finance Commission Presentation
Fifteenth Finance Commission Presentation
 
Beyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why Diagram
Beyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why DiagramBeyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why Diagram
Beyond the Five Whys: Exploring the Hierarchical Causes with the Why-Why Diagram
 
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
 
Board Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch PresentationBoard Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch Presentation
 
How-How Diagram: A Practical Approach to Problem Resolution
How-How Diagram: A Practical Approach to Problem ResolutionHow-How Diagram: A Practical Approach to Problem Resolution
How-How Diagram: A Practical Approach to Problem Resolution
 
Motivational theories an leadership skills
Motivational theories an leadership skillsMotivational theories an leadership skills
Motivational theories an leadership skills
 
Simplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes ThinkingSimplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes Thinking
 
Reflecting, turning experience into insight
Reflecting, turning experience into insightReflecting, turning experience into insight
Reflecting, turning experience into insight
 
LPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations ReviewLPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations Review
 
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Servicesauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
 

Executive Briefing: Why managing machines is harder than you think

  • 1. Executive Briefing Why managing machines is harder than you think Peter Skomoroch - @peteskomoroch Strata Data Conference, London - May 1, 2019
  • 2. Background: Machine Learning & Data Products Peter Skomoroch @peteskomoroch • Co-Founder and CEO of SkipFlag, Enterprise AI startup acquired in 2018 by Workday • 18+ years building machine learning products • Principal Data Scientist, ran Data Products team at LinkedIn. ML & Search at MIT, AOL, ProfitLogic • Co-Host of O’Reilly AI Bots Podcast, Startup Advisor
  • 3. Better, Faster Decisions at Scale • Machine learning drove massive growth at consumer internet companies over the last decade • A wave of AI startups and vertical machine learning applications have emerged across other industries • For many problems, machine learning makes better, faster, and more repeatable decisions at scale • Amazon, Google, and Microsoft are now re-organizing themselves around AI
  • 4. Data Products Automated systems that collect and learn from data to make user facing decisions with machine learning
  • 5. Machine Learning Projects are Hard • The transition to machine learning will be about 100x harder than the transition to mobile • Companies that adopt an experimental culture can still succeed • Some of the biggest challenges are organizational, not technical • Data driven companies like Google and Facebook have a strategic advantage building ML products based on their data & compute assets, large user population, tracking & instrumentation, and AI talent
  • 6. If you only do things where you know the answer in advance, your company goes away. Jeff Bezos Founder, Chairman & CEO of Amazon.com • Machine Learning shifts engineering from a deterministic process to a probabilistic one • Take intelligent risks • Most successful ML products are experiments at massive scale • Companies driven by analytics and experimental insights are more likely to succeed Experimental Culture
  • 7. Data Pipelines & Analytics Before AI Credit: @mrogati
  • 8. ML Algorithms Need Lots of Labelled Data Common Crawl: ~4B pages monthly
  • 9. Combined Pools of Data Give Better Results https://www.flickr.com/photos/nakrnsm/3814916578 • Learning patterns across large numbers of customers is the power behind recommendations from companies like Amazon and Netflix • The more precise or nuanced a prediction, the more data will need to be pooled • You need large amounts of labelled training data • Transfer learning may help push these limits further
  • 10. Democratize Data Access • Allow teams across your company to combine real data to improve their product areas, design with data, and discover new insights • Share derived data and input features for ML models across teams • At LinkedIn we had a rich repository of signals like connection strength, inferred skills, and other datasets that greatly accelerated new product development • Empower small teams to build things quickly and compound returns on feature engineering & derived data See https://www.confluent.io/ebook/i-heart-logs-event-data-stream-processing-and-data-integration/
  • 11. Product Management for Machine Learning Image source: Martin Eriksson https://www.mindtheproduct.com/2011/10/what-exactly-is-a-product-manager/ • A Data Product Manager (PM) has core product skills (strategy, roadmaps, prioritization, etc.) along with an intuitive grasp of ML • They help identify and prioritize the highest value applications for machine learning and do what it takes to make them successful
  • 12. Good ML Product Managers Have Data Expertise • Know the difference between easy, hard, and impossible machine learning problems • Even if something is feasible from a machine learning perspective, the level of effort may not justify building the feature • Know your company’s data inside and out including quality issues, limitations, biases, and gaps that need to be addressed • Develop an intuitive understanding of your company’s data and how it can be used to solve customer problems
  • 13. Apply ML to a Metric the Business Cares About
  • 14. Machine Learning Product Development 1. Verify you are solving the right problem 2. Theory + model design (in parallel with UI design) 3. Data collection, labelling, and cleaning 4. Feature engineering, model training, offline validation 5. Model deployment, monitoring & large scale training • Iterate: repeat process, refine live model & improve • 80% of effort and gains come from iterations after shipping v 1.0 • Use derived data from the system to build new products
  • 15. ML Adds Uncertainty to Product Roadmaps • PMs are often uncomfortable with expensive ideas that have an uncertain probability of success • Many organizations will struggle to justify the expense of projects that require significant research investment upfront • Some ML products may need to be split into time boxed projects that get to market in a shorter time frame • What can you productize now vs. much later on? • Keep track of dependencies on other teams and have a “Plan B”
  • 16. Every single company I've worked at and talked to has the same problem without a single exception so far — poor data quality, especially tracking data Ruslan Belkin VP of Engineering, Salesforce.com • Guide user input when you can • Use auto suggest fields • Validate user inputs, emails • Collect user tags, votes, ratings • Track impressions, queries, clicks • Sessionize logs • Disambiguate and annotate entities (company names, locations, etc.) Data Quality & Standardization
  • 17. Testing Machine Learning Products • Algorithm work that drags on without integration in the product where it can be seen and tested by real users is risky • Ship a complete MVP in production ASAP, benchmark, and iterate • Beware unintended consequences from seemingly small product changes • Remember the prototype is not the product - see what happens when you use a more realistic data set or scale up your inputs • Real world data changes over time, ensure your model tests and benchmarks keep up with changes in underlying data • Machine learning systems tend to fail in unexpected ways
  • 18. Look at Your Input Data & Prediction Errors
  • 19. Flywheel Effects & Data Products • Users generate data as a side effect of using most software products • That data in turn, can improve the product’s algorithms and enable new types of recommendations, leading to more data • These “Flywheels” get better the more customers use them leading to unique competitive moats • This works well in platforms, networks or marketplaces where value compounds * https://medium.freecodecamp.org/the-business-implications-of-machine-learning-11480b99184d
  • 20. Final Thoughts • Machine learning products are hard to build, but within reach of teams who invest in data infrastructure • Some of the biggest challenges are organizational, not technical • Good product leaders are a key factor in shipping successful ML products • Find a machine learning application with a direct connection to a metric your organization values and ship it Send me questions! @peteskomoroch