SlideShare uma empresa Scribd logo
1 de 26
Baixar para ler offline
BRIDGING THE GAP
FROM DATA SCIENCE TO
SERVICE
32ND PYDATA LONDON MEETUP
Daniel F Moisset - dmoisset@ /machinalis.com @dmoisset
ABOUT ME
Hi! I'm Daniel Moisset!
I work at Machinalis
A special thanks to Marcos Spontón who also works there and inspired most of this talk.
WARNING: THIS IS NOT A TECH
TALK!
In other words:
THIS TALK IS NOT ABOUT ALGORITHMS,
MODELS, TOOLS, OR USE CASES
In event di ferent words:
THIS TALK IS ABOUT PEOPLE
SO, RAISIN BREAD
By je freyw (Mmm...raisin bread) [ ],CC BY 2.0 via Wikimedia Commons
Machine Learning development is like the raisins in a
raisin bread... you need the bread first. But, it's just a
few tiny raisins but without it you would just have
plain bread
— I don't really know who, but I love the analogy
WHO WANTS RAISIN BREAD
Di ferent organizations use your services:
1. Large companies with a live product and data, but without enough
expertise/manpower in DS: «we'd like to add some raisins to our
bread»
2. Small start-up, with maybe just a prototype, that want to get to
production-ready scalable MVP: «We want some bread». And «it
should have raisins now/at some point in the future»
IS THAT WHAT THEY ACTUALLY
NEED?
“All the cool kids are doing it” is not good enough reason.
— Seen on the internet
Raisin cookies that look like chocolate chip cookies are
the main reason I have trust issues
PART I: COMMUNICATING
WITH THE CUSTOMER
IT'S NOT JUST SOFTWARE
DEVELOPMENT!
It also has a heavy R&D component
Higher uncertainty
Results are probabilistic
THERE'S A PAPER ABOUT IT ≠ A
PRODUCT
The distance may not be something coverable today.
MODELS ARE AN ASSET
Investing time on it is not a “necessary evil”
What's produced on a modelling phase is a critical component
A model emerges from the client data and constraints, so it is
unique to the client and an advantage over competitors.
MACHINE LEARNING ≠
CLAIRVOYANCE
Garbage in, Garbage out
The solution may not be clear; you may be unsure of what problem
is more important; but your business goal should be clear. Data
Science will not make it clear for you.
AGREEING ON METRICS
Explain what are you measuring and why
Explain what are the baselines and how much you think you can
improve
Connect these to the business goals.
A PICTURE IS WORTH A
THOUSAND WORDS
Visualize your proposal.
Be minimalistic.
Use o f the shelf tools for a proposal.
PART II: PROVIDING THE
SERVICE
THE SERVICE IS THE END, DATA
SCIENCE IS THE MEANS
Do not fall in love with the challenge
JUST OUT OF THE BOX MAY BE
ENOUGH
You should always be asking yourself:
1. Have I already covered the expectations?
2. Will an improved result here actually improve value?
MEASURE TWICE, CUT ONCE
Get a look at the object of analysis before starting work. Has it desirable
qualities?
1. Manageable size?
2. It's in an accessible representation?
3. Does it have a reasonable distribution?
4. ...
INVOLVE THE PO
Validate your assumptions with a person familiar with your domain
1. Are there contradictions between your assumptions and their
knowledge?
2. Are there contradictions between the data you already have and
their knowledge?
Keep learning about the business side, encourage your business
counterpart to learn to talk with Data Scientists.
PART OF YOUR SERVICE IS NOT
DS
Make sure you use the right tools and people in each area
PART III: WORKING AS A TEAM
SHARE INFORMATION
Basic descriptive statistics should be shared with all involved, even the
non DS. People in a team must be aware of what's important and
what's not.
SHARE UNCERTAINTY
There are a lot of tradeo fs to make regarding milestones and
deadlines. People can plan better (and have contingency plans) if they
know what parts of the project have higher risks.
IT'S OK TO BUILD FLIMSY CODE,
AS LONG AS IT'S NOT
SOFTWARE
code: programming text that runs on a computer
so tware: programming text that is part of a deliverable.
There are di ferences:
code does not necessarily need tests.
code does not necessarily need to follow other processes.
sometimes the outputs of your code are deliverable and may have
to be treated specially.
THE DISCUSSION IS
JUST BEGINNING
I'D LOVE TO HEAR ABOUT WHAT YOU'VE
LEARNED ELSEWHERE
THANKS!
ANY QUESTIONS?
You can find me at twitter (@dmoisset) or by email (dmoisset@machinalis.com)

Mais conteúdo relacionado

Destaque

Conférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovationConférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovationJean-Baptiste Lesaulnier
 
2017 ZRAY SPORTS
2017 ZRAY SPORTS2017 ZRAY SPORTS
2017 ZRAY SPORTSSophia Cui
 
(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用隆志 杉山
 
E2D3で地図を作画してみよう
E2D3で地図を作画してみようE2D3で地図を作画してみよう
E2D3で地図を作画してみようShigeo Ueda
 
How a CDCL SAT solver works
How a CDCL SAT solver worksHow a CDCL SAT solver works
How a CDCL SAT solver worksMasahiro Sakai
 
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015Dr. Marigo Raftopoulos
 
顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみた顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみたJotaro Shigeyama
 
神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)guregu
 
Basculement du monde et géopolitique du monde
Basculement du monde et géopolitique du mondeBasculement du monde et géopolitique du monde
Basculement du monde et géopolitique du mondeJean-François Fiorina
 
298885937-Us-Naval-Incompetence
298885937-Us-Naval-Incompetence298885937-Us-Naval-Incompetence
298885937-Us-Naval-IncompetenceAgha A
 
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...Andre Stern, CTS
 
Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話Takuto Matsuu
 

Destaque (15)

Conférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovationConférence numérique éducatif - semaine de l'innovation
Conférence numérique éducatif - semaine de l'innovation
 
Making The Most Of Internship
Making The Most Of Internship  Making The Most Of Internship
Making The Most Of Internship
 
2017 ZRAY SPORTS
2017 ZRAY SPORTS2017 ZRAY SPORTS
2017 ZRAY SPORTS
 
(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用(株)自治体構想による三根庁舎旧議場の利活用
(株)自治体構想による三根庁舎旧議場の利活用
 
E2D3で地図を作画してみよう
E2D3で地図を作画してみようE2D3で地図を作画してみよう
E2D3で地図を作画してみよう
 
GUIA PARA SALIR DE LA PRECARIEDAD LABORAL
GUIA PARA SALIR DE LA PRECARIEDAD LABORALGUIA PARA SALIR DE LA PRECARIEDAD LABORAL
GUIA PARA SALIR DE LA PRECARIEDAD LABORAL
 
How a CDCL SAT solver works
How a CDCL SAT solver worksHow a CDCL SAT solver works
How a CDCL SAT solver works
 
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
Marigo Raftopoulos for Gamification World Congress, Barcelona 2015
 
顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみた顔認識アルゴリズム:Constrained local model を調べてみた
顔認識アルゴリズム:Constrained local model を調べてみた
 
神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)神に近づくx/net/context (Finding God with x/net/context)
神に近づくx/net/context (Finding God with x/net/context)
 
Basculement du monde et géopolitique du monde
Basculement du monde et géopolitique du mondeBasculement du monde et géopolitique du monde
Basculement du monde et géopolitique du monde
 
298885937-Us-Naval-Incompetence
298885937-Us-Naval-Incompetence298885937-Us-Naval-Incompetence
298885937-Us-Naval-Incompetence
 
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
Infocomm Webinar 08/03/17 - Sistemas audiovisuais aplicados em avisos de emer...
 
Hair Extension Courses Manchester
Hair Extension Courses ManchesterHair Extension Courses Manchester
Hair Extension Courses Manchester
 
Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話Gentooプリインストールなノートパソコンの話
Gentooプリインストールなノートパソコンの話
 

Semelhante a Bridging the gap from data science to service

Future of IT preso
Future of IT presoFuture of IT preso
Future of IT presoLorna Garey
 
Digital transformation studies linkedin
Digital transformation studies linkedinDigital transformation studies linkedin
Digital transformation studies linkedinClaudete Mello
 
Data (by itself) Is Not Enough
Data (by itself) Is Not EnoughData (by itself) Is Not Enough
Data (by itself) Is Not EnoughCory Treffiletti
 
Information modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): ExtractInformation modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): Extractvdf Hochschulverlag AG
 
The Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of NetworkingThe Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of NetworkingHubilo
 
Exploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud ComputingExploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud ComputingDana Gardner
 
Digital Transformation Failure
Digital Transformation FailureDigital Transformation Failure
Digital Transformation FailureFrederik Bernard
 
Marketing Your Tech Talent
Marketing Your Tech TalentMarketing Your Tech Talent
Marketing Your Tech Talentdeirdrestraughan
 
Activities Computing - reading comprehension
Activities Computing - reading comprehensionActivities Computing - reading comprehension
Activities Computing - reading comprehensionCintia Santos
 
Prepare a wow demo - extreme365 2020
Prepare a wow demo  - extreme365 2020Prepare a wow demo  - extreme365 2020
Prepare a wow demo - extreme365 2020Nico Fernandez
 
Open Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration PresentationOpen Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration Presentationdarryl_lehmann
 
Accessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product WeekAccessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product WeekKat K. Richards
 
Why do most machine learning projects never make it to production
Why do most machine learning projects never make it to productionWhy do most machine learning projects never make it to production
Why do most machine learning projects never make it to productionCameron Vetter
 
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...Dana Gardner
 
New Era Of Corporate Communications Riaan Vanmeulen Fnb
New Era Of Corporate Communications Riaan Vanmeulen   FnbNew Era Of Corporate Communications Riaan Vanmeulen   Fnb
New Era Of Corporate Communications Riaan Vanmeulen Fnbguest22cb1ea7
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyayCameronDonovan
 

Semelhante a Bridging the gap from data science to service (20)

Future of IT preso
Future of IT presoFuture of IT preso
Future of IT preso
 
Digital transformation studies linkedin
Digital transformation studies linkedinDigital transformation studies linkedin
Digital transformation studies linkedin
 
Data (by itself) Is Not Enough
Data (by itself) Is Not EnoughData (by itself) Is Not Enough
Data (by itself) Is Not Enough
 
2014 Technical Communication Conference Program
2014 Technical Communication Conference Program2014 Technical Communication Conference Program
2014 Technical Communication Conference Program
 
Information modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): ExtractInformation modelling (Stefan Berner): Extract
Information modelling (Stefan Berner): Extract
 
Matchbox presentation
Matchbox presentation Matchbox presentation
Matchbox presentation
 
The Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of NetworkingThe Handy Guide to Cashing in the Currency of Networking
The Handy Guide to Cashing in the Currency of Networking
 
Exploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud ComputingExploring the Business Decision to Use Cloud Computing
Exploring the Business Decision to Use Cloud Computing
 
Digital Transformation Failure
Digital Transformation FailureDigital Transformation Failure
Digital Transformation Failure
 
Marketing Your Tech Talent
Marketing Your Tech TalentMarketing Your Tech Talent
Marketing Your Tech Talent
 
Activities Computing - reading comprehension
Activities Computing - reading comprehensionActivities Computing - reading comprehension
Activities Computing - reading comprehension
 
slide3.pptx
slide3.pptxslide3.pptx
slide3.pptx
 
Prepare a wow demo - extreme365 2020
Prepare a wow demo  - extreme365 2020Prepare a wow demo  - extreme365 2020
Prepare a wow demo - extreme365 2020
 
Open Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration PresentationOpen Web Technologies and You - Durham College Student Integration Presentation
Open Web Technologies and You - Durham College Student Integration Presentation
 
Boursiquot "Privacy and The Effective Search Experience"
Boursiquot "Privacy and The Effective Search Experience"Boursiquot "Privacy and The Effective Search Experience"
Boursiquot "Privacy and The Effective Search Experience"
 
Accessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product WeekAccessibility Buy-In for Inclusive Product Week
Accessibility Buy-In for Inclusive Product Week
 
Why do most machine learning projects never make it to production
Why do most machine learning projects never make it to productionWhy do most machine learning projects never make it to production
Why do most machine learning projects never make it to production
 
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
Industrializing Data Science: Transform into an End-to-End, Analytics-Oriente...
 
New Era Of Corporate Communications Riaan Vanmeulen Fnb
New Era Of Corporate Communications Riaan Vanmeulen   FnbNew Era Of Corporate Communications Riaan Vanmeulen   Fnb
New Era Of Corporate Communications Riaan Vanmeulen Fnb
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyay
 

Último

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Bridging the gap from data science to service

  • 1. BRIDGING THE GAP FROM DATA SCIENCE TO SERVICE 32ND PYDATA LONDON MEETUP Daniel F Moisset - dmoisset@ /machinalis.com @dmoisset
  • 2. ABOUT ME Hi! I'm Daniel Moisset! I work at Machinalis A special thanks to Marcos Spontón who also works there and inspired most of this talk.
  • 3. WARNING: THIS IS NOT A TECH TALK! In other words: THIS TALK IS NOT ABOUT ALGORITHMS, MODELS, TOOLS, OR USE CASES In event di ferent words: THIS TALK IS ABOUT PEOPLE
  • 4. SO, RAISIN BREAD By je freyw (Mmm...raisin bread) [ ],CC BY 2.0 via Wikimedia Commons
  • 5. Machine Learning development is like the raisins in a raisin bread... you need the bread first. But, it's just a few tiny raisins but without it you would just have plain bread — I don't really know who, but I love the analogy
  • 6. WHO WANTS RAISIN BREAD Di ferent organizations use your services: 1. Large companies with a live product and data, but without enough expertise/manpower in DS: «we'd like to add some raisins to our bread» 2. Small start-up, with maybe just a prototype, that want to get to production-ready scalable MVP: «We want some bread». And «it should have raisins now/at some point in the future»
  • 7. IS THAT WHAT THEY ACTUALLY NEED? “All the cool kids are doing it” is not good enough reason. — Seen on the internet Raisin cookies that look like chocolate chip cookies are the main reason I have trust issues
  • 9. IT'S NOT JUST SOFTWARE DEVELOPMENT! It also has a heavy R&D component Higher uncertainty Results are probabilistic
  • 10. THERE'S A PAPER ABOUT IT ≠ A PRODUCT The distance may not be something coverable today.
  • 11. MODELS ARE AN ASSET Investing time on it is not a “necessary evil” What's produced on a modelling phase is a critical component A model emerges from the client data and constraints, so it is unique to the client and an advantage over competitors.
  • 12. MACHINE LEARNING ≠ CLAIRVOYANCE Garbage in, Garbage out The solution may not be clear; you may be unsure of what problem is more important; but your business goal should be clear. Data Science will not make it clear for you.
  • 13. AGREEING ON METRICS Explain what are you measuring and why Explain what are the baselines and how much you think you can improve Connect these to the business goals.
  • 14. A PICTURE IS WORTH A THOUSAND WORDS Visualize your proposal. Be minimalistic. Use o f the shelf tools for a proposal.
  • 15. PART II: PROVIDING THE SERVICE
  • 16. THE SERVICE IS THE END, DATA SCIENCE IS THE MEANS Do not fall in love with the challenge
  • 17. JUST OUT OF THE BOX MAY BE ENOUGH You should always be asking yourself: 1. Have I already covered the expectations? 2. Will an improved result here actually improve value?
  • 18. MEASURE TWICE, CUT ONCE Get a look at the object of analysis before starting work. Has it desirable qualities? 1. Manageable size? 2. It's in an accessible representation? 3. Does it have a reasonable distribution? 4. ...
  • 19. INVOLVE THE PO Validate your assumptions with a person familiar with your domain 1. Are there contradictions between your assumptions and their knowledge? 2. Are there contradictions between the data you already have and their knowledge? Keep learning about the business side, encourage your business counterpart to learn to talk with Data Scientists.
  • 20. PART OF YOUR SERVICE IS NOT DS Make sure you use the right tools and people in each area
  • 21. PART III: WORKING AS A TEAM
  • 22. SHARE INFORMATION Basic descriptive statistics should be shared with all involved, even the non DS. People in a team must be aware of what's important and what's not.
  • 23. SHARE UNCERTAINTY There are a lot of tradeo fs to make regarding milestones and deadlines. People can plan better (and have contingency plans) if they know what parts of the project have higher risks.
  • 24. IT'S OK TO BUILD FLIMSY CODE, AS LONG AS IT'S NOT SOFTWARE code: programming text that runs on a computer so tware: programming text that is part of a deliverable. There are di ferences: code does not necessarily need tests. code does not necessarily need to follow other processes. sometimes the outputs of your code are deliverable and may have to be treated specially.
  • 25. THE DISCUSSION IS JUST BEGINNING I'D LOVE TO HEAR ABOUT WHAT YOU'VE LEARNED ELSEWHERE
  • 26. THANKS! ANY QUESTIONS? You can find me at twitter (@dmoisset) or by email (dmoisset@machinalis.com)