SlideShare uma empresa Scribd logo
1 de 22
Baixar para ler offline
How Will AI Change the Role of
the Data Scientist?


Hugo Gävert
@hgavert
Helsinki Data Science meet-up 2017-02-16
Who am I?
Currently:

Chief Data Scientist @ Sanoma
Past:
• HUT Infolab
• Xtract
• Nokia
Hugo Gävert, 2017-02-16
Artificial Intelligence
World Goals Use cases Examples
Special
purpose AI
Restricted, clear
inputs
Well defined,
finite
- Recommendation
engines,
- Credit scoring,
- Insurance claim
handling
- Image recognition
- Playing games;
chess, go, ping
pong, …
- Driving car
- GOFAI,
- ML,
- ANN / Deep
Learning
- Expert systems
- Supervised
- Unsupervised
- Reinforcement
General AI
Open, chaotic,
messy inputs
Poorly defined,
unconstrained
Requirements:
- Reasoning,
- communication,
- learning new
things
- ability to apply
skills to new
problems
- Design better AI
- Whole brain
simulation?
- Robotic form?
- Sensing?
- Manipulating the
world?
Super human intelligence?
Hugo Gävert, 2017-02-16
Artificial Super-Intelligence
Human
Intelligence
Artificial
Intelligence
Intelligence/Performance
Time
Games Expert tasks Mundane tasks
- Checkers, 1952 / 1994
- Backgammon, 1979
- Othello, Chess, 1997
- Jeopardy, 2010
- Go, 2016
- Poker, 2017
- Theorem proving, eq solving
- Credit scoring / probability
to default, insurance claim
fraud
- Medical diagnosis
- Speech to text, translation…
- Image recognition
- Natural language /
understanding text
- Walking
- Object manipulation
- Driving cars
Lieutenant Commander Data, year 2338?
Human Level

Machine Intelligence:
10%: 2020
50%: 2040-2050
90%: 2080-2100
Hugo Gävert, 2017-02-16
• Original ideas inspired by brains, but nowadays it’s more engineering for machine
learning tasks.
• Artificial Neural Network ≈ Layers of connected simple neurons
• Multiple different architectures for different uses
Neural Networks?
A cartoon drawing of a biological neuron (left) and its mathematical model (right).
Stanford CS231n: Convolutional Neural Networks for Visual Recognition
Hugo Gävert, 2017-02-16
http://playground.tensorflow.org/
Hugo Gävert, 2017-02-16
Why Deep Learning?
• Rebranded artificial neural networks, so what is different now?
Big Data
- Text, images, video
- Large annotated data
sources, like images
155k words, 117k senses
14M images, 1M BBoxes, 22k synsets
Computational power
Some new algorithms;
ReLU, dropouts,
initializations, ConvNets
-4 -3 -2 -1 0 1 2 3 4
-1
1
-4 -3 -2 -1 0 1 2 3 4
-1
1
-4 -3 -2 -1 0 1 2 3 4
-1
1
Hugo Gävert, 2017-02-16
Deep Belief Networks
• 2006, Geoff Hinton: A Fast Learning Algorithm for Deep Belief Networks
• First major results in 2009 in Acoustic Model using Deep Belief Networks

—> Speech recognition
• What is it?
• Multilayer feedforward network with
• Input layer
• Many hidden layers
• Output layer
• Training…
Train as RBM
Train as RBM
Train with
backpropagation
Hugo Gävert, 2017-02-16
From feature engineering to feature learning
Input Output
Hand
designed
program
Rule-based AI
Trained
classifier
Input Output
Hand
designed
features
Classic ML
Features
Trained
classifier
Input Output
Representation

Learning
Simple
features
Mid level
abstract
features
Trained
classifier
Input Output
High level
abstract
features
Deep

Learning
Hugo Gävert, 2017-02-16
• Deep Belief Networks have largely been replaced by convolutional networks for image recognition
• Architecture, layers:
• Input (width, height, depth = RGB)
• Convolutional layer
• Neuron calculates convolution of the weights over the local image area
• N filters with size (width, height, N)
• Relu activation layer
• Pooling layer
• Downsampling along the spatial width and height dimension
• Fully connected layer (output: 1 x 1 x num of classes)
• The conv + relu + pooling layers are repeated.
• Of course, other architectures also…
Convolutional networks?
Hugo Gävert, 2017-02-16
Example, 17 layers, 7000 params.
http://cs231n.stanford.edu/Hugo Gävert, 2017-02-16
More example layers…
Hugo Gävert, 2017-02-16
Convolutional networks - What is deep?
• AlexNet, 2012
• ImageNet challenge, top 5 error rate 16% (previous 26%)
• 5 conv, max-pooling, drop-out layers, 3 fully connected
• ZF Net, 2013
• Top 5 error rate 11.2%
• Similar architecture, only 10% of training data
• DeConvNet - visualisation of the layers
• VGG Net, 2014
• Top 5 error rate 7.3%
• 19 layers, but simple 3x3 convolution and 2x2 max pooling
• CNNs need to be deep, but otherwise simple
• GoogLeNet, 2015
• Top 5 error rate 6.7%
• 22 layers, but has inception-modules that do work in parallel
• Microsoft ResNet, 2015
• Top 5 error rate 3.6% (better than human)
• 152 layers, ultra deep
Hugo Gävert, 2017-02-16
Speech Recognition at Google
Brandon Ballinger: Deep Learning and the Dream of AI, Strata 2013
Jaitly et al (2012), Application of pretrained deep neural networks to LVSRHugo Gävert, 2017-02-16
Chatbots and AI
• Speech recognition ok
• Natural language
understanding needs work
• Logic
• If … then…
• No memory in session
• Behavior / approach
• Reactive, just answers
questions
• Proactive would be helpful…
Hugo Gävert, 2017-02-16
Products you should test / use
• Google APIs
• Machine learning platform (Deep
Learning: TensorFlow)
• Natural Language API
• Speech API
• Translation API
• Vision API
• IBM Watson analytics…
• Also, some of the famous image
ConvNets are downloadable in pre-
trained format
• MS Azure ML (Cortana analytics,
cognitive services)
• Deep Learning: CNTK
• Vision: Face API, Emotion API,
Computer Vision API, Content
Moderation API
• Recommendations API, Academic
knowledge API, Entity linking API,
Anomaly Detection
• Language: Text Analysis, Web
Language Model, spell checking,
translation
• Speech: Speech to text, speaker
identification, translation
Hugo Gävert, 2017-02-16
So is AI going to take the job of Data Scientists?
• Yes, absolutely
• Why?
• We, the data scientists, are building the
AI - we’re lazy, we build AI to do our
job…
• Harder to build the robots (or cars,
trucks, flying machines) than to just run
the AI inside computer. The early use
cases will be confined in the computers.
• When?
• Not very soon…
Hugo Gävert, 2017-02-16
What does typical data science project look like?
Business
understanding
Data understanding
and quality
Data pre-processing
Feature engineering
Modeling
Evaluation
Production
deployment
Hugo Gävert, 2017-02-16
What does typical data science project look like?
Business
understanding
Data understanding
and quality
Data pre-processing
Feature engineering
Modeling
Evaluation
Production
deployment
Data collection
design
Monitoring, control
Feature learning
Deep Learning
architecture
Communications,
internal consulting
How do we get
representative data for
the network?
Ok, images easy - how
about others?
Does it work?
Still expected results?
Fraudulent use?
What is this Black Box?
APIs
Hugo Gävert, 2017-02-16
Recommendations for
Data Scientists
• Keep on doing what you do
• Evolve with the world
• You still need
• Math; stats, probabilities, linear algebra…
• Algorithms and data structures
• You also need now
• Deep Learning (hype!)
• More communications skills
• Software writing & engineering skills (APIs)
• Google and Stack Overflow helps…
Hugo Gävert, 2017-02-16
Recommendations for companies
• Data
• Create data strategy; collect, store and make data available
• Data is key business asset in building AI capability. Deep
Learning needs data in training. Software can be replicated,
but data cannot - if a business has data, then it’s already in
better position than competitors.
• Hire talent
• AI models need to be customized for the business need,
application and context.
• Downloading open source software is not enough.
Applying it is far from trivial. The APIs solve only specific
problems and are too much black boxes.
• You need to be able to explain the models to customers -
specially in the legal, finance, insurance, health etc.
business.
“The best ideas
come from the guys
closest to the data.”
Todd Holloway
Head of Data Science at Trulia.
Hugo Gävert, 2017-02-16
Thanks!
Hugo Gävert
hgavert@gmail.com
@hgavert

Mais conteúdo relacionado

Mais procurados

International Journal of Artificial Intelligence and Soft Computing (IJAISC)
International Journal of Artificial Intelligence and Soft Computing (IJAISC)International Journal of Artificial Intelligence and Soft Computing (IJAISC)
International Journal of Artificial Intelligence and Soft Computing (IJAISC)MiajackB
 
Mathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML EngineeringMathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML EngineeringGopi Krishna Nuti
 
Andrii Belas "Modern approaches to working with categorical data in machine l...
Andrii Belas "Modern approaches to working with categorical data in machine l...Andrii Belas "Modern approaches to working with categorical data in machine l...
Andrii Belas "Modern approaches to working with categorical data in machine l...Lviv Startup Club
 
Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020Mikio L. Braun
 
Machine Learning, Artificial General Intelligence, and Robots with Human Minds
Machine Learning, Artificial General Intelligence, and Robots with Human MindsMachine Learning, Artificial General Intelligence, and Robots with Human Minds
Machine Learning, Artificial General Intelligence, and Robots with Human MindsUniversity of Huddersfield
 
Artificial general intelligence research project at Keen Software House (3/2015)
Artificial general intelligence research project at Keen Software House (3/2015)Artificial general intelligence research project at Keen Software House (3/2015)
Artificial general intelligence research project at Keen Software House (3/2015)Marek Rosa
 
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...AI Frontiers
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine LearningNeo4j
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DSRoopesh Kohad
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systemsXavier Amatriain
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligenceananth
 
AI Deep Learning - CF Machine Learning
AI Deep Learning - CF Machine LearningAI Deep Learning - CF Machine Learning
AI Deep Learning - CF Machine LearningKarl Seiler
 
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jAI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jIvan Zoratti
 

Mais procurados (16)

International Journal of Artificial Intelligence and Soft Computing (IJAISC)
International Journal of Artificial Intelligence and Soft Computing (IJAISC)International Journal of Artificial Intelligence and Soft Computing (IJAISC)
International Journal of Artificial Intelligence and Soft Computing (IJAISC)
 
Ml - A shallow dive
Ml  - A shallow diveMl  - A shallow dive
Ml - A shallow dive
 
Presentation v3
Presentation v3Presentation v3
Presentation v3
 
Mathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML EngineeringMathematics, Machine Learning and ML Engineering
Mathematics, Machine Learning and ML Engineering
 
Andrii Belas "Modern approaches to working with categorical data in machine l...
Andrii Belas "Modern approaches to working with categorical data in machine l...Andrii Belas "Modern approaches to working with categorical data in machine l...
Andrii Belas "Modern approaches to working with categorical data in machine l...
 
Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020
 
Machine Learning, Artificial General Intelligence, and Robots with Human Minds
Machine Learning, Artificial General Intelligence, and Robots with Human MindsMachine Learning, Artificial General Intelligence, and Robots with Human Minds
Machine Learning, Artificial General Intelligence, and Robots with Human Minds
 
Artificial general intelligence research project at Keen Software House (3/2015)
Artificial general intelligence research project at Keen Software House (3/2015)Artificial general intelligence research project at Keen Software House (3/2015)
Artificial general intelligence research project at Keen Software House (3/2015)
 
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DS
 
AI for HRM
AI for HRMAI for HRM
AI for HRM
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
AI Deep Learning - CF Machine Learning
AI Deep Learning - CF Machine LearningAI Deep Learning - CF Machine Learning
AI Deep Learning - CF Machine Learning
 
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jAI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
 

Destaque

TEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkTEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkVolker Hirsch
 
GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...
GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...
GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...GreenBiz Group
 
FIWARE: Managing Context Information at large scale
FIWARE: Managing Context Information at large scaleFIWARE: Managing Context Information at large scale
FIWARE: Managing Context Information at large scaleFermin Galan
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017Drift
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks남주 김
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningRenārs Liepiņš
 
Tiedonhallinnan ongelmat ja semanttisen teknologian keinot
Tiedonhallinnan ongelmat ja semanttisen teknologian keinotTiedonhallinnan ongelmat ja semanttisen teknologian keinot
Tiedonhallinnan ongelmat ja semanttisen teknologian keinotHeimo Hänninen
 
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in BusinessSmart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in BusinessDATAVERSITY
 
GLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid ComputingGLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid ComputingLINE+
 
How to build Open Hardware self-navigating car robot
How to build Open Hardware self-navigating car robotHow to build Open Hardware self-navigating car robot
How to build Open Hardware self-navigating car robotTomáš Jukin
 
(Kpi summer school 2015) theano tutorial part1
(Kpi summer school 2015) theano tutorial part1(Kpi summer school 2015) theano tutorial part1
(Kpi summer school 2015) theano tutorial part1Serhii Havrylov
 
(Kpi summer school 2015) theano tutorial part2
(Kpi summer school 2015) theano tutorial part2(Kpi summer school 2015) theano tutorial part2
(Kpi summer school 2015) theano tutorial part2Serhii Havrylov
 
"Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co...
"Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co..."Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co...
"Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co...Hyper Wellbeing
 
H2O World - GLM - Tomas Nykodym
H2O World - GLM - Tomas NykodymH2O World - GLM - Tomas Nykodym
H2O World - GLM - Tomas NykodymSri Ambati
 
AI State of Play Dec 2016 NYC
AI State of Play Dec 2016 NYCAI State of Play Dec 2016 NYC
AI State of Play Dec 2016 NYCPeter Morgan
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectReal-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectPAPIs.io
 
The reliability of IoT solutions in the healthcare sector
The reliability of IoT solutions in the healthcare sectorThe reliability of IoT solutions in the healthcare sector
The reliability of IoT solutions in the healthcare sectorPhu H. Nguyen
 
Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)
Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)
Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)John C. Havens
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial dataKudos S.A.S
 
Deep Learning Jeff-Shomaker_1-20-17_Final_
Deep Learning Jeff-Shomaker_1-20-17_Final_Deep Learning Jeff-Shomaker_1-20-17_Final_
Deep Learning Jeff-Shomaker_1-20-17_Final_Jeffrey Shomaker
 

Destaque (20)

TEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkTEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of Work
 
GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...
GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...
GreenBiz 17 Tutorial Slides: "How Corporates are Aligning with the Sustainabl...
 
FIWARE: Managing Context Information at large scale
FIWARE: Managing Context Information at large scaleFIWARE: Managing Context Information at large scale
FIWARE: Managing Context Information at large scale
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement Learning
 
Tiedonhallinnan ongelmat ja semanttisen teknologian keinot
Tiedonhallinnan ongelmat ja semanttisen teknologian keinotTiedonhallinnan ongelmat ja semanttisen teknologian keinot
Tiedonhallinnan ongelmat ja semanttisen teknologian keinot
 
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in BusinessSmart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
 
GLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid ComputingGLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid Computing
 
How to build Open Hardware self-navigating car robot
How to build Open Hardware self-navigating car robotHow to build Open Hardware self-navigating car robot
How to build Open Hardware self-navigating car robot
 
(Kpi summer school 2015) theano tutorial part1
(Kpi summer school 2015) theano tutorial part1(Kpi summer school 2015) theano tutorial part1
(Kpi summer school 2015) theano tutorial part1
 
(Kpi summer school 2015) theano tutorial part2
(Kpi summer school 2015) theano tutorial part2(Kpi summer school 2015) theano tutorial part2
(Kpi summer school 2015) theano tutorial part2
 
"Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co...
"Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co..."Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co...
"Continuous Digital Biomarkers from Wearable Devices" - Brandon Ballinger (Co...
 
H2O World - GLM - Tomas Nykodym
H2O World - GLM - Tomas NykodymH2O World - GLM - Tomas Nykodym
H2O World - GLM - Tomas Nykodym
 
AI State of Play Dec 2016 NYC
AI State of Play Dec 2016 NYCAI State of Play Dec 2016 NYC
AI State of Play Dec 2016 NYC
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectReal-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
 
The reliability of IoT solutions in the healthcare sector
The reliability of IoT solutions in the healthcare sectorThe reliability of IoT solutions in the healthcare sector
The reliability of IoT solutions in the healthcare sector
 
Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)
Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)
Individual-In-The-Loop (for Ethically Aligned Artificial Intelligence)
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial data
 
Deep Learning Jeff-Shomaker_1-20-17_Final_
Deep Learning Jeff-Shomaker_1-20-17_Final_Deep Learning Jeff-Shomaker_1-20-17_Final_
Deep Learning Jeff-Shomaker_1-20-17_Final_
 

Semelhante a How Will AI Change the Role of the Data Scientist?

Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsRakuten Group, Inc.
 
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?Agentschap Innoveren & Ondernemen
 
Big Data & Artificial Intelligence
Big Data & Artificial IntelligenceBig Data & Artificial Intelligence
Big Data & Artificial IntelligenceZavain Dar
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
 
2023-My AI Experience - Colm Dunphy.pdf
2023-My AI Experience - Colm Dunphy.pdf2023-My AI Experience - Colm Dunphy.pdf
2023-My AI Experience - Colm Dunphy.pdfColm Dunphy
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceData Science Milan
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or realityAwantik Das
 
Data science meetup - Spiros Antonatos
Data science meetup - Spiros AntonatosData science meetup - Spiros Antonatos
Data science meetup - Spiros AntonatosSpiros Antonatos
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTKAshish Jaiman
 
AI Technology Overview and Career Advice
AI Technology Overview and Career AdviceAI Technology Overview and Career Advice
AI Technology Overview and Career AdviceKunling Geng
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Diego Oppenheimer
 
Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Grigory Sapunov
 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Ian Gomez
 
Art of artificial intelligence and automation
Art of artificial intelligence and automationArt of artificial intelligence and automation
Art of artificial intelligence and automationLiew Wei Da Andrew
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...
Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...
Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...Codemotion
 
Slides galvin-widjaja
Slides galvin-widjajaSlides galvin-widjaja
Slides galvin-widjajaCodePolitan
 
NLU-MAP. IBM Watson NLU with Mind Mapping automation
NLU-MAP. IBM Watson NLU with Mind Mapping automationNLU-MAP. IBM Watson NLU with Mind Mapping automation
NLU-MAP. IBM Watson NLU with Mind Mapping automationJosé M. Guerrero
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsSafe Software
 
Introducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applicationsIntroducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applicationsRokesh Jankie
 

Semelhante a How Will AI Change the Role of the Data Scientist? (20)

Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
 
Big Data & Artificial Intelligence
Big Data & Artificial IntelligenceBig Data & Artificial Intelligence
Big Data & Artificial Intelligence
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
2023-My AI Experience - Colm Dunphy.pdf
2023-My AI Experience - Colm Dunphy.pdf2023-My AI Experience - Colm Dunphy.pdf
2023-My AI Experience - Colm Dunphy.pdf
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial Intelligence
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
Data science meetup - Spiros Antonatos
Data science meetup - Spiros AntonatosData science meetup - Spiros Antonatos
Data science meetup - Spiros Antonatos
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTK
 
AI Technology Overview and Career Advice
AI Technology Overview and Career AdviceAI Technology Overview and Career Advice
AI Technology Overview and Career Advice
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"
 
Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016
 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
 
Art of artificial intelligence and automation
Art of artificial intelligence and automationArt of artificial intelligence and automation
Art of artificial intelligence and automation
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...
Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...
Artificial Intelligence in practice - Gerbert Kaandorp - Codemotion Amsterdam...
 
Slides galvin-widjaja
Slides galvin-widjajaSlides galvin-widjaja
Slides galvin-widjaja
 
NLU-MAP. IBM Watson NLU with Mind Mapping automation
NLU-MAP. IBM Watson NLU with Mind Mapping automationNLU-MAP. IBM Watson NLU with Mind Mapping automation
NLU-MAP. IBM Watson NLU with Mind Mapping automation
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
Introducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applicationsIntroducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applications
 

Último

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 

Último (20)

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 

How Will AI Change the Role of the Data Scientist?

  • 1. How Will AI Change the Role of the Data Scientist? 
 Hugo Gävert @hgavert Helsinki Data Science meet-up 2017-02-16
  • 2. Who am I? Currently:
 Chief Data Scientist @ Sanoma Past: • HUT Infolab • Xtract • Nokia Hugo Gävert, 2017-02-16
  • 3. Artificial Intelligence World Goals Use cases Examples Special purpose AI Restricted, clear inputs Well defined, finite - Recommendation engines, - Credit scoring, - Insurance claim handling - Image recognition - Playing games; chess, go, ping pong, … - Driving car - GOFAI, - ML, - ANN / Deep Learning - Expert systems - Supervised - Unsupervised - Reinforcement General AI Open, chaotic, messy inputs Poorly defined, unconstrained Requirements: - Reasoning, - communication, - learning new things - ability to apply skills to new problems - Design better AI - Whole brain simulation? - Robotic form? - Sensing? - Manipulating the world? Super human intelligence? Hugo Gävert, 2017-02-16
  • 4. Artificial Super-Intelligence Human Intelligence Artificial Intelligence Intelligence/Performance Time Games Expert tasks Mundane tasks - Checkers, 1952 / 1994 - Backgammon, 1979 - Othello, Chess, 1997 - Jeopardy, 2010 - Go, 2016 - Poker, 2017 - Theorem proving, eq solving - Credit scoring / probability to default, insurance claim fraud - Medical diagnosis - Speech to text, translation… - Image recognition - Natural language / understanding text - Walking - Object manipulation - Driving cars Lieutenant Commander Data, year 2338? Human Level
 Machine Intelligence: 10%: 2020 50%: 2040-2050 90%: 2080-2100 Hugo Gävert, 2017-02-16
  • 5. • Original ideas inspired by brains, but nowadays it’s more engineering for machine learning tasks. • Artificial Neural Network ≈ Layers of connected simple neurons • Multiple different architectures for different uses Neural Networks? A cartoon drawing of a biological neuron (left) and its mathematical model (right). Stanford CS231n: Convolutional Neural Networks for Visual Recognition Hugo Gävert, 2017-02-16
  • 7. Why Deep Learning? • Rebranded artificial neural networks, so what is different now? Big Data - Text, images, video - Large annotated data sources, like images 155k words, 117k senses 14M images, 1M BBoxes, 22k synsets Computational power Some new algorithms; ReLU, dropouts, initializations, ConvNets -4 -3 -2 -1 0 1 2 3 4 -1 1 -4 -3 -2 -1 0 1 2 3 4 -1 1 -4 -3 -2 -1 0 1 2 3 4 -1 1 Hugo Gävert, 2017-02-16
  • 8. Deep Belief Networks • 2006, Geoff Hinton: A Fast Learning Algorithm for Deep Belief Networks • First major results in 2009 in Acoustic Model using Deep Belief Networks
 —> Speech recognition • What is it? • Multilayer feedforward network with • Input layer • Many hidden layers • Output layer • Training… Train as RBM Train as RBM Train with backpropagation Hugo Gävert, 2017-02-16
  • 9. From feature engineering to feature learning Input Output Hand designed program Rule-based AI Trained classifier Input Output Hand designed features Classic ML Features Trained classifier Input Output Representation
 Learning Simple features Mid level abstract features Trained classifier Input Output High level abstract features Deep
 Learning Hugo Gävert, 2017-02-16
  • 10. • Deep Belief Networks have largely been replaced by convolutional networks for image recognition • Architecture, layers: • Input (width, height, depth = RGB) • Convolutional layer • Neuron calculates convolution of the weights over the local image area • N filters with size (width, height, N) • Relu activation layer • Pooling layer • Downsampling along the spatial width and height dimension • Fully connected layer (output: 1 x 1 x num of classes) • The conv + relu + pooling layers are repeated. • Of course, other architectures also… Convolutional networks? Hugo Gävert, 2017-02-16
  • 11. Example, 17 layers, 7000 params. http://cs231n.stanford.edu/Hugo Gävert, 2017-02-16
  • 12. More example layers… Hugo Gävert, 2017-02-16
  • 13. Convolutional networks - What is deep? • AlexNet, 2012 • ImageNet challenge, top 5 error rate 16% (previous 26%) • 5 conv, max-pooling, drop-out layers, 3 fully connected • ZF Net, 2013 • Top 5 error rate 11.2% • Similar architecture, only 10% of training data • DeConvNet - visualisation of the layers • VGG Net, 2014 • Top 5 error rate 7.3% • 19 layers, but simple 3x3 convolution and 2x2 max pooling • CNNs need to be deep, but otherwise simple • GoogLeNet, 2015 • Top 5 error rate 6.7% • 22 layers, but has inception-modules that do work in parallel • Microsoft ResNet, 2015 • Top 5 error rate 3.6% (better than human) • 152 layers, ultra deep Hugo Gävert, 2017-02-16
  • 14. Speech Recognition at Google Brandon Ballinger: Deep Learning and the Dream of AI, Strata 2013 Jaitly et al (2012), Application of pretrained deep neural networks to LVSRHugo Gävert, 2017-02-16
  • 15. Chatbots and AI • Speech recognition ok • Natural language understanding needs work • Logic • If … then… • No memory in session • Behavior / approach • Reactive, just answers questions • Proactive would be helpful… Hugo Gävert, 2017-02-16
  • 16. Products you should test / use • Google APIs • Machine learning platform (Deep Learning: TensorFlow) • Natural Language API • Speech API • Translation API • Vision API • IBM Watson analytics… • Also, some of the famous image ConvNets are downloadable in pre- trained format • MS Azure ML (Cortana analytics, cognitive services) • Deep Learning: CNTK • Vision: Face API, Emotion API, Computer Vision API, Content Moderation API • Recommendations API, Academic knowledge API, Entity linking API, Anomaly Detection • Language: Text Analysis, Web Language Model, spell checking, translation • Speech: Speech to text, speaker identification, translation Hugo Gävert, 2017-02-16
  • 17. So is AI going to take the job of Data Scientists? • Yes, absolutely • Why? • We, the data scientists, are building the AI - we’re lazy, we build AI to do our job… • Harder to build the robots (or cars, trucks, flying machines) than to just run the AI inside computer. The early use cases will be confined in the computers. • When? • Not very soon… Hugo Gävert, 2017-02-16
  • 18. What does typical data science project look like? Business understanding Data understanding and quality Data pre-processing Feature engineering Modeling Evaluation Production deployment Hugo Gävert, 2017-02-16
  • 19. What does typical data science project look like? Business understanding Data understanding and quality Data pre-processing Feature engineering Modeling Evaluation Production deployment Data collection design Monitoring, control Feature learning Deep Learning architecture Communications, internal consulting How do we get representative data for the network? Ok, images easy - how about others? Does it work? Still expected results? Fraudulent use? What is this Black Box? APIs Hugo Gävert, 2017-02-16
  • 20. Recommendations for Data Scientists • Keep on doing what you do • Evolve with the world • You still need • Math; stats, probabilities, linear algebra… • Algorithms and data structures • You also need now • Deep Learning (hype!) • More communications skills • Software writing & engineering skills (APIs) • Google and Stack Overflow helps… Hugo Gävert, 2017-02-16
  • 21. Recommendations for companies • Data • Create data strategy; collect, store and make data available • Data is key business asset in building AI capability. Deep Learning needs data in training. Software can be replicated, but data cannot - if a business has data, then it’s already in better position than competitors. • Hire talent • AI models need to be customized for the business need, application and context. • Downloading open source software is not enough. Applying it is far from trivial. The APIs solve only specific problems and are too much black boxes. • You need to be able to explain the models to customers - specially in the legal, finance, insurance, health etc. business. “The best ideas come from the guys closest to the data.” Todd Holloway Head of Data Science at Trulia. Hugo Gävert, 2017-02-16