SlideShare uma empresa Scribd logo
1 de 29
Baixar para ler offline
Copyright © 2017 Anki, Inc. 1
Andrew Stein
May 2017
Making Cozmo See
Copyright © 2017 Anki, Inc. 2
Introducing Anki & Cozmo
• About Anki [http://www.anki.com]
• Consumer Robotics & AI Company, currently focused
on entertainment
• Two Products: Overdrive and Cozmo
• About Cozmo [http://www.anki.com/en-us/cozmo]
• App-driven, autonomous robot character
• Launched Fall 2016, $179
• Full-featured SDK (low- and high-level API)
Copyright © 2017 Anki, Inc. 3
• Primary Goal: Bring Cozmo to Life
• All about personality, via animation
• Interactive, aware
About Cozmo
Copyright © 2017 Anki, Inc. 4
Cozmo’s Personality
Copyright © 2017 Anki, Inc. 5
• Outputs:
• 4 Mechanical DoF: Head, Lift, Treaded Wheels
• Speaker
• OLED face
• Sensors:
• Camera (VGA, 15fps)
• Cliff sensor
• Encoders and IMU
• Battery Life: 1-2 hours
• 3 Processors: comms, sensors/motor control, supervisor
Cozmo’s Core Hardware Capabilities
Copyright © 2017 Anki, Inc. 6
Simple Linear Flow
Script Animation Library
I wish!
Copyright © 2017 Anki, Inc. 7
Cozmo Flow
Reaction Triggers
Game
Logic
World Model
Path Planning
Sensors
Emotional
State
Controlled
Movement
Animation Library
Behaviors
Copyright © 2017 Anki, Inc. 8
• Cozmo moves
• Motion blur
• Must not drive off cliffs
• Odometry not reliable (treads!)
• Cozmo gets moved
• “Robot Kidnapping” not just
academic with six-year-olds!
• How to maintain and
communicate world state?
• Cozmo interacts: Faces, cubes, charger, etc.
Not a Static Surveillance Camera
Copyright © 2017 Anki, Inc. 9
Vision Capabilities
Copyright © 2017 Anki, Inc. 10
• Detect/track/recognize: cubes, charger, faces, motion, etc.
• Robust enrollment for face recognition
• Make eye contact
• Estimate 3D pose of objects
• Maintain map of the world (even if picked up)
• Rolling shutter correction
• Flexible scheduling according to needs and computational budget
Vision Capabilities
Copyright © 2017 Anki, Inc. 11
• Store up to 10 identities across play sessions, enrolled via app activity
• Remember 3D poses for eye contact
Face Detection, Tracking, & Recognition
Copyright © 2017 Anki, Inc. 12
Cat & Dog Face Detection
Copyright © 2017 Anki, Inc. 13
Markers: Cubes, Charger, & Custom Objects (SDK)
Custom Markers for SDK
Copyright © 2017 Anki, Inc. 15
Markers: Design vs. Vision
Vs.
Copyright © 2017 Anki, Inc. 16
Detecting “Interesting Stuff”
Copyright © 2017 Anki, Inc. 17
System Architecture
Copyright © 2017 Anki, Inc. 18
• Use Smart Device for heavy
computation
• Reduce robot cost
• Better development environment
System Architecture
UI + Game Logic
Robotics
Engine
(Vision / AI)
Hardware Drivers
Low-Level
Controllers
Comms
Python
SDK
CLAD via Wifi CLAD via USB
CLAD: Custom
Cross-Platform Messaging
(similar to protobufs)
CLAD (Local)
Copyright © 2017 Anki, Inc. 19
• Animations don’t play instantaneously (must buffer audio)
• Face Enrollment in “Meet Cozmo” activity (limited visual feedback)
• State Mirroring (from engine to Robot / Game / SDK)
• Latency for vision and control
Architecture Complications
Copyright © 2017 Anki, Inc. 21
Under the Hood of the Full System
Copyright © 2017 Anki, Inc. 22
Under the Hood (Manual Drive)
Copyright © 2017 Anki, Inc. 23
Under the Hood (Autonomous)
Copyright © 2017 Anki, Inc. 24
Computer Vision in a Product at Scale
Copyright © 2017 Anki, Inc. 25
Computer Vision is Not the Product
Realistic
Expectations
Product
Requirements
Computer Vision
Copyright © 2017 Anki, Inc. 26
Computer Vision at Scale: Calibration
Manipulating cubes requires high precision
• Camera variability (lens placement)
• Mechanical tolerances
Must calibrate each camera at factory
• Verify camera placement inside head
• Estimate intrinsic parameters
Copyright © 2017 Anki, Inc. 27
• Cannot control
lighting in
people’s homes
• Cube lights
complicate local
binarization
Computer Vision at Scale: Illumination
Copyright © 2017 Anki, Inc. 28
• Detecting and responding to failures
is as important as being successful
Failure Detection
Copyright © 2017 Anki, Inc. 30
• Cozmo: interactive, complex, full of personality
• Computer vision is only a piece of the product,
does not drive all decisions
• Computer vision at scale, in a consumer product,
comes with special challenges and considerations
• Setting expectations for what is (not) possible with
vision is important in a world of sci-fi TV & movies
• Cozmo is a fun little robot to develop for
Conclusions
Copyright © 2017 Anki, Inc. 31
What Cozmo is All About
Copyright © 2017 Anki, Inc. 32
Join us!
We’re
hiring!

Mais conteúdo relacionado

Mais de Edge AI and Vision Alliance

“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
Edge AI and Vision Alliance
 
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
Edge AI and Vision Alliance
 
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ..."Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
Edge AI and Vision Alliance
 
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
Edge AI and Vision Alliance
 
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
Edge AI and Vision Alliance
 

Mais de Edge AI and Vision Alliance (20)

“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
 
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
 
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
 
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
 
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ..."Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
 
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
 
“A Survey of Model Compression Methods,” a Presentation from Instrumental
“A Survey of Model Compression Methods,” a Presentation from Instrumental“A Survey of Model Compression Methods,” a Presentation from Instrumental
“A Survey of Model Compression Methods,” a Presentation from Instrumental
 
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
 

Último

Último (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

"Making Cozmo See," a Presentation from Anki

  • 1. Copyright © 2017 Anki, Inc. 1 Andrew Stein May 2017 Making Cozmo See
  • 2. Copyright © 2017 Anki, Inc. 2 Introducing Anki & Cozmo • About Anki [http://www.anki.com] • Consumer Robotics & AI Company, currently focused on entertainment • Two Products: Overdrive and Cozmo • About Cozmo [http://www.anki.com/en-us/cozmo] • App-driven, autonomous robot character • Launched Fall 2016, $179 • Full-featured SDK (low- and high-level API)
  • 3. Copyright © 2017 Anki, Inc. 3 • Primary Goal: Bring Cozmo to Life • All about personality, via animation • Interactive, aware About Cozmo
  • 4. Copyright © 2017 Anki, Inc. 4 Cozmo’s Personality
  • 5. Copyright © 2017 Anki, Inc. 5 • Outputs: • 4 Mechanical DoF: Head, Lift, Treaded Wheels • Speaker • OLED face • Sensors: • Camera (VGA, 15fps) • Cliff sensor • Encoders and IMU • Battery Life: 1-2 hours • 3 Processors: comms, sensors/motor control, supervisor Cozmo’s Core Hardware Capabilities
  • 6. Copyright © 2017 Anki, Inc. 6 Simple Linear Flow Script Animation Library I wish!
  • 7. Copyright © 2017 Anki, Inc. 7 Cozmo Flow Reaction Triggers Game Logic World Model Path Planning Sensors Emotional State Controlled Movement Animation Library Behaviors
  • 8. Copyright © 2017 Anki, Inc. 8 • Cozmo moves • Motion blur • Must not drive off cliffs • Odometry not reliable (treads!) • Cozmo gets moved • “Robot Kidnapping” not just academic with six-year-olds! • How to maintain and communicate world state? • Cozmo interacts: Faces, cubes, charger, etc. Not a Static Surveillance Camera
  • 9. Copyright © 2017 Anki, Inc. 9 Vision Capabilities
  • 10. Copyright © 2017 Anki, Inc. 10 • Detect/track/recognize: cubes, charger, faces, motion, etc. • Robust enrollment for face recognition • Make eye contact • Estimate 3D pose of objects • Maintain map of the world (even if picked up) • Rolling shutter correction • Flexible scheduling according to needs and computational budget Vision Capabilities
  • 11. Copyright © 2017 Anki, Inc. 11 • Store up to 10 identities across play sessions, enrolled via app activity • Remember 3D poses for eye contact Face Detection, Tracking, & Recognition
  • 12. Copyright © 2017 Anki, Inc. 12 Cat & Dog Face Detection
  • 13. Copyright © 2017 Anki, Inc. 13 Markers: Cubes, Charger, & Custom Objects (SDK) Custom Markers for SDK
  • 14. Copyright © 2017 Anki, Inc. 15 Markers: Design vs. Vision Vs.
  • 15. Copyright © 2017 Anki, Inc. 16 Detecting “Interesting Stuff”
  • 16. Copyright © 2017 Anki, Inc. 17 System Architecture
  • 17. Copyright © 2017 Anki, Inc. 18 • Use Smart Device for heavy computation • Reduce robot cost • Better development environment System Architecture UI + Game Logic Robotics Engine (Vision / AI) Hardware Drivers Low-Level Controllers Comms Python SDK CLAD via Wifi CLAD via USB CLAD: Custom Cross-Platform Messaging (similar to protobufs) CLAD (Local)
  • 18. Copyright © 2017 Anki, Inc. 19 • Animations don’t play instantaneously (must buffer audio) • Face Enrollment in “Meet Cozmo” activity (limited visual feedback) • State Mirroring (from engine to Robot / Game / SDK) • Latency for vision and control Architecture Complications
  • 19. Copyright © 2017 Anki, Inc. 21 Under the Hood of the Full System
  • 20. Copyright © 2017 Anki, Inc. 22 Under the Hood (Manual Drive)
  • 21. Copyright © 2017 Anki, Inc. 23 Under the Hood (Autonomous)
  • 22. Copyright © 2017 Anki, Inc. 24 Computer Vision in a Product at Scale
  • 23. Copyright © 2017 Anki, Inc. 25 Computer Vision is Not the Product Realistic Expectations Product Requirements Computer Vision
  • 24. Copyright © 2017 Anki, Inc. 26 Computer Vision at Scale: Calibration Manipulating cubes requires high precision • Camera variability (lens placement) • Mechanical tolerances Must calibrate each camera at factory • Verify camera placement inside head • Estimate intrinsic parameters
  • 25. Copyright © 2017 Anki, Inc. 27 • Cannot control lighting in people’s homes • Cube lights complicate local binarization Computer Vision at Scale: Illumination
  • 26. Copyright © 2017 Anki, Inc. 28 • Detecting and responding to failures is as important as being successful Failure Detection
  • 27. Copyright © 2017 Anki, Inc. 30 • Cozmo: interactive, complex, full of personality • Computer vision is only a piece of the product, does not drive all decisions • Computer vision at scale, in a consumer product, comes with special challenges and considerations • Setting expectations for what is (not) possible with vision is important in a world of sci-fi TV & movies • Cozmo is a fun little robot to develop for Conclusions
  • 28. Copyright © 2017 Anki, Inc. 31 What Cozmo is All About
  • 29. Copyright © 2017 Anki, Inc. 32 Join us! We’re hiring!