SlideShare uma empresa Scribd logo
1 de 28
Image Recognition
Technology, Guidelines and Trends
David Marimon
CEO & Co-founder
david@catchoom.com
+34 654 906 753
The visual recognition market is estimated to grow
from $9.65 billion in 2014
to $25.65 billion by 2019
According to Image Recognition Market, Markets and Markets, May 2014
Image
Recognition
Face
Recognition
Object
Classification
Object
Character
Recognition
Visual Recognition
Augmented Reality and Image Recognition,
the happy couple
Outline
What works with image recognition
How to put image recognition into your app
Vendor comparison
Trends
Outline
What works with image recognition
How to put image recognition into your app
Vendor comparison
Trends
How does the world look like for a machine?
Textured Textureless Transparent
Deformable Rigid
What’s possible with Image Recognition?
Textured Textureless Transparent
Deformable Rigid
Outline
What objects work with image recognition
How to put image recognition into your app
Vendor comparison
Trends
What do you need to build an app?
Content
On-device or Cloud
Image Recognition
Image database
Curate the Image Database
Choose the IR mode that fits best
Cloud Service On-Device SDK
Choose the IR mode that fits best
Cloud Service On-Device SDK
IR requires Internet Yes No
IR speed Depends on network Controlled
Content updates Immediate Require local sync
Analytics Latest available Rely on app connection
Outline
What works with image recognition
How to put image recognition into your app
Vendor comparison
Trends
Cloud Service On-Device SDK
Vendors in the AR space
Service On Premises On-Device
Vendors in the IR space
Why my favorite is Catchoom
Real World-
tested
Built for
usability
Fast,
accurate,
reliable
Scalable
Catchoom has delivered
over 420 million
image recognitions to date
Outline
Image recognition: approaches and limitations
How to put image recognition into your app
Vendor comparison
Trends
Extended Search
On-Device SDK Cloud Service
Industrial Applications
Apparel recognition
Takeaways
1. Image recognition is the door to a broad range of
applications and services
2. Improve performance with better image databases
3. Choose on-device or cloud IR depending on your use
case.
4. Catchoom is already behind 420M interactions and
looking to meet upcoming trends
Image Recognition
Technology, Guidelines and Trends
David Marimon
CEO & Co-founder
david@catchoom.com
+34 654 906 753
Visit our booth
for live demos!
Annex
Image Recognition vs
Object Classification
Textured Textureless Transparent Deformable Rigid
Image
Recognition
Object
Classification
Challenges with benchmarks
Label a database with both reference and test
images
Identify infrastructure differences
Understand performance is not necessarily
optimized for your use case
How to benchmark
Small dataset Full test
1. Contact the vendor 1. Contact the vendor
2. Label your database
3. Use APIs

Mais conteúdo relacionado

Destaque

Image Recognition
Image RecognitionImage Recognition
Image Recognitionguestbe3cbf
 
Image Recognition With TensorFlow
Image Recognition With TensorFlowImage Recognition With TensorFlow
Image Recognition With TensorFlowYaz Santissi
 
Augmented reality using Triggered by Image Recognition
Augmented reality using Triggered by Image RecognitionAugmented reality using Triggered by Image Recognition
Augmented reality using Triggered by Image RecognitionNilesh Pawar
 
ppt on LIFI TECHNOLOGY
ppt on LIFI TECHNOLOGYppt on LIFI TECHNOLOGY
ppt on LIFI TECHNOLOGYtanshu singh
 
Using of Augmented Reality Technology in Food and Drink Industry
Using of Augmented Reality Technology in Food and Drink IndustryUsing of Augmented Reality Technology in Food and Drink Industry
Using of Augmented Reality Technology in Food and Drink IndustryAraz Davud
 
DIGITAL IMAGE TECHNOLOGY
DIGITAL IMAGE TECHNOLOGYDIGITAL IMAGE TECHNOLOGY
DIGITAL IMAGE TECHNOLOGY100677809
 
Imago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognitionImago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognitionMikhail Rybalkin
 
Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기동철 박
 
Augmented Reality - the next big thing in mobile
Augmented Reality - the next big thing in mobileAugmented Reality - the next big thing in mobile
Augmented Reality - the next big thing in mobileHari Gottipati
 
Process for Big Data Analysis
Process for Big Data AnalysisProcess for Big Data Analysis
Process for Big Data AnalysisMyunggoon Choi
 
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.kosena
 
Facial recognition technology by vaibhav
Facial recognition technology by vaibhavFacial recognition technology by vaibhav
Facial recognition technology by vaibhavVaibhav P
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text ConverterDhiraj Raj
 
Lifi technology(nimesh bariya)
Lifi technology(nimesh bariya)Lifi technology(nimesh bariya)
Lifi technology(nimesh bariya)Nimesh Bariya
 
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410Peter Woo
 
Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...
Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...
Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...Altoros
 

Destaque (20)

Image Recognition
Image RecognitionImage Recognition
Image Recognition
 
Image Recognition With TensorFlow
Image Recognition With TensorFlowImage Recognition With TensorFlow
Image Recognition With TensorFlow
 
Augmented reality using Triggered by Image Recognition
Augmented reality using Triggered by Image RecognitionAugmented reality using Triggered by Image Recognition
Augmented reality using Triggered by Image Recognition
 
Lifi ppt
Lifi pptLifi ppt
Lifi ppt
 
ppt on LIFI TECHNOLOGY
ppt on LIFI TECHNOLOGYppt on LIFI TECHNOLOGY
ppt on LIFI TECHNOLOGY
 
Using of Augmented Reality Technology in Food and Drink Industry
Using of Augmented Reality Technology in Food and Drink IndustryUsing of Augmented Reality Technology in Food and Drink Industry
Using of Augmented Reality Technology in Food and Drink Industry
 
Digital Imaging
Digital ImagingDigital Imaging
Digital Imaging
 
DIGITAL IMAGE TECHNOLOGY
DIGITAL IMAGE TECHNOLOGYDIGITAL IMAGE TECHNOLOGY
DIGITAL IMAGE TECHNOLOGY
 
Integrating Text and Image
Integrating Text and ImageIntegrating Text and Image
Integrating Text and Image
 
Imago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognitionImago OCR: Open-source toolkit for chemical structure image recognition
Imago OCR: Open-source toolkit for chemical structure image recognition
 
Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기Golang 으로 vision api 적용하기
Golang 으로 vision api 적용하기
 
Augmented Reality - the next big thing in mobile
Augmented Reality - the next big thing in mobileAugmented Reality - the next big thing in mobile
Augmented Reality - the next big thing in mobile
 
Process for Big Data Analysis
Process for Big Data AnalysisProcess for Big Data Analysis
Process for Big Data Analysis
 
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
[코세나, kosena] 빅데이터 기반의 End-to-End APM과 비정형 데이터 분석 자료입니다.
 
Facial recognition technology by vaibhav
Facial recognition technology by vaibhavFacial recognition technology by vaibhav
Facial recognition technology by vaibhav
 
Et lab
Et labEt lab
Et lab
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
 
Lifi technology(nimesh bariya)
Lifi technology(nimesh bariya)Lifi technology(nimesh bariya)
Lifi technology(nimesh bariya)
 
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
 
Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...
Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...
Deep Learning with TensorFlow: Understanding Tensors, Computations Graphs, Im...
 

Semelhante a Image Recognition. Technology, Guidelines and Trends

VR & AR Visual techniques in Market Research: a real case study.
VR & AR Visual techniques in Market Research: a real case study.VR & AR Visual techniques in Market Research: a real case study.
VR & AR Visual techniques in Market Research: a real case study.Elio Dalprato
 
Mobile App Development Service for Idea Cellular | Success Story
Mobile App Development Service for Idea Cellular | Success StoryMobile App Development Service for Idea Cellular | Success Story
Mobile App Development Service for Idea Cellular | Success StoryiProgrammer Solutions
 
Blocking Viral SaaS Adoption is Blocking Innovation - Novosco & Amplipahe
Blocking Viral SaaS Adoption is Blocking Innovation - Novosco & AmplipaheBlocking Viral SaaS Adoption is Blocking Innovation - Novosco & Amplipahe
Blocking Viral SaaS Adoption is Blocking Innovation - Novosco & AmplipaheNovosco
 
Building retail moments that matter across digital and physical environments
Building retail moments that matter across digital and physical environmentsBuilding retail moments that matter across digital and physical environments
Building retail moments that matter across digital and physical environmentsNational Retail Federation
 
i-Verve Company Brochure.pdf
i-Verve Company Brochure.pdfi-Verve Company Brochure.pdf
i-Verve Company Brochure.pdfI-Verve Inc
 
ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...
ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...
ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...IAB Canada
 
Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...
Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...
Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...Dana Gardner
 
Company-Profile-Ciber.pdf
Company-Profile-Ciber.pdfCompany-Profile-Ciber.pdf
Company-Profile-Ciber.pdfHarvey C
 
Rendered.ai - Intro to Synthetic data for Computer Vision.pdf
Rendered.ai - Intro to Synthetic data for Computer Vision.pdfRendered.ai - Intro to Synthetic data for Computer Vision.pdf
Rendered.ai - Intro to Synthetic data for Computer Vision.pdfChris Andrews
 
Rewarded Video: Benefits and Best Practices
Rewarded Video: Benefits and Best PracticesRewarded Video: Benefits and Best Practices
Rewarded Video: Benefits and Best PracticesironSource
 
PWC: Why we believe VR/AR will boost global GDP by $1.5 trillion
PWC: Why we believe VR/AR will boost global GDP by $1.5 trillionPWC: Why we believe VR/AR will boost global GDP by $1.5 trillion
PWC: Why we believe VR/AR will boost global GDP by $1.5 trillionAlejandro Franceschi
 
Unlocking The Marketing Potential Behind the Beacon Technology Outbreak
Unlocking The Marketing Potential Behind the Beacon Technology Outbreak Unlocking The Marketing Potential Behind the Beacon Technology Outbreak
Unlocking The Marketing Potential Behind the Beacon Technology Outbreak Klyp
 
Computer Vision Software Development.pdf
Computer Vision Software Development.pdfComputer Vision Software Development.pdf
Computer Vision Software Development.pdfJohnAdams514191
 
Review-2 LSM-1.pptx
Review-2 LSM-1.pptxReview-2 LSM-1.pptx
Review-2 LSM-1.pptxSid9832
 
Cloud Journey- Partner Advantage
Cloud Journey- Partner AdvantageCloud Journey- Partner Advantage
Cloud Journey- Partner AdvantageSalesforce Partners
 
Alternative to SolarWinds
Alternative to SolarWindsAlternative to SolarWinds
Alternative to SolarWindsSite24x7
 
So, Mobile is Big. Now What?
So, Mobile is Big. Now What?So, Mobile is Big. Now What?
So, Mobile is Big. Now What?BMA Carolinas
 

Semelhante a Image Recognition. Technology, Guidelines and Trends (20)

VR & AR Visual techniques in Market Research: a real case study.
VR & AR Visual techniques in Market Research: a real case study.VR & AR Visual techniques in Market Research: a real case study.
VR & AR Visual techniques in Market Research: a real case study.
 
915.pptx
915.pptx915.pptx
915.pptx
 
Augmented reality
Augmented realityAugmented reality
Augmented reality
 
Mobile App Development Service for Idea Cellular | Success Story
Mobile App Development Service for Idea Cellular | Success StoryMobile App Development Service for Idea Cellular | Success Story
Mobile App Development Service for Idea Cellular | Success Story
 
Blocking Viral SaaS Adoption is Blocking Innovation - Novosco & Amplipahe
Blocking Viral SaaS Adoption is Blocking Innovation - Novosco & AmplipaheBlocking Viral SaaS Adoption is Blocking Innovation - Novosco & Amplipahe
Blocking Viral SaaS Adoption is Blocking Innovation - Novosco & Amplipahe
 
Building retail moments that matter across digital and physical environments
Building retail moments that matter across digital and physical environmentsBuilding retail moments that matter across digital and physical environments
Building retail moments that matter across digital and physical environments
 
i-Verve Company Brochure.pdf
i-Verve Company Brochure.pdfi-Verve Company Brochure.pdf
i-Verve Company Brochure.pdf
 
ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...
ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...
ENGAGE 2015 - Inn-App Retargeting On Mobile Devices The Way Forward - Addicti...
 
Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...
Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...
Roundtable Discussion: Revlon, SAP and VMware See huge Benefits from Aggressi...
 
Company-Profile-Ciber.pdf
Company-Profile-Ciber.pdfCompany-Profile-Ciber.pdf
Company-Profile-Ciber.pdf
 
Mobility-RC
Mobility-RCMobility-RC
Mobility-RC
 
Rendered.ai - Intro to Synthetic data for Computer Vision.pdf
Rendered.ai - Intro to Synthetic data for Computer Vision.pdfRendered.ai - Intro to Synthetic data for Computer Vision.pdf
Rendered.ai - Intro to Synthetic data for Computer Vision.pdf
 
Rewarded Video: Benefits and Best Practices
Rewarded Video: Benefits and Best PracticesRewarded Video: Benefits and Best Practices
Rewarded Video: Benefits and Best Practices
 
PWC: Why we believe VR/AR will boost global GDP by $1.5 trillion
PWC: Why we believe VR/AR will boost global GDP by $1.5 trillionPWC: Why we believe VR/AR will boost global GDP by $1.5 trillion
PWC: Why we believe VR/AR will boost global GDP by $1.5 trillion
 
Unlocking The Marketing Potential Behind the Beacon Technology Outbreak
Unlocking The Marketing Potential Behind the Beacon Technology Outbreak Unlocking The Marketing Potential Behind the Beacon Technology Outbreak
Unlocking The Marketing Potential Behind the Beacon Technology Outbreak
 
Computer Vision Software Development.pdf
Computer Vision Software Development.pdfComputer Vision Software Development.pdf
Computer Vision Software Development.pdf
 
Review-2 LSM-1.pptx
Review-2 LSM-1.pptxReview-2 LSM-1.pptx
Review-2 LSM-1.pptx
 
Cloud Journey- Partner Advantage
Cloud Journey- Partner AdvantageCloud Journey- Partner Advantage
Cloud Journey- Partner Advantage
 
Alternative to SolarWinds
Alternative to SolarWindsAlternative to SolarWinds
Alternative to SolarWinds
 
So, Mobile is Big. Now What?
So, Mobile is Big. Now What?So, Mobile is Big. Now What?
So, Mobile is Big. Now What?
 

Último

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Image Recognition. Technology, Guidelines and Trends

Notas do Editor

  1. The visual recognition market is growing extremely quickly. The two main reasons for this growth are kind of obvious: There is a big proliferation of images on the Internet and; There has also been a big expansion in the use of mobile for searching and purchasing
  2. On December 1975, Kodak and Sasson invented the digital camera. Ever since we can process images and videos digitally, we’ve been developing visual recognition, trying to make machines understand the environment. Visual Recognition at large is a field of activity that has many branches. It is important to know that each one uses different computer vision approaches and there is not yet one ring to rule them all. The most prominent branches are Image Recognition, Face Recognition, Object classification, and Object Character Recognition, and each one has a different level of maturity. Image Recognition enables a fast search for images in a database to match an image taken by a smartphone or tablet. The image match pulls up related content, and users can interact, shop or rate products. Face Recognition is basically the same but instead of comparing with images or any object, it focuses on faces. Most face recognition solutions work by training a system with very large databases of images of faces previously labelled. The main use case is security or photo album organization. Object classification is a bit different in scope. Instead of searching for a very specific match in a database, it is trying to understand the elements present in a picture. This is the closest to what a kid does: this is a chair, this is a dog, or more complex descriptions like this is a steem train under Swiss Matterhorn. The use case is simple: Google. Object Character Recognition identfies letters and numbers from an image. It is used in digitizing ancient books for instance. In this tutorial, I’ll talk about Image Recognition and give you an overview of the technology, guidelines to build apps and services, and trends that we see in the market.
  3. Why am I talking about IR in an AR conf? Image Recognition is the door to most AR interactions in the world. Via Image Recognition, a machine can tell what the user is seeing through her camera. If we know that, we can provide limitless options connected to the digital world. For instance, we can augment the environment with an inmersive experience that helps the user make a better decision.
  4. Computer Vision tries to understand what is there and what is happening in the world via images and videos. Let’s take a look at the world with the eyes of a machine and try to see what will make us suffer.
  5. In the first row, you can find samples of objects that differ from the amount of visual pattern that is available for recognition. In the second row, you see two sorts that differ a lot in the amount of different samples that can exist from the same very object.
  6. It is important to set the expectations right with respect to the kinds of objects that I showed before and the technology that is available. If an object has a lot of texture, it has a higher probability of being more distinguishable within a large set of images, for instance, book covers. It does not work so well when the difference between two hundred objects have no pattern and all the same shade of grey. On the other hand, if the goal is to say “this is a blue shirt”, object classification works smoothly. If an object is deformable, we could create a database with tones of samples but it would become unbearable if you want to do that for a hundred thousand object. On the other hand, you can still train a classification system with many samples of that object in different deformations. What happens if an object is transparent? Let me tell you a story: when Logitech launched a mouse that could work over glass surfaces a few years ago,... well, rumor has it that on the day of the demo, they had to scratch the glass to make it work. The reason was that the sensor needed to "see" the dirty dots and scratches to translate that into motion. As another example, time-of-flight cameras like Kinect see through glass, or in other words, they do not see the glass in front of them. These examples showcase the challenge that glass puts into any sensing. ------- I’ve been restrictive here and for instance Catchoom’s IR engine works with deformable objects, as along as they are textured. Textureless is possible, but depends on the size of the database and how close two objects can be.
  7. In in this 2nd part, I’ll cover fundamental aspects of project development and discuss the pieces that are necessary to deploy an app that includes Image Recognition.
  8. There are three elements that you need for an Image Recognition app to be built. The base of the pyramid is the Image database. This is something that is often overlooked at the beginning. Sometimes, we find customers that consider the collection of images that trigger experiences after they’ve spent resources on building the app. We suggest to spend as much time with the reference images as possible to get the best experience for your users. The second piece is the technology component. There are many options here and I’ll give you some pointers in a minute. And last but not least, Content is always king. Make sure your app is valuable to your users. Image recognition is impressive, but even more impressive is when users want to repeat and come back to your app.
  9. Imagine you prepared your database with any of the images below. Then you try to recognize that logo with a query image like the one on top. For different reference images, you’ll get very different results. The message here is to devote time to the image database. Typically, you’ll learn what works and what doesn’t, but it is good to chat with us to know what will work and what may be an issue. One of our customers augments tattoos. You definitively want to get it right before tattooing your skin.
  10. On-device IR makes sense especially on cases that it is preferable to offload a server infrastructure and provide quick responses to users. This is the case for second screen environments where the user gets content or offers in sync with a TV show. Cloud IR on the other hand is very well suited for magazines or any content that is frequently updated and has a rather uniform traffic.
  11. Let’s compare both at the feature level. While OD looks technically more appealing, it has some limitations when it comes to enable common business interests like content updates or analytics. In general, you will achieve the same results with both, so it depends on the use case or even your business model.
  12. I’ll give you an overview of the vendors in the AR and outside the AR space that can help you with that.
  13. In this list we have AR-vendors. AR vendors offer IR that is used to trigger AR experiences at scale. In other words, they allow to search through larger databases that what would fit into a smartphone by relying on the cloud. The disadvantage of most AR vendors who offer cloud IR, is that they’re designed only for AR and are not that flexible when used for non-AR use cases. Also, for augmented reality it is now commonly known that patterns need well-spread texture. Image recognition is not as demanding, but benefits from curation.
  14. In this list, we have vendors that offer the core service, independently of how you want to use it whether it is to render an AR experience, compare products or anything you’d like to do. The table shows one additional column, which is “On Premises”. Instead of a SaaS, some vendors including Catchoom, license the core server technology to allow others build entire platforms. For example Times of India, the largest publisher in India, among other AR browsers run Catchoom inside their servers. As you can see from this and the previous slide, Catchoom is the only one who offer in both spaces AR and IR, and also have the full set of options.
  15. But the real reason why I like Catchoom is that we have a unique combination of ingredients in our magic sauce. First, our image recognition tests are performed using pictures snapped by users in real world environments – so our technology knows how to handle difficult angles, blurry images, low light conditions and reflections. Second, our passion for seamless interactions. Catchoom was built to give users an easy, seamless image recognition experience – with no knowledge of the technology required. They just keep snapping photos like they always do. Third, the results speak for themselves. An independent benchmark study using images taken by real users rated Catchoom 20% higher on image recognition than our competitors. We also ensure a response within half a second regardless of your location thanks to our servers in the US and EU. And last, you can build entire platforms. Whether you use our service or an on-premises installation, our image recognition software is designed to deliver outstanding performance regardless of the traffic or size of your database. From hundreds of requests per second, to millions of images, we’ve engineered our software to be prepared.
  16. Catchoom is, in fact, already one of the most used IR engines. Even though you may not have not heard of the brand Catchoom, our solution is already behind 420 million image recognitions globally.
  17. And now I’m getting to the last part of the talk to discuss some of the trends that we see in this space.
  18. There is a number of businesses with a long list of products that have a head and a long-tail of popularity. This is typically the case for eCommerce sites. What we see is an increasing demand to search on-device on a subset of images and if there is no match, continue with a cloud request. We have patented technology to provide support for this kind of environments without cutting any corner on the performance.
  19. Imagine you’re a technician that has to repair a very specific part in a Star Destroyer. How can you search through all the catalogue of parts in a fraction of a second just by scanning that part. This is another research line that Catchoom is working on right now.
  20. Fashion is one of the most exhiting sectors for image recognition. Being able to recognize a pair of shoes, a handbag or a complete look is in the mindset of thousands of fashionistas around the world. Catchoom is investing in recent advances in the field of computer vision using a technique that is called deep learning. Deep learning allows neural networks to learn the visual properties of certain objects and be able to classify them with very high precision. ----- Those three are the main trends that we see in the IR space, and Catchoom’s Labs are heavily investing in building the technology that will make them possible in the near future.
  21. 1. Image recognition is the door to a broad range of applications and services in a fast growing market. 2. You can significantly improve the performance with better image databases. 3. Choose on-device or cloud depending on your technical and business needs. 4. Catchoom is already behind 420M interactions and is working on the current trends to meet them in the near future.
  22. Please visit our booth in the next couple of days for live demos. Thanks you very much for your time!
  23. Catchoom in fact is already one of the most used IR engines out there. While maybe you have not heard of the brand Catchoom, our solution is already behind 420 million image recognition interactions across the world.
  24. There are a number of challenges when trying to compare the performance of image recognition vendors. 1. How many of you have around 100,000 images on both sides of the equation, references and test images? That’s probably around the number you need to build to 1M images. 2. Is the infrastructure showing the real experience that your users will have? Let me give you an example, Catchoom has servers in the US and in EU that allow apps to connect to the closest server wherever you are in the world. Is your app global, or simply your customer is in another continent? Take that into account. 3. Performance is not necessarily optimized for a specific use case. So the question is, does that vendor perform so well / wrong? Most vendors provide the same experience to all customers because they cannot fine-tune parameters, but rather offer performance that is on average good for a large variety of cases. If you use 100,000 images, you probably have multiple use cases represented, but if you just have a few, you may not show the full benchmark of that solution.
  25. You’re probably under two situations: Situation #1: you have a customer, with ver few images and you just want it to work like charm. Situation #2: you’re building a self-served service, where your customers or partners will upload images without any supervision. In both cases, my suggestion is to contact the vendor to know exactly what is possible and what not, and whether some tweeks here and there can improve significantly the results. For instance, at Catchoom, we look at particular cases in your results to try to identify improvements or simply different profiles of the internal paramenters that can be tuned for your case. But the reality is that unless you have an On Premises license, you won’t be able to fine tune any paramenter as all cloud service providers have the same performance across all customers.