SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
Large-scale Semantic Visual Search
NGUYEN ANH TUAN
tuannguyen.research@gmail.com
2016/07/17
About me
• 東京大学 情報理工学系研究科
修士2年生
• テーマ:Object Retrieval,情
報検索等
• 趣味:水泳,囲碁
• ブログ:
https://imsmarxen68.tumblr.co
m/
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
A picture is worth a thousand
words
Outline
• Semantic Visual Search
• A visual search framework
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Feature
extraction
Feature
aggregation
Feature
matching Re-ranking
Preliminary
results
Final
results
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Visual search
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Image credits: http://google.com
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
What’s the problem?
• Semantic difficulties: fine-grained differences
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
But for search problem?
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Query Database
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
But for search problem?
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Query Database
0.1
0.5
0.2Ranking problem
with a variation of
fine-grained
changes
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
But for search problem?
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Query Database
0.1
0.5
0.2Find visual representations
to capture all fine-grained
local information in images
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Large-scale Visual Search
Robust feature extraction
• Robust to
– Scale changes
– Rotation and affine changes
– Blur, sharpening, …
Feature
extraction
Feature
aggregation
Feature
matching Re-ranking
Preliminary
results
Final
results
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
A picture is
worth a
thousand words
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Statistical kernels
• Bag-of-Features (BoF)
• Fisher kernel (GMM) [1]
• VLAD (K-means) [2]
Image credits: http://www.mathworks.com/matlabcentral/
Feature
extraction
Feature
aggregation
Feature
matching Re-ranking
Preliminary
results
Final
results
[1] F. Perronnin, C. Dance, “Fisher Kernels on Visual Vocabularies for Image
Categorization,” in Proc. CVPR, IEEE, 2007
[2] H. Jegou, F. Perronnin, M. Douze, J. Sanchez, P. Perez, C. Schmid, “Aggregating Local
Image Descriptors into Compact Codes,” IEEE Trans. Pattern Anal. Mach. Intell. 34 (2012)
1704–1716. NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Statistical kernels
Feature
extraction
Feature
aggregation
Feature
matching Re-ranking
Preliminary
results
Final
results
NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Image matching = Feature matching
• Feature matching→Nearest Neighbor Search
– Inverse Search with Inverted Indices
– Compressed data for better memory usage [3]
Feature
extraction
Feature
aggregation
Feature
matching Re-ranking
Preliminary
results
Final
results
[3] H. Jégou, M. Douze, C. Schmid, Product
quantization for nearest neighbor search., IEEE
Trans. Pattern Anal. Mach. Intell. 33 (2011) 117–
28.Data CompressionNGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Verification
• Geometry verification
– RANSAC methods [4]
– Reduce the number of good inliers
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Feature
extraction
Feature
aggregation
Feature
matching Re-ranking
Preliminary
results
Final
results
[4] M.A. Fischler, R.C. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,
Commun. ACM. 24 (1981) 381–395. NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Thank you for listening

Mais conteúdo relacionado

Destaque

Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作鈵斯 倪
 
小魯蛇與他快樂的夥伴
小魯蛇與他快樂的夥伴小魯蛇與他快樂的夥伴
小魯蛇與他快樂的夥伴鈵斯 倪
 
20150419_pbtech_openstack_nyah #pbtech
20150419_pbtech_openstack_nyah #pbtech20150419_pbtech_openstack_nyah #pbtech
20150419_pbtech_openstack_nyah #pbtechume3_
 
Slack 簡介
Slack 簡介Slack 簡介
Slack 簡介Fenix Wu
 
Create Your Own Chatbot with Hubot and CoffeeScript
Create Your Own Chatbot with Hubot and CoffeeScriptCreate Your Own Chatbot with Hubot and CoffeeScript
Create Your Own Chatbot with Hubot and CoffeeScriptRob Scaduto
 
20170222【ppt】 礦業漏洞又一樁我家地下在挖礦
20170222【ppt】 礦業漏洞又一樁我家地下在挖礦20170222【ppt】 礦業漏洞又一樁我家地下在挖礦
20170222【ppt】 礦業漏洞又一樁我家地下在挖礦Ray Reng
 
DeepLearning 中心に見る最近の論文事情
DeepLearning 中心に見る最近の論文事情DeepLearning 中心に見る最近の論文事情
DeepLearning 中心に見る最近の論文事情Yuta Yamashita
 
How to build a slack-hubot with js
How to build a slack-hubot with jsHow to build a slack-hubot with js
How to build a slack-hubot with jsJuneyoung Oh
 
正しい開発をする
正しい開発をする正しい開発をする
正しい開発をするHonMarkHunt
 
【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot
【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot
【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBots1hit
 
新人研修チーム開発演習発表資料
新人研修チーム開発演習発表資料新人研修チーム開発演習発表資料
新人研修チーム開発演習発表資料Ryota Sakamoto
 
新日本プロレスに学ぶエンジニアのキャリアプラン
新日本プロレスに学ぶエンジニアのキャリアプラン新日本プロレスに学ぶエンジニアのキャリアプラン
新日本プロレスに学ぶエンジニアのキャリアプランHonMarkHunt
 
2015年4月ペパボテックカンファレンス資料
2015年4月ペパボテックカンファレンス資料2015年4月ペパボテックカンファレンス資料
2015年4月ペパボテックカンファレンス資料buty4649
 
LINE Messaging apiと戯れる
LINE Messaging apiと戯れるLINE Messaging apiと戯れる
LINE Messaging apiと戯れるHonMarkHunt
 
機械学習を用いた会議診断システムの開発
機械学習を用いた会議診断システムの開発機械学習を用いた会議診断システムの開発
機械学習を用いた会議診断システムの開発Takahiro Kubo
 
全脳アーキテクチャ実現への長き道のりをいかに支えるのか
全脳アーキテクチャ実現への長き道のりをいかに支えるのか全脳アーキテクチャ実現への長き道のりをいかに支えるのか
全脳アーキテクチャ実現への長き道のりをいかに支えるのかドワンゴ 人工知能研究所
 

Destaque (20)

Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作
 
小魯蛇與他快樂的夥伴
小魯蛇與他快樂的夥伴小魯蛇與他快樂的夥伴
小魯蛇與他快樂的夥伴
 
20150419_pbtech_openstack_nyah #pbtech
20150419_pbtech_openstack_nyah #pbtech20150419_pbtech_openstack_nyah #pbtech
20150419_pbtech_openstack_nyah #pbtech
 
LINE Bot 作ってみた
LINE Bot 作ってみたLINE Bot 作ってみた
LINE Bot 作ってみた
 
ChatOps@研究室
ChatOps@研究室ChatOps@研究室
ChatOps@研究室
 
Slack 簡介
Slack 簡介Slack 簡介
Slack 簡介
 
Create Your Own Chatbot with Hubot and CoffeeScript
Create Your Own Chatbot with Hubot and CoffeeScriptCreate Your Own Chatbot with Hubot and CoffeeScript
Create Your Own Chatbot with Hubot and CoffeeScript
 
20170222【ppt】 礦業漏洞又一樁我家地下在挖礦
20170222【ppt】 礦業漏洞又一樁我家地下在挖礦20170222【ppt】 礦業漏洞又一樁我家地下在挖礦
20170222【ppt】 礦業漏洞又一樁我家地下在挖礦
 
20160717 csc sec_bd
20160717 csc sec_bd20160717 csc sec_bd
20160717 csc sec_bd
 
DeepLearning 中心に見る最近の論文事情
DeepLearning 中心に見る最近の論文事情DeepLearning 中心に見る最近の論文事情
DeepLearning 中心に見る最近の論文事情
 
How to build a slack-hubot with js
How to build a slack-hubot with jsHow to build a slack-hubot with js
How to build a slack-hubot with js
 
正しい開発をする
正しい開発をする正しい開発をする
正しい開発をする
 
【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot
【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot
【JAWS-UG Shimane vol.5 】[ハンズオン]サーバーレスで作るチャットBot
 
新人研修チーム開発演習発表資料
新人研修チーム開発演習発表資料新人研修チーム開発演習発表資料
新人研修チーム開発演習発表資料
 
Python webinar 2nd july
Python webinar 2nd julyPython webinar 2nd july
Python webinar 2nd july
 
新日本プロレスに学ぶエンジニアのキャリアプラン
新日本プロレスに学ぶエンジニアのキャリアプラン新日本プロレスに学ぶエンジニアのキャリアプラン
新日本プロレスに学ぶエンジニアのキャリアプラン
 
2015年4月ペパボテックカンファレンス資料
2015年4月ペパボテックカンファレンス資料2015年4月ペパボテックカンファレンス資料
2015年4月ペパボテックカンファレンス資料
 
LINE Messaging apiと戯れる
LINE Messaging apiと戯れるLINE Messaging apiと戯れる
LINE Messaging apiと戯れる
 
機械学習を用いた会議診断システムの開発
機械学習を用いた会議診断システムの開発機械学習を用いた会議診断システムの開発
機械学習を用いた会議診断システムの開発
 
全脳アーキテクチャ実現への長き道のりをいかに支えるのか
全脳アーキテクチャ実現への長き道のりをいかに支えるのか全脳アーキテクチャ実現への長き道のりをいかに支えるのか
全脳アーキテクチャ実現への長き道のりをいかに支えるのか
 

Semelhante a 今日から始める人工知能 × 機械学習 Meetup ライトニングトーク1

Image based search engine
Image based search engineImage based search engine
Image based search engineIRJET Journal
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
SUMMER INTERNSHIP PROJECT
SUMMER INTERNSHIP PROJECTSUMMER INTERNSHIP PROJECT
SUMMER INTERNSHIP PROJECTRajarshi Roy
 
Paper id 25201471
Paper id 25201471Paper id 25201471
Paper id 25201471IJRAT
 
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEYAPPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEYcscpconf
 
Applications of spatial features in cbir a survey
Applications of spatial features in cbir  a surveyApplications of spatial features in cbir  a survey
Applications of spatial features in cbir a surveycsandit
 
10.1.1.432.9149.pdf
10.1.1.432.9149.pdf10.1.1.432.9149.pdf
10.1.1.432.9149.pdfmoemi1
 
https://uii.io/0hIB
https://uii.io/0hIBhttps://uii.io/0hIB
https://uii.io/0hIBmoemi1
 
10.1.1.432.9149
10.1.1.432.914910.1.1.432.9149
10.1.1.432.9149moemi1
 
Predicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsPredicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsJulia Kiseleva
 
An Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective ViewAn Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective Viewijtsrd
 
Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...
Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...
Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...Kalle
 
An Enhance Image Retrieval of User Interest Using Query Specific Approach and...
An Enhance Image Retrieval of User Interest Using Query Specific Approach and...An Enhance Image Retrieval of User Interest Using Query Specific Approach and...
An Enhance Image Retrieval of User Interest Using Query Specific Approach and...IJSRD
 
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...IRJET Journal
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_Titash Mandal
 
Active reranking for web image search
Active reranking for web image searchActive reranking for web image search
Active reranking for web image searchingenioustech
 
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNINGATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNINGNathan Mathis
 

Semelhante a 今日から始める人工知能 × 機械学習 Meetup ライトニングトーク1 (20)

Image based search engine
Image based search engineImage based search engine
Image based search engine
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
SUMMER INTERNSHIP PROJECT
SUMMER INTERNSHIP PROJECTSUMMER INTERNSHIP PROJECT
SUMMER INTERNSHIP PROJECT
 
Paper id 25201471
Paper id 25201471Paper id 25201471
Paper id 25201471
 
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEYAPPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
APPLICATIONS OF SPATIAL FEATURES IN CBIR : A SURVEY
 
Applications of spatial features in cbir a survey
Applications of spatial features in cbir  a surveyApplications of spatial features in cbir  a survey
Applications of spatial features in cbir a survey
 
10.1.1.432.9149.pdf
10.1.1.432.9149.pdf10.1.1.432.9149.pdf
10.1.1.432.9149.pdf
 
https://uii.io/0hIB
https://uii.io/0hIBhttps://uii.io/0hIB
https://uii.io/0hIB
 
10.1.1.432.9149
10.1.1.432.914910.1.1.432.9149
10.1.1.432.9149
 
Predicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsPredicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov Models
 
An Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective ViewAn Impact on Content Based Image Retrival A Perspective View
An Impact on Content Based Image Retrival A Perspective View
 
Cv huancheng hsu_2018
Cv huancheng hsu_2018Cv huancheng hsu_2018
Cv huancheng hsu_2018
 
Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...
Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...
Zhang Eye Movement As An Interaction Mechanism For Relevance Feedback In A Co...
 
An Enhance Image Retrieval of User Interest Using Query Specific Approach and...
An Enhance Image Retrieval of User Interest Using Query Specific Approach and...An Enhance Image Retrieval of User Interest Using Query Specific Approach and...
An Enhance Image Retrieval of User Interest Using Query Specific Approach and...
 
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
 
Active reranking for web image search
Active reranking for web image searchActive reranking for web image search
Active reranking for web image search
 
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNINGATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
 
Ts2 c topic
Ts2 c topicTs2 c topic
Ts2 c topic
 
Ts2 c topic (1)
Ts2 c topic (1)Ts2 c topic (1)
Ts2 c topic (1)
 

Último

Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 

Último (20)

Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 

今日から始める人工知能 × 機械学習 Meetup ライトニングトーク1

  • 1. Large-scale Semantic Visual Search NGUYEN ANH TUAN tuannguyen.research@gmail.com 2016/07/17
  • 2. About me • 東京大学 情報理工学系研究科 修士2年生 • テーマ:Object Retrieval,情 報検索等 • 趣味:水泳,囲碁 • ブログ: https://imsmarxen68.tumblr.co m/ NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 3. A picture is worth a thousand words
  • 4. Outline • Semantic Visual Search • A visual search framework Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html Feature extraction Feature aggregation Feature matching Re-ranking Preliminary results Final results NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 5. Visual search Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html Image credits: http://google.com NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 6. What’s the problem? • Semantic difficulties: fine-grained differences Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 7. But for search problem? Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html Query Database NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 8. But for search problem? Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html Query Database 0.1 0.5 0.2Ranking problem with a variation of fine-grained changes NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 9. But for search problem? Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html Query Database 0.1 0.5 0.2Find visual representations to capture all fine-grained local information in images NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 11. Robust feature extraction • Robust to – Scale changes – Rotation and affine changes – Blur, sharpening, … Feature extraction Feature aggregation Feature matching Re-ranking Preliminary results Final results Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html A picture is worth a thousand words NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 12. Statistical kernels • Bag-of-Features (BoF) • Fisher kernel (GMM) [1] • VLAD (K-means) [2] Image credits: http://www.mathworks.com/matlabcentral/ Feature extraction Feature aggregation Feature matching Re-ranking Preliminary results Final results [1] F. Perronnin, C. Dance, “Fisher Kernels on Visual Vocabularies for Image Categorization,” in Proc. CVPR, IEEE, 2007 [2] H. Jegou, F. Perronnin, M. Douze, J. Sanchez, P. Perez, C. Schmid, “Aggregating Local Image Descriptors into Compact Codes,” IEEE Trans. Pattern Anal. Mach. Intell. 34 (2012) 1704–1716. NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 14. Image matching = Feature matching • Feature matching→Nearest Neighbor Search – Inverse Search with Inverted Indices – Compressed data for better memory usage [3] Feature extraction Feature aggregation Feature matching Re-ranking Preliminary results Final results [3] H. Jégou, M. Douze, C. Schmid, Product quantization for nearest neighbor search., IEEE Trans. Pattern Anal. Mach. Intell. 33 (2011) 117– 28.Data CompressionNGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 15. Verification • Geometry verification – RANSAC methods [4] – Reduce the number of good inliers Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html Feature extraction Feature aggregation Feature matching Re-ranking Preliminary results Final results [4] M.A. Fischler, R.C. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Commun. ACM. 24 (1981) 381–395. NGUYEN ANH TUAN 東京大学・情報理 工・修士2年生
  • 16. Thank you for listening