NIPS2015 reading explores visual recognition biases

•

4 gostaram•907 visualizações

1) The document discusses a paper on improving visual recognition systems by leveraging human visual biases and generating images from random features. 2) It describes estimating visual biases from human psychophysics experiments, then using those biases to reconstruct images from random features. The reconstructed images can then be used to train machine learning models. 3) The document outlines experiments showing that incorporating estimated human visual biases into machine learning models, such as SVMs, can help improve visual recognition performance compared to models trained without biases.

Tecnologia

1-page summary
• Improving visual recognition systems with the help of
– image reconstruction from (random) features
– huge amount of human annotations for generated images
3
Image reconstruction
(Computer vision)
Visual bias understanding
(Human psychophysics)
Model training
(Machine learning)

Visual biases in human visual systems
4
Canonical views
Preferred ways of
viewing objects
Gestalt law
Tendency to organize
visual elements into
unified groups
[Mezuman+ NPS12]
http://graphicdesign.spokanefalls.edu/tutorials/process/gestaltprinciples/gestaltprinc.htm

Classification images
• Finding the internal template in human visual system
that discriminates 2 classes
5
[Ahumada Perception96]
𝑓𝑓 𝒙𝒙; 𝒄𝒄 = 𝒄𝒄⊤ 𝒙𝒙
𝒄𝒄 = 𝝁𝝁𝐴𝐴𝐴𝐴 + 𝝁𝝁𝐵𝐵𝐵𝐵 − 𝝁𝝁𝐴𝐴𝐴𝐴 + 𝝁𝝁𝐵𝐵𝐵𝐵
𝝁𝝁𝐴𝐴𝐴𝐴: Average of stimulus where
• The true class = A
• The predicted class = B

Estimating biases in feature spaces
• No real images required
– robust to many issues in dataset bias
– It scales 6
Random feature
Feature inverse
Generated image
Is this television or not?
Yes
No
𝒄𝒄 = 𝝁𝝁𝑌𝑌𝑌𝑌𝑌𝑌 − 𝝁𝝁𝑁𝑁𝑁𝑁Approximate internal template of people
150,000 features
from standard Gaussian
• HOGgles
[Vondrick+ ICCV13] Amazon MT

Reconstructing images from features
7
[Weinzaepfel+ CVPR11]
[Vondrick+ ICCV13]
[Kato+ CVPR14]
[Mahendran+ CVPR15]

HOGgles
• Paired dictionary learning
– Can be applied to other features such as CNN.
8
[Vondrick+ ICCV13]
𝒚𝒚𝑖𝑖
�𝒙𝒙𝑖𝑖
𝑽𝑽
𝑼𝑼

Leveraging biases for recognition
• Directly utilizing the visual biases c as a classifier
11
𝑓𝑓 𝒙𝒙; 𝒄𝒄 = 𝒄𝒄⊤ 𝒙𝒙

Leveraging biases for recognition
• Directly utilizing the visual biases c as a classifier
12
𝑓𝑓 𝒙𝒙; 𝒄𝒄 = 𝒄𝒄⊤ 𝒙𝒙
Shape is an important bias to discriminate objects in CNN features.

Learning with human biases
• Incorporating human biases into learning algorithms
for visual recognition
– SVM with orientation (= bias) constraints
13
Hyperplane for classification
Visual bias
It can be solved as a conic program,
by introducing 𝛼𝛼 satisfying
𝑤𝑤⊤ 𝑤𝑤 ≤ 𝛼𝛼 ≤ 𝑤𝑤⊤ 𝑐𝑐/𝜃𝜃

Experiments
14
• The performance is improved as the number of positive samples increases.
• The proposed method (SVM + Human) significantly improves the performance

Experiments (cont.)
15
• The proposed method (SVM + bias) MAY help alleviate some dataset bias issues.
SVM only bias only

Mais conteúdo relacionado

Mais procurados

Super resolution in deep learning era - Jaejun YooJaeJun Yoo

neural networkSTUDENT

Convolutional Neural NetworkJunho Cho

Visual geometry with deep learningNAVER Engineering

Neural Networks for Pattern RecognitionVipra Singh

Unsupervised visual representation learning overview: Toward Self-SupervisionLEE HOSEONG

Neural ComputingJehoshaphat Abu

Cost-effective Interactive Attention Learning with Neural Attention ProcessMLAI2

Intro to Neural NetworksDean Wyatte

【ECCV 2016 BNMW】Human Action Recognition without HumanHirokatsu Kataoka

Deep learningRatnakar Pandey

Single Image Super Resolution using Fuzzy Deep Convolutional NetworksGreeshma M.S.R

Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Ali Alkan

Recent Trends in Deep LearningSungjoon Choi

モデル高速化百選Yusuke Uchida

[PR12] Generative Models as Distributions of FunctionsJaeJun Yoo

Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...Seonho Park

【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...Hirokatsu Kataoka

Deep Learning for Structure-from-Motion (SfM)PetteriTeikariPhD

19IMPULSE_TECHNOLOGY

Mais procurados (20)

Super resolution in deep learning era - Jaejun Yoo

neural network

Convolutional Neural Network

Visual geometry with deep learning

Neural Networks for Pattern Recognition

Unsupervised visual representation learning overview: Toward Self-Supervision

Neural Computing

Cost-effective Interactive Attention Learning with Neural Attention Process

Intro to Neural Networks

【ECCV 2016 BNMW】Human Action Recognition without Human

Deep learning

Single Image Super Resolution using Fuzzy Deep Convolutional Networks

Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...

Recent Trends in Deep Learning

モデル高速化百選

[PR12] Generative Models as Distributions of Functions

Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...

【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...

Deep Learning for Structure-from-Motion (SfM)

Destaque

NIPS2014 reading - Top rank optimization in linear timeAkisato Kimura

ICCV2013 reading: Learning to rank using privileged informationAkisato Kimura

多変量解析の一般化Akisato Kimura

CVPR2016 reading - 特徴量学習とクロスモーダル転移についてAkisato Kimura

第1回ステアラボ人工知能セミナー（オープニング）STAIR Lab, Chiba Institute of Technology

深層学習による自然言語処理の研究動向STAIR Lab, Chiba Institute of Technology

Destaque (6)

NIPS2014 reading - Top rank optimization in linear time

ICCV2013 reading: Learning to rank using privileged information

多変量解析の一般化

CVPR2016 reading - 特徴量学習とクロスモーダル転移について

第1回ステアラボ人工知能セミナー（オープニング）

深層学習による自然言語処理の研究動向

Semelhante a NIPS2015 reading explores visual recognition biases

Cross-domain complementary learning with synthetic data for multi-person part...哲东郑

face detectionSmriti Tikoo

Retinal Image Analysis using Machine Learning and Deep.pptxDeval Bhapkar

Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Wanjin Yu

HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...Tulipp. Eu

Detection and recognition of face using neural networkSmriti Tikoo

Face recognition Mohamed Magdy

LEARNING BASES OF ACTICITYPadma Kannan

ppt.pdfMohanRaj924804

Fcv poster jizukun

Overview of computer vision and machine learningsmckeever

A Re-evaluation of Pedestrian Detection on Riemannian ManifoldsDiego Tosato

Facial emotion detection on babies' emotional face using Deep Learning.Takrim Ul Islam Laskar

A study on face recognition technique based on eigenfacesadique_ghitm

Face recogntion using PCA algorithmAshwini Awatare

Predicting Emotions through Facial Expressions twinkle singh

Seminar nov2017Ahmed Youssef Ali Amer

Fa19_P1.pptxMd Abul Hayat

J03504073076theijes

Unit-V Machine Learning.pptSharpmark256

Semelhante a NIPS2015 reading explores visual recognition biases (20)

Cross-domain complementary learning with synthetic data for multi-person part...

face detection

Retinal Image Analysis using Machine Learning and Deep.pptx

Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...

HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...

Detection and recognition of face using neural network

Face recognition

LEARNING BASES OF ACTICITY

ppt.pdf

Fcv poster ji

Overview of computer vision and machine learning

A Re-evaluation of Pedestrian Detection on Riemannian Manifolds

Facial emotion detection on babies' emotional face using Deep Learning.

A study on face recognition technique based on eigenface

Face recogntion using PCA algorithm

Predicting Emotions through Facial Expressions

Seminar nov2017

Fa19_P1.pptx

J03504073076

Unit-V Machine Learning.ppt

Mais de Akisato Kimura

Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Akisato Kimura

CVPR2015 reading "Global refinement of random forest"Akisato Kimura

CVPR2015 reading "Understainding image virality" (in Japanese)Akisato Kimura

Computational models of human visual attention driven by auditory cuesAkisato Kimura

CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...Akisato Kimura

ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using...Akisato Kimura

IJCAI13 Paper review: Large-scale spectral clustering on graphsAkisato Kimura

関西CVPR勉強会 2012.10.28Akisato Kimura

関西CVPR勉強会 2012.7.29Akisato Kimura

ICWSM12 Brief ReviewAkisato Kimura

関西CVPRML勉強会 2012.2.18 （一般物体認識 - データセット）Akisato Kimura

関西CVPRML勉強会（特定物体認識） 2012.1.14Akisato Kimura

人間の視覚的注意を予測するモデル－動的ベイジアンネットワークに基づく最新のアプローチ－Akisato Kimura

IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」趣旨説明Akisato Kimura

立命館大学 AMLコロキウム 2011.10.20Akisato Kimura

広島画像情報学セミナ 2011.9.16Akisato Kimura

関西CVPRML勉強会 2011.9.23Akisato Kimura

関西CVPRML 2011.8.27Akisato Kimura

CVPR2011祭り発表スライドAkisato Kimura

Mais de Akisato Kimura (20)

Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...

CVPR2015 reading "Global refinement of random forest"

CVPR2015 reading "Understainding image virality" (in Japanese)

Computational models of human visual attention driven by auditory cues

CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...

ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using...

IJCAI13 Paper review: Large-scale spectral clustering on graphs

関西CVPR勉強会 2012.10.28

関西CVPR勉強会 2012.7.29

ICWSM12 Brief Review

関西CVPRML勉強会 2012.2.18 （一般物体認識 - データセット）

関西CVPRML勉強会（特定物体認識） 2012.1.14

人間の視覚的注意を予測するモデル－動的ベイジアンネットワークに基づく最新のアプローチ－

IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」趣旨説明

立命館大学 AMLコロキウム 2011.10.20

広島画像情報学セミナ 2011.9.16

関西CVPRML勉強会 2011.9.23

関西CVPRML 2011.8.27

CVPR2011祭り発表スライド

Último

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Slack Application Development 101 Slidespraypatel2

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

🐬 The future of MySQL is Postgres 🐘RTylerCroy

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

A Call to Action for Generative AI in 2024Results

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

NIPS2015 reading explores visual recognition biases

1. NIPS2015 reading Akisato Kimura 1

2. Paper to read NIPS2015 2

3. 1-page summary • Improving visual recognition systems with the help of – image reconstruction from (random) features – huge amount of human annotations for generated images 3 Image reconstruction (Computer vision) Visual bias understanding (Human psychophysics) Model training (Machine learning)

4. Visual biases in human visual systems 4 Canonical views Preferred ways of viewing objects Gestalt law Tendency to organize visual elements into unified groups [Mezuman+ NPS12] http://graphicdesign.spokanefalls.edu/tutorials/process/gestaltprinciples/gestaltprinc.htm

5. Classification images • Finding the internal template in human visual system that discriminates 2 classes 5 [Ahumada Perception96] 𝑓𝑓 𝒙𝒙; 𝒄𝒄 = 𝒄𝒄⊤ 𝒙𝒙 𝒄𝒄 = 𝝁𝝁𝐴𝐴𝐴𝐴 + 𝝁𝝁𝐵𝐵𝐵𝐵 − 𝝁𝝁𝐴𝐴𝐴𝐴 + 𝝁𝝁𝐵𝐵𝐵𝐵 𝝁𝝁𝐴𝐴𝐴𝐴: Average of stimulus where • The true class = A • The predicted class = B

6. Estimating biases in feature spaces • No real images required – robust to many issues in dataset bias – It scales 6 Random feature Feature inverse Generated image Is this television or not? Yes No 𝒄𝒄 = 𝝁𝝁𝑌𝑌𝑌𝑌𝑌𝑌 − 𝝁𝝁𝑁𝑁𝑁𝑁Approximate internal template of people 150,000 features from standard Gaussian • HOGgles [Vondrick+ ICCV13] Amazon MT

7. Reconstructing images from features 7 [Weinzaepfel+ CVPR11] [Vondrick+ ICCV13] [Kato+ CVPR14] [Mahendran+ CVPR15]

8. HOGgles • Paired dictionary learning – Can be applied to other features such as CNN. 8 [Vondrick+ ICCV13] 𝒚𝒚𝑖𝑖 �𝒙𝒙𝑖𝑖 𝑽𝑽 𝑼𝑼

9. Visualizing biases 9

10. Visualizing biases (cont.) 10

11. Leveraging biases for recognition • Directly utilizing the visual biases c as a classifier 11 𝑓𝑓 𝒙𝒙; 𝒄𝒄 = 𝒄𝒄⊤ 𝒙𝒙

12. Leveraging biases for recognition • Directly utilizing the visual biases c as a classifier 12 𝑓𝑓 𝒙𝒙; 𝒄𝒄 = 𝒄𝒄⊤ 𝒙𝒙 Shape is an important bias to discriminate objects in CNN features.

13. Learning with human biases • Incorporating human biases into learning algorithms for visual recognition – SVM with orientation (= bias) constraints 13 Hyperplane for classification Visual bias It can be solved as a conic program, by introducing 𝛼𝛼 satisfying 𝑤𝑤⊤ 𝑤𝑤 ≤ 𝛼𝛼 ≤ 𝑤𝑤⊤ 𝑐𝑐/𝜃𝜃

14. Experiments 14 • The performance is improved as the number of positive samples increases. • The proposed method (SVM + Human) significantly improves the performance

15. Experiments (cont.) 15 • The proposed method (SVM + bias) MAY help alleviate some dataset bias issues. SVM only bias only

16. 16

NIPS2015 reading explores visual recognition biases

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (6)

Semelhante a NIPS2015 reading explores visual recognition biases

Semelhante a NIPS2015 reading explores visual recognition biases (20)

Mais de Akisato Kimura

Mais de Akisato Kimura (20)

Último

Último (20)

NIPS2015 reading explores visual recognition biases