Why Self-Supervised Learning Transfers Better to New Tasks

•Transferir como PPTX, PDF•

0 gostou•18 visualizações

ssuserbafbd0

Self-supervised learning performance

Tecnologia

Why self-supervised
learning?
Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty,
2019, arXIV
How well do self-supervised models transfer? 2021 , CPVR

Supervised
learning
• Supervised Pipeline
• 1000s of Hours (Human
annotators)
• Limits No of images

Self-supervised
learning
• Automatically generated pseudo-labels from structure of data
• More images More powerful discriminative framework
• After Training Transfer representations to new task
• Reduce need of large volume of data(For downstream tasks)

How well do self-supervised
models transfer?
• Significant improvement (Recent years)
• Approach supervised perform, (ImageNet benchmark
dataset Identical architecture)
Despite still being behind on ImageNet (Initial evidences),
Self-supervised models transfers better to new tasks.

How do you validate the statement?
• Study Transfer performance of self-supervised
pretrained models

Four
concerns
of Transfer
performance
How : Self-supervised transfer
vs Supervised transfer ?
Is there a best self-supervised
method overall ?
Has SSL overfit ? (ImageNet
Benchmark dataset)
Same information represented?
(Sup VS self-sup)

Datasets
Wide varieties of
datasets
Similar datasets such as
ImageNet , CIFAR-10
Quite different dataset :
Medical x-ray images

Answering the
concerns: Method
• 13 pre-trained self-supervised models +
supervised baseline.
• All models : RESNET50 architecture
(Pretrained on ImageNet) without labels.
• Model differences (Hyperparms such as
epochs , batch size, data augmentation )

Transfer Evaluation
(wide range of tasks)
• Linear Evaluation, fine tuning: MSR
• Small/Large domain shift: FSR
• A frozen and Finetuned backbone: OD
• Dense Prediction tasks: SNE | SS
• Surface Normal prediction: Predict the surface orientation
of the object inside a scene

Results: Self-supervised methods are approaching supervised performance on ImageNet

Transfer performance
• Highly correlated with Many-shot recognition
• Increasingly less correlated with few-shot R. | OD | Dense Predictions

1. How does self-
supervised transfer
compare to
supervised transfer ?
Compare self-supervised models to the supervised base line
represented by the green star.

2. Is there Is there a
best self supervised
method overall ?
Across all but one
setting, self-
supervised models
outperform
supervision, showing
their superior
transferability.
But there is no single
model that dominates
all setting, showing
the community still a
way to go to reach
truly general feature.

3. Has SSL overfit to
imageNet as a
benchmark?
• ImageNet performance is highly correlated with many-shot recognition

Correlation is weaker in other cases
In order to achieve more generalizable representations in the future,
the community needs to consider the wider benchmarks for
evaluation.

4. Do self-supervised and supervised features represent the
same information?
• Model features analysis
Analysis
• Image reconstruction from feature vector (Supervised better)
• Why? Self-supervised (lose color information due to heavy data Augm.)
Reconstruction
• SSM: Have wide attentive focus (Attention)
SSM
• Sup: High spatial focus (Location)
Sup

Conclusion
Self-supervision tend to produce
better calibrated classifiers for
downstream recognition tasks.

Mais conteúdo relacionado

Semelhante a Why Self-Supervised Learning Transfers Better to New Tasks

How to use transfer learning to bootstrap image classification and question a...Wee Hyong Tok

Brief History of Visual Representation LearningSangwoo Mo

CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018Universitat Politècnica de Catalunya

OReilly AI Transfer LearningDanielle Dean

TIP_TAViT_presentation.pdfBoahKim2

Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Ian Morgan

Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Bayes Nets meetup London

Presentation of master thesisSeoung-Ho Choi

Talk@rmit 09112017Shuai Zhang

Graph Based Machine Learning with Applications to Media AnalyticsNYC Predictive Analytics

Emr a scalable graph based ranking model for content-based image retrievalPvrtechnologies Nellore

Integrated Hidden Markov Model and Kalman Filter for Online Object Trackingijsrd.com

ResNeSt: Split-Attention NetworksSeunghyun Hwang

Tomáš Mikolov - Distributed Representations for NLPMachine Learning Prague

Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Dongmin Choi

Long-term Face Tracking in the Wild using Deep LearningElaheh Rashedi

Utilizing additional information in factorization methods (research overview,...Balázs Hidasi

Unsupervised/Self-supervvised visual object trackingYu Huang

Pratik ibm-open power-pptVaibhav R

The Analytics Frontier of the Hadoop Eco-Systeminside-BigData.com

Semelhante a Why Self-Supervised Learning Transfers Better to New Tasks (20)

How to use transfer learning to bootstrap image classification and question a...

Brief History of Visual Representation Learning

CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018

OReilly AI Transfer Learning

TIP_TAViT_presentation.pdf

Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...

Presentation of master thesis

Talk@rmit 09112017

Graph Based Machine Learning with Applications to Media Analytics

Emr a scalable graph based ranking model for content-based image retrieval

Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking

ResNeSt: Split-Attention Networks

Tomáš Mikolov - Distributed Representations for NLP

Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]

Long-term Face Tracking in the Wild using Deep Learning

Utilizing additional information in factorization methods (research overview,...

Unsupervised/Self-supervvised visual object tracking

Pratik ibm-open power-ppt

The Analytics Frontier of the Hadoop Eco-System

Último

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Key Features Of Token Development (1).pptxLBM Solutions

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Why Self-Supervised Learning Transfers Better to New Tasks

1. Why self-supervised learning? Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty, 2019, arXIV How well do self-supervised models transfer? 2021 , CPVR

2. Supervised learning • Supervised Pipeline • 1000s of Hours (Human annotators) • Limits No of images

3. Self-supervised learning • Automatically generated pseudo-labels from structure of data • More images More powerful discriminative framework • After Training Transfer representations to new task • Reduce need of large volume of data(For downstream tasks)

4. How well do self-supervised models transfer? • Significant improvement (Recent years) • Approach supervised perform, (ImageNet benchmark dataset Identical architecture) Despite still being behind on ImageNet (Initial evidences), Self-supervised models transfers better to new tasks.

5. How do you validate the statement? • Study Transfer performance of self-supervised pretrained models

6. Four concerns of Transfer performance How : Self-supervised transfer vs Supervised transfer ? Is there a best self-supervised method overall ? Has SSL overfit ? (ImageNet Benchmark dataset) Same information represented? (Sup VS self-sup)

7. Datasets Wide varieties of datasets Similar datasets such as ImageNet , CIFAR-10 Quite different dataset : Medical x-ray images

8. Answering the concerns: Method • 13 pre-trained self-supervised models + supervised baseline. • All models : RESNET50 architecture (Pretrained on ImageNet) without labels. • Model differences (Hyperparms such as epochs , batch size, data augmentation )

9. Transfer Evaluation (wide range of tasks) • Linear Evaluation, fine tuning: MSR • Small/Large domain shift: FSR • A frozen and Finetuned backbone: OD • Dense Prediction tasks: SNE | SS • Surface Normal prediction: Predict the surface orientation of the object inside a scene

10. Results: Self-supervised methods are approaching supervised performance on ImageNet

11. Transfer performance • Highly correlated with Many-shot recognition • Increasingly less correlated with few-shot R. | OD | Dense Predictions

12. 1. How does self- supervised transfer compare to supervised transfer ? Compare self-supervised models to the supervised base line represented by the green star.

13. 2. Is there Is there a best self supervised method overall ? Across all but one setting, self- supervised models outperform supervision, showing their superior transferability. But there is no single model that dominates all setting, showing the community still a way to go to reach truly general feature.

14. 3. Has SSL overfit to imageNet as a benchmark? • ImageNet performance is highly correlated with many-shot recognition

15. Correlation is weaker in other cases In order to achieve more generalizable representations in the future, the community needs to consider the wider benchmarks for evaluation.

16. 4. Do self-supervised and supervised features represent the same information? • Model features analysis Analysis • Image reconstruction from feature vector (Supervised better) • Why? Self-supervised (lose color information due to heavy data Augm.) Reconstruction • SSM: Have wide attentive focus (Attention) SSM • Sup: High spatial focus (Location) Sup

17. Conclusion Self-supervision tend to produce better calibrated classifiers for downstream recognition tasks.

Why Self-Supervised Learning Transfers Better to New Tasks

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Why Self-Supervised Learning Transfers Better to New Tasks

Semelhante a Why Self-Supervised Learning Transfers Better to New Tasks (20)

Último

Último (20)

Why Self-Supervised Learning Transfers Better to New Tasks