[PR12] PR-063: Peephole predicting network performance before training

•

0 gostou•617 visualizações

Paper review for "Peephole: Predicting Network Performance Before Training (2017)" https://www.youtube.com/watch?v=ZO4bXgdcCQA

Engenharia

Paper reviewed by Taegyun Jeon
Peephole: Predicting Network
Performance Before Training
Boyang Deng, Junjie Yan, Dahua Lin,
“Peephole: Predicting Network Performance Before Training” (2017)
https://arxiv.org/abs/1712.03351
[TensorFlow-KR] PR12

배경 | 높은 성능을 얻으려면?
▪ 결론: 좋은 네트워크를 써야한다.
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 2

배경 | 좋은 네트워크를 얻으려면?
▪ 2가지 고려요소
▫ Large design space
• For Convolutional Neural Networks (CNN)
◦ the number of layers
◦ the number of channels within these layers
◦ whether to insert a pooling layer at certain points
▫ Costly training process
• Z. Zhong, J. Yan, and C. L. Liu. “Practical network blocks design with q-learning”. arXiv preprint
arXiv:1708.05552, 2017.
• B. Zoph and Q. V. Le. “Neural architecture search with reinforcement learning”. arXiv preprint
arXiv:1611.01578, 2016.
• B. Zoph, V. Vasudevan, J. Shlens, and Q. V. Le. “Learning transferable architectures for scalable
image recognition.” arXiv preprint arXiv:1707.07012, 2017.
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 3

문제정의 | 모델 성능 예측
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 4

아이디어 | “네트워크 구조에 대한 성능”을 학습
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 5
𝑦 = 𝑓(𝑥, 𝑡)

제안 | “네트워크 구조” 표현
▪ Unified Layer Code and Layer Embedding
▫ Integer code: TY, KW, KH, CH
• index of 8-bins: CH = [0.25, 0.5, 0.75, 1.0, 1.5, 2.0, 2.5, 3.0]
▫ Layer embedding
• Hidden state of LSTM cell: structural features
• Epoch index: embedded into real-vector
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 6

제안 | “네트워크 구조”와 “성능” 의 데이터
▪ 막연한 생각
▫ Random sampling sequences of layers
• The design space grows exponentially as the number of layers increases.
• Many combinations of layers are not reasonable options from a practical point of view.
▪ Block-based generation
▫ Skeleton + generated blocks
▫ One block contains less than 10 layers
• First layer is convolution layer w/ random
kernel size.
▫ Markov chain
• For predefined transition prob.
from practical networks
▫ Restrict the number of convolution layers
within a block to less than 4
▫ 1x1 convolution for dimension matching
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 7

제안 | 기존 네트워크 구조 = Markov Chain
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 8

제안 | “X:네트워크, Y:성능” 데이터셋
▪ 데이터셋 구성
▫ N 개의 네트워크: {𝑥𝑖}1:𝑁
▫ Performance curves 𝑦𝑖(𝑡)
• Training data로 학습시키면서 epoch 𝑡에서 validation data에 대한 validation accuracy
▫ 𝒟 = {𝑥𝑖, 𝑦𝑖}1:𝑁
▪ Objective function with smooth L1 loss
▫ ℒ(𝒟; 𝜃) =
1
𝑁
σ𝑖=1
𝑛
𝑙(𝑓 𝑥𝑖, 𝑇 , 𝑦𝑖(𝑇))
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 9

실험 | 무엇을 학습할 것인가?
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 10

실험 | 무엇을 학습할 것인가?
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 11

실험 | 무엇을 학습할 것인가?
▪ Comparison
▫ Bayesian Neural Networks and 𝜐-SVR (Support Vector Regression)
▪ Evaluation metrics
▫ Mean Square Error (MSE)
▫ Kendall’s Tau (Tau)
▫ Coefficient of Determination (R2)
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 12

실험 | Transfer to ImageNet
▪ a
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 13

결론 |
▪ Block-based generation
▫ Skeleton + generated blocks
▪ 다른 요소들에 대한 실험은..?
▫ Residual block, Dense connection 등
▪ 결국 평가를 위해선 모든 세팅에 대한 학습 필요
▪ Transfer learning을 위한 최적의 방법인가?
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 14

Mais conteúdo relacionado

Mais procurados

Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017Universitat Politècnica de Catalunya

Electricity price forecasting with Recurrent Neural NetworksTaegyun Jeon

Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017Universitat Politècnica de Catalunya

Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...Universitat Politècnica de Catalunya

Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Convolutional neural networks 이론과 응용홍배 김

Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)Universitat Politècnica de Catalunya

DLD meetup 2017, Efficient Deep LearningBrodmann17

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Convolutional Neural NetworkJunho Cho

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)Universitat Politècnica de Catalunya

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya

Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Learning where to look: focus and attention in deep visionUniversitat Politècnica de Catalunya

Deep LearningフレームワークChainerと最近の技術動向Shunta Saito

Mais procurados (20)

Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017

Electricity price forecasting with Recurrent Neural Networks

Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017

Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...

Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)

Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)

Convolutional neural networks 이론과 응용

Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)

DLD meetup 2017, Efficient Deep Learning

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)

Convolutional Neural Network

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...

Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)

Learning where to look: focus and attention in deep vision

Deep LearningフレームワークChainerと最近の技術動向

Semelhante a [PR12] PR-063: Peephole predicting network performance before training

Finding the best solution for Image ProcessingTech Triveni

DLD_WeightSharing_SlideKang-Ho Lee

An Introduction to Neural Architecture SearchBill Liu

On the value of Sampling and Pruning for SBSEJianfeng Chen

Resnet.pdfYanhuaSi

Introduction to ChainerPreferred Networks

Introduction to ChainerShunta Saito

モデル高速化百選Yusuke Uchida

Resnet.pptxYanhuaSi

State-of-the-art Image Processing across all domainsKnoldus Inc.

Deep Learning Initiative @ NECSTLabNECST Lab @ Politecnico di Milano

CNNs: from the Basics to Recent AdvancesDmytro Mishkin

Pruning convolutional neural networks for resource efficient inferenceKaushalya Madhawa

Image classification with neural networksSepehr Rasouli

Small Deep-Neural-Networks: Their Advantages and Their DesignForrest Iandola

Scalable image recognition model with deep embedding捷恩蔡

Convolutional neural network Yan Xu

Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang

Issues in AI product development and practices in audio applicationsTaesu Kim

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)Universitat Politècnica de Catalunya

Semelhante a [PR12] PR-063: Peephole predicting network performance before training (20)

Finding the best solution for Image Processing

DLD_WeightSharing_Slide

An Introduction to Neural Architecture Search

On the value of Sampling and Pruning for SBSE

Resnet.pdf

Introduction to Chainer

モデル高速化百選

Resnet.pptx

State-of-the-art Image Processing across all domains

Deep Learning Initiative @ NECSTLab

CNNs: from the Basics to Recent Advances

Pruning convolutional neural networks for resource efficient inference

Image classification with neural networks

Small Deep-Neural-Networks: Their Advantages and Their Design

Scalable image recognition model with deep embedding

Convolutional neural network

Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015

Issues in AI product development and practices in audio applications

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)

Mais de Taegyun Jeon

TensorFlow-KR 3rd meetup - Lightning Talk for SI AnalyticsTaegyun Jeon

TensorFlow Dev Summit 2018 Extended: TensorFlow Eager ExecutionTaegyun Jeon

[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-ResolutionTaegyun Jeon

GDG DevFest Xiamen 2017Taegyun Jeon

[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...Taegyun Jeon

GDG DevFest Seoul 2017: Codelab - Time Series Analysis for Kaggle using Tenso...Taegyun Jeon

[PR12] PR-036 Learning to Remember Rare EventsTaegyun Jeon

[대전AI포럼] 위성영상 분석 기술 개발 현황 소개Taegyun Jeon

[PR12] PR-026: Notes for CVPR Machine Learning SessionsTaegyun Jeon

[PR12] You Only Look Once (YOLO): Unified Real-Time Object DetectionTaegyun Jeon

[PR12] image super resolution using deep convolutional networksTaegyun Jeon

Google Dev Summit Extended Seoul - TensorFlow: Tensorboard & KerasTaegyun Jeon

TensorFlow KR 2nd Meetup - Lightening talk (Satrec Initiative)Taegyun Jeon

인공지능: 변화와 능력개발Taegyun Jeon

Mais de Taegyun Jeon (14)

TensorFlow-KR 3rd meetup - Lightning Talk for SI Analytics

TensorFlow Dev Summit 2018 Extended: TensorFlow Eager Execution

[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution

GDG DevFest Xiamen 2017

[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...

GDG DevFest Seoul 2017: Codelab - Time Series Analysis for Kaggle using Tenso...

[PR12] PR-036 Learning to Remember Rare Events

[대전AI포럼] 위성영상 분석 기술 개발 현황 소개

[PR12] PR-026: Notes for CVPR Machine Learning Sessions

[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection

[PR12] image super resolution using deep convolutional networks

Google Dev Summit Extended Seoul - TensorFlow: Tensorboard & Keras

TensorFlow KR 2nd Meetup - Lightening talk (Satrec Initiative)

인공지능: 변화와 능력개발

Último

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti

KubeKraft presentation @CloudNativeHooghlysanyuktamishra911

Online banking management system project.pdfKamal Acharya

Double rodded leveling 1 pdf activity 01KreezheaRecto

Block diagram reduction techniques in control systems.pptNANDHAKUMARA10

(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Thermal Engineering -unit - III & IV.pptDineshKumar4165

Intro To Electric Vehicles PDF Notes.pdfrs7054576148

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi

Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)

Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi

VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698

(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7Call Girls in Nagpur High Profile Call Girls

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya

UNIT - IV - Air Compressors and its Performancesivaprakash250

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsArindam Chakraborty, Ph.D., P.E. (CA, TX)

Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control

Thermal Engineering-R & A / C - unit - VDineshKumar4165

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile

[PR12] PR-063: Peephole predicting network performance before training

1. Paper reviewed by Taegyun Jeon Peephole: Predicting Network Performance Before Training Boyang Deng, Junjie Yan, Dahua Lin, “Peephole: Predicting Network Performance Before Training” (2017) https://arxiv.org/abs/1712.03351 [TensorFlow-KR] PR12

2. 배경 | 높은 성능을 얻으려면? ▪ 결론: 좋은 네트워크를 써야한다. [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 2

3. 배경 | 좋은 네트워크를 얻으려면? ▪ 2가지 고려요소 ▫ Large design space • For Convolutional Neural Networks (CNN) ◦ the number of layers ◦ the number of channels within these layers ◦ whether to insert a pooling layer at certain points ▫ Costly training process • Z. Zhong, J. Yan, and C. L. Liu. “Practical network blocks design with q-learning”. arXiv preprint arXiv:1708.05552, 2017. • B. Zoph and Q. V. Le. “Neural architecture search with reinforcement learning”. arXiv preprint arXiv:1611.01578, 2016. • B. Zoph, V. Vasudevan, J. Shlens, and Q. V. Le. “Learning transferable architectures for scalable image recognition.” arXiv preprint arXiv:1707.07012, 2017. [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 3

4. 문제정의 | 모델 성능 예측 [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 4

5. 아이디어 | “네트워크 구조에 대한 성능”을 학습 [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 5 𝑦 = 𝑓(𝑥, 𝑡)

6. 제안 | “네트워크 구조” 표현 ▪ Unified Layer Code and Layer Embedding ▫ Integer code: TY, KW, KH, CH • index of 8-bins: CH = [0.25, 0.5, 0.75, 1.0, 1.5, 2.0, 2.5, 3.0] ▫ Layer embedding • Hidden state of LSTM cell: structural features • Epoch index: embedded into real-vector [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 6

7. 제안 | “네트워크 구조”와 “성능” 의 데이터 ▪ 막연한 생각 ▫ Random sampling sequences of layers • The design space grows exponentially as the number of layers increases. • Many combinations of layers are not reasonable options from a practical point of view. ▪ Block-based generation ▫ Skeleton + generated blocks ▫ One block contains less than 10 layers • First layer is convolution layer w/ random kernel size. ▫ Markov chain • For predefined transition prob. from practical networks ▫ Restrict the number of convolution layers within a block to less than 4 ▫ 1x1 convolution for dimension matching [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 7

8. 제안 | 기존 네트워크 구조 = Markov Chain [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 8

9. 제안 | “X:네트워크, Y:성능” 데이터셋 ▪ 데이터셋 구성 ▫ N 개의 네트워크: {𝑥𝑖}1:𝑁 ▫ Performance curves 𝑦𝑖(𝑡) • Training data로 학습시키면서 epoch 𝑡에서 validation data에 대한 validation accuracy ▫ 𝒟 = {𝑥𝑖, 𝑦𝑖}1:𝑁 ▪ Objective function with smooth L1 loss ▫ ℒ(𝒟; 𝜃) = 1 𝑁 σ𝑖=1 𝑛 𝑙(𝑓 𝑥𝑖, 𝑇 , 𝑦𝑖(𝑇)) [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 9

10. 실험 | 무엇을 학습할 것인가? [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 10

11. 실험 | 무엇을 학습할 것인가? [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 11

12. 실험 | 무엇을 학습할 것인가? ▪ Comparison ▫ Bayesian Neural Networks and 𝜐-SVR (Support Vector Regression) ▪ Evaluation metrics ▫ Mean Square Error (MSE) ▫ Kendall’s Tau (Tau) ▫ Coefficient of Determination (R2) [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 12

13. 실험 | Transfer to ImageNet ▪ a [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 13

14. 결론 | ▪ Block-based generation ▫ Skeleton + generated blocks ▪ 다른 요소들에 대한 실험은..? ▫ Residual block, Dense connection 등 ▪ 결국 평가를 위해선 모든 세팅에 대한 학습 필요 ▪ Transfer learning을 위한 최적의 방법인가? [PR12] Peephole: Predicting Network Performance Before Training (2017) Page 14

[PR12] PR-063: Peephole predicting network performance before training

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a [PR12] PR-063: Peephole predicting network performance before training

Semelhante a [PR12] PR-063: Peephole predicting network performance before training (20)

Mais de Taegyun Jeon

Mais de Taegyun Jeon (14)

Último

Último (20)

[PR12] PR-063: Peephole predicting network performance before training