SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
[course site]
Javier Ruiz Hidalgo
j.ruiz@upc.edu
Associate Professor
Universitat Politecnica de Catalunya
Technical University of Catalonia
3D
Day 4 Lecture 1
#DLUPC
Acknowledgments
2
Belen Luque López
CV Master student
Alba Pujol Miró
PhD Student
Outline
● Motivation
● Point Clouds
● 3D datasets
● Deep Learning considerations
● Techniques
○ 2.5D
○ Voxelization
○ Projection
○ Direct
● Conclusions
3
Motivation
● New / cheaper / smaller sensors to acquire 3D structure of the scene
○ Microsoft Kinect, Structure Sensor , Primesense Carmine
○ New datasets
● Virtual and Augmented reality applications
Oculus rift
Primesense Carmine
Kinect
4
Point clouds (1)
● Common representation for 3D data
● Collection of data points defined by a given
coordinates system
● Represents surface of objects
● Usually in Cartesian Coordinate System
○ X,Y,Z coordinates for each point of the cloud
3D Point cloud of a Torus
5
Point clouds (2)
Extra data can be added
for each point:
● Color (RGB)
● Orientation
● Curvature
Example of a point cloud
6
List of unordered XYZ values
Point clouds vs. Meshes
● A more complete 3D representation may
include faces defined between points / vertices
Mesh representing a dolphin
Mesh representation as a list of face-Vertex
7
3D datasets: Classification
● Large Dataset of Object Scans
○ PrimeSense Carmine sensor
○ 10k scans
○ 43 objects
8
3D datasets: Pose estimation
● T-less: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects
○ Primesense Carmine, Kinect v2 and Canon IXUS 950 sensors
○ 38k (training) + 10k (test) scans
○ 30 objects + groundtruth pose
9
3D datasets: Segmentation
● Sematic3d
○ Velodyne LIDAR sensor
○ 30 scenes, 1 billion labelled points
○ 8 classes
● ScanNet
○ Structure sensor
○ 1.5k scenes, 2.5M views
○ 20 classes
10
3D datasets: Autonomous driving
● Cityscapes: semantic understanding of urban street scenes
○ Stereo cameras
○ 5 cities, 20k images
○ 20 classes (instances)
11
3D datasets: Scene understanding (1)
● SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite
○ Kinect sensor
○ 10k scans
12
3D datasets: Scene understanding (2)
● Stanford 2D-3D-Semantics dataset
○ Kinect sensor
○ 70k scans, 6 areas with over 6000 m2
○ 13 classes
13
Deep learning from 3D point clouds? (1)
There are several challenges when using 3D point clouds in a deep learning
framework:
1. Undefined neighborhood
Image neighbours are easily defined by
their connectivity
14
Point cloud neighbours need to be
explicitly defined (Euclidean distance)
Deep learning from 3D point clouds? (2)
There are several challenges when using 3D point clouds in a deep learning
framework:
2. No lattice (convolution layers?)
15
Deep learning from 3D point clouds? (3)
There are several challenges when using 3D point clouds in a deep learning
framework:
3. Different density
16
Velodyne LIDAR data
Techniques
17
RGB-D / 2.5D Data
Voxelization
Multiview Projection Point clouds
RGB-D / 2.5D data (1)
Use depth as RGB + Depth (RGB-D) images
● Very common and used in the deep learning literature
○ RAW data from Kinect / Structure / Primesense sensors
● Multiple applications (classification, gesture recognition, semantic
segmentation, etc.)
18
RGB DEPTH POINT
CLOUD
RGB-D / 2.5D data (2)
● Straight-forward solution → include depth as a new channel (RGBD input)
19
RGB-D / 2.5D data (3)
● However, better results are obtained when depth is incorporated as a
two-stream network
20
N. Audebert, et al. Fusion of heterogeneous data in convolutional networks for urban semantic labeling, JURSE 2017
Voxelization
● Discretize 3D space with occupancy grid
● Use 3D convolutional layers
● Difficult to define a voxel size for all applications (density of point clouds)
● Memory requirements 21
semantic3d.net
Point cloud Voxel occupancy
grid
Voxelization: Examples
22Z. Wu, et al. 3D ShapeNets: A Deep Representation for Volumetric
Shapes, CVPR 2015
D. Maturana, et al. VoxNet: A 3D Convolutional Neural Network for
real-time object recognition, IROS 2015
3D ShapeNets VoxNet
Projection
● Instead of using the point cloud directly or voxelize it, project back the point
cloud into 1 (single) or several (multi-view) images
● Use the projected images as input tensors for the network
23
Correspondence matching
● Find correspondent 3D points between two point clouds
Classify both projections as
correspondences or not (binary
classification)
Projection: Example (1)
24
Point cloud 1
Point cloud 2
Project neighbouring points into
principal plane for the two
candidates
A Pujol, J.R. Casas, J. Ruiz-Hidalgo, (Work in progress)
Projection: Example (2)
Multiview projections
25
H. Su, et al. Multi-view Convolutional Neural Networks for 3D Shape Recognition, ICCV, 2015
Voxelization vs. Projection (1)
Experiments indicate that projection based multiview CNNs perform better than
voxelized volumetric representations
● Accuracy in object classification
26
C. R. Qi, et al. Volumetric and Multi-View CNNs for Object Classification on 3D Data, CVPR 2016
Voxelization vs. Projection (2)
Experiments indicate that projection based multiview CNNs perform better than
voxelized volumetric representations
● New architectures are closing the gap
27
C. R. Qi, et al. Volumetric and Multi-View CNNs for Object Classification on 3D Data, CVPR 2016
Point clouds
Is it possible to use point clouds directly in a learning framework?
28
PointNet (1)
Novel deep learning architecture that directly consumes point clouds
29
C. R. Qi, et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation, CVPR 2017
PointNet (2)
Input is an unordered list of XYZ coordinates (point cloud)
30
C. R. Qi, et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation, CVPR 2017
Mini-network to learn transformation
matrix to make the network invariant to
rotation, translation and scale
Similar idea to learn a transformation in
the feature space to align features
PointNet (3)
Classification results
31
C. R. Qi, et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation, CVPR 2017
Conclusions
● Point clouds rich representation of 3D data
● Working with point clouds on deep learning frameworks is a challenge due to
the unorganized nature of point clouds
● Techniques
○ RGB-D / 2.5D data
○ Voxelization
○ Multi-view projections
○ Point clouds as input gaining momentum
32
Questions?
33

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...
Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...
Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...
 
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
 
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)
 
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
 
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
 
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
 
Deep Learning for Computer Vision: Data Augmentation (UPC 2016)
Deep Learning for Computer Vision: Data Augmentation (UPC 2016)Deep Learning for Computer Vision: Data Augmentation (UPC 2016)
Deep Learning for Computer Vision: Data Augmentation (UPC 2016)
 
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
 
Deep Learning for Computer Vision: Deep Networks (UPC 2016)
Deep Learning for Computer Vision: Deep Networks (UPC 2016)Deep Learning for Computer Vision: Deep Networks (UPC 2016)
Deep Learning for Computer Vision: Deep Networks (UPC 2016)
 
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
 
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
 
Deep Learning for Computer Vision: Segmentation (UPC 2016)
Deep Learning for Computer Vision: Segmentation (UPC 2016)Deep Learning for Computer Vision: Segmentation (UPC 2016)
Deep Learning for Computer Vision: Segmentation (UPC 2016)
 
Conditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN DecodersConditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN Decoders
 
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
 
Joint unsupervised learning of deep representations and image clusters
Joint unsupervised learning of deep representations and image clustersJoint unsupervised learning of deep representations and image clusters
Joint unsupervised learning of deep representations and image clusters
 
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
 
Centernet
CenternetCenternet
Centernet
 
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Convolutional Neural Networks on Graphs with Fast Localized Spectral FilteringConvolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
 

Semelhante a Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017

Semelhante a Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017 (20)

Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable Rendering
 
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point CloudPoint-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
Point Cloud Stream on Spatial Mixed Reality: Toward Telepresence in Architect...
Point Cloud Stream on Spatial Mixed Reality: Toward Telepresence in Architect...Point Cloud Stream on Spatial Mixed Reality: Toward Telepresence in Architect...
Point Cloud Stream on Spatial Mixed Reality: Toward Telepresence in Architect...
 
Weakly supervised semantic segmentation of 3D point cloud
Weakly supervised semantic segmentation of 3D point cloudWeakly supervised semantic segmentation of 3D point cloud
Weakly supervised semantic segmentation of 3D point cloud
 
[3D勉強会@関東] Deep Reinforcement Learning of Volume-guided Progressive View Inpa...
[3D勉強会@関東] Deep Reinforcement Learning of Volume-guided Progressive View Inpa...[3D勉強会@関東] Deep Reinforcement Learning of Volume-guided Progressive View Inpa...
[3D勉強会@関東] Deep Reinforcement Learning of Volume-guided Progressive View Inpa...
 
3-d interpretation from single 2-d image for autonomous driving II
3-d interpretation from single 2-d image for autonomous driving II3-d interpretation from single 2-d image for autonomous driving II
3-d interpretation from single 2-d image for autonomous driving II
 
Best Techniques of Point cloud to 3D.pdf
Best Techniques of Point cloud to 3D.pdfBest Techniques of Point cloud to 3D.pdf
Best Techniques of Point cloud to 3D.pdf
 
FastV2C-HandNet - ICICC 2020
FastV2C-HandNet - ICICC 2020FastV2C-HandNet - ICICC 2020
FastV2C-HandNet - ICICC 2020
 
“Efficient Deep Learning for 3D Point Cloud Understanding,” a Presentation fr...
“Efficient Deep Learning for 3D Point Cloud Understanding,” a Presentation fr...“Efficient Deep Learning for 3D Point Cloud Understanding,” a Presentation fr...
“Efficient Deep Learning for 3D Point Cloud Understanding,” a Presentation fr...
 
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic VisualizationNon-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
 
TransNeRF
TransNeRFTransNeRF
TransNeRF
 
LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)
 
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
 
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
 
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D streamColor and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
Color and 3D Semantic Reconstruction of Indoor Scenes from RGB-D stream
 
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
 
V2 v posenet
V2 v posenetV2 v posenet
V2 v posenet
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 

Mais de Universitat Politècnica de Catalunya

Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 

Mais de Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Deep Generative Learning for All
Deep Generative Learning for AllDeep Generative Learning for All
Deep Generative Learning for All
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
 
The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
 
Open challenges in sign language translation and production
Open challenges in sign language translation and productionOpen challenges in sign language translation and production
Open challenges in sign language translation and production
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in MinecraftDiscovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in Minecraft
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
 
Intepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural NetworksIntepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural Networks
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
 
Curriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object SegmentationCurriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object Segmentation
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
 

Último

Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
HyderabadDolls
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 

Último (20)

Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017

  • 1. [course site] Javier Ruiz Hidalgo j.ruiz@upc.edu Associate Professor Universitat Politecnica de Catalunya Technical University of Catalonia 3D Day 4 Lecture 1 #DLUPC
  • 2. Acknowledgments 2 Belen Luque López CV Master student Alba Pujol Miró PhD Student
  • 3. Outline ● Motivation ● Point Clouds ● 3D datasets ● Deep Learning considerations ● Techniques ○ 2.5D ○ Voxelization ○ Projection ○ Direct ● Conclusions 3
  • 4. Motivation ● New / cheaper / smaller sensors to acquire 3D structure of the scene ○ Microsoft Kinect, Structure Sensor , Primesense Carmine ○ New datasets ● Virtual and Augmented reality applications Oculus rift Primesense Carmine Kinect 4
  • 5. Point clouds (1) ● Common representation for 3D data ● Collection of data points defined by a given coordinates system ● Represents surface of objects ● Usually in Cartesian Coordinate System ○ X,Y,Z coordinates for each point of the cloud 3D Point cloud of a Torus 5
  • 6. Point clouds (2) Extra data can be added for each point: ● Color (RGB) ● Orientation ● Curvature Example of a point cloud 6 List of unordered XYZ values
  • 7. Point clouds vs. Meshes ● A more complete 3D representation may include faces defined between points / vertices Mesh representing a dolphin Mesh representation as a list of face-Vertex 7
  • 8. 3D datasets: Classification ● Large Dataset of Object Scans ○ PrimeSense Carmine sensor ○ 10k scans ○ 43 objects 8
  • 9. 3D datasets: Pose estimation ● T-less: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects ○ Primesense Carmine, Kinect v2 and Canon IXUS 950 sensors ○ 38k (training) + 10k (test) scans ○ 30 objects + groundtruth pose 9
  • 10. 3D datasets: Segmentation ● Sematic3d ○ Velodyne LIDAR sensor ○ 30 scenes, 1 billion labelled points ○ 8 classes ● ScanNet ○ Structure sensor ○ 1.5k scenes, 2.5M views ○ 20 classes 10
  • 11. 3D datasets: Autonomous driving ● Cityscapes: semantic understanding of urban street scenes ○ Stereo cameras ○ 5 cities, 20k images ○ 20 classes (instances) 11
  • 12. 3D datasets: Scene understanding (1) ● SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite ○ Kinect sensor ○ 10k scans 12
  • 13. 3D datasets: Scene understanding (2) ● Stanford 2D-3D-Semantics dataset ○ Kinect sensor ○ 70k scans, 6 areas with over 6000 m2 ○ 13 classes 13
  • 14. Deep learning from 3D point clouds? (1) There are several challenges when using 3D point clouds in a deep learning framework: 1. Undefined neighborhood Image neighbours are easily defined by their connectivity 14 Point cloud neighbours need to be explicitly defined (Euclidean distance)
  • 15. Deep learning from 3D point clouds? (2) There are several challenges when using 3D point clouds in a deep learning framework: 2. No lattice (convolution layers?) 15
  • 16. Deep learning from 3D point clouds? (3) There are several challenges when using 3D point clouds in a deep learning framework: 3. Different density 16 Velodyne LIDAR data
  • 17. Techniques 17 RGB-D / 2.5D Data Voxelization Multiview Projection Point clouds
  • 18. RGB-D / 2.5D data (1) Use depth as RGB + Depth (RGB-D) images ● Very common and used in the deep learning literature ○ RAW data from Kinect / Structure / Primesense sensors ● Multiple applications (classification, gesture recognition, semantic segmentation, etc.) 18 RGB DEPTH POINT CLOUD
  • 19. RGB-D / 2.5D data (2) ● Straight-forward solution → include depth as a new channel (RGBD input) 19
  • 20. RGB-D / 2.5D data (3) ● However, better results are obtained when depth is incorporated as a two-stream network 20 N. Audebert, et al. Fusion of heterogeneous data in convolutional networks for urban semantic labeling, JURSE 2017
  • 21. Voxelization ● Discretize 3D space with occupancy grid ● Use 3D convolutional layers ● Difficult to define a voxel size for all applications (density of point clouds) ● Memory requirements 21 semantic3d.net Point cloud Voxel occupancy grid
  • 22. Voxelization: Examples 22Z. Wu, et al. 3D ShapeNets: A Deep Representation for Volumetric Shapes, CVPR 2015 D. Maturana, et al. VoxNet: A 3D Convolutional Neural Network for real-time object recognition, IROS 2015 3D ShapeNets VoxNet
  • 23. Projection ● Instead of using the point cloud directly or voxelize it, project back the point cloud into 1 (single) or several (multi-view) images ● Use the projected images as input tensors for the network 23
  • 24. Correspondence matching ● Find correspondent 3D points between two point clouds Classify both projections as correspondences or not (binary classification) Projection: Example (1) 24 Point cloud 1 Point cloud 2 Project neighbouring points into principal plane for the two candidates A Pujol, J.R. Casas, J. Ruiz-Hidalgo, (Work in progress)
  • 25. Projection: Example (2) Multiview projections 25 H. Su, et al. Multi-view Convolutional Neural Networks for 3D Shape Recognition, ICCV, 2015
  • 26. Voxelization vs. Projection (1) Experiments indicate that projection based multiview CNNs perform better than voxelized volumetric representations ● Accuracy in object classification 26 C. R. Qi, et al. Volumetric and Multi-View CNNs for Object Classification on 3D Data, CVPR 2016
  • 27. Voxelization vs. Projection (2) Experiments indicate that projection based multiview CNNs perform better than voxelized volumetric representations ● New architectures are closing the gap 27 C. R. Qi, et al. Volumetric and Multi-View CNNs for Object Classification on 3D Data, CVPR 2016
  • 28. Point clouds Is it possible to use point clouds directly in a learning framework? 28
  • 29. PointNet (1) Novel deep learning architecture that directly consumes point clouds 29 C. R. Qi, et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation, CVPR 2017
  • 30. PointNet (2) Input is an unordered list of XYZ coordinates (point cloud) 30 C. R. Qi, et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation, CVPR 2017 Mini-network to learn transformation matrix to make the network invariant to rotation, translation and scale Similar idea to learn a transformation in the feature space to align features
  • 31. PointNet (3) Classification results 31 C. R. Qi, et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation, CVPR 2017
  • 32. Conclusions ● Point clouds rich representation of 3D data ● Working with point clouds on deep learning frameworks is a challenge due to the unorganized nature of point clouds ● Techniques ○ RGB-D / 2.5D data ○ Voxelization ○ Multi-view projections ○ Point clouds as input gaining momentum 32