SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
Connecting the Dots 2015
Tuesday Meeting
Tim Head
École Polytechnique Fédérale de Lausanne
24 March 2015
Question: What is pattern recognition in sparsely sampled data?
Obvious answer: Track reconstruction!
Interesting answer: Computer vision, track reconstruction, space object tracking, face
recognition, jet reconstruction, self driving cars, ''Ok, Google ...''
Tim Head (EPFL) 24 March 2015 2
© BerkeleyLab
• A new conference series, this time in
Berkeley
• February 2015
• Check the agenda for lots of
interesting talks
• (the views are amazing)
Tim Head (EPFL) 24 March 2015 3
1. Is an aggressive R&D in this field sufficiently motivated?
2. Which are the most promising directions we should explore?
1. Associative Memory ASICs vs. FPGAs
2. Retina/Hough transform
3. Tracklets
4. Cellular Automata
5. GPUs
6. Commercial CPUs
7. .....
What is the future of fast track finding for trigger applications
beyond Atlas and CMS Phase II Upgrade?
Where charm leads,
beauty goes.
Followed by the Higgs.
Luciano
Ristori
Tim Head (EPFL) 24 March 2015 4
• In the post-Higgs era, in absence of of new physics, the key to progress in our
field will be precision measurements
• The HL-LHC at 1035 will produce ~1014 Beauty and Charm decays/year. If we
can harvest most of them we could bring the precision of CP violation
measurement in rare decays from the present ~ 10–2 to below ~10–4
• To do this we will need to change the way we perform experiments
• 1014 x 1 MB = 1020 bytes = 105 PB/year -> No way!
• We need to read out the detector for every single crossing, perform an almost
complete analysis in real time and retain only the information relevant to the
process of interest (e.g. few tracks involved in the decay)
• This involves finding all tracks down to low momentum, identifying decay
vertices, computing invariant masses...the complexity of this problem is
10-100 times worse than what we are now trying to solve for CMS Phase II
• 1014 x 1 KB = 100 PB/year -> Possible!
Is an aggressive R&D in this field sufficiently motivated?
an example
To stay ahead, we
need completely
new ideas.
Luciano
Ristori
Tim Head (EPFL) 24 March 2015 5
It is all About Representation
1.5 1.0 0.5 0.0 0.5 1.0 1.5
X
1.5
1.0
0.5
0.0
0.5
1.0
1.5
Y
Original Data
Separating black from
white is hard work ...
Tim Head (EPFL) 24 March 2015 6
It is all About Representation
1.5 1.0 0.5 0.0 0.5 1.0 1.5
X
1.5
1.0
0.5
0.0
0.5
1.0
1.5
Y
Original Data
2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5
One dimensional representation
Separating black from
white is hard work ...
... until you learn
about spherical co-
ordinates.
Tim Head (EPFL) 24 March 2015 6
How Jets are Like YouTube
Jet Clustering 101Detecting Jets
7	
  
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 8
Jet Clustering 101The HEP Problem at Hand
8	
  
QCD	
  
QCD	
  
QCD	
  
QCD	
  
QCD	
  
QCD	
  
QCD	
  
QCD	
  
Decay products of the
W and Z all end up in
the same jet.
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 9
N-subjettinessHEP Approach to Boosted Particle Tagging
•  “Substructure” techniques to analyze constituents of jet, e.g.
–  Is it a 1-prong, 2-prong, or 3-prong like decay?
–  Is the energy split evenly amongst “sub-jets”?
–  Many sub-structure related variables / algorithms
•  Example substructure variable:
–  N-subjettiness τ21=τ2 / τ1
–  Continuous version of subjet counting
•  Example Classification problem:
Separate W boson jet from a QCD light jet 9	
  
21
τ
0.2 0.4 0.6 0.8 1 1.2 1.4NormalisedEntries
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.2
0.22 ATLAS Simulation Preliminary
=8 TeVs
jets with R=1.0t
anti-k
Trimmed
| < 1.2
TRUTH
η|
< 350 GeV
TRUTH
T
200 < p
Window
RECO
M
QCD jets
W jets
N-subjettiness: after
a lot of thinking, cook
up a variable that can
separate QCD from W
jets.
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 10
N-subjettinessThe Jet-Image
•  Jets built from calorimeter towers
•  Build NxN grid of towers containing the jet (here 25x25)
•  The Jet-Image à calorimeter towers like pixels in image! 11	
  
Example	
  Jet	
  from	
  Wàqq’	
  decay	
  
Jet	
   Jet-­‐Image	
  
Calorimeter towers are
like the pixels of an
image.
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 11
N-subjettinessClass Averages
14	
  
0.0 0.5 1.0 1.5 2.0 2.5
Q2
0.0
0.5
1.0
1.5
2.0
2.5
Q1
Cell
Coefficient
10−9
10−8
10−7
10−6
10−5
10−4
10−3
10−2
10−1
0.0 0.5 1.0 1.5 2.0 2.5
Q2
0.0
0.5
1.0
1.5
2.0
2.5
Q1
Cell
Coefficient
10−9
10−8
10−7
10−6
10−5
10−4
10−3
10−2
10−1
Average W jet Average Light jet from QCD
How can we extract the important features?
How can we convert this into discrimination power?
After some prepro-
cessing, there is a dif-
ference!
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 12
N-subjettinessFisher Discriminant
•  Finds direction that maximizes
between-class scatter / within-class scatter
–  Extract “most important” feature, a, for discrimination for this metric
–  This can be written as a “Generalized” eigenvalue problem
•  If data is high dimensional, e.g. 625 elements, then St has huge
number of independent components, e.g. 192,495!
–  Not enough data to build full rank matrix à Must regularize!
–  Details of analytic solution: Z. Zhang et. al. Regularized Discriminant Analysis, Ridge Regression and Beyond,
Journal of Machine Learning Research 11 (2010) 2199-2228
16	
  
A complicated way of
saying ...
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 13
Fisher's Linear Discriminant
4 3 2 1 0 1 2 3 4
6
4
2
0
2
4
6
Find an axis along
which we can separ-
ate the data.
Tim Head (EPFL) 24 March 2015 14
Fisher's Linear Discriminant
4 3 2 1 0 1 2 3 4
6
4
2
0
2
4
6
Find an axis along
which we can separ-
ate the data.
Tim Head (EPFL) 24 March 2015 15
PerformancePerformance
23	
  
0 10 20 30 40 50 60 70 80 90 100
Signal Efficiency [%]
1
3
6
10
30
60
100
BackgroundRejection
Fisher-Jet
N-subjettiness (⌧2/⌧1)
We did not have to
think long and hard
about a variable, and
are competitive!
M
ichaelKagan
Tim Head (EPFL) 24 March 2015 16
Computer Vision Applied Blindly
• By mapping concepts from images to jets you gain access to well studied CV
techniques
• No need to think up ''clever'' variables a priori
flexible method!
• Computers can discover good ways to represent the data ''by themselves''
• Fisher's Linear Discriminant was state of the art in 1997, things have moved on
since then!
Tim Head (EPFL) 24 March 2015 17
What About YouTube?
Let a computer watch YouTube and
it will learn that cats are a useful
thing (variable) to know about.
Tim Head (EPFL) 24 March 2015 18
The automatic physicist?
Deep Learningdetecting the higgs boson
A two-class supervised learning problem:
Higgs-production Primary background
Machine learning classifier:
∙ 28 features
∙ 21 low-level features
∙ 7 high-level features derived by physicists
∙ 10M simulated collisions for training (50% each)
∙ 500k validation set
∙ 500k test set
3
Do the seven high
level variables help?
PeterSadow
ski
Tim Head (EPFL) 24 March 2015 20
Deep Learningdetecting the higgs boson
∙ Current approach: shallow models
∙ Boosted decision trees* (BDT)
∙ Shallow neural networks (NN)
∙ Our approach: deep neural networks (DNN)
BDT NN DNN
*Used for Higgs discovery in 2012
4
Things we knew in the
80s have finally star-
ted working!
PeterSadow
ski
Tim Head (EPFL) 24 March 2015 21
Deep Learningdeep learning for particle collider data analysis
Motivated by successes of deep learning in vision and speech.
∙ Huge progress on benchmark supervised learning tasks
∙ Replacement of engineered features with learned features
Engineered features Learned features
2
Deep Neural Networks
can learn better rep-
resentations of the
data without human
input.
PeterSadow
ski
Tim Head (EPFL) 24 March 2015 22
Deep Learningdetecting the higgs boson
Area Under ROC Curve for Test Set
Technique Low-level features All features
BDT 0.73 0.81
NN 0.733 (0.007) 0.816 (0.004)
DNN 0.880 (0.001) 0.885 (0.002)
Deep learning improves AUC by 8% over shallow methods.
Deep learning does not require engineered features.
Baldi et al, Nature Communications 2014
6
No, adding high level
features does not im-
prove performance.
PeterSadow
ski
Tim Head (EPFL) 24 March 2015 23
Nice, ... what does all this have to do with LHCb?
The Physics Equivalent of the Cat
What variables does
NN learn when you
show it physics? We
should find out!
Tim Head (EPFL) 24 March 2015 25
Learn Expensive Parts of the Simulation
detecting the higgs boson
Mean Squared Error of networks trained to compute 7 high-level
features from 21 low-level features.
Technique Feature Regression MSE
Linear Regression 0.1468
NN 0.0885
DNN 3 layers 0.0821
DNN 4 layers 0.0818
DNN 5 layers 0.0815
DNN 6 layers 0.0812
High-level features easier to learn with deep nets
9
Use a NN with multiple
regression outputs to
learn a fast simulation
of some parts of the
simulation?
PeterSadow
ski
Tim Head (EPFL) 24 March 2015 26
Isolation or Flavour Tagging
Can we use "jets-are-
like-images" ideas for
this?
Tim Head (EPFL) 24 March 2015 27
Visualisation
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
0
0
2 2
7
8
2
0
12
6
33
7
3 3
4
6
6
6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
43
1
4
0
5
3
6
9
6
1
7
5
44
7
2
8
22
5
7
9
5
4
8
8
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
0
0
2
2
7
8
2
0
1
2
6
33
7
3
3
4
6
6
6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4 4
7
2
8
2
2
55
4
88 4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
00
2
2
7
8
2
0
1
2
6
3
3
7
33
4
66
6
4
9
1
5
0
9
5
2
8
2
00
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4
4
7
2
8
2
2
5
7
9
5
4
8
8
4
9
0
8
9
3
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
77
3
5
1
0
0
2
2
7
8
2
0
1
2
6
33
7
33
4
6 66
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8 4
3
1 4
0
5
3
6
9
6
1
7
5
4
4
7
2
8
2
2
5
7
9
5
4
8 8
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
0
0
22
7
8
2
0
1
2
6
3
3
7
33
4
6
6 6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
61
7
5
4
4
7
2
8
2
2
5
7
9
5
4
8
8
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
12
3
4
5
6
7
8
9
0
12
3
4
5
6
7
8
9
0
9
55
6
5
0
9
8
9
8 4
1
77
3
5
1
0
0
2
2
7
8
2
0
1
2
6
33
7
33
4
66 6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4
4
7
2
8
22
5
7
9
5
4
8
8
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
0
0
2
2
7
8
2
0
1
2
6
33
7
33
4
6
6
6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4
4
7
2
8
2
2
5
7
9
5
4
8
8
4
9
0
8
9
8
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5 1
2
7
8
2
0
1
2
6
33
7
3
3
4
6
6
6
4
9
1
5
0
9 5
2
8
2
0
0
1
7
6
3
2
1
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
53
6
9
6
1
7
5
4
4
7
2
8
2
2
5
7
9
5
4
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
12
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5 5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
00
7
8
2
0
1
2
6
3
3
7
3 3
4
6
6 6
4
9
1
5
0
9
5
2 8
2
0 0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4 4
7
2
8
2
2
5
7
9
5
4
8
8
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3 4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
00
2
2
7
8
2
0
1
2
6
3
3
7
3
3
4
6
66
4
9
1
5
0
9
5
2 8
2
00
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4
4
7
2
8
2
2
5
7
9
5
4
8
8
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
45
6
7
8
9
0
9
55
6
5
0
9
8
9
8
4
1
7
7
3 5
1
00
22
7
8
2
0
1
2
6
3
3
7
33 4
66
6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
53
6
9 6
1
7
5
44
7
2
8
2
2
5
7
9
5
4
8
8
4
9
0
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
9
55
6
5
0
9
8
9
8 41
77
3
5
1
00
2
2
7
8
2
0
1
2
6
3
3
7
3
3
4
6
6
6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9
6
1
7
5
4 4
7
2
8
2
2
5
7
9
5
4
88
4
9
0
8
9
8
0
1
2
3
4
5
6
7
8
9
0
1
2
3
4
5
6
9
0
1
2
3
4
5
6
7
8
9
0
9
5 5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
00
2
2
7
8
2
0
12
6
33
7
3
3
4
66
6
4
9
1
5
0
9
5
2
8
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
69
6
1
7
5
4
4
7
2
2
5
7
9
5
4
4
9
0
8
9
8
0
12
3
4
5
6
7
8
9
0
1
2
3
4
5
6
7
8
9
0
1
2
3 4
5
6
7
8
9
0
9
5
5
6
5
0
9
8
9
8
4
1
7
7
3
5
1
0
0
2
2
7
8
2
0
1
2
6
3 3
7
3
3
4
6
6
6
4
9
1
5
0
9
5
2
8
2
0
0
1
7
6
3
2
1
7
4
6
3
1
3
9
1
7
6
8
4
3
1
4
0
5
3
6
9 6
1
7
5
4
4
7
2
8
2
2
5
7
9
5
4
8 8
4
9
0
8
9
8
t-SNE projecting a 64
dimensional space
into 2D, without using
labels.
Tim Head (EPFL) 24 March 2015 28
The End
• It is all about representation.
• A small conference with
unusual mix of attendants.
check the agenda for more
on traditional tracking, etc
• LHCb is leading the way when
it comes to ''real time''
tracking, others are following.
• To stay ahead of the other
experiments we should
investigate these new ML
tools.
Tim Head (EPFL) 24 March 2015 29

Mais conteúdo relacionado

Destaque

No touch porfis de fernando jose duarte tipton
No touch porfis de fernando jose duarte tiptonNo touch porfis de fernando jose duarte tipton
No touch porfis de fernando jose duarte tipton12345tyuiop
 
P ppresentation
P ppresentationP ppresentation
P ppresentationpastuhhov
 
Responsible Innovation for the pursuit of Sustainability
Responsible Innovation for the pursuit of SustainabilityResponsible Innovation for the pursuit of Sustainability
Responsible Innovation for the pursuit of SustainabilityRene Von schomberg
 
INTS2301_Portfolio
INTS2301_PortfolioINTS2301_Portfolio
INTS2301_PortfolioJordan Lee
 
Data Science at LHCb
Data Science at LHCbData Science at LHCb
Data Science at LHCbTimothy Head
 
Responsible Innovation: A Paradigm Shift in Innovation Policy
Responsible Innovation: A Paradigm Shift in Innovation PolicyResponsible Innovation: A Paradigm Shift in Innovation Policy
Responsible Innovation: A Paradigm Shift in Innovation PolicyRene Von schomberg
 
Track Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 TriggerTrack Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 TriggerTimothy Head
 
Nozzle dryers...
Nozzle dryers...Nozzle dryers...
Nozzle dryers...ridhamridz
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Rene Von schomberg
 
modified starch
modified starchmodified starch
modified starchridhamridz
 

Destaque (12)

Reusable science
Reusable scienceReusable science
Reusable science
 
No touch porfis de fernando jose duarte tipton
No touch porfis de fernando jose duarte tiptonNo touch porfis de fernando jose duarte tipton
No touch porfis de fernando jose duarte tipton
 
P ppresentation
P ppresentationP ppresentation
P ppresentation
 
Responsible Innovation for the pursuit of Sustainability
Responsible Innovation for the pursuit of SustainabilityResponsible Innovation for the pursuit of Sustainability
Responsible Innovation for the pursuit of Sustainability
 
INTS2301_Portfolio
INTS2301_PortfolioINTS2301_Portfolio
INTS2301_Portfolio
 
Data Science at LHCb
Data Science at LHCbData Science at LHCb
Data Science at LHCb
 
Responsible Innovation: A Paradigm Shift in Innovation Policy
Responsible Innovation: A Paradigm Shift in Innovation PolicyResponsible Innovation: A Paradigm Shift in Innovation Policy
Responsible Innovation: A Paradigm Shift in Innovation Policy
 
SC Historic Cemeteries
SC Historic CemeteriesSC Historic Cemeteries
SC Historic Cemeteries
 
Track Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 TriggerTrack Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 Trigger
 
Nozzle dryers...
Nozzle dryers...Nozzle dryers...
Nozzle dryers...
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts';
 
modified starch
modified starchmodified starch
modified starch
 

Semelhante a Tim connecting-the-dots

Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Mathieu DESPRIEE
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloOCTO Technology
 
6 data envelopment_analysis
6 data envelopment_analysis6 data envelopment_analysis
6 data envelopment_analysisFEG
 
Balancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationBalancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationAlex D. Gaudio
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities台灣資料科學年會
 
Kusto (Azure Data Explorer) Training for R&D - January 2019
Kusto (Azure Data Explorer) Training for R&D - January 2019 Kusto (Azure Data Explorer) Training for R&D - January 2019
Kusto (Azure Data Explorer) Training for R&D - January 2019 Tal Bar-Zvi
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用CHENHuiMei
 
A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...
A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...
A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...Pierre-Majorique Léger
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Hiroto Honda
 
Big learning 1.2
Big learning   1.2Big learning   1.2
Big learning 1.2Mohit Garg
 
2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the Trenches2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the TrenchesChris Dagdigian
 
Some "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data frontSome "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data frontGreg Landrum
 
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...CARLOS III UNIVERSITY OF MADRID
 
230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptxArthur240715
 
Microservices 101: From DevOps to Docker and beyond
Microservices 101: From DevOps to Docker and beyondMicroservices 101: From DevOps to Docker and beyond
Microservices 101: From DevOps to Docker and beyondDonnie Berkholz
 
Functional Leap of Faith (Keynote at JDay Lviv 2014)
Functional Leap of Faith (Keynote at JDay Lviv 2014)Functional Leap of Faith (Keynote at JDay Lviv 2014)
Functional Leap of Faith (Keynote at JDay Lviv 2014)Tomer Gabel
 
This Helix Nebula Science Cloud Pilot Phase Open Session
This Helix Nebula Science Cloud Pilot Phase Open SessionThis Helix Nebula Science Cloud Pilot Phase Open Session
This Helix Nebula Science Cloud Pilot Phase Open SessionHelix Nebula The Science Cloud
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering odsc
 

Semelhante a Tim connecting-the-dots (20)

Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
6 data envelopment_analysis
6 data envelopment_analysis6 data envelopment_analysis
6 data envelopment_analysis
 
Balancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationBalancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem Formulation
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities
 
Kusto (Azure Data Explorer) Training for R&D - January 2019
Kusto (Azure Data Explorer) Training for R&D - January 2019 Kusto (Azure Data Explorer) Training for R&D - January 2019
Kusto (Azure Data Explorer) Training for R&D - January 2019
 
Deeplearning in finance
Deeplearning in financeDeeplearning in finance
Deeplearning in finance
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用
 
A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...
A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...
A Cloud-Based Lab Management and Analytics Software for Triangulated Human-Ce...
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)
 
Big learning 1.2
Big learning   1.2Big learning   1.2
Big learning 1.2
 
2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the Trenches2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the Trenches
 
Some "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data frontSome "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data front
 
Fa19_P1.pptx
Fa19_P1.pptxFa19_P1.pptx
Fa19_P1.pptx
 
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
 
230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx
 
Microservices 101: From DevOps to Docker and beyond
Microservices 101: From DevOps to Docker and beyondMicroservices 101: From DevOps to Docker and beyond
Microservices 101: From DevOps to Docker and beyond
 
Functional Leap of Faith (Keynote at JDay Lviv 2014)
Functional Leap of Faith (Keynote at JDay Lviv 2014)Functional Leap of Faith (Keynote at JDay Lviv 2014)
Functional Leap of Faith (Keynote at JDay Lviv 2014)
 
This Helix Nebula Science Cloud Pilot Phase Open Session
This Helix Nebula Science Cloud Pilot Phase Open SessionThis Helix Nebula Science Cloud Pilot Phase Open Session
This Helix Nebula Science Cloud Pilot Phase Open Session
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering
 

Último

GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfTukamushabaBismark
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to VirusesAreesha Ahmad
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 

Último (20)

GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 

Tim connecting-the-dots

  • 1. Connecting the Dots 2015 Tuesday Meeting Tim Head École Polytechnique Fédérale de Lausanne 24 March 2015
  • 2. Question: What is pattern recognition in sparsely sampled data? Obvious answer: Track reconstruction! Interesting answer: Computer vision, track reconstruction, space object tracking, face recognition, jet reconstruction, self driving cars, ''Ok, Google ...'' Tim Head (EPFL) 24 March 2015 2
  • 3. © BerkeleyLab • A new conference series, this time in Berkeley • February 2015 • Check the agenda for lots of interesting talks • (the views are amazing) Tim Head (EPFL) 24 March 2015 3
  • 4. 1. Is an aggressive R&D in this field sufficiently motivated? 2. Which are the most promising directions we should explore? 1. Associative Memory ASICs vs. FPGAs 2. Retina/Hough transform 3. Tracklets 4. Cellular Automata 5. GPUs 6. Commercial CPUs 7. ..... What is the future of fast track finding for trigger applications beyond Atlas and CMS Phase II Upgrade? Where charm leads, beauty goes. Followed by the Higgs. Luciano Ristori Tim Head (EPFL) 24 March 2015 4
  • 5. • In the post-Higgs era, in absence of of new physics, the key to progress in our field will be precision measurements • The HL-LHC at 1035 will produce ~1014 Beauty and Charm decays/year. If we can harvest most of them we could bring the precision of CP violation measurement in rare decays from the present ~ 10–2 to below ~10–4 • To do this we will need to change the way we perform experiments • 1014 x 1 MB = 1020 bytes = 105 PB/year -> No way! • We need to read out the detector for every single crossing, perform an almost complete analysis in real time and retain only the information relevant to the process of interest (e.g. few tracks involved in the decay) • This involves finding all tracks down to low momentum, identifying decay vertices, computing invariant masses...the complexity of this problem is 10-100 times worse than what we are now trying to solve for CMS Phase II • 1014 x 1 KB = 100 PB/year -> Possible! Is an aggressive R&D in this field sufficiently motivated? an example To stay ahead, we need completely new ideas. Luciano Ristori Tim Head (EPFL) 24 March 2015 5
  • 6. It is all About Representation 1.5 1.0 0.5 0.0 0.5 1.0 1.5 X 1.5 1.0 0.5 0.0 0.5 1.0 1.5 Y Original Data Separating black from white is hard work ... Tim Head (EPFL) 24 March 2015 6
  • 7. It is all About Representation 1.5 1.0 0.5 0.0 0.5 1.0 1.5 X 1.5 1.0 0.5 0.0 0.5 1.0 1.5 Y Original Data 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 One dimensional representation Separating black from white is hard work ... ... until you learn about spherical co- ordinates. Tim Head (EPFL) 24 March 2015 6
  • 8. How Jets are Like YouTube
  • 9. Jet Clustering 101Detecting Jets 7   M ichaelKagan Tim Head (EPFL) 24 March 2015 8
  • 10. Jet Clustering 101The HEP Problem at Hand 8   QCD   QCD   QCD   QCD   QCD   QCD   QCD   QCD   Decay products of the W and Z all end up in the same jet. M ichaelKagan Tim Head (EPFL) 24 March 2015 9
  • 11. N-subjettinessHEP Approach to Boosted Particle Tagging •  “Substructure” techniques to analyze constituents of jet, e.g. –  Is it a 1-prong, 2-prong, or 3-prong like decay? –  Is the energy split evenly amongst “sub-jets”? –  Many sub-structure related variables / algorithms •  Example substructure variable: –  N-subjettiness τ21=τ2 / τ1 –  Continuous version of subjet counting •  Example Classification problem: Separate W boson jet from a QCD light jet 9   21 τ 0.2 0.4 0.6 0.8 1 1.2 1.4NormalisedEntries 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 ATLAS Simulation Preliminary =8 TeVs jets with R=1.0t anti-k Trimmed | < 1.2 TRUTH η| < 350 GeV TRUTH T 200 < p Window RECO M QCD jets W jets N-subjettiness: after a lot of thinking, cook up a variable that can separate QCD from W jets. M ichaelKagan Tim Head (EPFL) 24 March 2015 10
  • 12. N-subjettinessThe Jet-Image •  Jets built from calorimeter towers •  Build NxN grid of towers containing the jet (here 25x25) •  The Jet-Image à calorimeter towers like pixels in image! 11   Example  Jet  from  Wàqq’  decay   Jet   Jet-­‐Image   Calorimeter towers are like the pixels of an image. M ichaelKagan Tim Head (EPFL) 24 March 2015 11
  • 13. N-subjettinessClass Averages 14   0.0 0.5 1.0 1.5 2.0 2.5 Q2 0.0 0.5 1.0 1.5 2.0 2.5 Q1 Cell Coefficient 10−9 10−8 10−7 10−6 10−5 10−4 10−3 10−2 10−1 0.0 0.5 1.0 1.5 2.0 2.5 Q2 0.0 0.5 1.0 1.5 2.0 2.5 Q1 Cell Coefficient 10−9 10−8 10−7 10−6 10−5 10−4 10−3 10−2 10−1 Average W jet Average Light jet from QCD How can we extract the important features? How can we convert this into discrimination power? After some prepro- cessing, there is a dif- ference! M ichaelKagan Tim Head (EPFL) 24 March 2015 12
  • 14. N-subjettinessFisher Discriminant •  Finds direction that maximizes between-class scatter / within-class scatter –  Extract “most important” feature, a, for discrimination for this metric –  This can be written as a “Generalized” eigenvalue problem •  If data is high dimensional, e.g. 625 elements, then St has huge number of independent components, e.g. 192,495! –  Not enough data to build full rank matrix à Must regularize! –  Details of analytic solution: Z. Zhang et. al. Regularized Discriminant Analysis, Ridge Regression and Beyond, Journal of Machine Learning Research 11 (2010) 2199-2228 16   A complicated way of saying ... M ichaelKagan Tim Head (EPFL) 24 March 2015 13
  • 15. Fisher's Linear Discriminant 4 3 2 1 0 1 2 3 4 6 4 2 0 2 4 6 Find an axis along which we can separ- ate the data. Tim Head (EPFL) 24 March 2015 14
  • 16. Fisher's Linear Discriminant 4 3 2 1 0 1 2 3 4 6 4 2 0 2 4 6 Find an axis along which we can separ- ate the data. Tim Head (EPFL) 24 March 2015 15
  • 17. PerformancePerformance 23   0 10 20 30 40 50 60 70 80 90 100 Signal Efficiency [%] 1 3 6 10 30 60 100 BackgroundRejection Fisher-Jet N-subjettiness (⌧2/⌧1) We did not have to think long and hard about a variable, and are competitive! M ichaelKagan Tim Head (EPFL) 24 March 2015 16
  • 18. Computer Vision Applied Blindly • By mapping concepts from images to jets you gain access to well studied CV techniques • No need to think up ''clever'' variables a priori flexible method! • Computers can discover good ways to represent the data ''by themselves'' • Fisher's Linear Discriminant was state of the art in 1997, things have moved on since then! Tim Head (EPFL) 24 March 2015 17
  • 19. What About YouTube? Let a computer watch YouTube and it will learn that cats are a useful thing (variable) to know about. Tim Head (EPFL) 24 March 2015 18
  • 21. Deep Learningdetecting the higgs boson A two-class supervised learning problem: Higgs-production Primary background Machine learning classifier: ∙ 28 features ∙ 21 low-level features ∙ 7 high-level features derived by physicists ∙ 10M simulated collisions for training (50% each) ∙ 500k validation set ∙ 500k test set 3 Do the seven high level variables help? PeterSadow ski Tim Head (EPFL) 24 March 2015 20
  • 22. Deep Learningdetecting the higgs boson ∙ Current approach: shallow models ∙ Boosted decision trees* (BDT) ∙ Shallow neural networks (NN) ∙ Our approach: deep neural networks (DNN) BDT NN DNN *Used for Higgs discovery in 2012 4 Things we knew in the 80s have finally star- ted working! PeterSadow ski Tim Head (EPFL) 24 March 2015 21
  • 23. Deep Learningdeep learning for particle collider data analysis Motivated by successes of deep learning in vision and speech. ∙ Huge progress on benchmark supervised learning tasks ∙ Replacement of engineered features with learned features Engineered features Learned features 2 Deep Neural Networks can learn better rep- resentations of the data without human input. PeterSadow ski Tim Head (EPFL) 24 March 2015 22
  • 24. Deep Learningdetecting the higgs boson Area Under ROC Curve for Test Set Technique Low-level features All features BDT 0.73 0.81 NN 0.733 (0.007) 0.816 (0.004) DNN 0.880 (0.001) 0.885 (0.002) Deep learning improves AUC by 8% over shallow methods. Deep learning does not require engineered features. Baldi et al, Nature Communications 2014 6 No, adding high level features does not im- prove performance. PeterSadow ski Tim Head (EPFL) 24 March 2015 23
  • 25. Nice, ... what does all this have to do with LHCb?
  • 26. The Physics Equivalent of the Cat What variables does NN learn when you show it physics? We should find out! Tim Head (EPFL) 24 March 2015 25
  • 27. Learn Expensive Parts of the Simulation detecting the higgs boson Mean Squared Error of networks trained to compute 7 high-level features from 21 low-level features. Technique Feature Regression MSE Linear Regression 0.1468 NN 0.0885 DNN 3 layers 0.0821 DNN 4 layers 0.0818 DNN 5 layers 0.0815 DNN 6 layers 0.0812 High-level features easier to learn with deep nets 9 Use a NN with multiple regression outputs to learn a fast simulation of some parts of the simulation? PeterSadow ski Tim Head (EPFL) 24 March 2015 26
  • 28. Isolation or Flavour Tagging Can we use "jets-are- like-images" ideas for this? Tim Head (EPFL) 24 March 2015 27
  • 29. Visualisation 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 0 0 2 2 7 8 2 0 12 6 33 7 3 3 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 43 1 4 0 5 3 6 9 6 1 7 5 44 7 2 8 22 5 7 9 5 4 8 8 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 0 0 2 2 7 8 2 0 1 2 6 33 7 3 3 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 55 4 88 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 00 2 2 7 8 2 0 1 2 6 3 3 7 33 4 66 6 4 9 1 5 0 9 5 2 8 2 00 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 77 3 5 1 0 0 2 2 7 8 2 0 1 2 6 33 7 33 4 6 66 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 0 0 22 7 8 2 0 1 2 6 3 3 7 33 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 61 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 12 3 4 5 6 7 8 9 0 12 3 4 5 6 7 8 9 0 9 55 6 5 0 9 8 9 8 4 1 77 3 5 1 0 0 2 2 7 8 2 0 1 2 6 33 7 33 4 66 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 22 5 7 9 5 4 8 8 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 0 0 2 2 7 8 2 0 1 2 6 33 7 33 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 8 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 2 7 8 2 0 1 2 6 33 7 3 3 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 53 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 12 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 00 7 8 2 0 1 2 6 3 3 7 3 3 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 00 2 2 7 8 2 0 1 2 6 3 3 7 3 3 4 6 66 4 9 1 5 0 9 5 2 8 2 00 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 45 6 7 8 9 0 9 55 6 5 0 9 8 9 8 4 1 7 7 3 5 1 00 22 7 8 2 0 1 2 6 3 3 7 33 4 66 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 53 6 9 6 1 7 5 44 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 55 6 5 0 9 8 9 8 41 77 3 5 1 00 2 2 7 8 2 0 1 2 6 3 3 7 3 3 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 88 4 9 0 8 9 8 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 00 2 2 7 8 2 0 12 6 33 7 3 3 4 66 6 4 9 1 5 0 9 5 2 8 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 69 6 1 7 5 4 4 7 2 2 5 7 9 5 4 4 9 0 8 9 8 0 12 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 9 5 5 6 5 0 9 8 9 8 4 1 7 7 3 5 1 0 0 2 2 7 8 2 0 1 2 6 3 3 7 3 3 4 6 6 6 4 9 1 5 0 9 5 2 8 2 0 0 1 7 6 3 2 1 7 4 6 3 1 3 9 1 7 6 8 4 3 1 4 0 5 3 6 9 6 1 7 5 4 4 7 2 8 2 2 5 7 9 5 4 8 8 4 9 0 8 9 8 t-SNE projecting a 64 dimensional space into 2D, without using labels. Tim Head (EPFL) 24 March 2015 28
  • 30. The End • It is all about representation. • A small conference with unusual mix of attendants. check the agenda for more on traditional tracking, etc • LHCb is leading the way when it comes to ''real time'' tracking, others are following. • To stay ahead of the other experiments we should investigate these new ML tools. Tim Head (EPFL) 24 March 2015 29