Continual Learning: Another Step Towards Truly Intelligent Machines

Continual Learning: Another Step
TowardsTruly Intelligent Machines
Introduction Meetup @ Numenta
16-09-2019
Vincenzo Lomonaco
vincenzo.lomonaco@unibo.it
Postdoctoral Researcher @ University of Bologna
Supervisor: Davide Maltoni

About me
• Post-Doc @ University of Bologna
• Research Aﬃliate @ AI Labs
• Teaching Assistant of the courses
Machine Learning and Computer
Architectures @ UniBo
• Author andTechnical reviewer of the
online course Deep Learning with R and
book R Deep Learning Essentials.
• Co-Founder and President of
ContinualAI.org
• Co-Founder and Board Member of Data
Science Bologna and AIforPeople.org

What’s ContinualAI?
• ContinualAI is a non-proﬁt research organization and
the largest research community on Continual Learning
for AI.
• It counts more than 550+ members in 17 diﬀerent
time-zones and from top-notch research institutions.
• Learn more about ContinualAI at www.continualai.org

ContinualAI Board Members and Advisors

Machine Intelligence @ BioLab
Davide Maltoni
Vincenzo Lomonaco Lorenzo Pellegrini Gabriele Graﬃeti

Outline
1. Personal ResearchTrajectory andVision
2. Continual Learning: State-of-the-art
3. Rehearsal-free and Task-agnostic
Online Continual Learning
4. CurrentWork and Research Direction

PersonalResearch Trajectory
andVision

ResearchTrajectory andVision
I meet Davide Maltoni
who was working at
HTMs from 2011.
I read “On
Intelligence” and join
his quest for
understanding
intelligence and build it
in silicon.
MasterThesis Published:
“Comparing HTMs and CNNs
on Object RecognitionTasks”
2014
Visiting Scholar at Purdue
University.
Working on Continual
Reinforcement /
Unsupervised Learning.
Visiting Scholar at ENSTA
ParisTech.
Working on Continual for
Robotics and a more
comprehensive CL
framework deﬁnition.
2015 2017 2018
I defend my PhD
Dissertation “Continual
Learning with Deep
Architectures”.
Putting everything
together.
Post-Doc @ UniBo
on the same topic.
2019
We abandon HTM (1st Gen.) to
work on top of deep learning
directly with a focus on
Continual Learning.
In particular, on Continual
Learning from video sequences.
2016
Long-term vision: “Understand the key computational
principles of intelligence and build truly intelligent machines.”
Main research goal: “Closing the gap between the HTM
theory and current AI systems.”

OurWorks with HTMs (1st Gen.)
1. D. Maltoni, Pattern Recognition by HierarchicalTemporal
Memory,Technical Report, DEIS - University of Bologna technical
report, April 2011.
2. D. Maltoni and E.M. Rehn, Incremental Learning by Message
Passing in HierarchicalTemporal Memory in 5thWorkshop on
Artiﬁcial Neural Networks in Pattern Recognition (ANNPR12),
Trento (Italy), pp.24-35, September 2012.
3. E.M. Rehn and D. Maltoni, Incremental Learning by Message
Passing in HierarchicalTemporal Memory, Neural Computation,
vol.26, no.8, pp.1763-1809, August 2014.
4. D. Maltoni andV. Lomonaco, Semi-supervisedTuning from
Temporal Coherence, in International Conference on Pattern
Recognition (ICPR16), Cancun, Mexico, December 2016.

Semi-SupervisedTuning from
Temporal Coherence (with HTMs)

HTM theory
Principles of Intelligence
1. Hierarchical Learning
2. Sequence Learning
3. Continual Learning
4. Sparse Representations
5. Sensory-Motor Integration (Embodiment)
6. Distributed Parallel Modeling (Thousands BrainTheory)
7. … ?
Emerging Properties
Flexibility Robustness Scalability Eﬃciency Adaptation
Autonomy Generalization Compositionality Reasoning
Common Sense ...

Towards “Cortical Learning”
Neuroscience Grounding
PracticalFunctionality
Symbolic AI
Kernel Machines
Feed-Forward
NNs / LSTMs
CNNs
Deep-CNNs
Conv-LSTMs
Deep-RL
Continual
Learning
Cortical
Learning
HTM
(1st Gen.)
CLA
(2nd Gen.)
CLA
(3rd Gen.)
Other Approaches
HTM-based
Neural Networks Based
Bayesian Approaches
Analogism-based
Approaches Evolutionary Approaches

Continual Learning:
State-of-the-art

The Stability-Plasticity Dilemma
Stability-Plasticity Dilemma:
• Remember past concepts
• Learn new concepts
• Generalize
Biggest Problem in Deep Learning:
• Catastrophic Forgetting

The Stability-Plasticity Dilemma

Continual Learning: Approaches
T. Lesort,V. Lomonaco et al. Continual Learning for Robotics. pre-print arxiv arXiv:1907.00182 .

CL Framework
CL Algorithm
Mini-spot Robot from Boston Dynamics, 2018

Rehearsal-free and Task-agnostic

3 Short-term Research Objective for CL
1. Rehearsal-Free: Raw data cannot be stored and re-used
for rehearsal.
2. Task Agnostic: No use of supplementary task supervised
signal “t”.
3. Online: Bounded computational and memory
overheads, eﬃcient, real-time updates (possibly one
data instance at a time).

Task Agnostic Continual Learning
1. New Instances (NI)
2. New Classes (NC)
3. New Instances and Classes (NIC)
Initial Batch Incremental Batches
Τ
. . .

CORe50Website
Dataset, Benchmark, code and additional
information freely available at:
vlomonaco.github.io/core50
LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017.

CORe50: aVideo Benchmark for CL
and Object Recognition/Detection

# Images 164,866
Format RGB-D
Image size 350x350
128x128
# Categories 10
# Obj. x Cat. 5
# Sessions 11
# img. x Sess. ~300
# Outdoor Sess. 3
Acquisition Sett. Hand held
CORe50: aVideo Benchmark for CL
and Object Recognition/Detection

Fine-Grained Continual Learning
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.

AR-1*
Rehearsal-free andTask Agnostic
Maltoni D. and LomonacoV. Continuous Learning in Single-Incremental-Task Scenarios. Neural Networks Journal, 2019.

AR-1*: Overview (with MobileNet-V1)

AR-1*: Supervised / Unsupervised
Pre-Training Phase
● Supervised or
Unsupervised
Pre-Training from
ImageNet.
● Slowly Fine-tuned or
kept ﬁxed.
● future direction:
unsupervised
co-training from
scratch.

AR-1*: Regularization Phase

AR-1*: Regularization Phase
● Computational
Eﬃcient (independent
from the number of
training batches)
● Just one Fisher matrix
(running sum + max
clip)
● Importance of Batch
ReNormalization

AR-1*: Architectural Phase
● CWR*: generalization of
CWR+ to handle
agnostically NI, NC and
NIC settings
● Dual-Memory system for
memory consolidation.
● Based on zero-init for new
classes, weights
consolidation and
ﬁnetuning for already
encountered classes.

CORe50 - NICv2 Results
● (0%-92%) -45% avg. memory.
● (0%-94%) -49% avg. compute.
● -20% price in accuracy at
the end of last batch.

CurrentWork and Research
Direction

Real-World Continual Learning on
Embedded Systems
Pellegrini L., Graﬃeti G. , LomonacoV. and Maltoni D. Towards Continual Learning on the Edge.To be published.

AR-1*: Closing the Accuracy Gap with
Latent Rehearsal

AR-1*: Sparse Representations
● Imposing Sparsity of the
activactivation does not
aﬀect accuracy from
~55% to ~35%.
● It has been shown that
sparsity may help the CL
process.
● Less memory overhead for
latent rehearsal.

FutureWorks and Research Direction
1. Latent Generative Replay
2. Lowering the amount of Supervision (Unsupervised
Reinforcement Learning, Active Learning)
3. Infer or make use of the sparse “task signal” (context
modulation)
4. Sequence Learning/ Temporal Coherence Integration
5. Improve robustness in real-world embedded
applications (Smartphone devices, Nao Robot, …)
Maltoni D. and LomonacoV. Semi-SupervisedTuning fromTemporal Coherence. ICPR 2016.
LomonacoV., Desai K., Maltoni D. and Culurciello, E. Continual Reinforcement Learning in 3D non-stationary
environments. preprint arxiv arXiv:1905.10112, 2019.

AR-1*: Closing the Accuracy Gap with
Latent Generative Replay
●
●
●
●
●
●

Questions?
Introduction Meetup @ Numenta
16-09-2019
Vincenzo Lomonaco
vincenzo.lomonaco@unibo.it
Postdoctoral Researcher @ University of Bologna
Supervisor: Davide Maltoni

Continual Learning: Another Step Towards Truly Intelligent Machines

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (17)

Semelhante a Continual Learning: Another Step Towards Truly Intelligent Machines

Semelhante a Continual Learning: Another Step Towards Truly Intelligent Machines (20)

Mais de Vincenzo Lomonaco

Mais de Vincenzo Lomonaco (14)

Último

Último (20)

Continual Learning: Another Step Towards Truly Intelligent Machines