Tutorial inns2019 full

Continual Lifelong Learning
with Neural Networks
April 16, 2019 -Tutorial @ INNSBDDL2019

A Practical Example
• 50GB/s streaming
data.
• ~30240TB of data
after only a week.
• Impossible to re-train
the mini-spot brain
from scratch and to
adapt fast.
Mini-spot Robot from Boston Dynamics, 2018

Continual Learning (CL)
• Ability to continually acquire, ﬁne-tune, and transfer
new knowledge and skills
• Higher and realistic time-scale where data (and tasks)
become available only during time.
• No access to previously encountered data.
• Constrained computational and memory resources.

Catastrophic forgetting
• Training a model with new information interferes with
previously learned knowledge
• Abrupt performance decrease or old knowledge
completely overwritten by the new one.

The Stability-Plasticity Dilemma
Stability-Plasticity Dilemma:
• Remember past concepts
• Learn new concepts
• Generalize
Biggest Problem in Deep Learning:
• Catastrophic Forgetting

Biological factors of CL
• Structural Plasticity
• Neurosynaptic adaptation to changes in the environment
• Change of physical structure as the result of learning
• Stability-plasticity balance
• Complementary Learning Systems
• Retaining episodic memories (memorization)
• Extracting statistical structure (generalization)
• Memory replay

ElasticWeights Consolidation (EWC)
Fisher Information
...

Growing Networks
Parisi,Tani,Weber,Wermter. Lifelong Learning of Spatio-temporal Representations with Dual-Memory
Recurrent Self-Organization. Frontiers in Neurorobotics 2019.

CL Strategies
Architectural
Regularization Rehearsal
CWR PNN
EWC
SI
LWF
ICARL
AR1
GEM
Pure
Rehearsal
GDM
Exstream

Supervised CL benchmarks
Dataset Strategy
Permuted MNIST EWC, GEM, SI, ...
Rotated MNIST GEM
MNIST Split SI
CIFAR10/100 Split GEM, iCARL, SI, AR1, ...
ILSVRC2012 iCARL
CUB-200 GMD, ...
LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017.

Sequential CL benchmarks

CRL Environments
Environments Scenarios
Atari Multiple 2D games
DeepMind Lab
Maze Exploration,
Object Picking
Malmo Multiple tasks
OpenAI Gym Multiple 3D tasks
MuJoCo Multiple Joint Stiﬀness
VizDoom -
Unity 3D -
StarCraft II Curriculum learning

Some References for CRL
• Al-Shedivat, Maruan, et al. "Continuous adaptation via meta-learning in
nonstationary and competitive environments." arXiv preprint arXiv:1710.03641
(2017).
• Tessler, Chen, et al. "A deep hierarchical approach to lifelong learning in
minecraft."Thirty-First AAAI Conference on Artiﬁcial Intelligence. 2017.
• Kirkpatrick, James, et al. "Overcoming catastrophic forgetting in neural
networks." Proceedings of the national academy of sciences 114.13 (2017):
3521-3526.
• Schwarz, Jonathan, et al. "Progress & compress: A scalable framework for
continual learning." arXiv preprint arXiv:1805.06370 (2018).
• Kaplanis, Christos, Murray Shanahan, and Claudia Clopath. "Continual
reinforcement learning with complex synapses." arXiv preprint arXiv:1802.07239
(2018).

CORe50: aVideo Benchmark for CL and Object
Recognition, Detection and Segmentation

# Images 164,866
Format RGB-D
Image size 350x350
128x128
# Categories 10
# Obj. x Cat. 5
# Sessions 11
# img. x Sess. ~300
# Outdoor Sess. 3
Acquisition Sett. Hand held
CORe50: aVideo Benchmark for CL and Object
Recognition, Detection and Segmentation

CORe50 Benchmark
(NI)
(NC)
(NIC)

CORe50Website
Dataset, Benchmark, code and additional
information freely available at:
vlomonaco.github.io/core50

CRL in 3D non-stationary environment
LomonacoV., Desai K., Maltoni D. and Culurciello, E. Continual Reinforcement Learning in 3D non-stationary
environments. Submitted to ECML-PKDD, 2019.
VIDEO!

CL Framework and Metrics
CL Algorithm
N. Díaz-Rodríguez,V. Lomonaco et al. Don't forget, there is more than forgetting: new metrics for Continual Learning.
CLWorkshop, NeurIPS 2018.

Continual Learning:
Where to start?

ContinualAI non-proﬁt Research Organization
http://continualai.org https://continualai.herokuapp.com/

A Gentle Introduction to CL in PyTorch
https://github.com/ContinualAI/colab

Limitations and FutureWorks
Limitations
• Young line of research
• Theoretical foundations
• Real-world applications
What’s next?
• Towards Biological Synaptic Plasticity, learning and
memory.
• Robustness, ﬂexibility, and eﬃciency.

CL in autonomous agents & robots
• Progressively acquire, ﬁne-tune, and transfer
knowledge and skills through the interaction with the
environment
• Data are temporally correlated and increasingly more
complex
• Active exploration through intrinsic motivation

Rethinking CL for autonomous agents

Tutorial inns2019 full

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Similar to Tutorial inns2019 full

Similar to Tutorial inns2019 full (20)

More from Vincenzo Lomonaco

More from Vincenzo Lomonaco (12)

Recently uploaded

Recently uploaded (20)

Tutorial inns2019 full