0% found this document useful (0 votes)

26 views33 pages

Continual Learning Strategies

Uploaded by

Huang Leonard

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views33 pages

Continual Learning Strategies

Uploaded by

Huang Leonard

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Continual Learning: On Machines

that can Learn Continually

Ofﬁcial Open-Access Course @ University of Pisa, ContinualAI, AIDA

Lecture 5: Methodologies [Part 1]

Vincenzo Lomonaco
University of Pisa & ContinualAI
vincenzo.lomonaco@unipi.it
TABLE OF CONTENTS

01 02 03
Strategies Replay Avalanche
Categorization Strategies: Strategies &
and History Intro & Main Plugins
Approaches
Strategy
Categorization and
History
Possible 4-way Fuzzy Categorization

With some twists

● No formal deﬁnition

● Alternative categorizations are possible

Continual Learning for Robotics: Deﬁnition, Framework, Learning Strategies, Opportunities and Challenges, Lesort et al. Information Fusion, 2020.
A continual learning survey: Defying forgetting in classiﬁcation tasks. De Lange et al, TPAMI 2021.
Continual Learning Baselines

Common Baselines / Control Algorithms

● Naive / Finetuning (just continuing backprop)

● JointTraining / Ofﬂine (pure Multi-task

learning): The best you can do with all the data
starting from scratch

● Ensemble: one model for each experience

● Cumulative: for every experience, accumulate

all data and re-train from scratch.

A brief review on multi-task learning. Thung et al, 2018.

Fundamental Design Choices

Strategic Choices

● Start from scratch or pre-trained?

● What model architecture to use?

● Such choices may affect the CL approach

effectiveness

Multi-Head vs Single-Head

Continual Learning for Recurrent Neural Networks: an Empirical Evaluation. Cossu et al, 2021.
Historical Trends

● Initial focus on Task Incremental (a few experiences, one for task, task labels given)

● Simple Regularization methods (L1 / L2, Dropout, Elastic Weights Consolidation, Synaptic Intelligence,
etc.)

● Simple Architectural strategies (Multi-head, Copy-Weight with Reinit, Progressive Neural Networks,
etc.)

● Simple Replay Strategies (random Replay, multi-buffer random replay, etc.)

● Current trend: more and more articulate strategies (often starting from pre-trained models), mostly
hybrid

● Mostly Heuristics, not principled methods. Very difﬁcult to generalize to a large set of scenarios
Effective Solutions

Good News

● Replay is a very general and

effective strategy for CL

Bad News

● Replay is approximating an
i.i.d distribution
● Can be seen as a form of
cheating
● Compute / memory
limitations

Replay-Based Methods for Continual Learning, Gabriele Merlin, MS Thesis, University of Pisa, 2021.
Is Forgetting Solved?

Not really

● The gap with an ofﬂine strategy may be still

very large

● The accuracy improvements with respect to the

memory size is often logarithmic

● Huge buffer sizes (approximating a cumulative

strategy) may be very inefﬁcient

○ Memory size (for imagenet 50 imgs per

class means about 7 GB memory)

○ Additional forward and backward passes

over the same examples

Latent Replay for Real-Time Continual Learning. Pellegrini et al. IROS, 2019.
Replay Strategies
Random Replay

A basic approach

● Sample randomly from the

current experience data

● Fill your ﬁxed Random Memory

(RM)

● Replace examples randomly to

maintain an approximate equal
number of examples for
experience

Latent Replay for Real-Time Continual Learning. Pellegrini et al. IROS, 2019.
Many Implementation Options
…and many implications

● Fixed or “adaptive” external memory?

● Sample selection: random or representative examples only?

● Mini-batch sample selection: what examples to choose from M and to use in the current mini-batch?
What augmentations to use?

● Separate buffers per class / tasks / notable distributions?

● Sample based on time: different timescales? Uniform sampling in time?

● Sample replacement: which examples to throw away when the memory is full?

● No clear answer to all these questions: a coherent empirical evaluation still missing

● It really depends on the scenario / problem you are solving -> more engineering than science

Memory Efficient Experience Replay for Streaming Learning, Hayes et al. 2019
GDUMB: Another Control Baseline

Greedy Sampler and Dumb Learner

● Interesting paper that sparked strong

discussions in the CL community

● Note that there’s no knowledge transfer in

this strategy (quite dumb indeed!)

● Despite its simplicity, It was shown to work

better than some existing and more complex
strategies, questioning the utility of some
benchmarks/metrics in our ﬁeld

● If your strategy cannot beat GDumb there’s

something wrong about your strategy or your
evaluation setting

GDumb: A Simple Approach that Questions Our Progress in Continual Learning. Prabhu et al. ECCV, 2020.
Maximally Interfered Retrieval (MIR)

Mini-batch Sample Selection

● Select the examples

that are more
negatively impacted
by the estimated
weights update

● May be quite slow in

practice w.r.t. the
actual accuracy gain
over random selection

Online Continual Learning with Maximally Interfered Retrieval. Aljundi et al. 2019.
Latent Replay

Key Ideas

● Replay in the input space is inefﬁcient and

biologically implausible

● Why not replaying in the latent activations

space?

● Good Accuracy-Memory-Computation
trade-offs are possible

Latent Replay for Real-Time Continual Learning. Pellegrini et al. IROS, 2019.
Generative Replay
Key Ideas

● Instead of a replay memory why not generating examples?

● In theory this would be even better than replay: allowing for generating examples that were never
seen before (a form of dreaming or imagination)
● Still difficult to scale on high-dimensional data and find good accuracy-efficiency trade-offs

Continual Learning with Deep Generative Replay, Shi et al, 2017.

Replay: Summary and Next Steps

● A deﬁnitive study of replay in deep continual learning is still missing

● Replay has been shown to be an effective strategy in CL if performance is the main objective

● Replay is unlikely to be represent the main computational principle for CL in biological learning
systems (not a good efﬁciency-effectiveness trade-off)

● Many improvements and implementation options have been explored with different degrees of
success

● Generative / latent replay constitute an interesting future direction but quite challenge at the moment
due to the limited generative models capabilities
Avalanche
Strategies and
Plugins
Training: Design

Avalanche provides popular strategies already implemented and ready-to-use and easy mechanisms to
deﬁne custom strategies.

● Many strategies are already available

● Easy modiﬁcation of the training loop to add logging and custom behavior (mostly trough
Polymorphism)

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
How to: Strategy Initialization
How to: Training & Evaluation

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
Training: Design

● Strategy: deﬁnes a CL strategy with two simple methods:

○ train and eval.

● Plugin: a simple interface to add custom behavior to the training and eval loops.

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
How to: Add Plugins

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
Training: Custom Strategies

How to write custom strategy

● plugin: the easiest way to customize training and deﬁne new strategies.

● strategy: override the loop methods directly.

Why should I use Avalanche to implement my own strategies?

● automatic logging & metrics evaluation.

● you write less code, and you can easily share it with the community.

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
BaseStrategy: Under the hood

● The base class from which to inherit

and to specialize

● Implemented as a series of callbacks as

a skeleton to the plugin system: this
means you can write plugins “by
difference” and compose plugins

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
Custom Plugin

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
Custom Strategy

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
Training: What’s Next?

● More Strategies & Plugins! (and make sure they can reproduce published results)

● Support for Unsupervised / Reinforcement Continual Learning (check the Avalanche ecosystem!)

V. Lomonaco et al. Avalanche: an End-to-End Library for Continual Learning. CLVision Workshop at CVPR 2021.
Training in Avalanche Demo Session
!

https://avalanche.continualai.org/from-zero-to-hero-tutorial/04_training
Replay in
Avalanche
Replay in Avalanche Demo Session
!

https://avalanche.continualai.org/how-tos/dataloading_buffers_replay
Next:
Methodologies [Part 2]
Do you have any questions?

vincenzo.lomonaco@unipi.it
vincenzolomonaco.com
University of Pisa

THANKS
CREDITS: This presentation template was created by Slidesgo,
including icons by Flaticon, and infographics & images by Freepik

Main Neurips
No ratings yet
Main Neurips
32 pages
04 Evaluation
No ratings yet
04 Evaluation
35 pages
CW World Main
No ratings yet
CW World Main
14 pages
C L C F: Ontinual Earning and Atastrophic Orgetting
No ratings yet
C L C F: Ontinual Earning and Atastrophic Orgetting
21 pages
Continual Learning of Large Language Models: A Comprehensive Survey
No ratings yet
Continual Learning of Large Language Models: A Comprehensive Survey
57 pages
Continual Learning Proposal
No ratings yet
Continual Learning Proposal
11 pages
Toward Understanding Catastrophic Forgetting in Continual Learning
No ratings yet
Toward Understanding Catastrophic Forgetting in Continual Learning
12 pages
Continual Learning Applications and The Road Forward
No ratings yet
Continual Learning Applications and The Road Forward
21 pages
C L C S N N: Ontinual Earning With Olumnar Piking Eural Etworks
No ratings yet
C L C S N N: Ontinual Earning With Olumnar Piking Eural Etworks
12 pages
Continual Learning
No ratings yet
Continual Learning
12 pages
Brain-Inspired Replay For Continual Learning With Artificial Neural Networks
No ratings yet
Brain-Inspired Replay For Continual Learning With Artificial Neural Networks
14 pages
A PyTorch Library For Deep Continual Learning
No ratings yet
A PyTorch Library For Deep Continual Learning
6 pages
Adaptive Experience Replay for Lifelong Learning
No ratings yet
Adaptive Experience Replay for Lifelong Learning
13 pages
Continual Learning and Catastrophic Forgetting
No ratings yet
Continual Learning and Catastrophic Forgetting
21 pages
Continual Learning For Recurrent Neural Networks - An Empirical Evaluation
No ratings yet
Continual Learning For Recurrent Neural Networks - An Empirical Evaluation
48 pages
Chandra Continual Learning With Dependency Preserving Hypernetworks WACV 2023 Paper
No ratings yet
Chandra Continual Learning With Dependency Preserving Hypernetworks WACV 2023 Paper
10 pages
A Continual Learning Survey Defying Forgetting in Classification Tasks
No ratings yet
A Continual Learning Survey Defying Forgetting in Classification Tasks
20 pages
Deep Continual Learning Streams
No ratings yet
Deep Continual Learning Streams
156 pages
L L F B M T M I: Earning To Earn Without Orgetting Y Aximizing Ransfer and Inimizing Nterference
No ratings yet
L L F B M T M I: Earning To Earn Without Orgetting Y Aximizing Ransfer and Inimizing Nterference
31 pages
Three Types of Incremental Learning
No ratings yet
Three Types of Incremental Learning
14 pages
Online Continual Learning With Natural Distribution Shifts - An Empirical Study With Visual Data
No ratings yet
Online Continual Learning With Natural Distribution Shifts - An Empirical Study With Visual Data
10 pages
Curriculum Learning For Reinforcement Learning Domains - A Framework and Survey
No ratings yet
Curriculum Learning For Reinforcement Learning Domains - A Framework and Survey
50 pages
Adversarial Continual Learning: Sayna, Trevor @eecs - Berkeley.edu Fmeier, Rcalandra, MRF
No ratings yet
Adversarial Continual Learning: Sayna, Trevor @eecs - Berkeley.edu Fmeier, Rcalandra, MRF
20 pages
Cont Learning
No ratings yet
Cont Learning
33 pages
La-MAML: Look-Ahead Meta Learning For Continual Learning: Gunshi Gupta Karmesh Yadav Liam Paull
No ratings yet
La-MAML: Look-Ahead Meta Learning For Continual Learning: Gunshi Gupta Karmesh Yadav Liam Paull
20 pages
Syllabus: Portable Curricula For Reinforcement Learning Agents
No ratings yet
Syllabus: Portable Curricula For Reinforcement Learning Agents
29 pages
Paper Fiuri
No ratings yet
Paper Fiuri
17 pages
ACL - 2022 - Yanzhe Zhang - Continual Sequence Generation With Adaptive Compositional Modules
No ratings yet
ACL - 2022 - Yanzhe Zhang - Continual Sequence Generation With Adaptive Compositional Modules
15 pages
Wang Continual Learning With Lifelong Vision Transformer CVPR 2022 Paper
No ratings yet
Wang Continual Learning With Lifelong Vision Transformer CVPR 2022 Paper
11 pages
Assist
No ratings yet
Assist
11 pages
tiếng anhi
No ratings yet
tiếng anhi
7 pages
AI's Path to Lifelong Learning
No ratings yet
AI's Path to Lifelong Learning
12 pages
Machine Learning On Big Data: Opportunities and Challenges
No ratings yet
Machine Learning On Big Data: Opportunities and Challenges
25 pages
Brain-Inspired Replay For Continual Learning With Artificial Neural Networks
No ratings yet
Brain-Inspired Replay For Continual Learning With Artificial Neural Networks
14 pages
SP14 CS188 Lecture 10 - Reinforcement Learning I PDF
No ratings yet
SP14 CS188 Lecture 10 - Reinforcement Learning I PDF
38 pages
A Comprehensive Survey of Continual Learning Theory Method and Application
No ratings yet
A Comprehensive Survey of Continual Learning Theory Method and Application
22 pages
Reuse, Don't Retrain: A Recipe For Continued Pretraining of Language Models
No ratings yet
Reuse, Don't Retrain: A Recipe For Continued Pretraining of Language Models
15 pages
Electronics 12 02265
No ratings yet
Electronics 12 02265
21 pages
Drift To Remember
No ratings yet
Drift To Remember
37 pages
Wang Learning To Prompt For Continual Learning CVPR 2022 Paper
No ratings yet
Wang Learning To Prompt For Continual Learning CVPR 2022 Paper
11 pages
Continual Learning in Large Language Models
No ratings yet
Continual Learning in Large Language Models
51 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
40 pages
Overview of Machine Learning PDF
100% (1)
Overview of Machine Learning PDF
57 pages
A Comprehensive Survey of Continual Learning Theory Method and Application
No ratings yet
A Comprehensive Survey of Continual Learning Theory Method and Application
22 pages
Larning Introduction
No ratings yet
Larning Introduction
6 pages
A Layered Learning Approach To Scaling in Learning Classifier Systems For Boolean Problems
No ratings yet
A Layered Learning Approach To Scaling in Learning Classifier Systems For Boolean Problems
25 pages
Online Continual Learning From Imbalanced Data
No ratings yet
Online Continual Learning From Imbalanced Data
10 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
Continual Learning For Smart City: A Survey: Li Yang, Zhipeng Luo, Shiming Zhang, Fei Teng, and Tianrui Li
No ratings yet
Continual Learning For Smart City: A Survey: Li Yang, Zhipeng Luo, Shiming Zhang, Fei Teng, and Tianrui Li
24 pages
T S - L A P N N I T: HE ELF Earning Gent With A Rogressive Eural Etwork Ntegrated Ransformer
No ratings yet
T S - L A P N N I T: HE ELF Earning Gent With A Rogressive Eural Etwork Ntegrated Ransformer
7 pages
Formale Methoden UZH Nov 2013
No ratings yet
Formale Methoden UZH Nov 2013
282 pages
Machine 1
No ratings yet
Machine 1
35 pages
Learning To Prompt For Continual Learning
No ratings yet
Learning To Prompt For Continual Learning
13 pages
ML FINA L Note
No ratings yet
ML FINA L Note
90 pages
Automatic Curriculum Learning Through Value Disagreement
No ratings yet
Automatic Curriculum Learning Through Value Disagreement
12 pages
Tips Reference 1
No ratings yet
Tips Reference 1
42 pages
Reinforcement Learning Details
No ratings yet
Reinforcement Learning Details
9 pages
2015.08.26.Lecture01Intro 2
No ratings yet
2015.08.26.Lecture01Intro 2
37 pages
Rvrfref
No ratings yet
Rvrfref
3 pages
CFA Level 1 Prep - Videos, Notes & Mock Exams - AnalystPrep
0% (1)
CFA Level 1 Prep - Videos, Notes & Mock Exams - AnalystPrep
7 pages
Upload A Document - Scribd
No ratings yet
Upload A Document - Scribd
2 pages
Make Your Document Easy To Find: Don't Want To Upload?
No ratings yet
Make Your Document Easy To Find: Don't Want To Upload?
3 pages
Securely Buy, Sell & Trade Bitcoin, Ethereum and 400+ Altcoins
No ratings yet
Securely Buy, Sell & Trade Bitcoin, Ethereum and 400+ Altcoins
9 pages
Yamaha Powered Loudspeakers Brochure 2024 en Web
No ratings yet
Yamaha Powered Loudspeakers Brochure 2024 en Web
19 pages
Guild Introduction
No ratings yet
Guild Introduction
6 pages
Guild Introduction3
No ratings yet
Guild Introduction3
13 pages
Guild Introduction3
No ratings yet
Guild Introduction3
9 pages
Self Intro Corning
No ratings yet
Self Intro Corning
15 pages
Self Intro Corning
No ratings yet
Self Intro Corning
20 pages
Design & Business Student Guide
No ratings yet
Design & Business Student Guide
6 pages
Classroom Structuringreading Pantry Monitoring Tool
No ratings yet
Classroom Structuringreading Pantry Monitoring Tool
5 pages
Dysgraphia Journal
100% (1)
Dysgraphia Journal
4 pages
Situating Local Culture in Elt Material Design
No ratings yet
Situating Local Culture in Elt Material Design
12 pages
Midterm Study Guide - Working
No ratings yet
Midterm Study Guide - Working
103 pages
(Ebook PDF) Classroom Reading Inventory 12th Edition - Experience The Full Ebook by Downloading It Now
100% (3)
(Ebook PDF) Classroom Reading Inventory 12th Edition - Experience The Full Ebook by Downloading It Now
45 pages
EPAS WEEK3and4 DLL
100% (1)
EPAS WEEK3and4 DLL
3 pages
Unpacking AI Infographic 101024 v6
No ratings yet
Unpacking AI Infographic 101024 v6
1 page
Rating Sheet Demo Teaching 4
No ratings yet
Rating Sheet Demo Teaching 4
2 pages
HED4802 2025 Assignment 04
No ratings yet
HED4802 2025 Assignment 04
3 pages
Monday Tuesday Wednesday Thursday Friday: GRADES 1 To 12 Daily Lesson Log
No ratings yet
Monday Tuesday Wednesday Thursday Friday: GRADES 1 To 12 Daily Lesson Log
8 pages
Group 3 Semi Detailed Lesson Plan
No ratings yet
Group 3 Semi Detailed Lesson Plan
3 pages
Math Lesson: Factoring by Grouping
No ratings yet
Math Lesson: Factoring by Grouping
2 pages
Agriculture - Grade 6 - Term-I
No ratings yet
Agriculture - Grade 6 - Term-I
25 pages
CESC12 Q1 Mod4 Community-Action-Modalities
No ratings yet
CESC12 Q1 Mod4 Community-Action-Modalities
12 pages
Effective Communication First Quarter Budget Outlay
100% (1)
Effective Communication First Quarter Budget Outlay
3 pages
Neo-Behaviorism: Tolman & Bandura
No ratings yet
Neo-Behaviorism: Tolman & Bandura
3 pages
TFF Annual Review Worksheets
No ratings yet
TFF Annual Review Worksheets
11 pages
Early Settlers Artifact Project
No ratings yet
Early Settlers Artifact Project
4 pages
Gardner's Theory of Multiple Intelligences
No ratings yet
Gardner's Theory of Multiple Intelligences
12 pages
Balwant - 2016 - Transformational Instructor - Leadership in Higher Education Teaching A Meta - Analytic Review and Research Agenda
No ratings yet
Balwant - 2016 - Transformational Instructor - Leadership in Higher Education Teaching A Meta - Analytic Review and Research Agenda
23 pages
S VCR
No ratings yet
S VCR
16 pages
Principles of Assessment Practices
No ratings yet
Principles of Assessment Practices
16 pages
K-12 Curriculum Teaching and Assessment Nowadays Is It Effective?
No ratings yet
K-12 Curriculum Teaching and Assessment Nowadays Is It Effective?
16 pages
Davao Oriental State College of Science and Technology
No ratings yet
Davao Oriental State College of Science and Technology
3 pages
Comp 1 - Freelancing Starter Course
No ratings yet
Comp 1 - Freelancing Starter Course
4 pages
Community Organizing As An Education Reform Strategy: NMEF Report
100% (1)
Community Organizing As An Education Reform Strategy: NMEF Report
35 pages
Grade 2 Math Measurement Lesson
90% (10)
Grade 2 Math Measurement Lesson
2 pages
DLP in Science Grade 5
No ratings yet
DLP in Science Grade 5
3 pages
DLL in Reading and Writing
95% (56)
DLL in Reading and Writing
45 pages