Learning to Learn without Forgetting using Attention

This repository is the official implementation of Learning to Learn without Forgetting using Attention presented at CoLLAs 2024.

TL;DR

We propose meta-learning a transformer-based optimizer to enhance continual learning. This meta-learned optimizer uses attention to learn the complex relationships between model parameters across a stream of tasks, and is designed to generate effective weight updates for the current task while preventing catastrophic forgetting on previously encountered tasks.

Available datasets

SplitMNIST.

RotatedMNIST.

SplitCIFAR-100.

Usage

To run the code, use main_splitmnist.py, main_rotatedmnist.py, main_cifar100.py.

These codes will train the model and compute the average accuracy, the backward transfer (BWT), and the forward transfer (FWT) metrics.

Notes

This code has been tested with Python 3.10.12 and PyTorch 1.13.1 with CUDA 11.7.

Cite as

@inproceedings{vettoruzzo2024learning,
  title={Learning to Learn without Forgetting using Attention},
  author={Vettoruzzo, Anna and Joaquin, Vanschoren and Mohamed-Rafik, Bouguelia and Thorsteinn, R{\"o}gnvaldsson},
  booktitle={Conference on Lifelong Learning Agents (CoLLAs), 2024},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
_saved_models_final/splitmnist		_saved_models_final/splitmnist
methods		methods
models		models
README.md		README.md
dataset.py		dataset.py
main_cifar100.py		main_cifar100.py
main_rotatedmnist.py		main_rotatedmnist.py
main_splitmnist.py		main_splitmnist.py
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Learn without Forgetting using Attention

TL;DR

Available datasets

Usage

Notes

Cite as

About

Releases

Packages

Languages

annaVettoruzzo/L2L_with_attention

Folders and files

Latest commit

History

Repository files navigation

Learning to Learn without Forgetting using Attention

TL;DR

Available datasets

Usage

Notes

Cite as

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages