Skip to content

LucyDYu/Awesome-Multimodal-Continual-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 

Repository files navigation

Awesome-Multimodal-Continual-Learning

Our MMCL Survey

Recent Advances of Multimodal Continual Learning: A Comprehensive Survey

The first comprehensive survey for Multimodal Continual Learning (MMCL) Methods. [PDF] [机器之心]

Methodology

Regularization-based

Paper Method Venue Code
Continual Instruction Tuning for Large Multimodal Models TIR arXiv 2023 -
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models ZSCL ICCV 2023 Github
Continual Vision-Language Representation Learning with Off-Diagonal Information Mod-X ICML 2023 -
Multi-Domain Lifelong Visual Question Answering via Self-Critical Distillation SCD ACM Multimedia 2023 -
Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery CS-VQLA MICCAI 2023 Github
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation CTP ICCV 2023 Github
Continual Multimodal Knowledge Graph Construction MSPT IJCAI 2024 Github

Architecture-based

Paper Method Venue Code
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning RATT NeurIPS 2020 Github
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters MoE-Adapters4CL CVPR 2024 Github
CLAP4CLIP: Continual learning with probabilistic finetuning for vision-language models CLAP NeurIPS 2024 Github
Hierarchical Visual-Textual Knowledge Distillation for Life-Long Correlation Learning VLKD Int. J. Comput. Vis. 2021 -
Continual Instruction Tuning for Large Multimodal Models EProj arXiv 2023 -
Real-world Cross-modal Retrieval via Sequential Learning SCML IEEE Trans. Multim. 2021 -
Multimodal Continual Graph Learning with Neural Architecture Search MSCGL WWW 2022 -
Multimodal Continual Learning Using Online Dictionary Updating ODU IEEE Trans. Cogn. Dev. Syst. 2021 -
Confusion Mixup Regularized Multimodal Fusion Network for Continual Egocentric Activity Recognition CMR-MFN ICCV (Workshops) 2023 Github

Replay-based

Paper Method Venue Code
Task-Attentive Transformer Architecture for Continual Learning of Vision-and-Language Tasks Using Knowledge Distillation TAM-CL EMNLP (Findings) 2023 Github
VQACL: A Novel Visual Question Answering Continual Learning Setting VQACL CVPR 2023 Github
Knowledge Decomposition and Replay: A Novel Cross-modal Image-Text Retrieval Continual Learning Method KDR ACM Multimedia 2023 -
Generative Negative Text Replay for Continual Vision- Language Pretraining IncCLIP ECCV 2022 -
Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task SGP AAAI 2023 Github
Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning SAMM CoLLAs 2024 Github
Vision-Sensor Attention Based Continual Multimodal Egocentric Activity Recognition AID ICASSP 2024 -
Continual Egocentric Activity Recognition With Foreseeable-Generalized Visual–IMU Representations FGVIRs IEEE Sensors Journal 2024 -
Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion RAPF ECCV 2024 Github
Generative Multi-modal Models are Good Class Incremental Learners GMM CVPR 2024 Github

Prompt-based

Paper Method Venue Code
Multimodal Parameter-Efficient Few-Shot Class Incremental Learning CPE-CLIP ICCV (Workshops) 2023 Github
Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering TRIPLET ICCV 2023 -
Beyond Anti-Forgetting: Multimodal Continual Instruction Tuning with Positive Forward Transfer Fwd-Prompt arXiv 2024 -
S-Prompts Learning with Pre-trained Transformers: An Occam's Razor for Domain Incremental Learning S-liPrompts NeurIPS 2022 Github

Benchmarks

Paper Name Venue Code
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks CLiMB NeurIPS 2022 Github
Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task CLOVE AAAI 2023 Github
Continual Multimodal Knowledge Graph Construction IMNER, IMRE IJCAI 2024 Github
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models MTIL ICCV 2023 Github
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation VLCP ICCV 2023 Github
Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning MMCL CoLLAs 2024 Github
Towards Continual Egocentric Activity Recognition: A Multi-Modal Egocentric Activity Dataset for Continual Learning CEAR IEEE Trans. Multim. 2024 Github

Other CL Surveys

Paper Venue
A Comprehensive Survey of Continual Learning: Theory, Method and Application IEEE TPAMI 2024
A Continual Learning Survey: Defying Forgetting in Classification Tasks IEEE TPAMI 2022
Class-Incremental Learning: A Survey IEEE TPAMI 2024
Recent Advances of Continual Learning in Computer Vision: An Overview arXiv 2024
Class-Incremental Learning: Survey and Performance Evaluation on Image Classification IEEE TPAMI 2023
Continual Learning of Natural Language Processing Tasks: A Survey arXiv 2023
Continual Learning for Large Language Models: A Survey arXiv 2024
Towards Lifelong Learning of Large Language Models: A Survey arXiv 2024
Continual Learning on Graphs: Challenges, Solutions, and Opportunities arXiv 2024
Continual Learning with Pre-Trained Models: A Survey arXiv 2024
Recent Advances of Foundation Language Models-based Continual Learning: A Survey arXiv 2024

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published