Spectral Adapter: Fine-Tuning in Spectral Space

This repo contains the source code for experiments for our paper

Spectral Adapter: Fine-Tuning in Spectral Space
Fangzhao Zhang, Mert Pilanci
Paper: https://arxiv.org/abs/2405.13952

Recent developments in Parameter-Efficient Fine-Tuning (PEFT) methods for pretrained deep neural networks have captured widespread interest. In this work, we study the enhancement of current PEFT methods by incorporating the spectral information of pretrained weight matrices into the fine-tuning procedure. We investigate two spectral adaptation mechanisms, namely additive tuning and orthogonal rotation of the top singular vectors, both are done via first carrying out Singular Value Decomposition (SVD) of pretrained weights and then fine-tuning the top spectral space.

The following plots show the superiority of our spectral adapter applied to fine-tune llama3 8B model

Method Overview

In this project, we investigate the performance of fine-tuning the spectral space of pretrained model weights. The following figure demonstrates our method. Specifically, we consider both addtive fine-tuning and (orthogonal) rotational fine-tuning of top singular vectors. For orthogonal rotation implementation, we employ cayley parameterization.

Repository Overview

In this project, we experiments with language model fine-tuning and diffusion model fine-tuning.

LLama3-8B Fine-Tuning (see llama3_tune/ for experiment code.)
DeBERTaV3-base Fine-Tuning (see deberta_tune/ for experiment code.)
Mistral 7B Fine-Tuning (see mistral_tune/ for experiment code.)
Adapter Fusion (see adapter_fusion/ for experiment code.)
Adapter Efficiency (see adapter_efficiency/ for experiment code.)

References and Acknowledgements

Our diffusion model experiment code is mainly built on the following repositories: Mix-of-Show.

Contact

Please contact us or post an issue if you have any questions.

Fangzhao Zhang ([email protected])

Citation

@misc{zhang2024spectraladapterfinetuningspectral,
      title={Spectral Adapter: Fine-Tuning in Spectral Space}, 
      author={Fangzhao Zhang and Mert Pilanci},
      year={2024},
      eprint={2405.13952},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2405.13952}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
adapter_efficiency		adapter_efficiency
adapter_fusion		adapter_fusion
deberta_tune		deberta_tune
llama3_tune		llama3_tune
main_figures		main_figures
mistral_tune		mistral_tune
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spectral Adapter: Fine-Tuning in Spectral Space

Method Overview

Repository Overview

References and Acknowledgements

Contact

Citation

About

Releases

Packages

Languages

License

pilancilab/spectral_adapter

Folders and files

Latest commit

History

Repository files navigation

Spectral Adapter: Fine-Tuning in Spectral Space

Method Overview

Repository Overview

References and Acknowledgements

Contact

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages