Skip to content

Commit

Permalink
[CITATION]
Browse files Browse the repository at this point in the history
  • Loading branch information
Kye committed Mar 4, 2024
1 parent 2d60f7b commit ad478a9
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 2 deletions.
16 changes: 14 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,7 @@
[![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)

# Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"

Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models". [PAPER LINK](https://huggingface.co/papers/2402.19427)


## install
Expand Down Expand Up @@ -38,3 +37,16 @@ print(y)

# License
MIT


# Citation
```
@misc{de2024griffin,
title={Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models},
author={Soham De and Samuel L. Smith and Anushan Fernando and Aleksandar Botev and George Cristian-Muraru and Albert Gu and Ruba Haroun and Leonard Berrada and Yutian Chen and Srivatsan Srinivasan and Guillaume Desjardins and Arnaud Doucet and David Budden and Yee Whye Teh and Razvan Pascanu and Nando De Freitas and Caglar Gulcehre},
year={2024},
eprint={2402.19427},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
```
3 changes: 3 additions & 0 deletions griffin_torch/main.py
Original file line number Diff line number Diff line change
Expand Up @@ -319,7 +319,10 @@ def forward(self, x: Tensor) -> Tensor:
Tensor: Output tensor.
"""
# Embed the tokens
x = self.emb(x)

# Normilize
x = self.norm(x)

# Loop
Expand Down

0 comments on commit ad478a9

Please sign in to comment.