We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DALLE2-pytorch/dalle2_pytorch/dalle2_pytorch.py
Line 869 in 00e07b7
It seems we need to scale up Q and K when using cosine sim. But what is the reason for scaling Q before applying rotary emb?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
DALLE2-pytorch/dalle2_pytorch/dalle2_pytorch.py
Line 869 in 00e07b7
It seems we need to scale up Q and K when using cosine sim. But what is the reason for scaling Q before applying rotary emb?
The text was updated successfully, but these errors were encountered: