-
Notifications
You must be signed in to change notification settings - Fork 91
Issues: fla-org/flash-linear-attention
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[RFC] Support for more New feature or request
finetuning Transformers to RNNs
methods (e.g., LOLCATS)
enhancement
#127
opened Jan 19, 2025 by
sustcsonglin
[Bug] GatedLinearAttention got NaN
bug
Something isn't working
#122
opened Jan 17, 2025 by
980202006
[Feature Request] Weight Conversion
enhancement
New feature or request
#120
opened Jan 14, 2025 by
Triang-jyed-driung
[RFC] Autotune should consider batch size and number of heads
enhancement
New feature or request
urgent
[Bug]: Grad_norm & Loss are NAN when training Gated_Deltanet on fineweb-edu-10BT
bug
Something isn't working
help wanted
Extra attention is needed
#111
opened Jan 6, 2025 by
Chris-city
[Bug]: Bunches of Issues in Mamba and Mamba2
bug
Something isn't working
#90
opened Dec 9, 2024 by
WorldEditors
[Bug]: GSA and RWKV6 Occasionally Report Gradient=NAN when Backward
bug
Something isn't working
#77
opened Nov 7, 2024 by
WorldEditors
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.