You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@karpathy - thank you for the great software. I wrote up a visual walk-through of how it all works in detail. I think I got it all right and am currently using the software along with the llama2.cu fork by @rogerallen and the 4-bit gpu fork by @ankan-ban. On a Windows PC I am seeing 7t/s on just the CPU, 20+ with fp32 on the GPU and 140+ t/s with the 4-bit gpu. Really fun stuff! I hope this visual walk-through is helpful.
@karpathy - thank you for the great software. I wrote up a visual walk-through of how it all works in detail. I think I got it all right and am currently using the software along with the llama2.cu fork by @rogerallen and the 4-bit gpu fork by @ankan-ban. On a Windows PC I am seeing 7t/s on just the CPU, 20+ with fp32 on the GPU and 140+ t/s with the 4-bit gpu. Really fun stuff! I hope this visual walk-through is helpful.
https://www.signalpop.com/2024/02/10/understanding-llama2-c-and-chatgpt-a-visual-design-walkthrough/
Cheers!
The text was updated successfully, but these errors were encountered: