-
Notifications
You must be signed in to change notification settings - Fork 717
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QUESTION] Recommendations to run large number of tokens on A100 40GB #259
Comments
Hello,
The format is For instance:
IIRC the impact on memory requirements is small (but enough to be useful). A tiny |
Thanks for the insights and clarifications! Currently, only A100s 40GB are available, but new systems with H100 80GB will be available this year. Thanks for the help! |
Actually, one more question, which also asked in #236 Would allocating more CPU memory and adjusting |
Based on description from @Augustin-Zidek in google-deepmind#259
Sorry, I am not sure, but worth trying. |
Hello,
disclaimer: I'm no user of AF3, but support users that are.
What would be the recommendations in order to run a large number of tokens, say >6000 or > 7000 tokens on A100 40 GB ?
A current HPC job has been running for a week now, with unified memory.
What is the implications of the different values of
pair_transition_shard_spec
?After reading the section performance I am uncertain to understand the values in
pair_transition_shard_spec
and thus their implication in performance for an A100 40GB.For instance, what represent the tuple
(2048, None),
and what doesNone
imply ?Also interested in seeing an answer to #236
Thanks,
The text was updated successfully, but these errors were encountered: