Skip to content

v2.6.3-cktile

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 17 Sep 18:52
e2182cc

We send the PR to upstream in this PR

  1. Update the ROCm backend (CK), so I modify how to call ck due to changing of CK api.
  2. Improve backward performance by updating the CK (1)
  3. Implement mha_fwd_kvcache().
  4. Change of compile flag to support ROCm6.2
  5. Change bf16 rounding to RTN (round to nearest)