Fix `return_attention_scores` bug #1977

abheesht17 · 2024-11-08T18:02:46Z

No description provided.

SamanehSaadat · 2024-11-08T18:18:33Z

keras_hub/src/layers/modeling/transformer_encoder.py

+            query=x,
+            value=x,
+            attention_mask=self_attention_mask,
+            return_attention_scores=True,


I was looking to see how return_attention_scores is used in MHA and looking at this, I was thinking if setting return_attention_scores=True here could cause any problems. If we set it True here, the user doesn't have a way to disable it if they want to use flash attention, right?

Ah, good spot. I'll keep the if...else, then

I think we can still get rid of the if-else block by directly passing the flag to MHA (setting it like this return_attention_scores=return_attention_scores). We don't have to pass the flag only when it's True. wdyt?

Hmmm, but in the case of True, it returns two elements, whereas in the case of False, it returns just one element.

If we want to do it that way, it'd have to be something like this:

... attention_layer_output = self._self_attention_layer( query=x, value=x, attention_mask=self_attention_mask, return_attention_scores=return_attention_scores, training=training, ) if return_attention_scores: x, attention_scores = attention_layer_output else: x = attention_layer_output ...

You're right! Let's keep it the way it is then!

SamanehSaadat · 2024-11-11T17:56:26Z

@abheesht17 I was wondering if you've checked why one of the tests is failing.

SamanehSaadat · 2024-11-11T18:12:15Z

@abheesht17 I was wondering if you've checked why one of the tests is failing.

I think this might be the root cause: https://github.com/keras-team/keras/pull/20448/files#r1837037902

SamanehSaadat

Thanks, Abheesht, for catching and fixing this bug!

SamanehSaadat · 2024-11-11T22:03:12Z

I'll merge as the test failure is not related to this PR.

abheesht17 · 2024-11-12T01:12:45Z

@abheesht17 I was wondering if you've checked why one of the tests is failing.

I think this might be the root cause: https://github.com/keras-team/keras/pull/20448/files#r1837037902

I'll take a look later today! Sorry for the delay

SamanehSaadat · 2024-11-12T01:16:53Z

@abheesht17 I was wondering if you've checked why one of the tests is failing.

I think this might be the root cause: https://github.com/keras-team/keras/pull/20448/files#r1837037902

I'll take a look later today! Sorry for the delay

All good! The test failure was unrelated to this PR and it should be fixed now (keras-team/keras#20482).

abheesht17 · 2024-11-12T01:17:50Z

@abheesht17 I was wondering if you've checked why one of the tests is failing.

I think this might be the root cause: https://github.com/keras-team/keras/pull/20448/files#r1837037902

I'll take a look later today! Sorry for the delay

All good! The test failure was unrelated to this PR and it should be fixed now (keras-team/keras#20482).

Awesome!

Fix return_attention_scores bug

bafc985

abheesht17 requested a review from SamanehSaadat November 8, 2024 18:07

SamanehSaadat reviewed Nov 8, 2024

View reviewed changes

Keep the if...else block

9d50369

SamanehSaadat mentioned this pull request Nov 11, 2024

Enable flash attention keras-team/keras#20448

Merged

3 tasks

SamanehSaadat approved these changes Nov 11, 2024

View reviewed changes

SamanehSaadat merged commit d97db05 into keras-team:master Nov 11, 2024
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `return_attention_scores` bug #1977

Fix `return_attention_scores` bug #1977

abheesht17 commented Nov 8, 2024

SamanehSaadat Nov 8, 2024

abheesht17 Nov 8, 2024

SamanehSaadat Nov 8, 2024

abheesht17 Nov 8, 2024

SamanehSaadat Nov 8, 2024

SamanehSaadat commented Nov 11, 2024

SamanehSaadat commented Nov 11, 2024

SamanehSaadat left a comment

SamanehSaadat commented Nov 11, 2024

abheesht17 commented Nov 12, 2024

SamanehSaadat commented Nov 12, 2024

abheesht17 commented Nov 12, 2024

Fix return_attention_scores bug #1977

Fix return_attention_scores bug #1977

Conversation

abheesht17 commented Nov 8, 2024

SamanehSaadat Nov 8, 2024

Choose a reason for hiding this comment

abheesht17 Nov 8, 2024

Choose a reason for hiding this comment

SamanehSaadat Nov 8, 2024

Choose a reason for hiding this comment

abheesht17 Nov 8, 2024

Choose a reason for hiding this comment

SamanehSaadat Nov 8, 2024

Choose a reason for hiding this comment

SamanehSaadat commented Nov 11, 2024

SamanehSaadat commented Nov 11, 2024

SamanehSaadat left a comment

Choose a reason for hiding this comment

SamanehSaadat commented Nov 11, 2024

abheesht17 commented Nov 12, 2024

SamanehSaadat commented Nov 12, 2024

abheesht17 commented Nov 12, 2024

Fix `return_attention_scores` bug #1977

Fix `return_attention_scores` bug #1977