feat(attention): add Bi-Directional MLM attention model #721

TamirFriedman-RecoLabs · 2023-12-12T22:53:36Z

I want to implement this kind of mask in xformers, for implementing bidirectional masked-language-model type of attention:

class BlockDiagoNULLMask(fmha.attn_bias.BlockDiagonalMask):
    """
    Modification of `BlockDiagonalMask` where blocks are inner-connected
    except for the diagonal elements, which are masked from themselves.
    """

    def _create_block_mask(
        self: Self,
        shape: Tuple[int, ...],
        dtype: torch.dtype,
        device: str | torch.device,
    ) -> torch.Tensor:
        # Create a matrix filled with `-inf` on the diagonal and `0` elsewhere
        return torch.zeros(shape, dtype=dtype, device=device).fill_diagonal_(-torch.inf)

Signed-off-by: Tamir Friedman <[email protected]>

yiakwy-xpu-ml-framework-team · 2024-03-15T08:30:33Z

Hi @TamirFriedman-RecoLabs Are you working on encoder stack ? For example generate model for video, music and so on.

Are you still working on this branch ? Happy to hear from you soon!

TamirFriedman-RecoLabs force-pushed the main branch from 7bcd504 to 1314169 Compare December 12, 2023 23:15

feat(attention): add Bi-Directional MLM attention model

be410e5

Signed-off-by: Tamir Friedman <[email protected]>

TamirFriedman-RecoLabs force-pushed the main branch 2 times, most recently from 2b9de71 to be410e5 Compare December 13, 2023 22:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(attention): add Bi-Directional MLM attention model #721

feat(attention): add Bi-Directional MLM attention model #721

TamirFriedman-RecoLabs commented Dec 12, 2023

yiakwy-xpu-ml-framework-team commented Mar 15, 2024

feat(attention): add Bi-Directional MLM attention model #721

Are you sure you want to change the base?

feat(attention): add Bi-Directional MLM attention model #721

Conversation

TamirFriedman-RecoLabs commented Dec 12, 2023

yiakwy-xpu-ml-framework-team commented Mar 15, 2024