Add NaN propagation and array initialization notes to ?GEMM docs #1098

TiborGY · 2025-01-19T14:02:37Z

Description
This PR fixes and closes #1077 and:

Adds this description of NaN/Inf propagation quirks to the ?GEMM docs:

Note: if alpha and/or beta is zero, some parts of the matrix-matrix operations are not performed.
This results in the following NaN/Inf propagation quirks:

 1. If alpha is zero, NaNs or Infs in A or B do not affect the result.
 2. If both alpha and beta are zero, then a zero matrix is returned in C, irrespective of any NaNs or Infs 
 in A, B or C.
 3. If only beta is zero, alpha*op( A )*op( B ) is returned, irrespective of any NaNs or Infs in C.

Adds notes like this to S/D GEMM about why they support complex conjugation:

Note: TRANSA = 'C' is supported for the sake of API consistency between all ?GEMM variants.

Adds this to the descriptions of ALPHA arguments:

If ALPHA is zero the values in A and B do not affect the result.
This also means that NaN/Inf propagation from A and B is inhibited if ALPHA is zero.

Changes descriptions of A arguments from array A must contain the matrix A. to:

array A must contain the matrix A, except if ALPHA is zero.
If ALPHA is zero, none of the values in A affect the result, even if they are NaN/Inf.
This also implies that if ALPHA is zero, the matrix elements of A need not be initialized by the caller.

Changes descriptions of B arguments from array B must contain the matrix B. to:

array B must contain the matrix B, except if ALPHA is zero.
If ALPHA is zero, none of the values in B affect the result, even if they are NaN/Inf.
This also implies that if ALPHA is zero, the matrix elements of B need not be initialized by the caller.

Changes descriptions of BETA arguments from
When BETA is supplied as zero then C need not be set on input.
to:

If BETA is zero the values in C do not affect the result. This also means that NaN/Inf propagation
from C is inhibited if BETA is zero.

Changes descriptions of C arguments from
array C must contain the matrix C, except when beta is zero, in which case C need not be set on entry.
to:

array  C must contain the matrix  C, except if beta is zero. If beta is zero, none of the values in C 
affect the result, even if they are NaN/Inf. This also implies that if beta is zero, the matrix elements of 
C need not be initialized by the caller.

This documents/clarifies existing behaviour without adding anything to the docs that might hypothetically clash with the behaviour of some optimized BLAS implementation. (eg. I have decided against adding no-touch-guarantees to A, B and C, so a conforming implementation might still read data from them, as long as it does not affect the output)

Checklist

The documentation has been updated.
If the PR solves a specific issue, it is set to be closed on merge.

ilayn · 2025-01-19T14:22:17Z

May I suggest a less if, else language such as

If alpha is set to zero, A and B are not referenced to avoid unnecessary math. A potential downside of this is that if the arrays contain NaN/Inf values they are not transferred to C.

This is more to the point and explains the mechanism instead of enumerating rules. I don't claim a command on English but this reads easier to me.

TiborGY · 2025-01-19T14:50:24Z

May I suggest a less if, else language such as

If alpha is set to zero, A and B are not referenced to avoid unnecessary math. A potential downside of this is that if the arrays contain NaN/Inf values they are not transferred to C.

This is more to the point and explains the mechanism instead of enumerating rules. I don't claim a command on English but this reads easier to me.

Thanks for the suggestion, I am of course open to refining the wording based on feedback.

Your version changes the meaning slightly, because "A and B are not referenced" would imply a no-touch-guarantee (no reads, no writes to any array element), which is something I do not want to include in the BLAS specification. Such a guarantee would permit A and B to not only contain arbitrary values, but also to not even be allocated.

Hypothetically, that could be untrue for some other BLAS implementations, eg. hardware vendor optimized BLAS libraries. I guess this could also happen with the NaN propagation quirks I am adding, since the spec has been silent on this until now, but I would rather not make this change any more invasive than it needs to be.

TiborGY · 2025-01-19T14:59:36Z

On a third reading, some of the conditions in the quirks list were redundant, so I have simplified that.

ilayn · 2025-01-19T15:47:34Z

Your version changes the meaning slightly, because "A and B are not referenced" would imply a no-touch-guarantee (no reads, no writes to any array element), which is something I do not want to include in the BLAS specification.

That's a standard LAPACK/BLAS wording that can be found in many routine specifications. See for example vs parameter of ?gees to give an example that came to my mind first. The code does indeed not reference the arrays in case alpha is zero as you can see in the code.

TiborGY · 2025-01-19T16:23:07Z

Your version changes the meaning slightly, because "A and B are not referenced" would imply a no-touch-guarantee (no reads, no writes to any array element), which is something I do not want to include in the BLAS specification.

That's a standard LAPACK/BLAS wording that can be found in many routine specifications. See for example vs parameter of ?gees to give an example that came to my mind first. The code does indeed not reference the arrays in case alpha is zero as you can see in the code.

I see. The reference BLAS implementation does of course satisfy the stronger guarantee of not referencing the arrays, and I would actually be happy to demand it in the spec, but since this is not coordinated with other BLAS implementations, I minimized the amount that implementation freedom is restricted by the spec change.

add NaN propagation and array initialization notes to gemm docs

7ad40db

simplify nan propagation quirks in the purpose section

5f79137

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NaN propagation and array initialization notes to ?GEMM docs #1098

Add NaN propagation and array initialization notes to ?GEMM docs #1098

TiborGY commented Jan 19, 2025 •

edited

Loading

ilayn commented Jan 19, 2025

TiborGY commented Jan 19, 2025

TiborGY commented Jan 19, 2025

ilayn commented Jan 19, 2025

TiborGY commented Jan 19, 2025

Add NaN propagation and array initialization notes to ?GEMM docs #1098

Are you sure you want to change the base?

Add NaN propagation and array initialization notes to ?GEMM docs #1098

Conversation

TiborGY commented Jan 19, 2025 • edited Loading

ilayn commented Jan 19, 2025

TiborGY commented Jan 19, 2025

TiborGY commented Jan 19, 2025

ilayn commented Jan 19, 2025

TiborGY commented Jan 19, 2025

TiborGY commented Jan 19, 2025 •

edited

Loading