v2.6.2
This is a patch release containing the following changes to v2.6.1:
- Removed unused variables (2500b0f, b4e0032)
- Fixed correctness issue in fp32 convolution implementation for cases with large spatial size (207af06)
- Fixed correctness issue in bfloat16 matmul implementation for processors with Intel AMX support (404b762)
- Fixed correctness issue in int8 reorder implementation with zero points (b340cba)
- Improved int8 matmul and inner product primitives performance with small matrices for processors with Intel AMX support (73b7572, 58b386a)
- Improved int8 convolution performance for processors with Intel DL Boost support (f35a62f)
- Aligned AUGRU formula with Tensorflow definition (e47c6c5, 4ba0a57, b311e24)
- Suppressed 'unvectorized loop' warning for Intel C/C++ Compiler (3932d04)