v1.5.1

jianfeifeng released this 13 Jun 07:14

· 4 commits to master since this release

cf4ca8f

Added

Support Python API
Support AVX-VNNI and ARMv9 instruction set
Support Intel Desktop GPU (float16 and float32)
Support Windows on arm platform
Support more operators : Random, Sin, Cos, Einsum, Elu, UnPooling, Flatten, ConvertColor, BilateralSliceApply, Lut
Support more networks : ViTAE, CMT, EfficientFormer, ConvTT, Wenet, NFM, AFM, ONN, wide&deep, DeepFM, MMOE, etc
Improve multi-threads parallel inference performance on CPU
Add simple chinese deployment guide
Support model file compatibility
Support using outer memory(CPU array or OpenCL cl_mem) by using SetInputOutput API
Support data type and format transform by using C API

Changed

TensorDesc's dim array is changed to 20.
Remove FILE macro usage and warning log under release mode
change enum data and operator parameter size

Fixed

Fix GPU resize bug
Fix GPU concurrent inference bug
Fix ONNX converter bug
Add missed chinese automatic speech recognition model

Assets 2