Skip to content

v1.5.1

Compare
Choose a tag to compare
@jianfeifeng jianfeifeng released this 13 Jun 07:14
· 4 commits to master since this release
cf4ca8f

Added

  • Support Python API
  • Support AVX-VNNI and ARMv9 instruction set
  • Support Intel Desktop GPU (float16 and float32)
  • Support Windows on arm platform
  • Support more operators : Random, Sin, Cos, Einsum, Elu, UnPooling, Flatten, ConvertColor, BilateralSliceApply, Lut
  • Support more networks : ViTAE, CMT, EfficientFormer, ConvTT, Wenet, NFM, AFM, ONN, wide&deep, DeepFM, MMOE, etc
  • Improve multi-threads parallel inference performance on CPU
  • Add simple chinese deployment guide
  • Support model file compatibility
  • Support using outer memory(CPU array or OpenCL cl_mem) by using SetInputOutput API
  • Support data type and format transform by using C API

Changed

  • TensorDesc's dim array is changed to 20.
  • Remove FILE macro usage and warning log under release mode
  • change enum data and operator parameter size

Fixed

  • Fix GPU resize bug
  • Fix GPU concurrent inference bug
  • Fix ONNX converter bug
  • Add missed chinese automatic speech recognition model