Releases · artyom-beilis/pytorch_dlprim · GitHub

04 Sep 21:30

Release 0.2.0 Latest

Latest

What is new in 0.2.0

Bug/Issue Fixes

Fixed incorrect use of double constants in some operators
Fixed crash when loading models that were saved on OCL devices
Fixed default parameter of torch.ocl.synchronize
Fixed failure of printing on Intel devices with missing fp64 support

New nets Validated

Visual transformers vit_transformets and vit_x_NN ets validated

New operators implemented:

resize_, arange, mm, bmm, amin, amax, addmm, _native_multi_head_attention and transform_bias_rescale_qkv, round, maximum, minimum, prod, atan, dropout_native
lt,le,gt,ge,eq,ne for tensors
bitwise ^, |, &, ~
upsample_2d : bilinear, nearest and nearest exact, forward and backward

Fixed operators

Fixed softmax and log softmax support of dim that is not last dim
Fixed view operator and set_ storage
cat now supports mixed types
Fix handling of empty tensors with non empty storage
Very limited half tensor handling
Fixed tensor >, < ==, != scalar ops

New features:

Added support of profiling via torch.ocl.profile API
Improved benchmark scripts

Performance improvements

Intel Arc, UHD - enabled winograd convolution, support of OpenCL 3.0 floating point add atomics, enabled k-reduction for GEMM operators
NVidia - added use of native atomic float add (via PTX assembly)
GELU major improvements due to faulty use of double instead of float

Assets 15

pytorch_ocl-0.2.0+torch2.4-cp310-none-linux_x86_64.whl

657 KB 2024-09-04T21:11:02Z
pytorch_ocl-0.2.0+torch2.4-cp311-none-linux_x86_64.whl

658 KB 2024-09-04T21:10:58Z
pytorch_ocl-0.2.0+torch2.4-cp311-none-win_amd64.whl

1.12 MB 2024-09-04T21:11:13Z
pytorch_ocl-0.2.0+torch2.4-cp312-none-linux_x86_64.whl

658 KB 2024-09-04T21:10:54Z
pytorch_ocl-0.2.0+torch2.4-cp312-none-win_amd64.whl

1.12 MB 2024-09-04T21:11:19Z
pytorch_ocl-0.2.0+torch2.4-cp38-none-linux_x86_64.whl

657 KB 2024-09-04T21:11:09Z
pytorch_ocl-0.2.0+torch2.4-cp39-none-linux_x86_64.whl

658 KB 2024-09-04T21:11:06Z
pytorch_ocl-0.2.0+torch2.5-cp310-none-linux_x86_64.whl

658 KB 2024-10-24T11:08:10Z
pytorch_ocl-0.2.0+torch2.5-cp311-none-linux_x86_64.whl

659 KB 2024-10-24T11:08:07Z
pytorch_ocl-0.2.0+torch2.5-cp311-none-win_amd64.whl

1.13 MB 2024-10-24T11:07:51Z
Source code (zip)

2024-09-04T20:40:39Z
Source code (tar.gz)

2024-09-04T20:40:39Z

16 Aug 19:57

Release 0.1.0

This release supports pytorch 2.4 and introduces better way to use OpenCL pytorch

This time I provided binary distributions of the backend

Linux for python 3.8 till 3.12, for torch=2.4
Windows for python 3.11 and 3.12 for torch=2.4

To install - install CPU version of pytorch in virtual evironment, download whl file from release make sure torch version, python version and architecture matches your environment.

For example python 3.10, torch 2.4 on Linux it is:

pip install pytorch_ocl-0.1.0+torch2.4-cp310-none-linux_x86_64.whl

To use import pytorch_ocl

Assets 9

0 Join discussion