#
ptx-utils
Here are 2 public repositories matching this topic...
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
hpc profiler gpu opencl cuda nvidia gpu-acceleration gpu-computing sycl nvidia-cuda nvidia-gpu ptx gpu-programming roofline-model ptx-utils
-
Updated
Dec 31, 2023 - C++
Improve this page
Add a description, image, and links to the ptx-utils topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ptx-utils topic, visit your repo's landing page and select "manage topics."