v0.4.0
This release contains two new kernels, plus substantial changes to the build process and existing kernels.
Please download the RAJAPerf-0.4.0.tar.gz file below. The others will not work due to the way RAJAPerf uses git submodules.
Major changes include:
- Two new kernels: DAXPY and ADI.
- Update to a newer RAJA development version (SHA hash a59e7c4a...) to exercise newer RAJA features.
- All kernels with nested loops have been converted to latest RAJA::kernel API, including OpenMP target.
- Some kernels that use RAJA Views now explicitly specify which indexing dimension is stride-1 to take advantage of new internal RAJA optimizations.
- When building with OpenMP target enabled all other kernel variants are disabled, except for Base_Seq. This is a (hopefully) temporary change to avoid mis-interpretation of kernel timings due to issues with some compilers that require disabling inlining to generate correct results. The executable will have "omptarget" in its name.
- A variety of newer build scripts have been added for Livermore Computing platforms.