Skip to content

v0.4.0

Compare
Choose a tag to compare
@rhornung67 rhornung67 released this 28 Sep 17:15
a8f669c

This release contains two new kernels, plus substantial changes to the build process and existing kernels.

Please download the RAJAPerf-0.4.0.tar.gz file below. The others will not work due to the way RAJAPerf uses git submodules.

Major changes include:

  • Two new kernels: DAXPY and ADI.
  • Update to a newer RAJA development version (SHA hash a59e7c4a...) to exercise newer RAJA features.
  • All kernels with nested loops have been converted to latest RAJA::kernel API, including OpenMP target.
  • Some kernels that use RAJA Views now explicitly specify which indexing dimension is stride-1 to take advantage of new internal RAJA optimizations.
  • When building with OpenMP target enabled all other kernel variants are disabled, except for Base_Seq. This is a (hopefully) temporary change to avoid mis-interpretation of kernel timings due to issues with some compilers that require disabling inlining to generate correct results. The executable will have "omptarget" in its name.
  • A variety of newer build scripts have been added for Livermore Computing platforms.