Skip to content

v2.0.0

Compare
Choose a tag to compare
@ffevotte ffevotte released this 18 Jun 22:58
· 850 commits to master since this release

This version is based on Valgrind-3.13.0.

Added

  • Generation of Valgrind errors for NaN values. This can be useful to debug
    programs in conjunction with vgdb. (fixes gh-4)

  • Instrumentation of all FP binary instructions, as obtained by any combination of:

    • an operation: ADD / SUB / MUL / DIV
    • a vector variant: LLO / SSE / AVX2
    • a precision: single / double
  • Instrumentation of cast instructions (double -> float).

  • Preparation for the common interflop backend interface.

  • (Experimental) Parallelism in verrou_dd. The number of concurrent threads
    can be set using the VERROU_DD_NUM_THREADS environnement variable.

  • (Experimental) New DDmin algorithm for verrou_dd. Instead of computing a
    maximal set of stable symbols/lines using the DDmax algorithm (and outputting
    the complement), the DDmin algorithm computes the union of minimal sets of
    unstable symbols/lines. It can be activated by setting VERROU_DD_ALGO=rddmin
    in the environment.

Changed

  • C++ source files are now compiled using the C++11 standard.

  • Fix the LLO instrumentation bug. This solves problems which sometimes happened
    when LLO and real vector instructions were mixed. Known examples of such bugs
    situations include openblas or binaries compiled with the Intel compiler.

    The new --vr-unsafe-llo-optim allows keeping the old fast and buggy LLO
    instrumentation.

  • Suppression of useless warnings.

  • Fix bugs of next_after and next_prev.

  • More robust rounding mode (upward, downward, toward_zero) with subnormals.

  • Improvement of tests and unit test (with UCB references).

  • Usability improvements of verrou_dd. There is no need for absolute paths to
    argument scripts anymore. Also, error messages are now more readable. (related
    to gh-7)

  • Increase the max. size of symbol names in exclude files. New limit is set to
    4096 characters. (fixes gh-6)