Add sparse matrix to mp #860

Xewar313 · 2024-11-05T10:57:04Z

No description provided.

…d second format

…to benchmark

…uted_vector rma memory access

…ger matrices

lslusarczyk

first few comments, next are comming

lslusarczyk · 2025-01-10T09:21:54Z

benchmarks/gbench/mp/gemv.cpp

+
+namespace mp = dr::mp;
+
+#ifdef STANDALONE_BENCHMARK


I don't see in any of CMakeLists.txt building gemv.cpp with STANDALONE_BENCHMARK option. Code under this ifdef is never compiled. Please remove if you don't need it or add a target to cmakelists to make it being compiled and take care it is compiled during CI (to catch compile time errors in CI)

Fair point, I removed it, since it is not strictly necessary for benchmarking

lslusarczyk · 2025-01-10T19:20:21Z

benchmarks/gbench/mp/gemv.cpp

+}
+} // namespace
+static auto getMatrix() {
+  std::size_t n = std::max(1., std::sqrt(default_vector_size / 100000)) * 50000;


default_vector_size is to be more-less equal to size of data being allocated. However when I run all benchamrks on dnp02 host the only tests which fail with bad_alloc are Gemv_DR.

Please add explanation to the code what are 100000 and 50000 constants for.

Please make the benchamrk allocating more less the same ammount of data like other benchmars so command:\

ONEAPI_DEVICE_SELECTOR='level_zero:gpu' I_MPI_OFFLOAD=1 \ I_MPI_OFFLOAD_CELL_LIST=0-11 I_MPI_HYDRA_BOOTSTRAP=ssh \ mpiexec -n 2 -ppn 2 build/benchmarks/gbench/mp/mp-bench --vector-size 1000000000 \ --reps 50 --v=3 --benchmark_filter=GemvEq_DR/ --sycl

will not fail on kdse-pre-dnp-02

I corrected sizes of the matrix, so that data size is actually similar (not exactly the same, but very close).

The line with 100000 and 50000 was commented out, and added comment that that matrix size is suitable for weak testing with dr-bench.

The command should also be working on kdse-pre-dnp-02

What's important, it may sometimes freeze - it is due to gtest deciding on some hosts to stop iterating state, and not on others. I am not sure why it decides that, but in that case the test needs to be re-run

Xewar313 added 30 commits August 14, 2024 12:07

Add initial implementation of sparse matrix in mp

00f1e39

Fixed row shape calculation

36d7f38

Extract matrix format from matrix implementation

dd57bb1

Add initial gemv implementation

b84ecc0

Move matrix related files from sp to general module

1c1dad7

Separated matrix format from mp sparse matrix implementation and adde…

2456379

…d second format

Improve matrix loading performance

bd63c2c

Add sycl support to mp sparse matrixes

b75b7ed

Added initialization from one node in mp sparse matrix

756fab1

Add concept requirement for gemv operation

0b498ad

Initial improvement to matrix reading

bbf2acf

Add small improvements to matrix loading

9eca244

Fix formatting

7b55a1b

Add sparse benchmark and broadcasted vector

bb2e02e

Add benchmarking tools

18165da

Add gemv benchmark to gbench

94f818e

Add reference gemv implementation

982a0e0

Fixed gemv reference

47a8455

Fixed gemv benchmark implementation

a97a97b

Fix band csr generation

231a09a

Add support for slim matrix multiplication

6b8af49

Fix benchmark and band csr generation

628aa07

Merge branch 'benchmark' of github.com:Xewar313/distributed-ranges in…

2047ecf

…to benchmark

Add support to device based computing in distributed sparse matrix

4f12327

add broadcasted slim matrix device memory support

71bd336

Fix issue with inconsistent timing when using mp gemv

6f96929

Some fixes to sparse matrixes

08a2247

improve work division in csr eq distribution

6a4bd30

Add better work distribution to csr_row_distiribution and fix distrib…

f93961b

…uted_vector rma memory access

improve performance on less dense matrices and allow broadcasting big…

e421523

…ger matrices

Xewar313 added 8 commits November 12, 2024 06:49

Fix compilation on borealis

05f5c63

fix compilation

06a6628

Fix issues with very small and very big matrices

aa706f7

Merge branch 'main' into benchmark

6e0e9d2

Fix compilation on older OneDpl

8f1a2b7

Fix style

28e023e

Merge onedpl fix

e4ee8c7

Some fixes with verions

55185dc

Xewar313 marked this pull request as ready for review November 19, 2024 13:06

Xewar313 added 18 commits November 20, 2024 15:26

Add local to csr_eq_segment

b7704ea

Add proper local method

4acbad6

Add problem to review

1f84ba7

Moved local view to distribution

ba20ee3

Add new example of not working code

bad5606

Fix issue with lambda copy

8e7f1fe

Make local work with shared memory

3503271

Fix device memory when using local in row distribution

44a6e78

Fix local in eq distribution

2bf503e

Fix formatting

dc89bc8

Reverse change in dr::transform_view

7e7f2d2

Fix benchmark when default vector size is small

dd1d6ed

Fix issue when distributed vector is too small

e42cfa2

Improve performance of eq distribution gather

2318a46

Remove unneccessary comment

4cfb110

Add test for reduce and fix type error in sparse matrix local

04191d7

Add broadcast_vector tests

adad4f7

Fix formatting

f17243b

lslusarczyk reviewed Jan 10, 2025

View reviewed changes

Xewar313 added 2 commits January 13, 2025 11:26

Corrected gemv matrix creation

f1639b0

Fix formatting

3dfdac0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sparse matrix to mp #860

Add sparse matrix to mp #860

Xewar313 commented Nov 5, 2024

lslusarczyk left a comment

lslusarczyk Jan 10, 2025

Xewar313 Jan 13, 2025

lslusarczyk Jan 10, 2025

Xewar313 Jan 13, 2025

Xewar313 Jan 13, 2025

Add sparse matrix to mp #860

Are you sure you want to change the base?

Add sparse matrix to mp #860

Conversation

Xewar313 commented Nov 5, 2024

lslusarczyk left a comment

Choose a reason for hiding this comment

lslusarczyk Jan 10, 2025

Choose a reason for hiding this comment

Xewar313 Jan 13, 2025

Choose a reason for hiding this comment

lslusarczyk Jan 10, 2025

Choose a reason for hiding this comment

Xewar313 Jan 13, 2025

Choose a reason for hiding this comment

Xewar313 Jan 13, 2025

Choose a reason for hiding this comment