o2bench

Raw one-dimensional C-Array storage vs 1D STL containers in C++ (std::array and std::vector), before and after compiler optimizations with g++ -O2. Additionally, cache alignment is implemented in this file, for 64 bit cache lines. This ensures the starting address is at the beginning of a cache line, and is done via alignas(64)

The Goal of this Benchmark

By examining the compiler's ability to optimize memory access, we will determine if a one-dimensional std::array or a raw C-Array is better for computational performance when defining a data structure. std::vector was added to see the performance difference of heap allocations before and after compiler optimizations are added.

Hypothesis

The compiler should be able to optimize std::array as well or better than raw C-arrays. If this is the case, then std::array may be the best choice for defining something like a matrix class when the size is constexpr (or known at compile time). This is because 1D matrix storage is faster than 2D in most cases, and std::array is safer than using a raw array, due to things like bounds checking.

1MB Array Access Results in milliseconds

More results can be viewed here

Directions

Configure

vim CMakeLists.txt

[Set Optimization level in the CMakeLists.txt file and select the test to perform here as well.

Run

mkdir build && cd build 
cmake .. -DCMAKE_CXX_COMPILER=g++ -DCMAKE_C_COMPILER=gcc 
make 
./<exe>      or    ./<exe> <results.csv>

Files and Structure

arrays: Contains the one-dimensional code where memory access is measured via initalization methods and different containers, such as:

A stack based array (C style)
std::array
std::vector

The subfolder arrays/i measures how O2 behaves with simple a[i] read/write.

The subfolder arrays/i_x measures how O2 behaves with a[i] = i*x read/write. In this folder, x is set to the appropriate value, depending on the type passed to value_t. E.g.,

using value_t = float; 
x = 3.0f;

using value_t = int; 
x = 3;

include: These are the header files used by the main programs in arrays
results_csv: Performane results on different machines before and after -O2 Optimization with different access methods (a[i] vs a[i] = i*x)

The subfolder results_csv/4core8thread is my measely laptop, whereas the subfolder results_csv/fireflynode2 is a node of an 80 logical core cluster.

src: The implementation of the header files that are not templated.
typedebuging: The programs in this folder exist as I was using them to double check myself about sematics regarding types/bytes. None of these programs are used in the performance analysis, though they could be a good place to start by compiling with g++ -Wall -std=c++17

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
arrays		arrays
include		include
src		src
typedebugging		typedebugging
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

o2bench

The Goal of this Benchmark

Hypothesis

1MB Array Access Results in milliseconds

Directions

Configure

Run

Files and Structure

About

Releases

Packages

Languages

License

tommygorham/o2bench

Folders and files

Latest commit

History

Repository files navigation

o2bench

The Goal of this Benchmark

Hypothesis

1MB Array Access Results in milliseconds

Directions

Configure

Run

Files and Structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages