Pea Hash: A Performant Extendible Adaptive Hashing Index

What's included

src/ is the main source files of pea hash.
util/ is some util functions.
test/ is test files.
compile.sh and run.sh shows the usage of the test_pmem.

Prerequisite

Hardware:

With Intel Optane Memory, you can reproduce all the results.
With AVX512, you can use some optimizations in Pea Hash.
Otherwise, only pea_dram can be deployed.

install cmake, g++
install gflags

sudo apt install libgflags-dev

(Optional) If you want to deploy pea_pmem, please install PMDK. Please refer to the Dependencies of https://github.com/pmem/pmdk if problems occur.

Building

CMakeLists, test/test_pmem*.cpp, compile.sh These are important test files.

Take pea_dram as example.

cd pea_dram/unique
./compile.sh

If your server has no AVX512, please delete this line in CMakeLists.txt

add_definitions(-DAVX512F)

If your server has no access to github.com, please refer to pea_pmem/unique/CMakeLists.txt.offline for help.

Getting started on DRAM

cd pea_dram/unique
./run.sh

Availability and Reproducibility for SIGMOD'23

pea_dram/unique: Fig.6
pea_pmem/unique: Fig.4, Fig.5
pea_pmem/duplicate: Fig.8, Fig.9, Fig.10 (see pea_pmem/duplicate/README.md)

Notes for Multithread Reproducibility

If your affinity setting is different from mine (eg. core 0-3 in socket 0, core 4-7 in socket 1), please change the function in test_pmem.

void set_affinity(uint32_t idx) {
  cpu_set_t my_set;
  CPU_ZERO(&my_set);
  CPU_SET(2 * idx + 1, &my_set);
  sched_setaffinity(0, sizeof(cpu_set_t), &my_set);
}

Running benchmark

As stated in our paper, we run the tests in a single NUMA node with 16 physical CPU cores. We pin threads to physical cores compactly assuming thread ID == core ID (e.g., for a dual-socket system, we assume cores 0,2,4,...,30 are located in socket 0, and cores 1,3,5,...,31 in socket 1).

To run benchmarks, use the test_pmem executable in the build directory. It supports the following arguments:

./build/test_pmem --helpshort
Usage: 
    ./build/test_pmem [OPTION...]

-index      the index to evaluate:dash-ex/dash-lh/cceh/level (default: "dash-ex")
-op         the type of operation to execute:insert/pos/neg/delete/mixed (default: "full")
-n          the number of warm-up workload (default: 0)
-p          the number of operations(insert/search/delete) to execute (default: 20000000)
-t          the number of concurrent threads (default: 1)
-r          search ratio for mixed workload: 0.0~1.0 (default: 1.0)
-s          insert ratio for mixed workload: 0.0~1.0 (default: 0.0)
-d          delete ratio for mixed workload: 0.0~1.0 (default: 0.0)
-e          whether to register epoch in application level: 0/1 (default: 0)
-k          the type of stored keys: fixed/variable (default: "fixed")
-vl         the length of the variable length key (default: 16)

Check out also the run.sh script for example benchmarks and easy testing of the hash tables.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.vscode		.vscode
pea_dram/unique		pea_dram/unique
pea_pmem		pea_pmem
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pea Hash: A Performant Extendible Adaptive Hashing Index

Directory

What's included

Prerequisite

Building

Getting started on DRAM

Availability and Reproducibility for SIGMOD'23

Notes for Multithread Reproducibility

Running benchmark

About

Releases

Packages

Contributors 2

Languages

License

schencoding/peahash

Folders and files

Latest commit

History

Repository files navigation

Pea Hash: A Performant Extendible Adaptive Hashing Index

Directory

What's included

Prerequisite

Building

Getting started on DRAM

Availability and Reproducibility for SIGMOD'23

Notes for Multithread Reproducibility

Running benchmark

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages