OpenHuFu is a multi party computation framework used for data query benchmarks. It provides flexibility for researchers to quickly implement their algorithms. With its help, we can quickly take the experiment and get the performance of our algorithms.
Prerequistes:
- Linux
- Java 11
- Maven (version at least 3.5.2)
Run the following commands
git clone https://github.com/BUAA-BDA/OpenHuFu.git
cd OpenHuFu
./build/script/package.sh
Hu-Fu is now installed in release
//todo
- TCP-H
cd dataset/TPC-H V3.0.1/dbgen
cp makefile.suite makefile
make
cd scripts
bash generateData.sh dst x y #其中dst是目标文件夹,x是数据库数量(整数),y是每个数据库的数据量(整数,单位为G)
//todo
- Plan
- DSL(Domain Specific Language): Easy to define and parse, takes time to design
- Function Call: Hard to describe query
- SQL
- Filter
- Projection
- Join: equi-join, theta join
- Cross products
- Aggregate(inc. group-by)
- Limited window aggs
- Distinct
- Sort
- Limit
- Common table expressions
- Spatial Query(TODO):
- range query
- knn
- skyline
- Communication Cost
- Running Time
- Data Access Time
- Encryption Time
- Decryption Time
- Query Time