Sorting Algorithms & Fair comparison testbed

In Nov 2018, I needed to figure out what is the best sort algorithm on small datasets (almost sorted for graphics rendering) so I started my quest gathering good Java Sorting implementations ... reading papers & started experimenting my own benchmarks.

As I found this gem (https://github.com/sebawild/nearly-optimal-mergesort-code) the source code of Sebastian Wild 2018 paper, I started hacking the code to make its benchmark for fair (more reproductible) tests and I fixed several Sorter implementations to use pre-allocation (no GC overhead).

Thanks to Vladimir Yaroslavskiy, I added its DualPivotQuickSort 2011, 2018 & his BentleyBasher implementation, from Tageer Valev his adaptive RadixSort ...

I made lots of improvements on the BentleyBasher

All code is free & open source, under MIT or GPL2 license.

TODO: complete history & motivations

Objective

Use a robust test methodology to test Java Sorting Implementations (working, tested) with all known distributions (random, sawtooth, dithered, ...) as recent Sorting implementations use polymorphism / adaptive strategy to better sort some types of data sets (sorted & reversed runs, random ...)
Include best Sort implementations (based on int[] arrays only, sorry) and also using 2 int[] arrays (data + indices)
Provide a complete Test suite (basher, stats, & analysis) to reproduce reliable experiments and allow optimisation (tuning every details of a particular instance) in a faire manner (reduce OS & JVM biases) to obtain a fair comparison

Data results

See in the results folder to last data & comparison stats

Nice Plots will be coming next...

Fork of the Sebastian Wild's 'Nearly optimal Natural Mergesort — Code'

I derived my work from this fabulous repository on github, that provides source code (MIT license) of several sorting algorithms (merge sorts, like TD/BU, TimSort & variants, Peek & Power Sort).

Here is the original 'README' of the master repository:

Nearly optimal Natural Mergesort — Code

Code for experiments with nearly optimally adaptive mergesort variants peeksort and powersort.

Reproducing the results from the paper

To reproduce the running time study from the paper, execute

ant package
./paper-experiments.sh

The build requires a recent JDK 8, Oracle's version is recommended.

Make sure to use the paper release:

This produces several files in the current directory.

The *.out files show the progress made in the individual runs and contain debug output from JVM's just-in-time compiler. It can be used to check that no massive deoptimization steps happened during the timed experiments. (Endless output during the warmup phase and occasional printed lines during timed runs are normal.)
The *.csv files contain one line per executed sort and report the individual running time. These files were used in the paper to compute average and standard deviations of running times.

Unit Tests

To run harness tests for correctness of the sorting methods, run

ant test

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
config/META-INF		config/META-INF
lib		lib
results		results
sort-bench		sort-bench
src		src
test/wildinter/net/mergesort		test/wildinter/net/mergesort
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bentley-basher.sh		bentley-basher.sh
bentley-parser.sh		bentley-parser.sh
bentley-statistic.sh		bentley-statistic.sh
build.xml		build.xml
cpu_fixed.sh		cpu_fixed.sh
isolate.sh		isolate.sh
laurent-experiments.sh		laurent-experiments.sh
mainInt.sh		mainInt.sh
paper-experiments.sh		paper-experiments.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sorting Algorithms & Fair comparison testbed

Objective

Data results

Fork of the Sebastian Wild's 'Nearly optimal Natural Mergesort — Code'

Nearly optimal Natural Mergesort — Code

Reproducing the results from the paper

Unit Tests

About

Releases

Packages

Languages

License

bourgesl/nearly-optimal-mergesort-code

Folders and files

Latest commit

History

Repository files navigation

Sorting Algorithms & Fair comparison testbed

Objective

Data results

Fork of the Sebastian Wild's 'Nearly optimal Natural Mergesort — Code'

Nearly optimal Natural Mergesort — Code

Reproducing the results from the paper

Unit Tests

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages