PrevUpHomeNext

Performance

The following tests were run with an NVIDIA Tesla K40c GPU on a system with an Intel Core i7 920 2.67GHz CPU.

Source code for the benchmarks can be found under the perf directory. All benchmarks were compiled with optimizations enabled (i.e. "gcc -O3").

Accumulate

accumulate_time_plot

Count

count_time_plot

Inner Product

inner_product_time_plot

Merge

merge_time_plot

Partial Sum

partial_sum_time_plot

Partition

partition_time_plot

Reverse

reverse_time_plot

Rotate

rotate_time_plot

Set Difference

set_difference_time_plot

Sort

sort_time_plot

Transform

saxpy_time_plot

Unique

unique_time_plot


PrevUpHomeNext