Experiments

April 2, 2026 · View on GitHub

We can compare the performance of the SYCL, DPCT-generated, and OpenMP implementations of each program on Intel integrated GPUs. The performance results below were obtained with the Intel OpenCL intercept layer. "total enqueue" indicates the total number of low-level OpenCL enqueue commands called by a parallel program. These enqueue commands include "clEnqueueNDRangeKernel", "clEnqueueReadBuffer", and "clEnqueueWriteBuffer". The host timing is the total elapsed time of executing OpenCL API functions on a CPU host while the device timing is the total elapsed time of executing OpenCL API functions on a GPU device. The Plugin Interface is OpenCL.

Setup

Software: Intel® oneAPI Beta08* Toolkit, Ubuntu 18.04
Platform 1: Intel® Xeon E3-1284L with a Gen8 P6300 integrated GPU
Platform 2: Intel® Xeon E-2176G with a Gen9.5 UHD630 integrated GPU

*newer versions may be used when they can produce results correctly

Note

We may execute SYCL programs on an Nvidia GPU by building the software from source with support for Nvidia CUDA. Then, go to a directory (e.g., affine-sycl) and type the command " make -f Makefile.cuda run". "Makefile.cuda" may be modified for different versions of Nvidia GPUs and toolchains.

We may execute SYCL programs on an AMD GPU by building the software from source with support for AMD HIP. Then, go to a directory (e.g., affine-sycl) and type the command " make -f Makefile.hip run". "Makefile.hip" may be modified for different versions of AMD GPUs and toolchains.

We may execute HIP programs on an Intel GPU by building the software from source with support for OpenCL. Then, go to a directory (e.g., affine-hip) and type the command " make -f Makefile.hipcl run". "Makefile.hipcl" may be modified for different versions of AMD GPUs and toolchains.

We may also evaluate OpenMP Offload to GPU feature with AOMP on an AMD GPU or the Nvidia HPC SDK on an Nvidia GPU. Then, go to a directory (e.g., affine-omp) and type "make -f Makefile.aomp run" or "make -f Makefile.nvc run". Both files may be modified for different versions of GPUs and toolchains.

Run

A script "run.sh" attempts to run all tests with the OpenCL plugin interface. To run a single test, go to a test directory and type the command "make run".

Results on Platform 1

affineSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102507
host timing(s)0.250.550.583.46
device timing(ms)6.76.86.68.1
all-pairs-distanceSYCLDPCT usmDPCT headerOpenMP
total enqueue60616167
host timing(s)0.380.760.7625.6
device timing(s)0.110.160.1622
amgmkSYCLDPCT usmDPCT headerOpenMP
total enqueue5015065062010
host timing(s)0.410.880.883.78
device timing(s)0.180.180.180.18
aobenchSYCLDPCT usmDPCT headerOpenMP
total enqueue20303085
host timing(s)0.580.920.953.71
device timing(s)0.140.140.140.16
astaSYCLDPCT usmDPCT headerOpenMP
total enqueue5505505501105
host timing(s)5.05.75.716.2
device timing(s)4.75.25.112.6
atomicIntrinsicsSYCLDPCT usmDPCT headerOpenMP
total enqueue333NA
host timing(s)9.39.79.7NA
device timing(s)9.19.19.1NA
axhelmSYCLDPCT usmDPCT headerOpenMP
total enqueue101105105NA
host timing(s)8.15/14.69.1/15.99.2/16.0NA
device timing(s)4.4/10.85.0/11.85.0/11.7NA
backpropSYCLDPCT usmDPCT headerOpenMP
total enqueue59919
host timing(s)2.02.42.86.1
device timing(s)0.771.491.492.3
bezier-surfaceSYCLDPCT usmDPCT headerOpenMP
total enqueue23312
host timing(s)1.51.791.874.47
device timing(s)0.70.710.720.75
bitonic-sortSYCLDPCT usmDPCT headerOpenMP
total enqueue3263273271957
host timing(s)2.212.562.675.85
device timing(s)1.921.931.972.36
black-scholesSYCLDPCT usmDPCT headerOpenMP
total enqueue2339
host timing(s)0.571.421.464.67
device timing(s)0.160.350.340.95
bsearchSYCLDPCT usmDPCT headerOpenMP
total enqueue80160160308
host timing(s)2.282.432.432.73
device timing(s)2.112.192.172.25
bspline-vghSYCLDPCT usmDPCT headerOpenMP
total enqueue120003120007120007228009
host timing(s)5.87.68.18.4
device timing(s)0.672.382.151.36
b+treeSYCLDPCT usmDPCT headerOpenMP
total enqueue5202032
host timing(s)3.50.580.653.48
device timing(s)3.10.00680.00680.0082
ccsd-trpdrvSYCLDPCT usmDPCT headerOpenMP
total enqueue500240024003405
host timing(s)7.98.38.212.7
device timing(s)7.98.28.112.3
cedSYCLDPCT usmDPCT headerOpenMP
total enqueue660663663NA
host timing(s)NANANANA
device timing(ms)NANANANA
cfdSYCLDPCT usmDPCT headerOpenMP
total enqueue16005NANA132035
host timing(s)4.44.2NA9.95
device timing(s)3.53.4NA3.76
chi2SYCLDPCT usmDPCT headerOpenMP
total enqueue23312
host timing(s)1.11.411.474.5
device timing(s)0.190.230.350.92
clenergySYCLDPCT usmDPCT headerOpenMP
total enqueue5025025023011
host timing(s)12.311.611.814.9
device timing(s)11.810.810.911.2
clinkSYCLDPCT usmDPCT headerOpenMP
total enqueue40160160205
host timing(s)19.520.424.325.1
device timing(s)13.513.316.917.6
cobahhSYCLDPCT usmDPCT headerOpenMP
total enqueue172424109
host timing(s)2.834.24.47.6
device timing(s)2.533.23.24.0
compute-scoreSYCLDPCT usmDPCT headerOpenMP
total enqueue201207207812
host timing(s)8.48.39.120.2
device timing(s)8.07.48.116.3
d2q9_bgkSYCLDPCT usmDPCT headerOpenMP
total enqueue800118002180021640027
host timing(s)19.314.721.439.2
device timing(s)14.415.416.020.8
diamondSYCLDPCT usmDPCT headerOpenMP
total enqueue25514
host timing(s)41.641.942.343.8
device timing(s)40.840.941.240.2
divergenceSYCLDPCT usmDPCT headerOpenMP
total enqueue40000100000100000280006
host timing(s)1.66.27.48.6
device timing(s)0.330.420.420.34
easyWaveSYCLDPCT usmDPCT headerOpenMP
total enqueue17293172991729969183
host timing(s)29.831.032.041.8
device timing(s)28.529.329.936.6
extend2SYCLDPCT usmDPCT headerOpenMP
total enqueue14000240002400046005
host timing(s)10.911.511.820.2
device timing(s)9.69.69.616.1
extremaSYCLDPCT usmDPCT headerOpenMP
total enqueue96969792979299404
host timing(s)34.237.938.644.8
device timing(s)33.536.837.239.1
filterSYCLDPCT usmDPCT headerOpenMP
total enqueue35511
host timing(s)0.620.920.986.4
device timing(ms)851471422711
fftSYCLDPCT usmDPCT headerOpenMP
total enqueue201202202NA
host timing(s)16.419.519.5NA
device timing(ms)14.317.117.0NA
floydwarshallSYCLDPCT usmDPCT headerOpenMP
total enqueue512515125151251512056
host timing(s)10.37.210.721.8
device timing(ms)8.96.858.910.4
fpcSYCLDPCT usmDPCT headerOpenMP
total enqueue400800800NA
host timing(s)5.14.14.2NA
device timing(ms)0.641.21.2NA
gamma-correctionSYCLDPCT usmDPCT headerOpenMP
total enqueue2339
host timing(s)0.270.660.693.56
device timing(ms)14272473
gaussianSYCLDPCT usmDPCT headerOpenMP
total enqueue81938196819661437
host timing(s)11.611.812.714.7
device timing(s)11.011.111.89.6
geodesicSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102407
host timing(s)5.55.85.89.2
device timing(s)5.15.15.15.6
gmmSYCLDPCT usmDPCT headerOpenMP
total enqueue2028720291NA44753
host timing(s)2.73.6NA7.0
device timing(s)1.131.92NA2.35
haccmkSYCLDPCT usmDPCT headerOpenMP
total enqueue4111121
host timing(s)0.210.720.63.49
device timing(ms)6.86.76.87.6
heartwallSYCLDPCT usmDPCT headerOpenMP
total enqueue212220220637
host timing(s)17.29.19.511.6
device timing(s)16.18.38.47.92
heatSYCLDPCT usmDPCT headerOpenMP
total enqueue10031003100310029
host timing(s)8.548.448.9412.36
device timing(s)7.987.67.928.36
heat2dSYCLDPCT usmDPCT headerOpenMP
total enqueue1021021021107
host timing(s)1.992.332.35.5
device timing(s)1.651.681.631.89
histogramSYCLDPCT usmDPCT headerOpenMP
total enqueue121812211221NA
host timing(s)1.881.391.59NA
device timing(s)0.570.560.59NA
hmmSYCLDPCT usmDPCT headerOpenMP
total enqueue5015045043249
host timing(s)8.911.411.214.1
device timing(s)8.610.710.510.4
hotspot3DSYCLDPCT usmDPCT headerOpenMP
total enqueue50015003500390008
host timing(s)4.54.64.99.4
device timing(s)4.14.14.14.2
hybridsortSYCLDPCT usmDPCT headerOpenMP
total enqueue253333193
host timing(s)1.51.741.874.89
device timing(s)0.820.870.861.21
interleaveSYCLDPCT usmDPCT headerOpenMP
total enqueue2022062061012
host timing(s)13.412.013.914.9
device timing(s)13.111.413.33.1
inversek2jSYCLDPCT usmDPCT headerOpenMP
total enqueue100001100004100004400008
host timing(s)53.755.516.1
device timing(s)1.932.651.993.85
isingSYCLDPCT usmDPCT headerOpenMP
total enqueue4044054054018
host timing(s)6.24.44.39.9
device timing(s)5.83.673.496.2
iso2dfdSYCLDPCT usmDPCT headerOpenMP
total enqueue10011004100410010
host timing(s)2.182.52.65.87
device timing(s)1.911.941.922.1
jaccardSYCLDPCT usmDPCT headerOpenMP
total enqueue102108108NA
host timing(s)28.628.828.9NA
device timing(s)28.228.128.1NA
jenkins-hashSYCLDPCT usmDPCT headerOpenMP
total enqueue101104104509
host timing(s)6.67.27.29.9
device timing(s)6.36.46.46.3
keccaktreehashSYCLDPCT usmDPCT headerOpenMP
total enqueue24272762
host timing(s)0.951.331.3617.8
device timing(s)0.570.580.5714.1
kmeansSYCLDPCT usmDPCT headerOpenMP
total enqueue21500215012150171703
host timing(s)110112114116
device timing(s)106109110111
knnSYCLDPCT usmDPCT headerOpenMP
total enqueue5007007002007
host timing(s)9.812.312.516.2
device timing(s)7.610.110.410.7
lanczosSYCLDPCT usmDPCT headerOpenMP
total enqueue91089156915637620
host timing(s)16.017.918.32642
device timing(s)14.916.716.82637
langfordSYCLDPCT usmDPCT headerOpenMP
total enqueue15202033
host timing(s)7.688.18.110.2
device timing(s)5.85.85.85.3
laplaceSYCLDPCT usmDPCT headerOpenMP
total enqueue12742127511275138237
host timing(s)1.181.271.664.95
device timing(s)0.280.280.270.67
lavaMDSYCLDPCT usmDPCT headerOpenMP
total enqueue26615
host timing(s)2.81.41.484.4
device timing(s)2.50.770.760.8
leukocyteSYCLDPCT usmDPCT headerOpenMP
total enqueue2047117111334
host timing(s)2.32.973.05.8
device timing(s)1.992.172.172.14
lid-driven-cavitySYCLDPCT usmDPCT headerOpenMP
total enqueue1605667160567316056735619820
host timing(s)216276263375
device timing(s)154212228223
lombscargleSYCLDPCT usmDPCT headerOpenMP
total enqueue101104104309
host timing(s)10.410.710.713.6
device timing(s)10.010.010.010.0
ludSYCLDPCT usmDPCT headerOpenMP
total enqueue1535153615366145
host timing(s)8.911.011.214.1
device timing(s)7.89.79.89.7
luleshSYCLDPCT usmDPCT headerOpenMP
total enqueue2970298629866635
host timing(s)35.830.135.933.5
device timing(s)32.027.431.528.1
memcpySYCLDPCT usmDPCT headerOpenMP
total enqueue131072131072131072131072
host timing(s)4.34.94.62.4
device timing(s)1.21.51.21.7
miniFESYCLDPCT usmDPCT headerOpenMP
total enqueue2404241224126645
host timing(s)9.59.810.016.7
device timing(s)8.78.88.812.8
minimap2SYCLDPCT usmDPCT headerOpenMP
total enqueueNA20NA83
host timing(s)NA1.95NA4.86
device timing(s)NA1.14NA1.29
mixbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue2049205020506151
host timing(s)5.15.55.69.6
device timing(s)4.84.84.85.8
mkl-sgemmSYCLDPCT usmDPCT headerOpenMP
total enqueue800011200048000480004
host timing(s)6.38.36.64.7
device timing(s)2.382.522.382.53
mtSYCLDPCT usmDPCT headerOpenMP
total enqueue2032042041018
host timing(s)1.471.751.805.4
device timing(s)1.061.061.061.72
multimaterialSYCLDPCT usmDPCT headerOpenMP
total enqueue100429429689
host timing(s)3.23.53.56.8
device timing(s)0.881.61.62.95
murmurhash3SYCLDPCT usmDPCT headerOpenMP
total enqueue101104104409
host timing(s)6.37.47.220.7
device timing(s)5.96.76.417.0
nbodySYCLDPCT usmDPCT headerOpenMP
total enqueue4024024021308
host timing(s)2.73.03.16.2
device timing(s)2.42.42.42.7
nmsSYCLDPCT usmDPCT headerOpenMP
total enqueue203205205NA
host timing(s)1.871.631.74NA
device timing(s)1.511.051.12NA
nnSYCLDPCT usmDPCT headerOpenMP
total enqueue23313
host timing(s)0.30.600.653.1
device timing(us)375762220
nwSYCLDPCT usmDPCT headerOpenMP
total enqueue20482050205013314
host timing(s)2.02.42.45.6
device timing(s)0.510.850.791.51
page-rankSYCLDPCT usmDPCT headerOpenMP
total enqueue6111130
host timing(s)0.751.251.193.99
device timing(s)0.230.310.310.36
particle-diffusionSYCLDPCT usmDPCT headerOpenMP
total enqueue27718
host timing(s)1.31.71.84.98
device timing(s)0.220.510.531.42
pathfinderSYCLDPCT usmDPCT headerOpenMP
total enqueue102104104709
host timing(s)2.365.725.6511.8
device timing(s)1.995.04.988.1
popcountSYCLDPCT usmDPCT headerOpenMP
total enqueue5055065062015
host timing(s)6.036.376.439.5
device timing(s)5.75.75.75.9
presentSYCLDPCT usmDPCT headerOpenMP
total enqueue200NANA512
host timing(s)1.46NANA4.36
device timing(s)0.94NANA0.67
projectileSYCLDPCT usmDPCT headerOpenMP
total enqueue102102102307
host timing(s)3.03.43.56.5
device timing(s)2.72.72.72.85
quicksortSYCLDPCT usmDPCT headerOpenMP
total enqueue22503670NANA
host timing(s)9.316.9NANA
device timing(s)2.13.4NANA
randomAccessSYCLDPCT usmDPCT headerOpenMP
total enqueue44421
host timing(s)2.62.92.97.1
device timing(s)22.12.03.3
reductionSYCLDPCT usmDPCT headerOpenMP
total enqueue325326326975
host timing(s)1.31.741.84.65
device timing(s)10.951.01.13
reverseSYCLDPCT usmDPCT headerOpenMP
total enqueue1048578104857910485791048584
host timing(s)33.427.174.943.3
device timing(s)2.32.441.474.14
rng-wallaceSYCLDPCT usmDPCT headerOpenMP
total enqueue200202202NA
host timing(s)4.33.94.6NA
device timing(s)3.63.23.6NA
rsbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue2101068
host timing(s)11.611.811.820.8
device timing(s)8.88.88.612.1
rtm8SYCLDPCT usmDPCT headerOpenMP
total enqueue323838220
host timing(s)4.74.85.08.7
device timing(s)3.93.93.954.9
randomAccessSYCLDPCT usmDPCT headerOpenMP
total enqueue44421
host timing(s)2.62.92.97.1
device timing(s)22.12.03.3
reductionSYCLDPCT usmDPCT headerOpenMP
total enqueue325326326975
host timing(s)1.31.741.84.65
device timing(s)10.951.01.13
reverseSYCLDPCT usmDPCT headerOpenMP
total enqueue1048578104857910485791048584
host timing(s)33.427.174.943.3
device timing(s)2.32.441.474.14
rng-wallaceSYCLDPCT usmDPCT headerOpenMP
total enqueue200202202207
host timing(s)4.33.94.66.6
device timing(s)3.63.23.63.1
rsbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue2101068
host timing(s)11.611.811.820.8
device timing(s)8.88.88.612.1
rtm8SYCLDPCT usmDPCT headerOpenMP
total enqueue323838220
host timing(s)4.74.85.08.7
device timing(s)3.93.93.954.9
s3dSYCLDPCT usmDPCT headerOpenMP
total enqueue27012705541034441
host timing(s)2116132313
device timing(s)0.240.291.450.42
scanSYCLDPCT usmDPCT headerOpenMP
total enqueue1000011000210002200007
host timing(s)3.42.93.810.3
device timing(s)0.691.220.91.39
secp256k1SYCLDPCT usmDPCT headerOpenMP
total enqueue101102102307
host timing(s)11.211.511.416.3
device timing(s)5.435.435.2412.3
shuffleSYCLDPCT usmDPCT headerOpenMP
total enqueue909910910NA
host timing(s)28.726.229.1NA
device timing(s)28.225.428.1NA
simplemocSYCLDPCT usmDPCT headerOpenMP
total enqueue12181873
host timing(s)14.340.216.336.0
device timing(s)14.139.615.632.4
softmaxSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102507
host timing(s)1.64.51.95.1
device timing(s)1.33.81.31.5
sortSYCLDPCT usmDPCT headerOpenMP
total enqueue241242242NA
host timing(s)4.9422.622.6NA
device timing(s)4.4921.821.8NA
sosfilSYCLDPCT usmDPCT headerOpenMP
total enqueue2022082081214
host timing(s)1.822.012.032.59
device timing(s)1.791.951.962.24
sphSYCLDPCT usmDPCT headerOpenMP
total enqueue20022004200413512
host timing(s)14.615.215.412.2
device timing(s)1414.114.210.9
sradSYCLDPCT usmDPCT headerOpenMP
total enqueue80038008800836026
host timing(s)1.31.541.795.0
device timing(s)0.620.780.770.84
ssspSYCLDPCT usmDPCT headerOpenMP
total enqueue883958862688655NA
host timing(s)6.68.18.4NA
device timing(s)2.32.22.2NA
stencilSYCLDPCT usmDPCT headerOpenMP
total enqueue23310
host timing(s)0.731.131.124.1
device timing(s)0.120.190.180.51
streamclusterSYCLDPCT usmDPCT headerOpenMP
total enqueue11278112781127830617
host timing(s)4.54.74.98.5
device timing(s)3.63.73.64.4
su3SYCLDPCT usmDPCT headerOpenMP
total enqueue102104104715
host timing(s)7.98.28.311
device timing(s)7.47.47.47.4
thomasSYCLDPCT usmDPCT headerOpenMP
total enqueue102105105710
host timing(s)5.82.86.29.1
device timing(s)5.42.25.55.6
transposeSYCLDPCT usmDPCT headerOpenMP
total enqueue51NA5964
host timing(s)12.9NA13.625.8
device timing(s)11.9NA12.422
triadSYCLDPCT usmDPCT headerOpenMP
total enqueue204400204400204400407907
host timing(s)7.27.37.498
device timing(s)3.42.83.486
xsbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue29926
host timing(s)3.03.43.06.8
device timing(s)2.62.72.13.1

Results on Platform 2

affineSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102507
host timing(s)0.330.620.623.1
device timing(ms)11.512.711.012.4
all-pairs-distanceSYCLDPCT usmDPCT headerOpenMP
total enqueue60616167
host timing(s)0.490.870.9358
device timing(s)0.140.220.2254
amgmkSYCLDPCT usmDPCT headerOpenMP
total enqueue5015065062010
host timing(s)0.591.040.953.87
device timing(s)0.280.290.280.28
aobenchSYCLDPCT usmDPCT headerOpenMP
total enqueue20303085
host timing(s)0.71.041.023.92
device timing(s)0.270.270.270.31
astaSYCLDPCT usmDPCT headerOpenMP
total enqueue5505505501105
host timing(s)5.66.16.115.8
device timing(s)5.45.85.712.9
atomicIntrinsicsSYCLDPCT usmDPCT headerOpenMP
total enqueue333NA
host timing(s)1.01.371.35NA
device timing(s)0.730.730.73NA
axhelmSYCLDPCT usmDPCT headerOpenMP
total enqueue101105105NA
host timing(s)6.3/9.67.1/10.77.1/10.2NA
device timing(s)3.2/6.63.5/6.83.5/6.8NA
backpropSYCLDPCT usmDPCT headerOpenMP
total enqueue59919
host timing(s)1.61.982.665.8
device timing(s)0.661.161.151.9
bezier-surfaceSYCLDPCT usmDPCT headerOpenMP
total enqueue23312
host timing(s)1.942.12.25.37
device timing(s)1.191.171.180.81
bfsSYCLDPCT usmDPCT headerOpenMP
total enqueue525555133
host timing(s)0.40.70.733.54
device timing(s)0.230.270.260.36
bitonic-sortSYCLDPCT usmDPCT headerOpenMP
total enqueue3263273271957
host timing(s)3.013.223.216.26
device timing(s)2.592.522.522.77
black-scholesSYCLDPCT usmDPCT headerOpenMP
total enqueue2339
host timing(s)0.711.421.434.49
device timing(s)0.270.420.370.96
boxfilterSYCLDPCT usmDPCT headerOpenMP
total enqueue2004200420042016
host timing(s)3.42.53.55.1
device timing(s)2.52.222.34.6
bsearchSYCLDPCT usmDPCT headerOpenMP
total enqueue80160160308
host timing(s)2.32.52.52.5
device timing(s)2.22.32.32.1
bspline-vghSYCLDPCT usmDPCT headerOpenMP
total enqueue120003120007120007228009
host timing(s)5.39.56.610.4
device timing(s)0.681.411.491.29
b+treeSYCLDPCT usmDPCT headerOpenMP
total enqueue5202032
host timing(s)1.040.680.693.44
device timing(s)0.560.00730.00650.0075
ccsd-trpdrvSYCLDPCT usmDPCT headerOpenMP
total enqueue500240024003405
host timing(s)14.918.519.315.2
device timing(s)13.714.215.911.0
cedSYCLDPCT usmDPCT headerOpenMP
total enqueue6606636631548
host timing(s)0.430.750.843.75
device timing(ms)44494960
cfdSYCLDPCT usmDPCT headerOpenMP
total enqueue160051601316013132035
host timing(s)3.758.24.926.5
device timing(s)3.043.03.0218.1
chi2SYCLDPCT usmDPCT headerOpenMP
total enqueue23312
host timing(s)0.961.251.34.51
device timing(s)0.190.310.281.03
clenergySYCLDPCT usmDPCT headerOpenMP
total enqueue5025025023011
host timing(s)16.716.416.820.1
device timing(s)15.9915.715.9315.99
clinkSYCLDPCT usmDPCT headerOpenMP
total enqueue40160160205
host timing(s)22.624.225.430.4
device timing(s)19.219.820.724.7
cobahhSYCLDPCT usmDPCT headerOpenMP
total enqueue172424109
host timing(s)1.933.03.16.56
device timing(s)1.562.01.993.0
compute-scoreSYCLDPCT usmDPCT headerOpenMP
total enqueue201207207812
host timing(s)6.97.17.318.4
device timing(s)6.56.36.514.6
convolutionSeperableSYCLDPCT usmDPCT headerOpenMP
total enqueue203205205218
host timing(s)9.49.59.510.7
device timing(s)1.81.771.761.44
d2q9_bgkSYCLDPCT usmDPCT headerOpenMP
total enqueue800118002180021640027
host timing(s)16.412.916.7100.1
device timing(s)12.713.012.951
diamondSYCLDPCT usmDPCT headerOpenMP
total enqueue25514
host timing(s)26.126.726.129.4
device timing(s)25.425.725.125.9
divergenceSYCLDPCT usmDPCT headerOpenMP
total enqueue40000100000100000280006
host timing(s)13.248.669.863.3
device timing(s)0.710.410.440.33
easyWaveSYCLDPCT usmDPCT headerOpenMP
total enqueue17293172991729969183
host timing(s)29.632.932.338.3
device timing(s)28.127.827.832.8
extend2SYCLDPCT usmDPCT headerOpenMP
total enqueue14000240002400046005
host timing(s)14.916.115.519.8
device timing(s)7.77.87.813.7
extremaSYCLDPCT usmDPCT headerOpenMP
total enqueue96969792979299404
host timing(s)59.449.458.595.1
device timing(s)55.550.155.091.8
filterSYCLDPCT usmDPCT headerOpenMP
total enqueue35511
host timing(s)0.620.870.908.4
device timing(ms)61104954869
fftSYCLDPCT usmDPCT headerOpenMP
total enqueue201202202NA
host timing(s)12.427.127.1NA
device timing(ms)11.424.924.9NA
floydwarshallSYCLDPCT usmDPCT headerOpenMP
total enqueue512515125151251512056
host timing(s)22.424.420.4138
device timing(ms)7.26.77.223.3
fpcSYCLDPCT usmDPCT headerOpenMP
total enqueue400800800NA
host timing(s)3.94.44.3NA
device timing(ms)0.741.61.6NA
gamma-correctionSYCLDPCT usmDPCT headerOpenMP
total enqueue2339
host timing(s)0.330.650.703.41
device timing(ms)16262368
gaussianSYCLDPCT usmDPCT headerOpenMP
total enqueue81938196819661437
host timing(s)11.212.011.515.2
device timing(s)10.710.710.79.0
geodesicSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102407
host timing(s)10.710.810.713.8
device timing(s)10.29.999.9910.2
gmmSYCLDPCT usmDPCT headerOpenMP
total enqueue2028720291NA44753
host timing(s)130450NA222
device timing(s)1.12.8NA15.3
haccmkSYCLDPCT usmDPCT headerOpenMP
total enqueue4111121
host timing(s)0.310.630.633.42
device timing(ms)5.85.85.86.8
heartwallSYCLDPCT usmDPCT headerOpenMP
total enqueue212220220637
host timing(s)26.69.49.714.2
device timing(s)25.58.68.710.5
heatSYCLDPCT usmDPCT headerOpenMP
total enqueue10031003100310029
host timing(s)9.799.7510.016.1
device timing(s)9.198.749.1112.3
heat2dSYCLDPCT usmDPCT headerOpenMP
total enqueue1021021021107
host timing(s)2.652.982.995.26
device timing(s)2.292.292.291.82
histogramSYCLDPCT usmDPCT headerOpenMP* (to be optimized)
total enqueue1218122112213666
host timing(s)2.752.52.615.5
device timing(s)0.830.860.8558
hmmSYCLDPCT usmDPCT headerOpenMP
total enqueue5015045046499
host timing(s)11.311.811.725.7
device timing(s)10.911.010.921.9
hotspot3DSYCLDPCT usmDPCT headerOpenMP
total enqueue50015003500390008
host timing(s)4.25.64.610.8
device timing(s)3.74.03.74.1
hybridsortSYCLDPCT usmDPCT headerOpenMP
total enqueue253333193
host timing(s)1.681.861.914.66
device timing(s)1.211.01.031.44
interleaveSYCLDPCT usmDPCT headerOpenMP
total enqueue2022062061012
host timing(s)20.820.620.023.9
device timing(s)20.519.819.320.2
inversek2jSYCLDPCT usmDPCT headerOpenMP
total enqueue100001100004100003400008
host timing(s)6.45226.950.5
device timing(s)3.914.373.946.1
isingSYCLDPCT usmDPCT headerOpenMP
total enqueue4044054054018
host timing(s)9.25.45.412.2
device timing(s)8.84.64.68.7
iso2dfdSYCLDPCT usmDPCT headerOpenMP
total enqueue10011004100410010
host timing(s)2.773.293.156.42
device timing(s)2.422.452.452.75
jaccardSYCLDPCT usmDPCT headerOpenMP
total enqueue102108108NA
host timing(s)41.742.141.2NA
device timing(s)41.341.740.6NA
jenkins-hashSYCLDPCT usmDPCT headerOpenMP
total enqueue101104104509
host timing(s)4.95.45.68.1
device timing(s)4.64.64.84.9
keccaktreehashSYCLDPCT usmDPCT headerOpenMP
total enqueue24272762
host timing(s)1.361.671.7817.2
device timing(s)0.920.960.9313.7
kmeansSYCLDPCT usmDPCT headerOpenMP
total enqueue21500215012150171703
host timing(s)119121123122.5
device timing(s)114.1114.1114.3114.6
knnSYCLDPCT usmDPCT headerOpenMP
total enqueue5007007002007
host timing(s)12.915.314.617.0
device timing(s)10.411.411.411.6
lanczosSYCLDPCT usmDPCT headerOpenMP
total enqueue91089156915637620
host timing(s)37.340.426.782.8
device timing(s)32.237.921.380.9
langfordSYCLDPCT usmDPCT headerOpenMP
total enqueue15202033
host timing(s)12.913.012.837.4
device timing(s)11.311.311.133
laplaceSYCLDPCT usmDPCT headerOpenMP
total enqueue12742127511275138237
host timing(s)9.987.710.416.4
device timing(s)0.650.940.663.56
lavaMDSYCLDPCT usmDPCT headerOpenMP
total enqueue26615
host timing(s)1.82.02.04.8
device timing(s)1.421.311.271.32
leukocyteSYCLDPCT usmDPCT headerOpenMP
total enqueue2047117111334
host timing(s)4.14.64.66.8
device timing(s)3.583.773.733.64
lid-driven-cavitySYCLDPCT usmDPCT headerOpenMP
total enqueue1605667160567316056735619820
host timing(s)264573289712
device timing(s)201222220231
lombscargleSYCLDPCT usmDPCT headerOpenMP
total enqueue101104104309
host timing(s)2.42.962.915.6
device timing(s)1.952.142.122.1
luleshSYCLDPCT usmDPCT headerOpenMP
total enqueue2970298629866635
host timing(s)46.947.348.8137.9
device timing(s)41.643.643.4132
ludSYCLDPCT usmDPCT headerOpenMP
total enqueue1535153615366145
host timing(s)11.111.912.015.6
device timing(s)10.310.710.811.3
mandelbrotSYCLDPCT usmDPCT headerOpenMP
total enqueue303404404611
host timing(s)0.340.720.743.48
device timing(ms)5.185.485.45.15
matrix-mulSYCLDPCT usmDPCT headerOpenMP
total enqueue24413
host timing(s)6.87.767.7913.37
device timing(s)6.447.097.19.95
matrix-rotateSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102705
host timing(s)8.73.49.1316.9
device timing(s)8.398.428.4313.0
maxpool3dSYCLDPCT usmDPCT headerOpenMP
total enqueue1011021021807
host timing(s)7.07.47.311.8
device timing(s)6.36.46.48.2
mdSYCLDPCT usmDPCT headerOpenMP
total enqueue1002100210028008
host timing(s)7.523.23.110.85
device timing(s)7.172.422.417.17
md5hashSYCLDPCT usmDPCT headerOpenMP
total enqueue16282857
host timing(s)6.566.136.128.65
device timing(s)5.395.395.395.12
memcpySYCLDPCT usmDPCT headerOpenMP
total enqueue131072131072131072131072
host timing(s)4.313.24.71.94
device timing(s)1.43.51.41.13
miniFESYCLDPCT usmDPCT headerOpenMP
total enqueue2404241224126638
host timing(s)12.710.321.428.7
device timing(s)11.19.219.223.4
minimap2SYCLDPCT usmDPCT headerOpenMP
total enqueueNA20NA83
host timing(s)NA1.63NA9.26
device timing(s)NA0.93NA6.09
mixbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue2049205020506151
host timing(s)8.28.68.211.5
device timing(s)7.527.57.477.56
mkl-sgemmSYCLDPCT usmDPCT headerOpenMP
total enqueue20004600072000720007
host timing(s)1.6314.619.16.7
device timing(s)0.454.00.450.64
mtSYCLDPCT usmDPCT headerOpenMP
total enqueue2032042041018
host timing(s)1.481.421.5710.8
device timing(s)1.061.021.057.6
multimaterialSYCLDPCT usmDPCT headerOpenMP
total enqueue100429429689
host timing(s)3.33.44.57.8
device timing(s)1.61.82.64.2
murmurhash3SYCLDPCT usmDPCT headerOpenMP
total enqueue101104104409
host timing(s)10.310.91113.5
device timing(s)9.810.210.210.1
nbodySYCLDPCT usmDPCT headerOpenMP
total enqueue4024024021308
host timing(s)4.14.33.17.3
device timing(s)3.63.63.63.9
nnSYCLDPCT usmDPCT headerOpenMP
total enqueue23313
host timing(s)0.20.550.593.5
device timing(us)384943103
nwSYCLDPCT usmDPCT headerOpenMP
total enqueue20482050205013314
host timing(s)1.72.62.15.7
device timing(s)0.570.880.761.47
page-rankSYCLDPCT usmDPCT headerOpenMP
total enqueue6111130
host timing(s)1.221.671.604.3
device timing(s)0.710.770.740.82
particle-diffusionSYCLDPCT usmDPCT headerOpenMP
total enqueue27718
host timing(s)1.141.491.524.83
device timing(s)0.240.480.411.32
particlefilterSYCLDPCT usmDPCT headerOpenMP
total enqueue394545179
host timing(s)4.924.554.955.37
device timing(s)4.864.484.875.02
pathfinderSYCLDPCT usmDPCT headerOpenMP
total enqueue102104104709
host timing(s)3.14.34.39.3
device timing(s)2.83.63.65.8
popcountSYCLDPCT usmDPCT headerOpenMP
total enqueue5055065062015
host timing(s)8.89.48.912.5
device timing(s)8.48.28.28.9
presentSYCLDPCT usmDPCT headerOpenMP
total enqueue200NANA512
host timing(s)1.37NANA6.1
device timing(s)0.97NANA2.78
projectileSYCLDPCT usmDPCT headerOpenMP
total enqueue102102102307
host timing(s)2.122.52.55.4
device timing(s)1.751.751.741.82
quicksortSYCLDPCT usmDPCT headerOpenMP
total enqueue2184368537006696
host timing(s)15.421.722.423.7
device timing(s)8.49.49.45151.3
randomAccessSYCLDPCT usmDPCT headerOpenMP
total enqueue44421
host timing(s)1.942.22.36.4
device timing(s)1.451.451.452.8
reductionSYCLDPCT usmDPCT headerOpenMP
total enqueue325326326975
host timing(s)1.611.931.944.8
device timing(s)1.21.171.181.3
reverseSYCLDPCT usmDPCT headerOpenMP
total enqueue1048578104857910485791048584
host timing(s)31.328.957173
device timing(s)3.13.31.943.5
rng-wallaceSYCLDPCT usmDPCT headerOpenMP
total enqueue200202202207
host timing(s)2.63.23.215.0
device timing(s)2.12.32.311.7
rsbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue2101034
host timing(s)16.317.216.924.7
device timing(s)14.314.714.319.1
rtm8SYCLDPCT usmDPCT headerOpenMP
total enqueue323838220
host timing(s)3.74.04.17.3
device timing(s)3.13.13.23.6
s3dSYCLDPCT usmDPCT headerOpenMP
total enqueue27012705541034441
host timing(s)9.129647.1
device timing(s)0.210.260.220.2
scanSYCLDPCT usmDPCT headerOpenMP
total enqueue1000011000210002200007
host timing(s)34333150.3
device timing(s)0.791.280.932.6
secp256k1SYCLDPCT usmDPCT headerOpenMP
total enqueue101102102307
host timing(s)5.75.825.8321.7
device timing(s)3.563.583.4617.9
shuffleSYCLDPCT usmDPCT headerOpenMP
total enqueue909910910NA
host timing(s)25.325.325.4NA
device timing(s)24.424.524.4NA
simplemocSYCLDPCT usmDPCT headerOpenMP
total enqueue12181873
host timing(s)46.147.543.850.3
device timing(s)45.646.743.147
snakeSYCLDPCT usmDPCT headerOpenMP
total enqueue52025202520220807
host timing(s)14.636.714.844.7
device timing(s)11.234.311.236.9
softmaxSYCLDPCT usmDPCT headerOpenMP
total enqueue101102102507
host timing(s)5.34.65.77.4
device timing(s)4.93.95.13.9
sortSYCLDPCT usmDPCT headerOpenMP
total enqueue241242242256
host timing(s)10.217.217.926.6
device timing(s)9.616.617.323.1
sosfilSYCLDPCT usmDPCT headerOpenMP
total enqueue202208208NA
host timing(s)1.781.951.95NA
device timing(s)1.731.911.89NA
sphSYCLDPCT usmDPCT headerOpenMP
total enqueue20022004200413512
host timing(s)21.822.322.224.8
device timing(s)21.12121.120.4
sradSYCLDPCT usmDPCT headerOpenMP
total enqueue80038008800836026
host timing(s)2.22.752.26.4
device timing(s)0.740.940.780.84
ssspSYCLDPCT usmDPCT headerOpenMP
total enqueue883558864188640NA
host timing(s)18.024.524.7NA
device timing(s)2.62.32.3NA
stencilSYCLDPCT usmDPCT headerOpenMP
total enqueue23310
host timing(s)0.680.890.984
device timing(s)0.090.140.130.48
streamclusterSYCLDPCT usmDPCT headerOpenMP
total enqueue11278112781127830617
host timing(s)7.89.07.711.9
device timing(s)5.956.15.956.6
su3SYCLDPCT usmDPCT headerOpenMP
total enqueue102104104715
host timing(s)6.86.76.710.1
device timing(s)6.35.85.86.5
thomasSYCLDPCT usmDPCT headerOpenMP
total enqueue102105105710
host timing(s)4.62.64.710.4
device timing(s)4.22.24.27.5
transposeSYCLDPCT usmDPCT headerOpenMP
total enqueue51NA5964
host timing(s)12.3NA12.517.9
device timing(s)11.1NA11.314.1
triadSYCLDPCT usmDPCT headerOpenMP
total enqueue204400204400204400407907
host timing(s)7.98.88.4192
device timing(s)3.83.33.896
vmcSYCLDPCT usmDPCT headerOpenMP
total enqueue110411041104NA
host timing(s)235.9238.1236.8NA
device timing(s)234.3236.5234.9NA
xsbenchSYCLDPCT usmDPCT headerOpenMP
total enqueue29926
host timing(s)2.492.72.85.8
device timing(s)2.12.02.02.3

Results on Platform 3

Intel® CoreTM i9-10920X CPU with a Gen12LP discrete GPU (DG1)

minimodSYCLDPCT usmDPCT headerOpenMP
total enqueue800880118011NA
host timing(s)2.41.92.0NA
device timing(s)0.690.680.68NA