Skip to content

Instantly share code, notes, and snippets.

@fo40225
Created June 26, 2019 03:42
Show Gist options
  • Save fo40225/ddf9b4e5207c04ad57f780792ca3dfe3 to your computer and use it in GitHub Desktop.
Save fo40225/ddf9b4e5207c04ad57f780792ca3dfe3 to your computer and use it in GitHub Desktop.
i9 9980xe GFlops
user@ubuntu:~/Flops/version3/binaries-linux$ ./2006-Core2
Running Core 2 tuned binary with 1 thread...
Single-Precision - 128-bit SSE - Add/Sub
GFlops = 34.752
Result = 4.42554e+06
Double-Precision - 128-bit SSE2 - Add/Sub
GFlops = 17.536
Result = 2.21176e+06
Single-Precision - 128-bit SSE - Multiply
GFlops = 34.896
Result = 4.43153e+06
Double-Precision - 128-bit SSE2 - Multiply
GFlops = 17.472
Result = 2.2397e+06
Single-Precision - 128-bit SSE - Multiply + Add
GFlops = 34.272
Result = 3.61454e+06
Double-Precision - 128-bit SSE2 - Multiply + Add
GFlops = 16.992
Result = 1.79621e+06
Running Core 2 tuned binary with 36 thread(s)...
Single-Precision - 128-bit SSE - Add/Sub
GFlops = 543.424
Result = 6.91764e+07
Double-Precision - 128-bit SSE2 - Add/Sub
GFlops = 272.864
Result = 3.4783e+07
Single-Precision - 128-bit SSE - Multiply
GFlops = 545.856
Result = 6.9592e+07
Double-Precision - 128-bit SSE2 - Multiply
GFlops = 272.856
Result = 3.48006e+07
Single-Precision - 128-bit SSE - Multiply + Add
GFlops = 545.904
Result = 5.80786e+07
Double-Precision - 128-bit SSE2 - Multiply + Add
GFlops = 272.688
Result = 2.89509e+07
user@ubuntu:~/Flops/version3/binaries-linux$ ./2017-SkylakePurley
Running Skylake Purley tuned binary with 1 thread...
Single-Precision - 128-bit AVX - Add/Sub
GFlops = 28.832
Result = 3.71883e+06
Double-Precision - 128-bit AVX - Add/Sub
GFlops = 14.432
Result = 1.84323e+06
Single-Precision - 128-bit AVX - Multiply
GFlops = 28.8
Result = 3.67606e+06
Double-Precision - 128-bit AVX - Multiply
GFlops = 14.424
Result = 1.84051e+06
Single-Precision - 128-bit AVX - Multiply + Add
GFlops = 28.848
Result = 3.07346e+06
Double-Precision - 128-bit AVX - Multiply + Add
GFlops = 14.4
Result = 1.52687e+06
Single-Precision - 128-bit FMA3 - Fused Multiply Add
GFlops = 57.6
Result = 3.66402e+06
Double-Precision - 128-bit FMA3 - Fused Multiply Add
GFlops = 28.848
Result = 1.84041e+06
Single-Precision - 256-bit AVX - Add/Sub
GFlops = 57.664
Result = 7.3565e+06
Double-Precision - 256-bit AVX - Add/Sub
GFlops = 28.8
Result = 3.65278e+06
Single-Precision - 256-bit AVX - Multiply
GFlops = 57.696
Result = 7.37138e+06
Double-Precision - 256-bit AVX - Multiply
GFlops = 28.848
Result = 3.69015e+06
Single-Precision - 256-bit AVX - Multiply + Add
GFlops = 57.792
Result = 6.16713e+06
Double-Precision - 256-bit AVX - Multiply + Add
GFlops = 28.944
Result = 3.06769e+06
Single-Precision - 256-bit FMA3 - Fused Multiply Add
GFlops = 115.584
Result = 7.3621e+06
Double-Precision - 256-bit FMA3 - Fused Multiply Add
GFlops = 57.696
Result = 3.67888e+06
Single-Precision - 512-bit AVX512 - Add/Sub
GFlops = 115.2
Result = 1.47602e+07
Double-Precision - 512-bit AVX512 - Add/Sub
GFlops = 57.6
Result = 7.3746e+06
Single-Precision - 512-bit AVX512 - Multiply
GFlops = 115.584
Result = 1.47348e+07
Double-Precision - 512-bit AVX512 - Multiply
GFlops = 57.792
Result = 7.37318e+06
Single-Precision - 512-bit AVX512 - Multiply + Add
GFlops = 115.584
Result = 1.23265e+07
Double-Precision - 512-bit AVX512 - Multiply + Add
GFlops = 57.792
Result = 6.13277e+06
Single-Precision - 512-bit AVX512 - Fused Multiply Add
GFlops = 231.168
Result = 1.46687e+07
Double-Precision - 512-bit AVX512 - Fused Multiply Add
GFlops = 115.2
Result = 7.32398e+06
Running Skylake Purley tuned binary with 36 thread(s)...
Single-Precision - 128-bit AVX - Add/Sub
GFlops = 401.376
Result = 5.11879e+07
Double-Precision - 128-bit AVX - Add/Sub
GFlops = 200.944
Result = 2.56799e+07
Single-Precision - 128-bit AVX - Multiply
GFlops = 402.432
Result = 5.11647e+07
Double-Precision - 128-bit AVX - Multiply
GFlops = 201.168
Result = 2.56195e+07
Single-Precision - 128-bit AVX - Multiply + Add
GFlops = 402.48
Result = 4.28282e+07
Double-Precision - 128-bit AVX - Multiply + Add
GFlops = 201.408
Result = 2.14222e+07
Single-Precision - 128-bit FMA3 - Fused Multiply Add
GFlops = 801.024
Result = 5.10709e+07
Double-Precision - 128-bit FMA3 - Fused Multiply Add
GFlops = 402.288
Result = 2.56592e+07
Single-Precision - 256-bit AVX - Add/Sub
GFlops = 802.368
Result = 1.02344e+08
Double-Precision - 256-bit AVX - Add/Sub
GFlops = 401.888
Result = 5.12359e+07
Single-Precision - 256-bit AVX - Multiply
GFlops = 805.056
Result = 1.02588e+08
Double-Precision - 256-bit AVX - Multiply
GFlops = 402.384
Result = 5.13744e+07
Single-Precision - 256-bit AVX - Multiply + Add
GFlops = 804.672
Result = 8.55144e+07
Double-Precision - 256-bit AVX - Multiply + Add
GFlops = 402.528
Result = 4.28051e+07
Single-Precision - 256-bit FMA3 - Fused Multiply Add
GFlops = 1610.11
Result = 1.02645e+08
Double-Precision - 256-bit FMA3 - Fused Multiply Add
GFlops = 804.96
Result = 5.12877e+07
Single-Precision - 512-bit AVX512 - Add/Sub
GFlops = 1610.5
Result = 2.05298e+08
Double-Precision - 512-bit AVX512 - Add/Sub
GFlops = 805.888
Result = 1.0271e+08
Single-Precision - 512-bit AVX512 - Multiply
GFlops = 1614.72
Result = 2.06027e+08
Double-Precision - 512-bit AVX512 - Multiply
GFlops = 804.864
Result = 1.02642e+08
Single-Precision - 512-bit AVX512 - Multiply + Add
GFlops = 1615.87
Result = 1.71827e+08
Double-Precision - 512-bit AVX512 - Multiply + Add
GFlops = 807.744
Result = 8.57674e+07
Single-Precision - 512-bit AVX512 - Fused Multiply Add
GFlops = 3229.44
Result = 2.05759e+08
Double-Precision - 512-bit AVX512 - Fused Multiply Add
GFlops = 1614.72
Result = 1.02841e+08
user@ubuntu:~/Flops/version3/binaries-linux$
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment