Skip to content

Instantly share code, notes, and snippets.

@luhenry
Last active December 18, 2020 00:36
Show Gist options
  • Save luhenry/2cda93cb40f3edef76cb499c896608a9 to your computer and use it in GitHub Desktop.
Save luhenry/2cda93cb40f3edef76cb499c896608a9 to your computer and use it in GitHub Desktop.
Benchmark Mode Threads Samples Score Score Error (99.9%) Unit Param: k Param: m Param: n Param: trans Param: transa Param: transb
dev.ludovic.blas.benchmarks.DaxpyBenchmark blas thrpt 1 6 10328662.50 167933.39 ops/s 100
dev.ludovic.blas.benchmarks.DaxpyBenchmark f2j thrpt 1 6 17276172.55 194877.38 ops/s 100
dev.ludovic.blas.benchmarks.DaxpyBenchmark vector thrpt 1 6 48170174.19 309585.24 ops/s 100
dev.ludovic.blas.benchmarks.DaxpyBenchmark blas thrpt 1 6 87.06 12.16 ops/s 10000000
dev.ludovic.blas.benchmarks.DaxpyBenchmark f2j thrpt 1 6 106.24 3.88 ops/s 10000000
dev.ludovic.blas.benchmarks.DaxpyBenchmark vector thrpt 1 6 121.42 1.29 ops/s 10000000
dev.ludovic.blas.benchmarks.DdotBenchmark blas thrpt 1 6 9971174.02 70502.50 ops/s 100
dev.ludovic.blas.benchmarks.DdotBenchmark f2j thrpt 1 6 10790365.04 56293.53 ops/s 100
dev.ludovic.blas.benchmarks.DdotBenchmark vector thrpt 1 6 32507396.01 462653.36 ops/s 100
dev.ludovic.blas.benchmarks.DdotBenchmark blas thrpt 1 6 112.21 7.88 ops/s 10000000
dev.ludovic.blas.benchmarks.DdotBenchmark f2j thrpt 1 6 88.53 2.60 ops/s 10000000
dev.ludovic.blas.benchmarks.DdotBenchmark vector thrpt 1 6 132.07 5.83 ops/s 10000000
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2042637.85 8542.60 ops/s 10 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 923011.41 1136.49 ops/s 10 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 921666.51 4647.82 ops/s 10 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 78147.69 386.24 ops/s 1000 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10171.39 276.29 ops/s 1000 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 10259.97 32.87 ops/s 1000 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 82944.62 422.25 ops/s 10 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 17731.58 129.65 ops/s 10 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 17638.89 83.05 ops/s 10 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2495.89 80.93 ops/s 1000 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 167.11 4.35 ops/s 1000 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 163.95 12.00 ops/s 1000 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 78734.43 4195.57 ops/s 10 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9802.36 26.60 ops/s 10 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 9622.01 538.65 ops/s 10 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1616.66 965.03 ops/s 1000 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 106.44 0.81 ops/s 1000 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 106.68 0.41 ops/s 1000 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2111.52 148.09 ops/s 10 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 171.32 0.34 ops/s 10 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 171.44 0.14 ops/s 10 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 105.51 0.81 ops/s 1000 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 1.60 0.23 ops/s 1000 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 1.66 0.03 ops/s 1000 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2054290.08 13240.66 ops/s 10 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 936262.08 2093.58 ops/s 10 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 1163587.20 17250.06 ops/s 10 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 79484.61 768.98 ops/s 1000 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10841.49 15.91 ops/s 1000 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 25887.80 1341.88 ops/s 1000 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 83362.52 2769.93 ops/s 10 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10094.88 22.52 ops/s 10 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 12025.65 180.84 ops/s 10 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2597.56 98.58 ops/s 1000 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 103.06 0.48 ops/s 1000 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 215.88 11.00 ops/s 1000 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 79787.90 1608.62 ops/s 10 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9753.82 29.97 ops/s 10 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 12516.77 1782.32 ops/s 10 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1613.01 2603.21 ops/s 1000 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 107.18 0.29 ops/s 1000 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 243.14 13.07 ops/s 1000 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2144.77 121.60 ops/s 10 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 99.29 0.43 ops/s 10 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 115.72 18.19 ops/s 10 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 105.16 1.03 ops/s 1000 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 1.03 0.01 ops/s 1000 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 2.07 0.22 ops/s 1000 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1930441.88 5692.22 ops/s 10 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 926376.89 1562.97 ops/s 10 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 920685.85 3918.77 ops/s 10 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 62923.75 243.08 ops/s 1000 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10077.18 1095.50 ops/s 1000 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 9981.89 616.41 ops/s 1000 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 81954.14 1084.29 ops/s 10 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 16855.39 96.91 ops/s 10 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 17299.23 1284.96 ops/s 10 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2421.07 445.05 ops/s 1000 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 150.69 6.03 ops/s 1000 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 152.34 0.51 ops/s 1000 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 68111.31 2490.68 ops/s 10 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9324.40 569.98 ops/s 10 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 9325.47 533.31 ops/s 10 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1897.84 50.92 ops/s 1000 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 73.31 2.28 ops/s 1000 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 72.61 3.23 ops/s 1000 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1559.77 2136.80 ops/s 10 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 155.75 1.01 ops/s 10 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 156.11 0.48 ops/s 10 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 104.29 1.23 ops/s 1000 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 1.49 0.00 ops/s 1000 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 1.47 0.03 ops/s 1000 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1919851.79 92851.33 ops/s 10 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 701353.88 16174.11 ops/s 10 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 700310.84 12532.63 ops/s 10 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 62829.00 2367.83 ops/s 1000 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10606.50 87.29 ops/s 1000 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 10605.33 23.56 ops/s 1000 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 82212.61 1444.81 ops/s 10 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 7696.81 5.12 ops/s 10 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 7697.56 13.72 ops/s 10 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2117.46 1719.39 ops/s 1000 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 98.67 0.33 ops/s 1000 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 97.85 1.51 ops/s 1000 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 68149.58 803.16 ops/s 10 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 7507.17 33.70 ops/s 10 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 7475.05 37.89 ops/s 10 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1884.05 149.60 ops/s 1000 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 46.96 0.40 ops/s 1000 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 46.78 0.79 ops/s 1000 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2103.09 54.31 ops/s 10 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 73.73 0.16 ops/s 10 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 73.65 0.42 ops/s 10 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 103.76 1.26 ops/s 1000 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 0.50 0.00 ops/s 1000 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 0.46 0.00 ops/s 1000 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 5008783.46 4293.61 ops/s 10 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 7854869.02 98402.52 ops/s 10 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 7625954.10 70418.09 ops/s 10 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 36127.91 28789.86 ops/s 10000 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 15514.60 649.30 ops/s 10000 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 15493.31 540.77 ops/s 10000 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 43836.63 25512.37 ops/s 10 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 8939.15 75.31 ops/s 10 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 8920.73 46.53 ops/s 10 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 23.23 1.93 ops/s 10000 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 13.11 0.14 ops/s 10000 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 13.16 0.15 ops/s 10000 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 4786332.19 100601.19 ops/s 10 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 8206584.15 25010.44 ops/s 10 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 11130624.39 454188.19 ops/s 10 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 127517.19 13292.91 ops/s 10000 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 10729.20 48.88 ops/s 10000 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 25482.85 2996.13 ops/s 10000 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 45755.79 2330.34 ops/s 10 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 9565.30 54.52 ops/s 10 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 12339.32 477.91 ops/s 10 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 23.30 3.03 ops/s 10000 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 9.41 0.05 ops/s 10000 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 16.72 0.22 ops/s 10000 10000 T
dev.ludovic.blas.benchmarks.DscalBenchmark blas thrpt 1 6 563953.00 1739.09 ops/s 100
dev.ludovic.blas.benchmarks.DscalBenchmark f2j thrpt 1 6 298616.50 1021.02 ops/s 100
dev.ludovic.blas.benchmarks.DscalBenchmark vector thrpt 1 6 1115766.10 7924.88 ops/s 100
dev.ludovic.blas.benchmarks.DscalBenchmark blas thrpt 1 6 55.08 2.32 ops/s 10000000
dev.ludovic.blas.benchmarks.DscalBenchmark f2j thrpt 1 6 2.97 0.04 ops/s 10000000
dev.ludovic.blas.benchmarks.DscalBenchmark vector thrpt 1 6 10.93 0.06 ops/s 10000000
dev.ludovic.blas.benchmarks.DspmvBenchmark blas thrpt 1 6 2739137.38 18297.23 ops/s 10
dev.ludovic.blas.benchmarks.DspmvBenchmark f2j thrpt 1 6 7479406.81 69014.28 ops/s 10
dev.ludovic.blas.benchmarks.DspmvBenchmark vector thrpt 1 6 6682122.03 132656.47 ops/s 10
dev.ludovic.blas.benchmarks.DspmvBenchmark blas thrpt 1 6 6468.91 234.80 ops/s 1000
dev.ludovic.blas.benchmarks.DspmvBenchmark f2j thrpt 1 6 1564.49 26.45 ops/s 1000
dev.ludovic.blas.benchmarks.DspmvBenchmark vector thrpt 1 6 880.45 29.84 ops/s 1000
dev.ludovic.blas.benchmarks.DsprBenchmark blas thrpt 1 6 3833498.05 18192.89 ops/s 10
dev.ludovic.blas.benchmarks.DsprBenchmark f2j thrpt 1 6 10447901.93 40333.68 ops/s 10
dev.ludovic.blas.benchmarks.DsprBenchmark vector thrpt 1 6 17533693.83 66560.52 ops/s 10
dev.ludovic.blas.benchmarks.DsprBenchmark blas thrpt 1 6 26347.24 1385.39 ops/s 1000
dev.ludovic.blas.benchmarks.DsprBenchmark f2j thrpt 1 6 3161.98 375.87 ops/s 1000
dev.ludovic.blas.benchmarks.DsprBenchmark vector thrpt 1 6 5853.33 194.00 ops/s 1000
dev.ludovic.blas.benchmarks.DsyrBenchmark blas thrpt 1 6 3949112.03 95713.30 ops/s 10
dev.ludovic.blas.benchmarks.DsyrBenchmark f2j thrpt 1 6 10566850.53 252425.40 ops/s 10
dev.ludovic.blas.benchmarks.DsyrBenchmark vector thrpt 1 6 17665244.00 93406.18 ops/s 10
dev.ludovic.blas.benchmarks.DsyrBenchmark blas thrpt 1 6 17960.69 5822.88 ops/s 1000
dev.ludovic.blas.benchmarks.DsyrBenchmark f2j thrpt 1 6 2909.92 134.90 ops/s 1000
dev.ludovic.blas.benchmarks.DsyrBenchmark vector thrpt 1 6 5211.19 133.58 ops/s 1000
dev.ludovic.blas.benchmarks.SdotBenchmark blas thrpt 1 6 10620811.19 152330.38 ops/s 100
dev.ludovic.blas.benchmarks.SdotBenchmark f2j thrpt 1 6 10834661.50 122757.07 ops/s 100
dev.ludovic.blas.benchmarks.SdotBenchmark vector thrpt 1 6 30110486.93 3713824.18 ops/s 100
dev.ludovic.blas.benchmarks.SdotBenchmark blas thrpt 1 6 285.67 13.37 ops/s 10000000
dev.ludovic.blas.benchmarks.SdotBenchmark f2j thrpt 1 6 95.90 1.23 ops/s 10000000
dev.ludovic.blas.benchmarks.SdotBenchmark vector thrpt 1 6 212.25 5.01 ops/s 10000000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment