Last active
January 8, 2016 09:40
-
-
Save EoD/aa4759f9dea80ae5ae01 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
$ ./bin/opencl-stream | |
---OpenCL: Platform Vendor 0: Mesa | |
---OpenCL: Device found 0. AMD TONGA (DRM 3.1.0, LLVM 3.8.0) | |
---OpenCL: CL_DEVICE_MAX_MEM_ALLOC_SIZE: 256 MB | |
---OpenCL: CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 0 B | |
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 4 B | |
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_FLOAT: 4 B | |
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 2 B | |
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_DOUBLE: 2 B | |
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 4 B | |
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_INT: 4 B | |
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 2 B | |
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_LONG: 2 B | |
Auto-selecting platform 0. | |
Auto-selecting device 0. | |
--------------------------------------------------------------------------------------------------- | |
Function Best Rate GB/s Avg time Min time Max time Best Workgroup Size Best GFLOPS | |
--------------------------------------------------------------------------------------------------- | |
copyKernel1 136.778 0.088240 0.182778 1.723276 64 0.000 | |
copyKernel2 136.027 0.057542 0.183787 0.904594 32 0.000 | |
copyKernel4 112.169 0.063991 0.222879 0.820489 8 0.000 | |
copyKernel8 119.263 0.063769 0.209621 0.813073 8 0.000 | |
copyKernel16 106.846 0.075604 0.233981 0.813352 4 0.000 | |
--------------------------------------------------------------------------------------------------- | |
scaleKernel1 136.355 0.097945 0.183346 1.950134 64 9.151 | |
scaleKernel2 135.931 0.072307 0.183916 1.302218 32 9.122 | |
scaleKernel4 135.681 0.063167 0.184256 1.089181 16 9.105 | |
scaleKernel8 133.663 0.067694 0.187038 1.004416 8 8.970 | |
scaleKernel16 90.921 0.088369 0.274963 1.007259 4 6.102 | |
--------------------------------------------------------------------------------------------------- | |
addKernel1 75.402 0.141642 0.331556 2.488551 256 5.060 | |
addKernel2 86.989 0.097308 0.287392 1.308611 256 5.838 | |
addKernel4 91.205 0.082391 0.274108 1.217896 32 6.121 | |
addKernel8 84.097 0.104618 0.297276 1.217038 16 5.644 | |
addKernel16 73.294 0.109907 0.341091 1.219109 8 4.919 | |
--------------------------------------------------------------------------------------------------- | |
triadKernel1 113.341 0.143745 0.330859 2.506930 256 10.142 | |
triadKernel2 132.021 0.111400 0.284046 1.536999 256 11.813 | |
triadKernel4 136.679 0.085245 0.274366 1.235817 32 12.230 | |
triadKernel8 123.406 0.107329 0.303874 1.263041 8 11.042 | |
triadKernel16 111.287 0.111690 0.336968 1.258950 8 9.958 | |
--------------------------------------------------------------------------------------------------- |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
$ ./bin/opencl-stream | |
---OpenCL: Platform Vendor 0: Mesa | |
---OpenCL: Device found 0. AMD TONGA (DRM 3.1.0, LLVM 3.8.0) | |
Auto-selecting platform 0. | |
Auto-selecting device 0. | |
OpenCL Error -61 (CL_INVALID_BUFFER_SIZE), line 98 | |
-------------------------------------------------------------------------------------------------- | |
Function Best Rate GB/s Avg time Min time Max time Best Workgroup Size Best GLOPS | |
-------------------------------------------------------------------------------------------------- | |
copyKernel1 51.458 0.327213 0.971664 5.200968 16 0.000 | |
copyKernel2 51.305 0.320870 0.974558 2.957927 8 0.000 | |
copyKernel4 50.460 0.397019 0.990882 4.089595 8 0.000 | |
copyKernel8 47.185 0.500957 1.059664 6.938917 8 0.000 | |
copyKernel16 37.611 1.097749 1.329412 14.879067 2 0.000 | |
-------------------------------------------------------------------------------------------------- | |
scaleKernel1 51.050 0.349621 0.979433 5.504304 16 3.426 | |
scaleKernel2 47.575 0.342027 1.050975 3.134202 8 3.193 | |
scaleKernel4 45.312 0.349201 1.103455 2.965188 16 3.041 | |
scaleKernel8 42.159 0.528744 1.185989 6.928565 4 2.829 | |
scaleKernel16 44.376 1.306128 1.126743 16.487554 4 2.978 | |
-------------------------------------------------------------------------------------------------- | |
addKernel1 45.929 0.360080 1.088644 6.068718 32 3.082 | |
addKernel2 45.954 0.326798 1.088034 3.135593 16 3.084 | |
addKernel4 46.913 0.315717 1.065802 2.501885 16 3.148 | |
addKernel8 47.049 0.416834 1.062731 4.461564 8 3.157 | |
addKernel16 46.972 0.862984 1.064466 13.083562 4 3.152 | |
-------------------------------------------------------------------------------------------------- | |
triadKernel1 69.291 0.369788 1.082395 6.350230 32 6.200 | |
triadKernel2 69.071 0.336216 1.085844 3.356133 16 6.180 | |
triadKernel4 70.164 0.318887 1.068925 2.627155 16 6.278 | |
triadKernel8 69.727 0.418932 1.075629 4.313267 8 6.239 | |
triadKernel16 58.054 0.923546 1.291897 14.418079 4 5.195 | |
-------------------------------------------------------------------------------------------------- | |
Error in result! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment