Last active
October 16, 2021 14:44
-
-
Save Muhammad-Yunus/d8a95e91a6e48a0de6409c734487ebac to your computer and use it in GitHub Desktop.
PoCL 1.7 clinfo Jetson TK1 CPU ARM Cortex A15 + GPU NVIDIA GK20a (CUDA 6.5 Enable)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Number of platforms 1 | |
| Platform Name Portable Computing Language | |
| Platform Vendor The pocl project | |
| Platform Version OpenCL 2.0 pocl 1.7, Debug+Asserts, LLVM 8.0.0, RELOC, SLEEF, FP16, CUDA, POCL_DEBUG | |
| Platform Profile FULL_PROFILE | |
| Platform Extensions cl_khr_icd | |
| Platform Extensions function suffix POCL | |
| Platform Name Portable Computing Language | |
| Number of devices 2 | |
| Device Name pthread-cortex-a15 | |
| Device Vendor ARM | |
| Device Vendor ID 0x13b5 | |
| Device Version OpenCL 1.2 pocl HSTR: pthread-armv7-unknown-linux-gnueabihf-cortex-a15 | |
| Driver Version 1.7 | |
| Device OpenCL C Version OpenCL C 1.2 pocl | |
| Device Type CPU | |
| Device Profile FULL_PROFILE | |
| Max compute units 4 | |
| Max clock frequency 2065MHz | |
| Device Partition (core) | |
| Max number of sub-devices 4 | |
| Supported partition types equally, by counts | |
| Max work item dimensions 3 | |
| Max work item sizes 4096x4096x4096 | |
| Max work group size 4096 | |
| Preferred work group size multiple 8 | |
| Preferred / native vector sizes | |
| char 16 / 16 | |
| short 8 / 8 | |
| int 4 / 4 | |
| long 2 / 2 | |
| half 8 / 8 (cl_khr_fp16) | |
| float 4 / 4 | |
| double 2 / 2 (cl_khr_fp64) | |
| Half-precision Floating-point support (cl_khr_fp16) | |
| Denormals No | |
| Infinity and NANs No | |
| Round to nearest No | |
| Round to zero No | |
| Round to infinity No | |
| IEEE754-2008 fused multiply-add No | |
| Support is emulated in software No | |
| Correctly-rounded divide and sqrt operations No | |
| Single-precision Floating-point support (core) | |
| Denormals No | |
| Infinity and NANs Yes | |
| Round to nearest Yes | |
| Round to zero No | |
| Round to infinity No | |
| IEEE754-2008 fused multiply-add No | |
| Support is emulated in software No | |
| Correctly-rounded divide and sqrt operations No | |
| Double-precision Floating-point support (cl_khr_fp64) | |
| Denormals Yes | |
| Infinity and NANs Yes | |
| Round to nearest Yes | |
| Round to zero Yes | |
| Round to infinity Yes | |
| IEEE754-2008 fused multiply-add Yes | |
| Support is emulated in software No | |
| Correctly-rounded divide and sqrt operations Yes | |
| Address bits 32, Little-Endian | |
| Global memory size 1488288768 (1.386GiB) | |
| Error Correction support No | |
| Max memory allocation 536870912 (512MiB) | |
| Unified memory for Host and Device Yes | |
| Minimum alignment for any data type 128 bytes | |
| Alignment of base address 1024 bits (128 bytes) | |
| Global Memory cache type None | |
| Image support Yes | |
| Max number of samplers per kernel 16 | |
| Max size for 1D images from buffer 33554432 pixels | |
| Max 1D or 2D image array size 2048 images | |
| Max 2D image size 8192x8192 pixels | |
| Max 3D image size 2048x2048x2048 pixels | |
| Max number of read image args 128 | |
| Max number of write image args 128 | |
| Local memory type Global | |
| Local memory size 524288 (512KiB) | |
| Max constant buffer size 524288 (512KiB) | |
| Max number of constant args 8 | |
| Max size of kernel argument 1024 | |
| Queue properties | |
| Out-of-order execution Yes | |
| Profiling Yes | |
| Prefer user sync for interop Yes | |
| Profiling timer resolution 1ns | |
| Execution capabilities | |
| Run OpenCL kernels Yes | |
| Run native kernels Yes | |
| printf() buffer size 16777216 (16MiB) | |
| Built-in kernels | |
| Device Available Yes | |
| Compiler Available Yes | |
| Linker Available Yes | |
| Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_fp16 cl_khr_fp64 | |
| Device Name GK20A | |
| Device Vendor NVIDIA Corporation | |
| Device Vendor ID 0x10de | |
| Device Version OpenCL 1.2 pocl HSTR: CUDA-sm_32 | |
| Driver Version 1.7 | |
| Device OpenCL C Version OpenCL C 1.2 pocl | |
| Device Type GPU | |
| Device Profile FULL_PROFILE | |
| Device Topology (NV) PCI-E, 00:00.0 | |
| Max compute units 1 | |
| Max clock frequency 852MHz | |
| Compute Capability (NV) 3.2 | |
| Device Partition (core) | |
| Max number of sub-devices 1 | |
| Supported partition types None | |
| Max work item dimensions 3 | |
| Max work item sizes 1024x1024x64 | |
| Max work group size 1024 | |
| Preferred work group size multiple 32 | |
| Warp size (NV) 32 | |
| Preferred / native vector sizes | |
| char 1 / 1 | |
| short 1 / 1 | |
| int 1 / 1 | |
| long 1 / 1 | |
| half 0 / 0 (n/a) | |
| float 1 / 1 | |
| double 1 / 1 (cl_khr_fp64) | |
| Half-precision Floating-point support (n/a) | |
| Single-precision Floating-point support (core) | |
| Denormals Yes | |
| Infinity and NANs Yes | |
| Round to nearest Yes | |
| Round to zero Yes | |
| Round to infinity Yes | |
| IEEE754-2008 fused multiply-add Yes | |
| Support is emulated in software No | |
| Correctly-rounded divide and sqrt operations No | |
| Double-precision Floating-point support (cl_khr_fp64) | |
| Denormals Yes | |
| Infinity and NANs Yes | |
| Round to nearest Yes | |
| Round to zero Yes | |
| Round to infinity Yes | |
| IEEE754-2008 fused multiply-add Yes | |
| Support is emulated in software No | |
| Correctly-rounded divide and sqrt operations No | |
| Address bits 32, Little-Endian | |
| Global memory size 1984385024 (1.848GiB) | |
| Error Correction support No | |
| Max memory allocation 496096256 (473.1MiB) | |
| Unified memory for Host and Device Yes | |
| Integrated memory (NV) Yes | |
| Minimum alignment for any data type 128 bytes | |
| Alignment of base address 4096 bits (512 bytes) | |
| Global Memory cache type None | |
| Image support No | |
| Local memory type Local | |
| Local memory size 49152 (48KiB) | |
| Registers per block (NV) 32768 | |
| Max constant buffer size 65536 (64KiB) | |
| Max number of constant args 8 | |
| Max size of kernel argument 1024 | |
| Queue properties | |
| Out-of-order execution No | |
| Profiling Yes | |
| Prefer user sync for interop Yes | |
| Profiling timer resolution 1ns | |
| Execution capabilities | |
| Run OpenCL kernels Yes | |
| Run native kernels No | |
| Kernel execution timeout (NV) No | |
| Concurrent copy and kernel execution (NV) Yes | |
| Number of async copy engines 1 | |
| printf() buffer size 16777216 (16MiB) | |
| Built-in kernels | |
| Device Available Yes | |
| Compiler Available Yes | |
| Linker Available Yes | |
| Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_nv_device_attribute_query | |
| NULL platform behavior | |
| clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) Portable Computing Language | |
| clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [POCL] | |
| clCreateContext(NULL, ...) [default] Success [POCL] | |
| clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) Success (1) | |
| Platform Name Portable Computing Language | |
| Device Name pthread-cortex-a15 | |
| clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) Success (1) | |
| Platform Name Portable Computing Language | |
| Device Name GK20A | |
| clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform | |
| clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform | |
| clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) Success (2) | |
| Platform Name Portable Computing Language | |
| Device Name pthread-cortex-a15 | |
| Device Name GK20A | |
| ICD loader properties | |
| ICD loader Name OpenCL ICD Loader | |
| ICD loader Vendor OCL Icd free software | |
| ICD loader Version 2.2.8 | |
| ICD loader Profile OpenCL 1.2 | |
| NOTE: your OpenCL library declares to support OpenCL 1.2, | |
| but it seems to support up to OpenCL 2.1 too. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment