Created
October 31, 2013 07:01
-
-
Save syoyo/7245357 to your computer and use it in GitHub Desktop.
Bullet3 on Radeon R9 280X(1050 MHz OC model)
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Demo settings: | |
SelectedDemo=1, demoname = BoxBox | |
x_dim=30, y_dim=30, z_dim=30 | |
x_gap=16.299999, y_gap=6.300000, z_gap=16.299999 | |
OpenCL settings: | |
Preferred cl_device index 1 | |
Preferred cl_platform index-1 | |
Platform info: | |
CL_PLATFORM_VENDOR: Advanced Micro Devices, Inc. | |
CL_PLATFORM_NAME: AMD Accelerated Parallel Processing | |
CL_PLATFORM_VERSION: OpenCL 1.2 AMD-APP (1348.4) | |
Device Info: | |
CL_DEVICE_NAME: Tahiti | |
CL_DEVICE_VENDOR: Advanced Micro Devices, Inc. | |
CL_DRIVER_VERSION: 1348.4 (VM) | |
CL_DEVICE_TYPE: CL_DEVICE_TYPE_GPU | |
CL_DEVICE_MAX_COMPUTE_UNITS: 32 | |
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3 | |
CL_DEVICE_MAX_WORK_ITEM_SIZES: 256 / 256 / 256 | |
CL_DEVICE_MAX_WORK_GROUP_SIZE: 256 | |
CL_DEVICE_MAX_CLOCK_FREQUENCY: 650 MHz | |
CL_DEVICE_ADDRESS_BITS: 32 | |
CL_DEVICE_MAX_MEM_ALLOC_SIZE: 512 MByte | |
CL_DEVICE_GLOBAL_MEM_SIZE: 2048 MByte | |
CL_DEVICE_ERROR_CORRECTION_SUPPORT: no | |
CL_DEVICE_LOCAL_MEM_TYPE: local | |
CL_DEVICE_LOCAL_MEM_SIZE: 32 KByte | |
CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64 KByte | |
CL_DEVICE_QUEUE_PROPERTIES: CL_QUEUE_PROFILING_ENABLE | |
CL_DEVICE_IMAGE_SUPPORT: 1 | |
CL_DEVICE_MAX_READ_IMAGE_ARGS: 128 | |
CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 8 | |
CL_DEVICE_IMAGE <dim> 2D_MAX_WIDTH 16384 | |
2D_MAX_HEIGHT 16384 | |
3D_MAX_WIDTH 2048 | |
3D_MAX_HEIGHT 2048 | |
3D_MAX_DEPTH 2048 | |
CL_DEVICE_EXTENSIONS:cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer | |
CL_DEVICE_PREFERRED_VECTOR_WIDTH_<t> CHAR 4, SHORT 2, INT 1,LONG 1, FLOAT 1, DOUBLE 1 | |
================================== | |
Frame 1: | |
---------------------------------- | |
Profiling: Root (total running time: 207.135 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.15 %) :: 0.320 ms / frame (1 calls) | |
2 -- gui->draw (0.61 %) :: 1.269 ms / frame (1 calls) | |
3 -- renderScene (0.16 %) :: 0.330 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (98.94 %) :: 204.935 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.136 %) :: 0.281 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.330 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.94 %) :: 0.310 ms / frame (2 calls) | |
...Unaccounted: (6.061 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.310 ms) --- | |
......0 -- glUseProgram(0); (1.61 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.87 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (41.94 %) :: 0.130 ms / frame (4 calls) | |
......3 -- glFlush2 (5.81 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (0.97 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.61 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (44.194 %) :: 0.137 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (80.00 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (20.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 204.935 ms) --- | |
...0 -- cl2gl_upload (84.75 %) :: 173.689 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (15.25 %) :: 31.243 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 31.243 ms) --- | |
......0 -- solveContacts (66.81 %) :: 20.873 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.71 %) :: 2.721 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (7.68 %) :: 2.400 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (16.58 %) :: 5.179 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.05 %) :: 0.015 ms / frame (1 calls) | |
......Unaccounted: (0.176 %) :: 0.055 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.873 ms) --- | |
.........0 -- GPU solveContactConstraint (69.91 %) :: 14.592 ms / frame (1 calls) | |
.........1 -- batching (29.71 %) :: 6.201 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.37 %) :: 0.077 ms / frame (1 calls) | |
.........Unaccounted: (0.014 %) :: 0.003 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.592 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.75 %) :: 6.822 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.25 %) :: 7.770 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.201 ms) --- | |
............0 -- m_contactToConstraintKernel (27.85 %) :: 1.727 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.13 %) :: 0.008 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.46 %) :: 4.431 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.113 %) :: 0.007 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.008 ms) --- | |
...............0 -- batch generation (87.50 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (12.500 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.077 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.30 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.701 %) :: 0.076 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.721 ms) --- | |
.........0 -- clipHullHullKernel (57.37 %) :: 1.561 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.87 %) :: 0.187 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.38 %) :: 0.609 ms / frame (1 calls) | |
.........Unaccounted: (13.377 %) :: 0.364 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.179 ms) --- | |
.........0 -- sapKernel (73.86 %) :: 3.825 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.56 %) :: 0.081 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.27 %) :: 0.066 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.17 %) :: 0.423 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.02 %) :: 0.053 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (0.97 %) :: 0.050 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.15 %) :: 0.629 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.08 %) :: 0.004 ms / frame (1 calls) | |
.........Unaccounted: (0.927 %) :: 0.048 ms | |
================================== | |
Frame 2: | |
---------------------------------- | |
Profiling: Root (total running time: 209.995 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.157 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.239 ms / frame (1 calls) | |
3 -- renderScene (0.17 %) :: 0.355 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (97.87 %) :: 205.526 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (1.294 %) :: 2.718 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.355 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.65 %) :: 0.336 ms / frame (2 calls) | |
...Unaccounted: (5.352 %) :: 0.019 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.336 ms) --- | |
......0 -- glUseProgram(0); (0.89 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (3.87 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (35.12 %) :: 0.118 ms / frame (4 calls) | |
......3 -- glFlush2 (8.33 %) :: 0.028 ms / frame (2 calls) | |
......4 -- updateCamera (0.60 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (3.57 %) :: 0.012 ms / frame (2 calls) | |
......Unaccounted: (47.619 %) :: 0.160 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.012 ms) --- | |
.........0 -- texture (83.33 %) :: 0.010 ms / frame (2 calls) | |
.........Unaccounted: (16.667 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.526 ms) --- | |
...0 -- cl2gl_upload (86.38 %) :: 177.543 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (13.61 %) :: 27.981 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 27.981 ms) --- | |
......0 -- solveContacts (70.43 %) :: 19.706 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.92 %) :: 2.775 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.63 %) :: 0.176 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.74 %) :: 5.243 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.261 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 19.706 ms) --- | |
.........0 -- GPU solveContactConstraint (68.57 %) :: 13.512 ms / frame (1 calls) | |
.........1 -- batching (31.02 %) :: 6.113 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.41 %) :: 0.080 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.512 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (47.11 %) :: 6.366 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (52.87 %) :: 7.144 ms / frame (1 calls) | |
............Unaccounted: (0.015 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.113 ms) --- | |
............0 -- m_contactToConstraintKernel (27.37 %) :: 1.673 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.47 %) :: 4.369 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.28 %) :: 0.017 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.605 %) :: 0.037 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.080 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (3.75 %) :: 0.003 ms / frame (1 calls) | |
............Unaccounted: (96.250 %) :: 0.077 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.775 ms) --- | |
.........0 -- clipHullHullKernel (56.97 %) :: 1.581 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.25 %) :: 0.229 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.45 %) :: 0.623 ms / frame (1 calls) | |
.........Unaccounted: (12.324 %) :: 0.342 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.243 ms) --- | |
.........0 -- sapKernel (73.13 %) :: 3.834 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.81 %) :: 0.095 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.74 %) :: 0.091 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.84 %) :: 0.411 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.47 %) :: 0.077 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.41 %) :: 0.074 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.18 %) :: 0.586 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.392 %) :: 0.073 ms | |
================================== | |
Frame 3: | |
---------------------------------- | |
Profiling: Root (total running time: 209.941 ms) --- | |
0 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.144 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.257 ms / frame (1 calls) | |
3 -- renderScene (0.14 %) :: 0.303 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (98.56 %) :: 206.921 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.626 %) :: 1.315 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.303 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.41 %) :: 0.280 ms / frame (2 calls) | |
...Unaccounted: (7.591 %) :: 0.023 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.280 ms) --- | |
......0 -- glUseProgram(0); (1.07 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (4.29 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (35.71 %) :: 0.100 ms / frame (4 calls) | |
......3 -- glFlush2 (9.29 %) :: 0.026 ms / frame (2 calls) | |
......4 -- updateCamera (3.93 %) :: 0.011 ms / frame (2 calls) | |
......5 -- init (1.43 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (44.286 %) :: 0.124 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.921 ms) --- | |
...0 -- cl2gl_upload (85.81 %) :: 177.553 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.19 %) :: 29.366 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.366 ms) --- | |
......0 -- solveContacts (73.17 %) :: 21.487 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.40 %) :: 2.759 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.51 %) :: 0.150 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (16.74 %) :: 4.915 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.163 %) :: 0.048 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.487 ms) --- | |
.........0 -- GPU solveContactConstraint (70.20 %) :: 15.083 ms / frame (1 calls) | |
.........1 -- batching (29.47 %) :: 6.333 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.083 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.58 %) :: 7.026 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.42 %) :: 8.057 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.333 ms) --- | |
............0 -- m_contactToConstraintKernel (27.11 %) :: 1.717 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.11 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.85 %) :: 4.550 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.521 %) :: 0.033 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.86 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.143 %) :: 0.068 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.759 ms) --- | |
.........0 -- clipHullHullKernel (57.48 %) :: 1.586 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (10.15 %) :: 0.280 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (21.96 %) :: 0.606 ms / frame (1 calls) | |
.........Unaccounted: (10.402 %) :: 0.287 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 4.915 ms) --- | |
.........0 -- sapKernel (77.25 %) :: 3.797 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.65 %) :: 0.081 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.36 %) :: 0.067 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.91 %) :: 0.389 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.04 %) :: 0.051 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (0.94 %) :: 0.046 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (8.91 %) :: 0.438 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.895 %) :: 0.044 ms | |
================================== | |
Frame 4: | |
---------------------------------- | |
Profiling: Root (total running time: 210.589 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.11 %) :: 0.232 ms / frame (1 calls) | |
2 -- gui->draw (0.61 %) :: 1.276 ms / frame (1 calls) | |
3 -- renderScene (0.22 %) :: 0.457 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (98.18 %) :: 206.754 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.888 %) :: 1.870 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.457 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.44 %) :: 0.427 ms / frame (2 calls) | |
...Unaccounted: (6.565 %) :: 0.030 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.427 ms) --- | |
......0 -- glUseProgram(0); (1.17 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.45 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (45.43 %) :: 0.194 ms / frame (4 calls) | |
......3 -- glFlush2 (6.56 %) :: 0.028 ms / frame (2 calls) | |
......4 -- updateCamera (0.94 %) :: 0.004 ms / frame (2 calls) | |
......5 -- init (1.17 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (40.281 %) :: 0.172 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (80.00 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (20.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.754 ms) --- | |
...0 -- cl2gl_upload (86.12 %) :: 178.063 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (13.88 %) :: 28.689 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 28.689 ms) --- | |
......0 -- solveContacts (71.90 %) :: 20.627 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.88 %) :: 2.547 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.87 %) :: 0.249 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (17.86 %) :: 5.123 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.471 %) :: 0.135 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.627 ms) --- | |
.........0 -- GPU solveContactConstraint (69.33 %) :: 14.301 ms / frame (1 calls) | |
.........1 -- batching (30.30 %) :: 6.249 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.074 ms / frame (1 calls) | |
.........Unaccounted: (0.015 %) :: 0.003 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.301 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (43.87 %) :: 6.274 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (56.11 %) :: 8.025 ms / frame (1 calls) | |
............Unaccounted: (0.014 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.249 ms) --- | |
............0 -- m_contactToConstraintKernel (27.09 %) :: 1.693 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.11 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.17 %) :: 4.510 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.192 %) :: 0.012 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.074 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.70 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.297 %) :: 0.072 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.547 ms) --- | |
.........0 -- clipHullHullKernel (60.31 %) :: 1.536 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.38 %) :: 0.137 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.24 %) :: 0.592 ms / frame (1 calls) | |
.........Unaccounted: (11.072 %) :: 0.282 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.123 ms) --- | |
.........0 -- sapKernel (77.16 %) :: 3.953 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.58 %) :: 0.081 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.29 %) :: 0.066 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.65 %) :: 0.392 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.07 %) :: 0.055 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.66 %) :: 0.085 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (8.61 %) :: 0.441 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.937 %) :: 0.048 ms | |
================================== | |
Frame 5: | |
---------------------------------- | |
Profiling: Root (total running time: 210.050 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.05 %) :: 0.105 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.268 ms / frame (1 calls) | |
3 -- renderScene (0.18 %) :: 0.372 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.13 %) :: 208.229 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.036 %) :: 0.076 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.372 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (88.44 %) :: 0.329 ms / frame (2 calls) | |
...Unaccounted: (11.559 %) :: 0.043 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.329 ms) --- | |
......0 -- glUseProgram(0); (1.52 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.56 %) :: 0.015 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (44.07 %) :: 0.145 ms / frame (4 calls) | |
......3 -- glFlush2 (7.29 %) :: 0.024 ms / frame (2 calls) | |
......4 -- updateCamera (0.91 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.82 %) :: 0.006 ms / frame (2 calls) | |
......Unaccounted: (39.818 %) :: 0.131 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.006 ms) --- | |
.........0 -- texture (66.67 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.229 ms) --- | |
...0 -- cl2gl_upload (85.76 %) :: 178.567 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.24 %) :: 29.657 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.657 ms) --- | |
......0 -- solveContacts (71.12 %) :: 21.092 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.66 %) :: 2.569 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.76 %) :: 0.226 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.00 %) :: 5.634 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.04 %) :: 0.011 ms / frame (1 calls) | |
......Unaccounted: (0.421 %) :: 0.125 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.092 ms) --- | |
.........0 -- GPU solveContactConstraint (69.81 %) :: 14.725 ms / frame (1 calls) | |
.........1 -- batching (29.77 %) :: 6.279 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.41 %) :: 0.086 ms / frame (1 calls) | |
.........Unaccounted: (0.009 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.725 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (44.84 %) :: 6.603 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (55.14 %) :: 8.119 ms / frame (1 calls) | |
............Unaccounted: (0.020 %) :: 0.003 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.279 ms) --- | |
............0 -- m_contactToConstraintKernel (28.14 %) :: 1.767 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.14 %) :: 0.009 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.05 %) :: 4.461 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.16 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.29 %) :: 0.018 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.175 %) :: 0.011 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.009 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.009 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.009 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.009 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.086 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.33 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.674 %) :: 0.084 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.569 ms) --- | |
.........0 -- clipHullHullKernel (60.69 %) :: 1.559 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.27 %) :: 0.161 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.28 %) :: 0.598 ms / frame (1 calls) | |
.........Unaccounted: (9.770 %) :: 0.251 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.634 ms) --- | |
.........0 -- sapKernel (71.42 %) :: 4.024 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.92 %) :: 0.108 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.65 %) :: 0.093 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.40 %) :: 0.417 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.40 %) :: 0.079 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.79 %) :: 0.101 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.05 %) :: 0.735 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.05 %) :: 0.003 ms / frame (1 calls) | |
.........Unaccounted: (1.313 %) :: 0.074 ms | |
================================== | |
Frame 6: | |
---------------------------------- | |
Profiling: Root (total running time: 210.194 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.09 %) :: 0.199 ms / frame (1 calls) | |
2 -- gui->draw (0.57 %) :: 1.208 ms / frame (1 calls) | |
3 -- renderScene (0.16 %) :: 0.331 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.14 %) :: 208.380 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.036 %) :: 0.076 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.331 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.05 %) :: 0.308 ms / frame (2 calls) | |
...Unaccounted: (6.949 %) :: 0.023 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.308 ms) --- | |
......0 -- glUseProgram(0); (1.30 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.22 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (38.64 %) :: 0.119 ms / frame (4 calls) | |
......3 -- glFlush2 (10.06 %) :: 0.031 ms / frame (2 calls) | |
......4 -- updateCamera (0.97 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.62 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (43.182 %) :: 0.133 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (80.00 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (20.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.380 ms) --- | |
...0 -- cl2gl_upload (86.08 %) :: 179.383 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (13.91 %) :: 28.993 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 28.993 ms) --- | |
......0 -- solveContacts (71.84 %) :: 20.829 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.04 %) :: 2.621 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.61 %) :: 0.177 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.28 %) :: 5.301 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.193 %) :: 0.056 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.829 ms) --- | |
.........0 -- GPU solveContactConstraint (69.40 %) :: 14.456 ms / frame (1 calls) | |
.........1 -- batching (30.23 %) :: 6.297 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.075 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.456 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.38 %) :: 6.560 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.61 %) :: 7.895 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.297 ms) --- | |
............0 -- m_contactToConstraintKernel (27.12 %) :: 1.708 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.29 %) :: 4.552 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.16 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.25 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.048 %) :: 0.003 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.075 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.67 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.333 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.621 ms) --- | |
.........0 -- clipHullHullKernel (60.40 %) :: 1.583 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.45 %) :: 0.169 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.72 %) :: 0.674 ms / frame (1 calls) | |
.........Unaccounted: (7.440 %) :: 0.195 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.301 ms) --- | |
.........0 -- sapKernel (73.89 %) :: 3.917 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.55 %) :: 0.135 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.43 %) :: 0.076 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.34 %) :: 0.442 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.08 %) :: 0.057 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.08 %) :: 0.057 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (10.49 %) :: 0.556 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.113 %) :: 0.059 ms | |
================================== | |
Frame 7: | |
---------------------------------- | |
Profiling: Root (total running time: 210.043 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.169 ms / frame (1 calls) | |
2 -- gui->draw (0.58 %) :: 1.221 ms / frame (1 calls) | |
3 -- renderScene (0.15 %) :: 0.315 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.16 %) :: 208.270 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.032 %) :: 0.068 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.315 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.65 %) :: 0.295 ms / frame (2 calls) | |
...Unaccounted: (6.349 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.295 ms) --- | |
......0 -- glUseProgram(0); (1.02 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (4.07 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (48.81 %) :: 0.144 ms / frame (4 calls) | |
......3 -- glFlush2 (5.08 %) :: 0.015 ms / frame (2 calls) | |
......4 -- updateCamera (1.36 %) :: 0.004 ms / frame (2 calls) | |
......5 -- init (1.36 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (38.305 %) :: 0.113 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.270 ms) --- | |
...0 -- cl2gl_upload (85.31 %) :: 177.682 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.69 %) :: 30.586 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.586 ms) --- | |
......0 -- solveContacts (70.46 %) :: 21.552 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.85 %) :: 2.707 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.72 %) :: 0.219 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.69 %) :: 6.021 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.255 %) :: 0.078 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.552 ms) --- | |
.........0 -- GPU solveContactConstraint (71.14 %) :: 15.333 ms / frame (1 calls) | |
.........1 -- batching (28.47 %) :: 6.136 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.38 %) :: 0.081 ms / frame (1 calls) | |
.........Unaccounted: (0.009 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.333 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (44.97 %) :: 6.896 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (55.03 %) :: 8.437 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.136 ms) --- | |
............0 -- m_contactToConstraintKernel (26.84 %) :: 1.647 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.51 %) :: 4.449 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.18 %) :: 0.011 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.098 %) :: 0.006 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.081 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.47 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.531 %) :: 0.079 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.707 ms) --- | |
.........0 -- clipHullHullKernel (58.81 %) :: 1.592 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.50 %) :: 0.203 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.01 %) :: 0.623 ms / frame (1 calls) | |
.........Unaccounted: (10.676 %) :: 0.289 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.021 ms) --- | |
.........0 -- sapKernel (64.94 %) :: 3.910 ms / frame (1 calls) | |
.........1 -- sap2Kernel (7.79 %) :: 0.469 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.04 %) :: 0.123 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.02 %) :: 0.483 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.76 %) :: 0.106 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.76 %) :: 0.106 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.01 %) :: 0.723 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.644 %) :: 0.099 ms | |
================================== | |
Frame 8: | |
---------------------------------- | |
Profiling: Root (total running time: 210.343 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.162 ms / frame (1 calls) | |
2 -- gui->draw (0.63 %) :: 1.324 ms / frame (1 calls) | |
3 -- renderScene (0.17 %) :: 0.358 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.10 %) :: 208.440 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.059 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.358 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.34 %) :: 0.327 ms / frame (2 calls) | |
...Unaccounted: (8.659 %) :: 0.031 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.327 ms) --- | |
......0 -- glUseProgram(0); (1.53 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.67 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (37.92 %) :: 0.124 ms / frame (4 calls) | |
......3 -- glFlush2 (7.03 %) :: 0.023 ms / frame (2 calls) | |
......4 -- updateCamera (1.22 %) :: 0.004 ms / frame (2 calls) | |
......5 -- init (1.53 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (47.095 %) :: 0.154 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (80.00 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (20.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.440 ms) --- | |
...0 -- cl2gl_upload (85.57 %) :: 178.359 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.43 %) :: 30.079 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.079 ms) --- | |
......0 -- solveContacts (71.18 %) :: 21.410 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.09 %) :: 2.733 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.68 %) :: 0.205 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.77 %) :: 5.645 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.263 %) :: 0.079 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.410 ms) --- | |
.........0 -- GPU solveContactConstraint (70.79 %) :: 15.157 ms / frame (1 calls) | |
.........1 -- batching (28.88 %) :: 6.183 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.157 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.15 %) :: 6.995 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.85 %) :: 8.162 ms / frame (1 calls) | |
what's wrong | |
............Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.183 ms) --- | |
............0 -- m_contactToConstraintKernel (26.65 %) :: 1.648 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.05 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.76 %) :: 4.499 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.113 %) :: 0.007 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.43 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.571 %) :: 0.069 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.733 ms) --- | |
.........0 -- clipHullHullKernel (60.15 %) :: 1.644 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.18 %) :: 0.169 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.34 %) :: 0.638 ms / frame (1 calls) | |
.........Unaccounted: (10.318 %) :: 0.282 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.645 ms) --- | |
.........0 -- sapKernel (70.33 %) :: 3.970 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.92 %) :: 0.165 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.80 %) :: 0.158 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.08 %) :: 0.456 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.61 %) :: 0.091 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.56 %) :: 0.088 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.18 %) :: 0.631 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.488 %) :: 0.084 ms | |
================================== | |
Frame 9: | |
---------------------------------- | |
Profiling: Root (total running time: 210.237 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.175 ms / frame (1 calls) | |
2 -- gui->draw (0.58 %) :: 1.227 ms / frame (1 calls) | |
3 -- renderScene (0.22 %) :: 0.464 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.09 %) :: 208.325 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.022 %) :: 0.046 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.464 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.46 %) :: 0.429 ms / frame (2 calls) | |
...Unaccounted: (7.543 %) :: 0.035 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.429 ms) --- | |
......0 -- glUseProgram(0); (0.93 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (3.73 %) :: 0.016 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (39.86 %) :: 0.171 ms / frame (4 calls) | |
......3 -- glFlush2 (5.13 %) :: 0.022 ms / frame (2 calls) | |
......4 -- updateCamera (0.70 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (3.26 %) :: 0.014 ms / frame (2 calls) | |
......Unaccounted: (46.387 %) :: 0.199 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.014 ms) --- | |
.........0 -- texture (28.57 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (71.429 %) :: 0.010 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.325 ms) --- | |
...0 -- cl2gl_upload (85.38 %) :: 177.864 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.62 %) :: 30.458 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.458 ms) --- | |
......0 -- solveContacts (70.00 %) :: 21.320 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.61 %) :: 2.928 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.80 %) :: 0.245 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.18 %) :: 5.842 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.374 %) :: 0.114 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.320 ms) --- | |
.........0 -- GPU solveContactConstraint (70.73 %) :: 15.080 ms / frame (1 calls) | |
.........1 -- batching (28.90 %) :: 6.162 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.37 %) :: 0.078 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.080 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.61 %) :: 7.029 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.39 %) :: 8.051 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.162 ms) --- | |
............0 -- m_contactToConstraintKernel (27.05 %) :: 1.667 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.25 %) :: 4.452 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.16 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.146 %) :: 0.009 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (83.33 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (16.667 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.078 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.56 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.436 %) :: 0.076 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.928 ms) --- | |
.........0 -- clipHullHullKernel (56.56 %) :: 1.656 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.47 %) :: 0.248 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.23 %) :: 0.651 ms / frame (1 calls) | |
.........Unaccounted: (12.739 %) :: 0.373 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.842 ms) --- | |
.........0 -- sapKernel (68.61 %) :: 4.008 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.94 %) :: 0.172 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.70 %) :: 0.158 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.25 %) :: 0.482 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.02 %) :: 0.118 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.99 %) :: 0.116 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.54 %) :: 0.674 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.917 %) :: 0.112 ms | |
================================== | |
Frame 10: | |
---------------------------------- | |
Profiling: Root (total running time: 209.691 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.124 ms / frame (1 calls) | |
2 -- gui->draw (0.58 %) :: 1.221 ms / frame (1 calls) | |
3 -- renderScene (0.13 %) :: 0.268 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.20 %) :: 208.016 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.062 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.268 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (88.43 %) :: 0.237 ms / frame (2 calls) | |
...Unaccounted: (11.567 %) :: 0.031 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.237 ms) --- | |
......0 -- glUseProgram(0); (1.69 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (5.49 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (38.40 %) :: 0.091 ms / frame (4 calls) | |
......3 -- glFlush2 (7.17 %) :: 0.017 ms / frame (2 calls) | |
......4 -- updateCamera (0.00 %) :: 0.000 ms / frame (2 calls) | |
......5 -- init (1.69 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (45.570 %) :: 0.108 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.016 ms) --- | |
...0 -- cl2gl_upload (86.03 %) :: 178.961 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (13.97 %) :: 29.052 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.052 ms) --- | |
......0 -- solveContacts (70.69 %) :: 20.538 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.25 %) :: 2.687 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.61 %) :: 0.176 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.17 %) :: 5.570 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.251 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.538 ms) --- | |
.........0 -- GPU solveContactConstraint (68.92 %) :: 14.155 ms / frame (1 calls) | |
.........1 -- batching (30.72 %) :: 6.309 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.074 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.155 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.13 %) :: 6.530 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.86 %) :: 7.624 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.309 ms) --- | |
............0 -- m_contactToConstraintKernel (26.88 %) :: 1.696 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.77 %) :: 4.528 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.840 %) :: 0.053 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.074 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.70 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.297 %) :: 0.072 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.687 ms) --- | |
.........0 -- clipHullHullKernel (58.50 %) :: 1.572 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.14 %) :: 0.165 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.75 %) :: 0.665 ms / frame (1 calls) | |
.........Unaccounted: (10.607 %) :: 0.285 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.570 ms) --- | |
.........0 -- sapKernel (71.15 %) :: 3.963 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.97 %) :: 0.110 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.53 %) :: 0.085 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.26 %) :: 0.460 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.33 %) :: 0.074 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.40 %) :: 0.078 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.12 %) :: 0.731 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.203 %) :: 0.067 ms | |
================================== | |
Frame 11: | |
---------------------------------- | |
Profiling: Root (total running time: 210.144 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.175 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.233 ms / frame (1 calls) | |
3 -- renderScene (0.14 %) :: 0.304 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.17 %) :: 208.390 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.020 %) :: 0.042 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.304 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.42 %) :: 0.284 ms / frame (2 calls) | |
...Unaccounted: (6.579 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.284 ms) --- | |
......0 -- glUseProgram(0); (1.41 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.23 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (42.25 %) :: 0.120 ms / frame (4 calls) | |
......3 -- glFlush2 (7.39 %) :: 0.021 ms / frame (2 calls) | |
......4 -- updateCamera (1.06 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.41 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (42.254 %) :: 0.120 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.390 ms) --- | |
...0 -- cl2gl_upload (85.64 %) :: 178.471 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.36 %) :: 29.915 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.915 ms) --- | |
......0 -- solveContacts (70.55 %) :: 21.106 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.33 %) :: 2.791 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.71 %) :: 0.211 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.12 %) :: 5.720 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.271 %) :: 0.081 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.106 ms) --- | |
.........0 -- GPU solveContactConstraint (70.22 %) :: 14.820 ms / frame (1 calls) | |
.........1 -- batching (29.43 %) :: 6.211 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.074 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.820 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.23 %) :: 6.852 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.77 %) :: 7.968 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.211 ms) --- | |
............0 -- m_contactToConstraintKernel (26.76 %) :: 1.662 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.66 %) :: 4.451 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (1.127 %) :: 0.070 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.074 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.35 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.649 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.791 ms) --- | |
.........0 -- clipHullHullKernel (56.00 %) :: 1.563 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.56 %) :: 0.183 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.90 %) :: 0.667 ms / frame (1 calls) | |
.........Unaccounted: (13.544 %) :: 0.378 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.720 ms) --- | |
.........0 -- sapKernel (70.10 %) :: 4.010 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.36 %) :: 0.135 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.06 %) :: 0.118 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.41 %) :: 0.481 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.75 %) :: 0.100 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.85 %) :: 0.106 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.75 %) :: 0.672 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.696 %) :: 0.097 ms | |
================================== | |
Frame 12: | |
---------------------------------- | |
Profiling: Root (total running time: 209.644 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.125 ms / frame (1 calls) | |
2 -- gui->draw (0.58 %) :: 1.217 ms / frame (1 calls) | |
3 -- renderScene (0.13 %) :: 0.264 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.20 %) :: 207.977 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.029 %) :: 0.061 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.264 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (87.50 %) :: 0.231 ms / frame (2 calls) | |
...Unaccounted: (12.500 %) :: 0.033 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.231 ms) --- | |
......0 -- glUseProgram(0); (1.73 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.76 %) :: 0.011 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (33.33 %) :: 0.077 ms / frame (4 calls) | |
......3 -- glFlush2 (10.82 %) :: 0.025 ms / frame (2 calls) | |
......4 -- updateCamera (0.43 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (0.87 %) :: 0.002 ms / frame (2 calls) | |
......Unaccounted: (48.052 %) :: 0.111 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.002 ms) --- | |
.........0 -- texture (50.00 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.977 ms) --- | |
...0 -- cl2gl_upload (85.52 %) :: 177.852 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.48 %) :: 30.120 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.002 %) :: 0.004 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.120 ms) --- | |
......0 -- solveContacts (71.69 %) :: 21.594 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.76 %) :: 2.638 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.75 %) :: 0.227 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.53 %) :: 5.580 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.242 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.594 ms) --- | |
.........0 -- GPU solveContactConstraint (71.13 %) :: 15.359 ms / frame (1 calls) | |
.........1 -- batching (28.52 %) :: 6.159 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.075 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.359 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.07 %) :: 7.076 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.93 %) :: 8.283 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.159 ms) --- | |
............0 -- m_contactToConstraintKernel (26.90 %) :: 1.657 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.45 %) :: 4.462 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.18 %) :: 0.011 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.130 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (75.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (25.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.075 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.67 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.333 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.638 ms) --- | |
.........0 -- clipHullHullKernel (60.96 %) :: 1.608 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.52 %) :: 0.172 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.94 %) :: 0.658 ms / frame (1 calls) | |
.........Unaccounted: (7.581 %) :: 0.200 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.580 ms) --- | |
.........0 -- sapKernel (73.03 %) :: 4.075 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.94 %) :: 0.108 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.38 %) :: 0.077 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.92 %) :: 0.442 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.09 %) :: 0.061 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.47 %) :: 0.082 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.28 %) :: 0.685 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.860 %) :: 0.048 ms | |
================================== | |
Frame 13: | |
---------------------------------- | |
Profiling: Root (total running time: 210.028 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.09 %) :: 0.192 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.259 ms / frame (1 calls) | |
3 -- renderScene (0.17 %) :: 0.348 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.12 %) :: 208.185 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.021 %) :: 0.044 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.348 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.24 %) :: 0.321 ms / frame (2 calls) | |
...Unaccounted: (7.759 %) :: 0.027 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.321 ms) --- | |
......0 -- glUseProgram(0); (1.56 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.36 %) :: 0.014 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (44.86 %) :: 0.144 ms / frame (4 calls) | |
......3 -- glFlush2 (6.54 %) :: 0.021 ms / frame (2 calls) | |
......4 -- updateCamera (0.62 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.25 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (40.810 %) :: 0.131 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 208.185 ms) --- | |
...0 -- cl2gl_upload (85.82 %) :: 178.654 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.18 %) :: 29.530 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.530 ms) --- | |
......0 -- solveContacts (71.12 %) :: 21.001 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.10 %) :: 2.688 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.60 %) :: 0.176 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.93 %) :: 5.589 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.237 %) :: 0.070 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.001 ms) --- | |
.........0 -- GPU solveContactConstraint (68.41 %) :: 14.367 ms / frame (1 calls) | |
.........1 -- batching (31.27 %) :: 6.567 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.067 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.367 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.57 %) :: 6.691 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.42 %) :: 7.675 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.567 ms) --- | |
............0 -- m_contactToConstraintKernel (25.77 %) :: 1.692 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.93 %) :: 4.789 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.09 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.21 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.898 %) :: 0.059 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (75.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (25.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.067 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.99 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.015 %) :: 0.065 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.688 ms) --- | |
.........0 -- clipHullHullKernel (59.23 %) :: 1.592 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.06 %) :: 0.163 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.18 %) :: 0.650 ms / frame (1 calls) | |
.........Unaccounted: (10.528 %) :: 0.283 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.589 ms) --- | |
.........0 -- sapKernel (71.59 %) :: 4.001 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.90 %) :: 0.106 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.68 %) :: 0.094 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.27 %) :: 0.462 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.84 %) :: 0.103 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.77 %) :: 0.099 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.72 %) :: 0.655 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.217 %) :: 0.068 ms | |
================================== | |
Frame 14: | |
---------------------------------- | |
Profiling: Root (total running time: 209.893 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.09 %) :: 0.181 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.235 ms / frame (1 calls) | |
3 -- renderScene (0.20 %) :: 0.416 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (98.84 %) :: 207.460 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.286 %) :: 0.601 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.416 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.51 %) :: 0.389 ms / frame (2 calls) | |
...Unaccounted: (6.490 %) :: 0.027 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.389 ms) --- | |
......0 -- glUseProgram(0); (1.29 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.86 %) :: 0.015 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (37.28 %) :: 0.145 ms / frame (4 calls) | |
......3 -- glFlush2 (5.14 %) :: 0.020 ms / frame (2 calls) | |
......4 -- updateCamera (0.26 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (1.03 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (51.157 %) :: 0.199 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.460 ms) --- | |
...0 -- cl2gl_upload (85.94 %) :: 178.291 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.06 %) :: 29.167 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.167 ms) --- | |
......0 -- solveContacts (71.23 %) :: 20.777 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.44 %) :: 2.752 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (1.00 %) :: 0.293 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (17.88 %) :: 5.214 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.418 %) :: 0.122 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.777 ms) --- | |
.........0 -- GPU solveContactConstraint (67.88 %) :: 14.103 ms / frame (1 calls) | |
.........1 -- batching (31.77 %) :: 6.601 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.073 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.103 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.74 %) :: 6.451 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.26 %) :: 7.652 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.601 ms) --- | |
............0 -- m_contactToConstraintKernel (26.12 %) :: 1.724 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (73.22 %) :: 4.833 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.152 %) :: 0.010 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.073 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.74 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.260 %) :: 0.071 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.752 ms) --- | |
.........0 -- clipHullHullKernel (59.88 %) :: 1.648 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.45 %) :: 0.150 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.36 %) :: 0.698 ms / frame (1 calls) | |
.........Unaccounted: (9.302 %) :: 0.256 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.214 ms) --- | |
.........0 -- sapKernel (77.46 %) :: 4.039 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.86 %) :: 0.097 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.40 %) :: 0.073 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.59 %) :: 0.396 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.17 %) :: 0.061 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.07 %) :: 0.056 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (8.34 %) :: 0.435 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.055 %) :: 0.055 ms | |
================================== | |
Frame 15: | |
---------------------------------- | |
Profiling: Root (total running time: 209.605 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.137 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.256 ms / frame (1 calls) | |
3 -- renderScene (0.20 %) :: 0.410 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.11 %) :: 207.742 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.029 %) :: 0.060 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.410 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.66 %) :: 0.384 ms / frame (2 calls) | |
...Unaccounted: (6.341 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.384 ms) --- | |
......0 -- glUseProgram(0); (1.30 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.43 %) :: 0.017 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (41.93 %) :: 0.161 ms / frame (4 calls) | |
......3 -- glFlush2 (6.77 %) :: 0.026 ms / frame (2 calls) | |
......4 -- updateCamera (0.52 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.78 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (44.271 %) :: 0.170 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.742 ms) --- | |
...0 -- cl2gl_upload (85.87 %) :: 178.385 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.13 %) :: 29.354 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.354 ms) --- | |
......0 -- solveContacts (70.10 %) :: 20.578 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.07 %) :: 2.661 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.83 %) :: 0.244 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.62 %) :: 5.758 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.358 %) :: 0.105 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.578 ms) --- | |
.........0 -- GPU solveContactConstraint (67.01 %) :: 13.790 ms / frame (1 calls) | |
.........1 -- batching (32.62 %) :: 6.713 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.073 ms / frame (1 calls) | |
.........Unaccounted: (0.010 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.790 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.96 %) :: 6.338 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.02 %) :: 7.450 ms / frame (1 calls) | |
............Unaccounted: (0.015 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.713 ms) --- | |
............0 -- m_contactToConstraintKernel (25.07 %) :: 1.683 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (74.32 %) :: 4.989 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.01 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.25 %) :: 0.017 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.119 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.073 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.74 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.260 %) :: 0.071 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.661 ms) --- | |
.........0 -- clipHullHullKernel (60.20 %) :: 1.602 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.46 %) :: 0.172 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.94 %) :: 0.637 ms / frame (1 calls) | |
.........Unaccounted: (9.395 %) :: 0.250 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.758 ms) --- | |
.........0 -- sapKernel (71.57 %) :: 4.121 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.91 %) :: 0.110 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.75 %) :: 0.101 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.47 %) :: 0.430 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.89 %) :: 0.109 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.15 %) :: 0.124 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.79 %) :: 0.679 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.424 %) :: 0.082 ms | |
================================== | |
Frame 16: | |
---------------------------------- | |
Profiling: Root (total running time: 209.587 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.150 ms / frame (1 calls) | |
2 -- gui->draw (0.58 %) :: 1.214 ms / frame (1 calls) | |
3 -- renderScene (0.17 %) :: 0.352 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.15 %) :: 207.806 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.065 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.352 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.18 %) :: 0.328 ms / frame (2 calls) | |
...Unaccounted: (6.818 %) :: 0.024 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.328 ms) --- | |
......0 -- glUseProgram(0); (0.91 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (3.96 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (35.67 %) :: 0.117 ms / frame (4 calls) | |
......3 -- glFlush2 (7.01 %) :: 0.023 ms / frame (2 calls) | |
......4 -- updateCamera (3.05 %) :: 0.010 ms / frame (2 calls) | |
......5 -- init (1.52 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (47.866 %) :: 0.157 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (40.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (60.000 %) :: 0.003 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.806 ms) --- | |
...0 -- cl2gl_upload (85.24 %) :: 177.139 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.76 %) :: 30.664 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.664 ms) --- | |
......0 -- solveContacts (68.98 %) :: 21.151 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.91 %) :: 3.040 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.96 %) :: 0.293 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.77 %) :: 6.061 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.365 %) :: 0.112 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.151 ms) --- | |
.........0 -- GPU solveContactConstraint (68.54 %) :: 14.496 ms / frame (1 calls) | |
.........1 -- batching (31.14 %) :: 6.587 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.068 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.496 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.92 %) :: 6.656 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.08 %) :: 7.840 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.587 ms) --- | |
............0 -- m_contactToConstraintKernel (24.81 %) :: 1.634 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (74.54 %) :: 4.910 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.12 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.21 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.197 %) :: 0.013 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.068 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.47 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.529 %) :: 0.067 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.040 ms) --- | |
.........0 -- clipHullHullKernel (54.64 %) :: 1.661 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (9.05 %) :: 0.275 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.73 %) :: 0.691 ms / frame (1 calls) | |
.........Unaccounted: (13.586 %) :: 0.413 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.061 ms) --- | |
.........0 -- sapKernel (68.62 %) :: 4.159 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.84 %) :: 0.172 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.71 %) :: 0.164 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.03 %) :: 0.487 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.03 %) :: 0.123 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.93 %) :: 0.117 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.09 %) :: 0.733 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.732 %) :: 0.105 ms | |
================================== | |
Frame 17: | |
---------------------------------- | |
Profiling: Root (total running time: 209.324 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.123 ms / frame (1 calls) | |
2 -- gui->draw (0.57 %) :: 1.194 ms / frame (1 calls) | |
3 -- renderScene (0.12 %) :: 0.247 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.22 %) :: 207.695 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.065 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.247 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.90 %) :: 0.227 ms / frame (2 calls) | |
...Unaccounted: (8.097 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.227 ms) --- | |
......0 -- glUseProgram(0); (1.76 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.85 %) :: 0.011 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (39.21 %) :: 0.089 ms / frame (4 calls) | |
......3 -- glFlush2 (10.57 %) :: 0.024 ms / frame (2 calls) | |
......4 -- updateCamera (0.44 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (0.88 %) :: 0.002 ms / frame (2 calls) | |
......Unaccounted: (42.291 %) :: 0.096 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.002 ms) --- | |
.........0 -- texture (50.00 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.695 ms) --- | |
...0 -- cl2gl_upload (85.75 %) :: 178.108 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.24 %) :: 29.584 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.584 ms) --- | |
......0 -- solveContacts (69.11 %) :: 20.445 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.90 %) :: 2.929 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.74 %) :: 0.219 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.90 %) :: 5.888 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.318 %) :: 0.094 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.445 ms) --- | |
.........0 -- GPU solveContactConstraint (67.91 %) :: 13.885 ms / frame (1 calls) | |
.........1 -- batching (31.71 %) :: 6.483 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.38 %) :: 0.077 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.885 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.98 %) :: 6.385 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.02 %) :: 7.500 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.483 ms) --- | |
............0 -- m_contactToConstraintKernel (25.78 %) :: 1.671 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (73.58 %) :: 4.770 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.12 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.25 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.139 %) :: 0.009 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.077 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.60 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.403 %) :: 0.075 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.929 ms) --- | |
.........0 -- clipHullHullKernel (56.30 %) :: 1.649 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.06 %) :: 0.236 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.77 %) :: 0.667 ms / frame (1 calls) | |
.........Unaccounted: (12.871 %) :: 0.377 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.888 ms) --- | |
.........0 -- sapKernel (69.57 %) :: 4.096 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.96 %) :: 0.174 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.82 %) :: 0.107 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.68 %) :: 0.452 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.04 %) :: 0.120 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.00 %) :: 0.118 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.11 %) :: 0.713 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.800 %) :: 0.106 ms | |
================================== | |
Frame 18: | |
---------------------------------- | |
Profiling: Root (total running time: 209.788 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.10 %) :: 0.200 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.239 ms / frame (1 calls) | |
3 -- renderScene (0.23 %) :: 0.474 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.07 %) :: 207.833 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.020 %) :: 0.042 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.474 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.88 %) :: 0.445 ms / frame (2 calls) | |
...Unaccounted: (6.118 %) :: 0.029 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.445 ms) --- | |
......0 -- glUseProgram(0); (1.12 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.27 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (46.97 %) :: 0.209 ms / frame (4 calls) | |
......3 -- glFlush2 (5.62 %) :: 0.025 ms / frame (2 calls) | |
......4 -- updateCamera (0.45 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.90 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (40.674 %) :: 0.181 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.833 ms) --- | |
...0 -- cl2gl_upload (85.19 %) :: 177.051 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.81 %) :: 30.780 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.780 ms) --- | |
......0 -- solveContacts (71.39 %) :: 21.974 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.86 %) :: 2.727 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.66 %) :: 0.204 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.80 %) :: 5.788 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.05 %) :: 0.015 ms / frame (1 calls) | |
......Unaccounted: (0.234 %) :: 0.072 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.974 ms) --- | |
.........0 -- GPU solveContactConstraint (68.00 %) :: 14.942 ms / frame (1 calls) | |
.........1 -- batching (31.71 %) :: 6.967 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.30 %) :: 0.065 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.942 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.13 %) :: 6.893 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.87 %) :: 8.049 ms / frame (1 calls) | |
what's wrong | |
............Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.967 ms) --- | |
............0 -- m_contactToConstraintKernel (24.39 %) :: 1.699 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.09 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (74.59 %) :: 5.197 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.09 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.01 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.588 %) :: 0.041 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.065 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (3.08 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (96.923 %) :: 0.063 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.727 ms) --- | |
.........0 -- clipHullHullKernel (57.54 %) :: 1.569 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.53 %) :: 0.178 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.46 %) :: 0.667 ms / frame (1 calls) | |
.........Unaccounted: (11.478 %) :: 0.313 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.788 ms) --- | |
.........0 -- sapKernel (71.48 %) :: 4.137 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.80 %) :: 0.104 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.54 %) :: 0.089 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.45 %) :: 0.489 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.80 %) :: 0.104 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.92 %) :: 0.111 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.83 %) :: 0.685 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.158 %) :: 0.067 ms | |
================================== | |
Frame 19: | |
---------------------------------- | |
Profiling: Root (total running time: 209.127 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.09 %) :: 0.185 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.235 ms / frame (1 calls) | |
3 -- renderScene (0.16 %) :: 0.328 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.13 %) :: 207.315 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.064 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.328 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.60 %) :: 0.307 ms / frame (2 calls) | |
...Unaccounted: (6.402 %) :: 0.021 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.307 ms) --- | |
......0 -- glUseProgram(0); (1.30 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.56 %) :: 0.014 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (45.60 %) :: 0.140 ms / frame (4 calls) | |
......3 -- glFlush2 (5.86 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (0.98 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.98 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (40.717 %) :: 0.125 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (100.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.315 ms) --- | |
...0 -- cl2gl_upload (85.37 %) :: 176.987 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.63 %) :: 30.325 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.001 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.325 ms) --- | |
......0 -- solveContacts (70.40 %) :: 21.349 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.83 %) :: 2.677 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.92 %) :: 0.279 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.47 %) :: 5.903 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.359 %) :: 0.109 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.349 ms) --- | |
.........0 -- GPU solveContactConstraint (67.50 %) :: 14.411 ms / frame (1 calls) | |
.........1 -- batching (32.14 %) :: 6.861 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.076 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.411 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.10 %) :: 6.643 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.89 %) :: 7.766 ms / frame (1 calls) | |
............Unaccounted: (0.014 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.861 ms) --- | |
............0 -- m_contactToConstraintKernel (25.08 %) :: 1.721 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (73.69 %) :: 5.056 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.700 %) :: 0.048 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.076 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.63 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.368 %) :: 0.074 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.677 ms) --- | |
.........0 -- clipHullHullKernel (58.27 %) :: 1.560 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.98 %) :: 0.160 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.51 %) :: 0.656 ms / frame (1 calls) | |
.........Unaccounted: (11.244 %) :: 0.301 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.903 ms) --- | |
.........0 -- sapKernel (70.69 %) :: 4.173 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.20 %) :: 0.130 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.74 %) :: 0.103 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.71 %) :: 0.455 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.08 %) :: 0.123 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.83 %) :: 0.108 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.69 %) :: 0.749 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.016 %) :: 0.060 ms | |
================================== | |
Frame 20: | |
---------------------------------- | |
Profiling: Root (total running time: 209.160 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.162 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.236 ms / frame (1 calls) | |
3 -- renderScene (0.14 %) :: 0.299 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.16 %) :: 207.404 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.059 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.299 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.31 %) :: 0.279 ms / frame (2 calls) | |
...Unaccounted: (6.689 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.279 ms) --- | |
......0 -- glUseProgram(0); (1.79 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (5.02 %) :: 0.014 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (46.95 %) :: 0.131 ms / frame (4 calls) | |
......3 -- glFlush2 (6.45 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (1.08 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.43 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (37.276 %) :: 0.104 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.404 ms) --- | |
...0 -- cl2gl_upload (85.79 %) :: 177.930 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.21 %) :: 29.471 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.471 ms) --- | |
......0 -- solveContacts (70.97 %) :: 20.915 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.22 %) :: 2.717 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.61 %) :: 0.181 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.89 %) :: 5.567 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.285 %) :: 0.084 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.915 ms) --- | |
.........0 -- GPU solveContactConstraint (66.04 %) :: 13.813 ms / frame (1 calls) | |
.........1 -- batching (33.59 %) :: 7.026 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.076 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.813 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.86 %) :: 6.334 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.14 %) :: 7.479 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 7.026 ms) --- | |
............0 -- m_contactToConstraintKernel (23.95 %) :: 1.683 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.07 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (75.49 %) :: 5.304 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.01 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.21 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.114 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.076 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.63 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.368 %) :: 0.074 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.717 ms) --- | |
.........0 -- clipHullHullKernel (59.66 %) :: 1.621 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.15 %) :: 0.167 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.51 %) :: 0.666 ms / frame (1 calls) | |
.........Unaccounted: (9.680 %) :: 0.263 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.567 ms) --- | |
.........0 -- sapKernel (74.29 %) :: 4.136 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.89 %) :: 0.105 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.49 %) :: 0.083 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.80 %) :: 0.434 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.58 %) :: 0.088 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.53 %) :: 0.085 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (10.20 %) :: 0.568 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.186 %) :: 0.066 ms | |
================================== | |
Frame 21: | |
---------------------------------- | |
Profiling: Root (total running time: 209.632 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.10 %) :: 0.206 ms / frame (1 calls) | |
2 -- gui->draw (0.62 %) :: 1.304 ms / frame (1 calls) | |
3 -- renderScene (0.23 %) :: 0.487 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.02 %) :: 207.577 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.058 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.487 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.46 %) :: 0.460 ms / frame (2 calls) | |
...Unaccounted: (5.544 %) :: 0.027 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.460 ms) --- | |
......0 -- glUseProgram(0); (1.09 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.91 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (45.22 %) :: 0.208 ms / frame (4 calls) | |
......3 -- glFlush2 (4.13 %) :: 0.019 ms / frame (2 calls) | |
......4 -- updateCamera (2.17 %) :: 0.010 ms / frame (2 calls) | |
......5 -- init (0.87 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (42.609 %) :: 0.196 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.577 ms) --- | |
...0 -- cl2gl_upload (85.43 %) :: 177.340 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.56 %) :: 30.233 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.233 ms) --- | |
......0 -- solveContacts (71.55 %) :: 21.632 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.99 %) :: 2.717 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.60 %) :: 0.180 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.57 %) :: 5.614 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.275 %) :: 0.083 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.632 ms) --- | |
.........0 -- GPU solveContactConstraint (67.77 %) :: 14.660 ms / frame (1 calls) | |
.........1 -- batching (31.92 %) :: 6.905 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.31 %) :: 0.067 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.660 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.04 %) :: 6.750 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.95 %) :: 7.909 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.905 ms) --- | |
............0 -- m_contactToConstraintKernel (24.32 %) :: 1.679 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.07 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (75.08 %) :: 5.184 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.130 %) :: 0.009 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.067 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.99 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.015 %) :: 0.065 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.717 ms) --- | |
.........0 -- clipHullHullKernel (59.44 %) :: 1.615 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.18 %) :: 0.168 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.06 %) :: 0.681 ms / frame (1 calls) | |
.........Unaccounted: (9.312 %) :: 0.253 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.614 ms) --- | |
.........0 -- sapKernel (73.35 %) :: 4.118 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.26 %) :: 0.127 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.82 %) :: 0.102 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.16 %) :: 0.458 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.57 %) :: 0.088 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.59 %) :: 0.089 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (10.17 %) :: 0.571 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.051 %) :: 0.059 ms | |
================================== | |
Frame 22: | |
---------------------------------- | |
Profiling: Root (total running time: 208.769 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.120 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.234 ms / frame (1 calls) | |
3 -- renderScene (0.16 %) :: 0.336 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.16 %) :: 207.018 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.029 %) :: 0.061 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.336 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.56 %) :: 0.311 ms / frame (2 calls) | |
...Unaccounted: (7.440 %) :: 0.025 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.311 ms) --- | |
......0 -- glUseProgram(0); (0.96 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (3.86 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (49.52 %) :: 0.154 ms / frame (4 calls) | |
......3 -- glFlush2 (3.86 %) :: 0.012 ms / frame (2 calls) | |
......4 -- updateCamera (0.64 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.64 %) :: 0.002 ms / frame (2 calls) | |
......Unaccounted: (40.514 %) :: 0.126 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.002 ms) --- | |
.........0 -- texture (50.00 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.018 ms) --- | |
...0 -- cl2gl_upload (83.79 %) :: 173.463 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (16.21 %) :: 33.552 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 33.552 ms) --- | |
......0 -- solveContacts (69.16 %) :: 23.204 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (10.28 %) :: 3.450 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (1.15 %) :: 0.387 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.91 %) :: 6.344 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.477 %) :: 0.160 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 23.204 ms) --- | |
.........0 -- GPU solveContactConstraint (68.83 %) :: 15.971 ms / frame (1 calls) | |
.........1 -- batching (30.84 %) :: 7.157 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.075 ms / frame (1 calls) | |
.........Unaccounted: (0.004 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.971 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.73 %) :: 7.304 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.26 %) :: 8.666 ms / frame (1 calls) | |
............Unaccounted: (0.006 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 7.157 ms) --- | |
............0 -- m_contactToConstraintKernel (26.02 %) :: 1.862 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.78 %) :: 5.209 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.01 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.27 %) :: 0.019 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.18 %) :: 0.013 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.657 %) :: 0.047 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (75.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (25.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.075 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.33 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.667 %) :: 0.074 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.450 ms) --- | |
.........0 -- clipHullHullKernel (51.83 %) :: 1.788 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.55 %) :: 0.295 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.20 %) :: 0.766 ms / frame (1 calls) | |
.........Unaccounted: (17.420 %) :: 0.601 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.344 ms) --- | |
.........0 -- sapKernel (67.83 %) :: 4.303 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.87 %) :: 0.182 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.99 %) :: 0.126 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.61 %) :: 0.546 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.73 %) :: 0.110 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.66 %) :: 0.105 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.68 %) :: 0.868 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.624 %) :: 0.103 ms | |
================================== | |
Frame 23: | |
---------------------------------- | |
Profiling: Root (total running time: 209.225 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.162 ms / frame (1 calls) | |
2 -- gui->draw (0.62 %) :: 1.291 ms / frame (1 calls) | |
3 -- renderScene (0.21 %) :: 0.445 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.07 %) :: 207.275 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.025 %) :: 0.052 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.445 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.71 %) :: 0.417 ms / frame (2 calls) | |
...Unaccounted: (6.292 %) :: 0.028 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.417 ms) --- | |
......0 -- glUseProgram(0); (1.20 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.32 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (44.36 %) :: 0.185 ms / frame (4 calls) | |
......3 -- glFlush2 (7.43 %) :: 0.031 ms / frame (2 calls) | |
......4 -- updateCamera (2.64 %) :: 0.011 ms / frame (2 calls) | |
......5 -- init (0.96 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (39.089 %) :: 0.163 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.275 ms) --- | |
...0 -- cl2gl_upload (85.66 %) :: 177.551 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.34 %) :: 29.721 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.721 ms) --- | |
......0 -- solveContacts (69.21 %) :: 20.571 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (10.08 %) :: 2.996 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.77 %) :: 0.229 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.64 %) :: 5.836 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.005 ms / frame (1 calls) | |
......Unaccounted: (0.283 %) :: 0.084 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.571 ms) --- | |
.........0 -- GPU solveContactConstraint (66.81 %) :: 13.744 ms / frame (1 calls) | |
.........1 -- batching (32.77 %) :: 6.741 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.42 %) :: 0.086 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.744 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.81 %) :: 6.296 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.19 %) :: 7.448 ms / frame (1 calls) | |
what's wrong | |
............Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.741 ms) --- | |
............0 -- m_contactToConstraintKernel (24.91 %) :: 1.679 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.04 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (74.54 %) :: 5.025 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.13 %) :: 0.009 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.07 %) :: 0.005 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.18 %) :: 0.012 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.089 %) :: 0.006 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.086 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (3.49 %) :: 0.003 ms / frame (1 calls) | |
............Unaccounted: (96.512 %) :: 0.083 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.996 ms) --- | |
.........0 -- clipHullHullKernel (55.97 %) :: 1.677 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.78 %) :: 0.263 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.36 %) :: 0.670 ms / frame (1 calls) | |
.........Unaccounted: (12.884 %) :: 0.386 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.836 ms) --- | |
.........0 -- sapKernel (71.08 %) :: 4.148 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.81 %) :: 0.164 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.19 %) :: 0.128 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.85 %) :: 0.458 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.49 %) :: 0.087 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.64 %) :: 0.096 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.53 %) :: 0.673 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.388 %) :: 0.081 ms | |
================================== | |
Frame 24: | |
---------------------------------- | |
Profiling: Root (total running time: 208.964 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.154 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.245 ms / frame (1 calls) | |
3 -- renderScene (0.25 %) :: 0.518 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.04 %) :: 206.965 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.039 %) :: 0.082 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.518 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.98 %) :: 0.492 ms / frame (2 calls) | |
...Unaccounted: (5.019 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.492 ms) --- | |
......0 -- glUseProgram(0); (1.22 %) :: 0.006 ms / frame (2 calls) | |
......1 -- glFlush (3.66 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (49.19 %) :: 0.242 ms / frame (4 calls) | |
......3 -- glFlush2 (3.86 %) :: 0.019 ms / frame (2 calls) | |
......4 -- updateCamera (0.61 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (2.24 %) :: 0.011 ms / frame (2 calls) | |
......Unaccounted: (39.228 %) :: 0.193 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.011 ms) --- | |
.........0 -- texture (27.27 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (72.727 %) :: 0.008 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.965 ms) --- | |
...0 -- cl2gl_upload (85.15 %) :: 176.234 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.85 %) :: 30.728 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.728 ms) --- | |
......0 -- solveContacts (69.72 %) :: 21.425 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.51 %) :: 2.921 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.62 %) :: 0.190 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.88 %) :: 6.109 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.244 %) :: 0.075 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.425 ms) --- | |
.........0 -- GPU solveContactConstraint (68.70 %) :: 14.718 ms / frame (1 calls) | |
.........1 -- batching (30.93 %) :: 6.627 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.37 %) :: 0.079 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.718 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.90 %) :: 6.755 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.10 %) :: 7.963 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.627 ms) --- | |
............0 -- m_contactToConstraintKernel (25.47 %) :: 1.688 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.11 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (73.82 %) :: 4.892 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.27 %) :: 0.018 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.121 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.079 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.27 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.734 %) :: 0.078 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.921 ms) --- | |
.........0 -- clipHullHullKernel (55.91 %) :: 1.633 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (10.72 %) :: 0.313 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.39 %) :: 0.654 ms / frame (1 calls) | |
.........Unaccounted: (10.989 %) :: 0.321 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.109 ms) --- | |
.........0 -- sapKernel (66.79 %) :: 4.080 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.03 %) :: 0.124 ms / frame (1 calls) | |
.........2 -- scatterKernel (6.81 %) :: 0.416 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.22 %) :: 0.441 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.42 %) :: 0.148 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.06 %) :: 0.126 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.54 %) :: 0.705 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.097 %) :: 0.067 ms | |
================================== | |
Frame 25: | |
---------------------------------- | |
Profiling: Root (total running time: 208.477 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.158 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.258 ms / frame (1 calls) | |
3 -- renderScene (0.11 %) :: 0.227 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.18 %) :: 206.773 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.029 %) :: 0.061 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.227 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (90.31 %) :: 0.205 ms / frame (2 calls) | |
...Unaccounted: (9.692 %) :: 0.022 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.205 ms) --- | |
......0 -- glUseProgram(0); (1.95 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (5.85 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (35.61 %) :: 0.073 ms / frame (4 calls) | |
......3 -- glFlush2 (8.78 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (1.46 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.46 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (44.878 %) :: 0.092 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.773 ms) --- | |
...0 -- cl2gl_upload (85.32 %) :: 176.428 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.67 %) :: 30.343 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.343 ms) --- | |
......0 -- solveContacts (68.72 %) :: 20.851 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.30 %) :: 2.821 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.94 %) :: 0.284 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.53 %) :: 6.228 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.501 %) :: 0.152 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.851 ms) --- | |
.........0 -- GPU solveContactConstraint (70.40 %) :: 14.680 ms / frame (1 calls) | |
.........1 -- batching (29.23 %) :: 6.094 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.076 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.680 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.92 %) :: 6.741 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.08 %) :: 7.939 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.094 ms) --- | |
............0 -- m_contactToConstraintKernel (27.06 %) :: 1.649 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.07 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.28 %) :: 4.405 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.25 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.131 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.076 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.63 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.368 %) :: 0.074 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.821 ms) --- | |
.........0 -- clipHullHullKernel (55.94 %) :: 1.578 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.92 %) :: 0.167 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.46 %) :: 0.690 ms / frame (1 calls) | |
.........Unaccounted: (13.683 %) :: 0.386 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.228 ms) --- | |
.........0 -- sapKernel (67.77 %) :: 4.221 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.15 %) :: 0.134 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.53 %) :: 0.095 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(10.12 %) :: 0.630 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.01 %) :: 0.125 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.94 %) :: 0.121 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.33 %) :: 0.830 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.124 %) :: 0.070 ms | |
================================== | |
Frame 26: | |
---------------------------------- | |
Profiling: Root (total running time: 208.646 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.173 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.249 ms / frame (1 calls) | |
3 -- renderScene (0.14 %) :: 0.297 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.15 %) :: 206.865 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.062 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.297 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.26 %) :: 0.274 ms / frame (2 calls) | |
...Unaccounted: (7.744 %) :: 0.023 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.274 ms) --- | |
......0 -- glUseProgram(0); (1.82 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (7.66 %) :: 0.021 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (36.86 %) :: 0.101 ms / frame (4 calls) | |
......3 -- glFlush2 (9.12 %) :: 0.025 ms / frame (2 calls) | |
......4 -- updateCamera (0.73 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.46 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (42.336 %) :: 0.116 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.865 ms) --- | |
...0 -- cl2gl_upload (85.45 %) :: 176.765 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.55 %) :: 30.096 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.096 ms) --- | |
......0 -- solveContacts (69.88 %) :: 21.030 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.23 %) :: 2.779 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.69 %) :: 0.207 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.95 %) :: 6.003 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.233 %) :: 0.070 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.030 ms) --- | |
.........0 -- GPU solveContactConstraint (68.95 %) :: 14.500 ms / frame (1 calls) | |
.........1 -- batching (30.64 %) :: 6.444 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.40 %) :: 0.085 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.500 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.97 %) :: 6.665 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.03 %) :: 7.834 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.444 ms) --- | |
............0 -- m_contactToConstraintKernel (26.43 %) :: 1.703 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.11 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.21 %) :: 4.653 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.25 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.06 %) :: 0.004 ms / frame (1 calls) | |
............Unaccounted: (0.776 %) :: 0.050 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (85.71 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (14.286 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.085 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.35 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.647 %) :: 0.083 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.779 ms) --- | |
.........0 -- clipHullHullKernel (58.80 %) :: 1.634 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.09 %) :: 0.197 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.50 %) :: 0.653 ms / frame (1 calls) | |
.........Unaccounted: (10.615 %) :: 0.295 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.003 ms) --- | |
.........0 -- sapKernel (68.48 %) :: 4.111 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.92 %) :: 0.115 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.90 %) :: 0.114 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(12.11 %) :: 0.727 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.22 %) :: 0.073 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.22 %) :: 0.073 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.73 %) :: 0.704 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.399 %) :: 0.084 ms | |
================================== | |
Frame 27: | |
---------------------------------- | |
Profiling: Root (total running time: 208.404 ms) --- | |
0 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.117 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.227 ms / frame (1 calls) | |
3 -- renderScene (0.15 %) :: 0.306 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.18 %) :: 206.691 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.062 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.306 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.77 %) :: 0.290 ms / frame (2 calls) | |
...Unaccounted: (5.229 %) :: 0.016 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.290 ms) --- | |
......0 -- glUseProgram(0); (1.03 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (4.14 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (57.59 %) :: 0.167 ms / frame (4 calls) | |
......3 -- glFlush2 (3.79 %) :: 0.011 ms / frame (2 calls) | |
......4 -- updateCamera (0.34 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (1.03 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (32.069 %) :: 0.093 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (33.33 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (66.667 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.691 ms) --- | |
...0 -- cl2gl_upload (85.10 %) :: 175.892 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.90 %) :: 30.795 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.795 ms) --- | |
......0 -- solveContacts (69.85 %) :: 21.511 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.16 %) :: 2.822 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.44 %) :: 0.135 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.37 %) :: 6.273 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.153 %) :: 0.047 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.511 ms) --- | |
.........0 -- GPU solveContactConstraint (70.66 %) :: 15.200 ms / frame (1 calls) | |
.........1 -- batching (29.00 %) :: 6.239 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.072 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.200 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.14 %) :: 7.013 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.86 %) :: 8.186 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.239 ms) --- | |
............0 -- m_contactToConstraintKernel (27.20 %) :: 1.697 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.31 %) :: 4.449 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.962 %) :: 0.060 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.072 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.39 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.611 %) :: 0.071 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.822 ms) --- | |
.........0 -- clipHullHullKernel (57.97 %) :: 1.636 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.47 %) :: 0.239 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.81 %) :: 0.672 ms / frame (1 calls) | |
.........Unaccounted: (9.745 %) :: 0.275 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.273 ms) --- | |
.........0 -- sapKernel (64.45 %) :: 4.043 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.36 %) :: 0.085 ms / frame (1 calls) | |
.........2 -- scatterKernel (4.75 %) :: 0.298 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(14.16 %) :: 0.888 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.69 %) :: 0.106 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.67 %) :: 0.105 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.11 %) :: 0.697 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.781 %) :: 0.049 ms | |
================================== | |
Frame 28: | |
---------------------------------- | |
Profiling: Root (total running time: 208.719 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.10 %) :: 0.204 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.240 ms / frame (1 calls) | |
3 -- renderScene (0.19 %) :: 0.390 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.10 %) :: 206.839 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.022 %) :: 0.046 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.390 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.82 %) :: 0.362 ms / frame (2 calls) | |
...Unaccounted: (7.179 %) :: 0.028 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.362 ms) --- | |
......0 -- glUseProgram(0); (1.38 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.59 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (29.56 %) :: 0.107 ms / frame (4 calls) | |
......3 -- glFlush2 (6.91 %) :: 0.025 ms / frame (2 calls) | |
......4 -- updateCamera (0.83 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.10 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (56.630 %) :: 0.205 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.839 ms) --- | |
...0 -- cl2gl_upload (85.60 %) :: 177.044 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.40 %) :: 29.793 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.793 ms) --- | |
......0 -- solveContacts (69.78 %) :: 20.789 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.24 %) :: 2.754 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.55 %) :: 0.163 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.25 %) :: 6.032 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.005 ms / frame (1 calls) | |
......Unaccounted: (0.168 %) :: 0.050 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.789 ms) --- | |
.........0 -- GPU solveContactConstraint (70.02 %) :: 14.556 ms / frame (1 calls) | |
.........1 -- batching (29.67 %) :: 6.169 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.31 %) :: 0.064 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.556 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.37 %) :: 6.750 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.63 %) :: 7.806 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.169 ms) --- | |
............0 -- m_contactToConstraintKernel (27.33 %) :: 1.686 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.10 %) :: 4.448 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.10 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.113 %) :: 0.007 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.064 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (3.13 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (96.875 %) :: 0.062 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.754 ms) --- | |
.........0 -- clipHullHullKernel (60.13 %) :: 1.656 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.44 %) :: 0.205 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.76 %) :: 0.682 ms / frame (1 calls) | |
.........Unaccounted: (7.662 %) :: 0.211 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.032 ms) --- | |
.........0 -- sapKernel (67.52 %) :: 4.073 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.36 %) :: 0.082 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.01 %) :: 0.061 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.06 %) :: 0.486 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (0.81 %) :: 0.049 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.28 %) :: 0.077 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (19.21 %) :: 1.159 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.713 %) :: 0.043 ms | |
================================== | |
Frame 29: | |
---------------------------------- | |
Profiling: Root (total running time: 209.330 ms) --- | |
0 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
1 -- window->endRendering (0.05 %) :: 0.112 ms / frame (1 calls) | |
2 -- gui->draw (0.63 %) :: 1.318 ms / frame (1 calls) | |
3 -- renderScene (0.19 %) :: 0.390 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.10 %) :: 207.445 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.064 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.390 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.33 %) :: 0.364 ms / frame (2 calls) | |
...Unaccounted: (6.667 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.364 ms) --- | |
......0 -- glUseProgram(0); (1.37 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.67 %) :: 0.017 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (58.24 %) :: 0.212 ms / frame (4 calls) | |
......3 -- glFlush2 (4.40 %) :: 0.016 ms / frame (2 calls) | |
......4 -- updateCamera (0.55 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.82 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (29.945 %) :: 0.109 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (33.33 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (66.667 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 207.445 ms) --- | |
...0 -- cl2gl_upload (85.55 %) :: 177.469 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.45 %) :: 29.972 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.972 ms) --- | |
......0 -- solveContacts (68.68 %) :: 20.584 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.67 %) :: 2.899 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.81 %) :: 0.242 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.53 %) :: 6.154 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.284 %) :: 0.085 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.584 ms) --- | |
.........0 -- GPU solveContactConstraint (69.81 %) :: 14.369 ms / frame (1 calls) | |
.........1 -- batching (29.81 %) :: 6.137 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.38 %) :: 0.078 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.369 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (47.71 %) :: 6.855 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (52.27 %) :: 7.511 ms / frame (1 calls) | |
............Unaccounted: (0.021 %) :: 0.003 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.137 ms) --- | |
............0 -- m_contactToConstraintKernel (27.26 %) :: 1.673 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.05 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.70 %) :: 4.400 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.18 %) :: 0.011 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.28 %) :: 0.017 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.489 %) :: 0.030 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.078 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.56 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.436 %) :: 0.076 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.899 ms) --- | |
.........0 -- clipHullHullKernel (56.19 %) :: 1.629 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.80 %) :: 0.197 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.18 %) :: 0.672 ms / frame (1 calls) | |
.........Unaccounted: (13.832 %) :: 0.401 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.154 ms) --- | |
.........0 -- sapKernel (68.31 %) :: 4.204 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.27 %) :: 0.140 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.72 %) :: 0.106 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(9.62 %) :: 0.592 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.05 %) :: 0.126 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.95 %) :: 0.120 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.43 %) :: 0.765 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.609 %) :: 0.099 ms | |
================================== | |
Frame 30: | |
---------------------------------- | |
Profiling: Root (total running time: 208.230 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.146 ms / frame (1 calls) | |
2 -- gui->draw (0.61 %) :: 1.265 ms / frame (1 calls) | |
3 -- renderScene (0.19 %) :: 0.390 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.11 %) :: 206.387 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.020 %) :: 0.042 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.390 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.31 %) :: 0.360 ms / frame (2 calls) | |
...Unaccounted: (7.692 %) :: 0.030 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.360 ms) --- | |
......0 -- glUseProgram(0); (1.39 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (5.00 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (40.56 %) :: 0.146 ms / frame (4 calls) | |
......3 -- glFlush2 (7.22 %) :: 0.026 ms / frame (2 calls) | |
......4 -- updateCamera (0.83 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.67 %) :: 0.006 ms / frame (2 calls) | |
......Unaccounted: (43.333 %) :: 0.156 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.006 ms) --- | |
.........0 -- texture (66.67 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.387 ms) --- | |
...0 -- cl2gl_upload (79.40 %) :: 163.880 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (20.59 %) :: 42.502 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 42.502 ms) --- | |
......0 -- solveContacts (55.70 %) :: 23.675 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (10.76 %) :: 4.575 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (1.00 %) :: 0.425 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (32.03 %) :: 13.612 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.01 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.492 %) :: 0.209 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 23.675 ms) --- | |
.........0 -- GPU solveContactConstraint (67.49 %) :: 15.978 ms / frame (1 calls) | |
.........1 -- batching (32.24 %) :: 7.633 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.27 %) :: 0.064 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.978 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (47.03 %) :: 7.515 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (52.96 %) :: 8.462 ms / frame (1 calls) | |
............Unaccounted: (0.006 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 7.633 ms) --- | |
............0 -- m_contactToConstraintKernel (23.49 %) :: 1.793 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.04 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (75.63 %) :: 5.773 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.08 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.18 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.01 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.537 %) :: 0.041 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.064 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.56 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.437 %) :: 0.063 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 4.575 ms) --- | |
.........0 -- clipHullHullKernel (49.97 %) :: 2.286 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.32 %) :: 0.335 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (29.55 %) :: 1.352 ms / frame (1 calls) | |
.........Unaccounted: (13.158 %) :: 0.602 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 13.612 ms) --- | |
.........0 -- sapKernel (69.12 %) :: 9.408 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.01 %) :: 0.273 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.65 %) :: 0.225 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(11.98 %) :: 1.631 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.71 %) :: 0.233 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.19 %) :: 0.162 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.48 %) :: 1.563 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.01 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (0.852 %) :: 0.116 ms | |
================================== | |
Frame 31: | |
---------------------------------- | |
Profiling: Root (total running time: 208.381 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.140 ms / frame (1 calls) | |
2 -- gui->draw (0.61 %) :: 1.269 ms / frame (1 calls) | |
3 -- renderScene (0.23 %) :: 0.470 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.07 %) :: 206.438 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.064 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.470 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.68 %) :: 0.445 ms / frame (2 calls) | |
...Unaccounted: (5.319 %) :: 0.025 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.445 ms) --- | |
......0 -- glUseProgram(0); (1.35 %) :: 0.006 ms / frame (2 calls) | |
......1 -- glFlush (3.82 %) :: 0.017 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (53.26 %) :: 0.237 ms / frame (4 calls) | |
......3 -- glFlush2 (4.04 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (0.67 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.90 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (35.955 %) :: 0.160 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 206.438 ms) --- | |
...0 -- cl2gl_upload (84.98 %) :: 175.436 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (15.02 %) :: 30.997 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.997 ms) --- | |
......0 -- solveContacts (68.32 %) :: 21.178 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (10.03 %) :: 3.110 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.88 %) :: 0.274 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.34 %) :: 6.304 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.397 %) :: 0.123 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.178 ms) --- | |
.........0 -- GPU solveContactConstraint (70.52 %) :: 14.935 ms / frame (1 calls) | |
.........1 -- batching (29.12 %) :: 6.168 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.075 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.935 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.01 %) :: 6.872 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.99 %) :: 8.063 ms / frame (1 calls) | |
what's wrong | |
............Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.168 ms) --- | |
............0 -- m_contactToConstraintKernel (27.38 %) :: 1.689 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.63 %) :: 4.418 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.28 %) :: 0.017 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.438 %) :: 0.027 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (75.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (25.000 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.075 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.67 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.333 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.110 ms) --- | |
.........0 -- clipHullHullKernel (54.15 %) :: 1.684 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (10.48 %) :: 0.326 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.41 %) :: 0.697 ms / frame (1 calls) | |
.........Unaccounted: (12.958 %) :: 0.403 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.304 ms) --- | |
.........0 -- sapKernel (66.45 %) :: 4.189 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.08 %) :: 0.131 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.54 %) :: 0.097 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(6.66 %) :: 0.420 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.22 %) :: 0.077 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.21 %) :: 0.076 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (19.67 %) :: 1.240 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.142 %) :: 0.072 ms | |
================================== | |
Frame 32: | |
---------------------------------- | |
Profiling: Root (total running time: 207.844 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.158 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.244 ms / frame (1 calls) | |
3 -- renderScene (0.21 %) :: 0.442 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.09 %) :: 205.944 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.027 %) :: 0.056 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.442 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.12 %) :: 0.416 ms / frame (2 calls) | |
...Unaccounted: (5.882 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.416 ms) --- | |
......0 -- glUseProgram(0); (1.20 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.33 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (53.13 %) :: 0.221 ms / frame (4 calls) | |
......3 -- glFlush2 (3.85 %) :: 0.016 ms / frame (2 calls) | |
......4 -- updateCamera (0.48 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.72 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (36.298 %) :: 0.151 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (100.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.944 ms) --- | |
...0 -- cl2gl_upload (85.06 %) :: 175.174 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.94 %) :: 30.766 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.766 ms) --- | |
......0 -- solveContacts (67.75 %) :: 20.843 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (10.66 %) :: 3.281 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.69 %) :: 0.211 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.52 %) :: 6.312 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.367 %) :: 0.113 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.843 ms) --- | |
.........0 -- GPU solveContactConstraint (67.85 %) :: 14.142 ms / frame (1 calls) | |
.........1 -- batching (31.82 %) :: 6.633 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.068 ms / frame (1 calls) | |
what's wrong | |
.........Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.142 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.50 %) :: 6.435 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.49 %) :: 7.706 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.633 ms) --- | |
............0 -- m_contactToConstraintKernel (26.13 %) :: 1.733 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.30 %) :: 4.729 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.12 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (2.096 %) :: 0.139 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.068 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.94 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.059 %) :: 0.066 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.281 ms) --- | |
.........0 -- clipHullHullKernel (56.75 %) :: 1.862 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.85 %) :: 0.192 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (21.55 %) :: 0.707 ms / frame (1 calls) | |
.........Unaccounted: (15.849 %) :: 0.520 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.312 ms) --- | |
.........0 -- sapKernel (67.98 %) :: 4.291 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.73 %) :: 0.109 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.57 %) :: 0.162 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.53 %) :: 0.475 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.17 %) :: 0.137 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.17 %) :: 0.137 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (14.26 %) :: 0.900 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.568 %) :: 0.099 ms | |
================================== | |
Frame 33: | |
---------------------------------- | |
Profiling: Root (total running time: 207.809 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.10 %) :: 0.199 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.239 ms / frame (1 calls) | |
3 -- renderScene (0.23 %) :: 0.481 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.05 %) :: 205.827 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.063 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.481 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.18 %) :: 0.453 ms / frame (2 calls) | |
...Unaccounted: (5.821 %) :: 0.028 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.453 ms) --- | |
......0 -- glUseProgram(0); (1.32 %) :: 0.006 ms / frame (2 calls) | |
......1 -- glFlush (3.97 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (49.23 %) :: 0.223 ms / frame (4 calls) | |
......3 -- glFlush2 (5.30 %) :: 0.024 ms / frame (2 calls) | |
......4 -- updateCamera (0.44 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.88 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (38.852 %) :: 0.176 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.827 ms) --- | |
...0 -- cl2gl_upload (85.12 %) :: 175.198 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.88 %) :: 30.625 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.625 ms) --- | |
......0 -- solveContacts (69.54 %) :: 21.296 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.11 %) :: 2.789 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (1.02 %) :: 0.313 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.84 %) :: 6.077 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.467 %) :: 0.143 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.296 ms) --- | |
.........0 -- GPU solveContactConstraint (70.49 %) :: 15.011 ms / frame (1 calls) | |
.........1 -- batching (29.17 %) :: 6.213 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.071 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.011 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.25 %) :: 6.943 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.74 %) :: 8.067 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.213 ms) --- | |
............0 -- m_contactToConstraintKernel (28.20 %) :: 1.752 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (70.48 %) :: 4.379 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.837 %) :: 0.052 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.071 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.82 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.183 %) :: 0.069 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.789 ms) --- | |
.........0 -- clipHullHullKernel (55.72 %) :: 1.554 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.03 %) :: 0.196 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.81 %) :: 0.692 ms / frame (1 calls) | |
.........Unaccounted: (12.442 %) :: 0.347 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.077 ms) --- | |
.........0 -- sapKernel (69.38 %) :: 4.216 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.16 %) :: 0.131 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.97 %) :: 0.120 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.44 %) :: 0.452 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.76 %) :: 0.107 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.88 %) :: 0.114 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.90 %) :: 0.845 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.497 %) :: 0.091 ms | |
================================== | |
Frame 34: | |
---------------------------------- | |
Profiling: Root (total running time: 207.480 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.126 ms / frame (1 calls) | |
2 -- gui->draw (0.62 %) :: 1.293 ms / frame (1 calls) | |
3 -- renderScene (0.21 %) :: 0.437 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.08 %) :: 205.561 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.063 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.437 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.51 %) :: 0.413 ms / frame (2 calls) | |
...Unaccounted: (5.492 %) :: 0.024 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.413 ms) --- | |
......0 -- glUseProgram(0); (1.21 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.36 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (50.61 %) :: 0.209 ms / frame (4 calls) | |
......3 -- glFlush2 (6.05 %) :: 0.025 ms / frame (2 calls) | |
......4 -- updateCamera (0.48 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.73 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (36.562 %) :: 0.151 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (33.33 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (66.667 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.561 ms) --- | |
...0 -- cl2gl_upload (85.11 %) :: 174.961 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.88 %) :: 30.597 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.597 ms) --- | |
......0 -- solveContacts (69.41 %) :: 21.237 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.22 %) :: 2.822 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.90 %) :: 0.276 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (20.04 %) :: 6.132 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.395 %) :: 0.121 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.237 ms) --- | |
.........0 -- GPU solveContactConstraint (69.98 %) :: 14.862 ms / frame (1 calls) | |
.........1 -- batching (29.65 %) :: 6.296 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.37 %) :: 0.079 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.862 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.12 %) :: 6.854 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.87 %) :: 8.006 ms / frame (1 calls) | |
............Unaccounted: (0.013 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.296 ms) --- | |
............0 -- m_contactToConstraintKernel (27.49 %) :: 1.731 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.01 %) :: 4.471 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.16 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.969 %) :: 0.061 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (75.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (25.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.079 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.53 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.468 %) :: 0.077 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.822 ms) --- | |
.........0 -- clipHullHullKernel (55.35 %) :: 1.562 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.65 %) :: 0.216 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.42 %) :: 0.661 ms / frame (1 calls) | |
.........Unaccounted: (13.572 %) :: 0.383 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.132 ms) --- | |
.........0 -- sapKernel (68.22 %) :: 4.183 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.94 %) :: 0.119 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.86 %) :: 0.114 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.76 %) :: 0.476 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.78 %) :: 0.109 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.84 %) :: 0.113 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (15.04 %) :: 0.922 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.533 %) :: 0.094 ms | |
================================== | |
Frame 35: | |
---------------------------------- | |
Profiling: Root (total running time: 207.456 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.176 ms / frame (1 calls) | |
2 -- gui->draw (0.62 %) :: 1.294 ms / frame (1 calls) | |
3 -- renderScene (0.20 %) :: 0.410 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.07 %) :: 205.518 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.058 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.410 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.41 %) :: 0.383 ms / frame (2 calls) | |
...Unaccounted: (6.585 %) :: 0.027 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.383 ms) --- | |
......0 -- glUseProgram(0); (1.57 %) :: 0.006 ms / frame (2 calls) | |
......1 -- glFlush (3.92 %) :: 0.015 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (39.16 %) :: 0.150 ms / frame (4 calls) | |
......3 -- glFlush2 (10.18 %) :: 0.039 ms / frame (2 calls) | |
......4 -- updateCamera (0.52 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.04 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (43.603 %) :: 0.167 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (100.00 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.518 ms) --- | |
...0 -- cl2gl_upload (85.51 %) :: 175.742 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.49 %) :: 29.774 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.774 ms) --- | |
......0 -- solveContacts (71.33 %) :: 21.237 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.88 %) :: 2.645 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.82 %) :: 0.244 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.67 %) :: 5.560 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.272 %) :: 0.081 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.237 ms) --- | |
.........0 -- GPU solveContactConstraint (70.62 %) :: 14.998 ms / frame (1 calls) | |
.........1 -- batching (28.99 %) :: 6.157 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.38 %) :: 0.080 ms / frame (1 calls) | |
.........Unaccounted: (0.009 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.998 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.86 %) :: 7.028 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.13 %) :: 7.969 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.157 ms) --- | |
............0 -- m_contactToConstraintKernel (27.32 %) :: 1.682 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.93 %) :: 4.429 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.39 %) :: 0.024 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.130 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.080 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.25 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.750 %) :: 0.079 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.645 ms) --- | |
.........0 -- clipHullHullKernel (59.85 %) :: 1.583 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.31 %) :: 0.167 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.35 %) :: 0.644 ms / frame (1 calls) | |
.........Unaccounted: (9.490 %) :: 0.251 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.560 ms) --- | |
.........0 -- sapKernel (74.33 %) :: 4.133 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.85 %) :: 0.103 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.56 %) :: 0.087 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.30 %) :: 0.406 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.37 %) :: 0.076 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.47 %) :: 0.082 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (10.83 %) :: 0.602 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.241 %) :: 0.069 ms | |
================================== | |
Frame 36: | |
---------------------------------- | |
Profiling: Root (total running time: 207.185 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.07 %) :: 0.148 ms / frame (1 calls) | |
2 -- gui->draw (0.63 %) :: 1.296 ms / frame (1 calls) | |
3 -- renderScene (0.15 %) :: 0.306 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.13 %) :: 205.374 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.029 %) :: 0.061 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.306 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.83 %) :: 0.281 ms / frame (2 calls) | |
...Unaccounted: (8.170 %) :: 0.025 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.281 ms) --- | |
......0 -- glUseProgram(0); (1.78 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.98 %) :: 0.014 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (42.35 %) :: 0.119 ms / frame (4 calls) | |
......3 -- glFlush2 (6.05 %) :: 0.017 ms / frame (2 calls) | |
......4 -- updateCamera (0.71 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.07 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (43.060 %) :: 0.121 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.374 ms) --- | |
...0 -- cl2gl_upload (84.91 %) :: 174.391 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (15.08 %) :: 30.979 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.979 ms) --- | |
......0 -- solveContacts (70.47 %) :: 21.832 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.85 %) :: 2.742 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.91 %) :: 0.281 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.31 %) :: 5.983 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.429 %) :: 0.133 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.832 ms) --- | |
.........0 -- GPU solveContactConstraint (71.15 %) :: 15.534 ms / frame (1 calls) | |
.........1 -- batching (28.53 %) :: 6.228 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.069 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.534 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.82 %) :: 7.118 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.18 %) :: 8.416 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.228 ms) --- | |
............0 -- m_contactToConstraintKernel (26.93 %) :: 1.677 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.90 %) :: 4.478 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.626 %) :: 0.039 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.069 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.90 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.101 %) :: 0.067 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.742 ms) --- | |
.........0 -- clipHullHullKernel (58.50 %) :: 1.604 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.20 %) :: 0.170 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.73 %) :: 0.678 ms / frame (1 calls) | |
.........Unaccounted: (10.576 %) :: 0.290 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.983 ms) --- | |
.........0 -- sapKernel (69.56 %) :: 4.162 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.29 %) :: 0.137 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.72 %) :: 0.103 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.34 %) :: 0.439 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.69 %) :: 0.101 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.62 %) :: 0.157 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.72 %) :: 0.821 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.020 %) :: 0.061 ms | |
================================== | |
Frame 37: | |
---------------------------------- | |
Profiling: Root (total running time: 207.235 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.129 ms / frame (1 calls) | |
2 -- gui->draw (0.60 %) :: 1.245 ms / frame (1 calls) | |
3 -- renderScene (0.19 %) :: 0.395 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.12 %) :: 205.401 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.065 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.395 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.67 %) :: 0.370 ms / frame (2 calls) | |
...Unaccounted: (6.329 %) :: 0.025 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.370 ms) --- | |
......0 -- glUseProgram(0); (1.08 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (3.24 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (43.78 %) :: 0.162 ms / frame (4 calls) | |
......3 -- glFlush2 (6.22 %) :: 0.023 ms / frame (2 calls) | |
......4 -- updateCamera (0.54 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (2.97 %) :: 0.011 ms / frame (2 calls) | |
......Unaccounted: (42.162 %) :: 0.156 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.011 ms) --- | |
.........0 -- texture (27.27 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (72.727 %) :: 0.008 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.401 ms) --- | |
...0 -- cl2gl_upload (85.21 %) :: 175.012 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.79 %) :: 30.386 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.386 ms) --- | |
......0 -- solveContacts (71.25 %) :: 21.650 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.12 %) :: 2.772 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.62 %) :: 0.187 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.75 %) :: 5.697 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.244 %) :: 0.074 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.650 ms) --- | |
.........0 -- GPU solveContactConstraint (70.47 %) :: 15.256 ms / frame (1 calls) | |
.........1 -- batching (29.20 %) :: 6.321 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.34 %) :: 0.073 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.256 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.35 %) :: 7.071 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.64 %) :: 8.184 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.321 ms) --- | |
............0 -- m_contactToConstraintKernel (29.00 %) :: 1.833 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (70.13 %) :: 4.433 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.09 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.25 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.411 %) :: 0.026 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (75.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (25.000 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.073 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.74 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.260 %) :: 0.071 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.772 ms) --- | |
.........0 -- clipHullHullKernel (58.62 %) :: 1.625 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.85 %) :: 0.190 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.92 %) :: 0.663 ms / frame (1 calls) | |
.........Unaccounted: (10.606 %) :: 0.294 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.697 ms) --- | |
.........0 -- sapKernel (72.07 %) :: 4.106 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.11 %) :: 0.120 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.67 %) :: 0.095 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.02 %) :: 0.457 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.39 %) :: 0.079 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.42 %) :: 0.081 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.76 %) :: 0.670 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.545 %) :: 0.088 ms | |
================================== | |
Frame 38: | |
---------------------------------- | |
Profiling: Root (total running time: 207.209 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.173 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.230 ms / frame (1 calls) | |
3 -- renderScene (0.15 %) :: 0.315 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.14 %) :: 205.427 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.064 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.315 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.65 %) :: 0.295 ms / frame (2 calls) | |
...Unaccounted: (6.349 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.295 ms) --- | |
......0 -- glUseProgram(0); (1.02 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (4.41 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (34.58 %) :: 0.102 ms / frame (4 calls) | |
......3 -- glFlush2 (6.78 %) :: 0.020 ms / frame (2 calls) | |
......4 -- updateCamera (1.02 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (4.41 %) :: 0.013 ms / frame (2 calls) | |
......Unaccounted: (47.797 %) :: 0.141 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.013 ms) --- | |
.........0 -- texture (23.08 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (76.923 %) :: 0.010 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.427 ms) --- | |
...0 -- cl2gl_upload (85.62 %) :: 175.882 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.38 %) :: 29.543 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.543 ms) --- | |
......0 -- solveContacts (69.86 %) :: 20.640 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.46 %) :: 2.794 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.71 %) :: 0.210 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.66 %) :: 5.807 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.288 %) :: 0.085 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.640 ms) --- | |
.........0 -- GPU solveContactConstraint (69.71 %) :: 14.389 ms / frame (1 calls) | |
.........1 -- batching (29.95 %) :: 6.182 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.068 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.389 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.98 %) :: 6.616 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.02 %) :: 7.773 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.182 ms) --- | |
............0 -- m_contactToConstraintKernel (27.22 %) :: 1.683 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.19 %) :: 4.463 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.097 %) :: 0.006 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.068 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.47 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.529 %) :: 0.067 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.794 ms) --- | |
.........0 -- clipHullHullKernel (58.95 %) :: 1.647 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.12 %) :: 0.171 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.27 %) :: 0.678 ms / frame (1 calls) | |
.........Unaccounted: (10.666 %) :: 0.298 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.807 ms) --- | |
.........0 -- sapKernel (71.17 %) :: 4.133 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.81 %) :: 0.163 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.70 %) :: 0.157 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.96 %) :: 0.462 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.57 %) :: 0.091 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.48 %) :: 0.086 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (10.78 %) :: 0.626 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.498 %) :: 0.087 ms | |
================================== | |
Frame 39: | |
---------------------------------- | |
Profiling: Root (total running time: 206.882 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.06 %) :: 0.120 ms / frame (1 calls) | |
2 -- gui->draw (0.59 %) :: 1.222 ms / frame (1 calls) | |
3 -- renderScene (0.14 %) :: 0.297 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.18 %) :: 205.185 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.058 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.297 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.61 %) :: 0.281 ms / frame (2 calls) | |
...Unaccounted: (5.387 %) :: 0.016 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.281 ms) --- | |
......0 -- glUseProgram(0); (1.07 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (3.91 %) :: 0.011 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (48.75 %) :: 0.137 ms / frame (4 calls) | |
......3 -- glFlush2 (4.98 %) :: 0.014 ms / frame (2 calls) | |
......4 -- updateCamera (0.36 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (0.71 %) :: 0.002 ms / frame (2 calls) | |
......Unaccounted: (40.214 %) :: 0.113 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.002 ms) --- | |
.........0 -- texture (50.00 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.185 ms) --- | |
...0 -- cl2gl_upload (86.15 %) :: 176.777 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (13.84 %) :: 28.404 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 28.404 ms) --- | |
......0 -- solveContacts (69.84 %) :: 19.838 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (8.98 %) :: 2.551 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.85 %) :: 0.242 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.91 %) :: 5.655 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.391 %) :: 0.111 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 19.838 ms) --- | |
.........0 -- GPU solveContactConstraint (68.44 %) :: 13.578 ms / frame (1 calls) | |
.........1 -- batching (31.18 %) :: 6.185 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.38 %) :: 0.075 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.578 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.91 %) :: 6.233 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.09 %) :: 7.345 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.185 ms) --- | |
............0 -- m_contactToConstraintKernel (27.00 %) :: 1.670 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.06 %) :: 4.457 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.16 %) :: 0.010 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.39 %) :: 0.024 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.146 %) :: 0.009 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (75.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (25.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.075 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.67 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.333 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.551 ms) --- | |
.........0 -- clipHullHullKernel (62.41 %) :: 1.592 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.06 %) :: 0.129 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.79 %) :: 0.658 ms / frame (1 calls) | |
.........Unaccounted: (6.742 %) :: 0.172 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.655 ms) --- | |
.........0 -- sapKernel (72.96 %) :: 4.126 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.61 %) :: 0.091 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.34 %) :: 0.076 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.69 %) :: 0.435 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.15 %) :: 0.065 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.65 %) :: 0.150 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.69 %) :: 0.661 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.866 %) :: 0.049 ms | |
================================== | |
Frame 40: | |
---------------------------------- | |
Profiling: Root (total running time: 207.359 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.13 %) :: 0.276 ms / frame (1 calls) | |
2 -- gui->draw (0.61 %) :: 1.273 ms / frame (1 calls) | |
3 -- renderScene (0.24 %) :: 0.491 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (98.99 %) :: 205.261 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.058 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.491 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.30 %) :: 0.463 ms / frame (2 calls) | |
...Unaccounted: (5.703 %) :: 0.028 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.463 ms) --- | |
......0 -- glUseProgram(0); (1.08 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.10 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (44.92 %) :: 0.208 ms / frame (4 calls) | |
......3 -- glFlush2 (5.83 %) :: 0.027 ms / frame (2 calls) | |
......4 -- updateCamera (2.16 %) :: 0.010 ms / frame (2 calls) | |
......5 -- init (1.08 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (40.821 %) :: 0.189 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (40.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (60.000 %) :: 0.003 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 205.261 ms) --- | |
...0 -- cl2gl_upload (85.59 %) :: 175.675 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.41 %) :: 29.583 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.583 ms) --- | |
......0 -- solveContacts (70.31 %) :: 20.800 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.83 %) :: 2.907 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (0.66 %) :: 0.195 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (18.89 %) :: 5.588 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.294 %) :: 0.087 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.800 ms) --- | |
.........0 -- GPU solveContactConstraint (69.63 %) :: 14.483 ms / frame (1 calls) | |
.........1 -- batching (30.02 %) :: 6.245 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.34 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.010 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.483 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.12 %) :: 6.679 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.88 %) :: 7.803 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.245 ms) --- | |
............0 -- m_contactToConstraintKernel (27.46 %) :: 1.715 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.11 %) :: 4.441 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.945 %) :: 0.059 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.43 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.571 %) :: 0.069 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.907 ms) --- | |
.........0 -- clipHullHullKernel (55.87 %) :: 1.624 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.71 %) :: 0.195 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.01 %) :: 0.698 ms / frame (1 calls) | |
.........Unaccounted: (13.416 %) :: 0.390 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.588 ms) --- | |
.........0 -- sapKernel (72.83 %) :: 4.070 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.31 %) :: 0.129 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.81 %) :: 0.101 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.78 %) :: 0.435 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.61 %) :: 0.090 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.20 %) :: 0.067 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.36 %) :: 0.635 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.074 %) :: 0.060 ms | |
================================== | |
Frame 41: | |
---------------------------------- | |
Profiling: Root (total running time: 206.520 ms) --- | |
0 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
1 -- window->endRendering (0.08 %) :: 0.158 ms / frame (1 calls) | |
2 -- gui->draw (0.61 %) :: 1.252 ms / frame (1 calls) | |
3 -- renderScene (0.21 %) :: 0.436 ms / frame (1 calls) | |
4 -- clientMoveAndDisplay (99.08 %) :: 204.610 ms / frame (1 calls) | |
5 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.064 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.436 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.04 %) :: 0.410 ms / frame (2 calls) | |
...Unaccounted: (5.963 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.410 ms) --- | |
......0 -- glUseProgram(0); (1.22 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.39 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (51.71 %) :: 0.212 ms / frame (4 calls) | |
......3 -- glFlush2 (4.88 %) :: 0.020 ms / frame (2 calls) | |
......4 -- updateCamera (0.49 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.73 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (36.585 %) :: 0.150 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 204.610 ms) --- | |
...0 -- cl2gl_upload (85.59 %) :: 175.127 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.41 %) :: 29.478 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.478 ms) --- | |
......0 -- solveContacts (69.38 %) :: 20.453 ms / frame (1 calls) | |
......1 -- computeConvexConvexContactsGPUSAT (9.39 %) :: 2.769 ms / frame (1 calls) | |
......2 -- primitiveContactsKernel (1.08 %) :: 0.319 ms / frame (1 calls) | |
......3 -- GPU 1-axis SAP calculateOverlappingPairs (19.68 %) :: 5.801 ms / frame (1 calls) | |
......4 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.434 %) :: 0.128 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.453 ms) --- | |
.........0 -- GPU solveContactConstraint (69.37 %) :: 14.189 ms / frame (1 calls) | |
.........1 -- batching (30.26 %) :: 6.190 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.074 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.189 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.02 %) :: 6.530 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.97 %) :: 7.658 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.190 ms) --- | |
............0 -- m_contactToConstraintKernel (27.61 %) :: 1.709 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.20 %) :: 4.407 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.26 %) :: 0.016 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.646 %) :: 0.040 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.074 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.70 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.297 %) :: 0.072 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.769 ms) --- | |
.........0 -- clipHullHullKernel (58.36 %) :: 1.616 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.96 %) :: 0.165 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.76 %) :: 0.658 ms / frame (1 calls) | |
.........Unaccounted: (11.918 %) :: 0.330 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.801 ms) --- | |
.........0 -- sapKernel (70.85 %) :: 4.110 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.24 %) :: 0.130 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.76 %) :: 0.102 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.93 %) :: 0.460 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.95 %) :: 0.113 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.90 %) :: 0.110 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.27 %) :: 0.712 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.069 %) :: 0.062 ms | |
================================== | |
Frame 42: | |
---------------------------------- | |
Profiling: Root (total running time: 315.771 ms) --- | |
0 -- castRaysGPU (29.30 %) :: 92.513 ms / frame (1 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.06 %) :: 0.201 ms / frame (1 calls) | |
3 -- gui->draw (0.85 %) :: 2.689 ms / frame (1 calls) | |
4 -- renderScene (22.17 %) :: 70.002 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (11.46 %) :: 36.175 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (36.163 %) :: 114.191 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 92.513 ms) --- | |
...0 -- raycast copyToHost (0.27 %) :: 0.248 ms / frame (1 calls) | |
...1 -- raycast launch1D (99.27 %) :: 91.834 ms / frame (1 calls) | |
...2 -- raycast copyFromHost (0.44 %) :: 0.406 ms / frame (1 calls) | |
...Unaccounted: (0.027 %) :: 0.025 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 70.002 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (99.96 %) :: 69.972 ms / frame (2 calls) | |
...Unaccounted: (0.043 %) :: 0.030 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 69.972 ms) --- | |
......0 -- glUseProgram(0); (0.02 %) :: 0.015 ms / frame (2 calls) | |
......1 -- glFlush (0.27 %) :: 0.187 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (99.48 %) :: 69.608 ms / frame (6 calls) | |
......3 -- glFlush2 (0.03 %) :: 0.019 ms / frame (2 calls) | |
......4 -- updateCamera (0.00 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.01 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (0.193 %) :: 0.135 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (60.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (40.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 36.175 ms) --- | |
...0 -- cl2gl_upload (2.99 %) :: 1.080 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (92.92 %) :: 33.613 ms / frame (1 calls) | |
...3 -- gl2cl (4.09 %) :: 1.480 ms / frame (1 calls) | |
...Unaccounted: (0.003 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 33.613 ms) --- | |
......0 -- solveJoints (13.13 %) :: 4.413 ms / frame (1 calls) | |
......1 -- solveContacts (60.07 %) :: 20.191 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.57 %) :: 2.882 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.70 %) :: 0.234 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (17.22 %) :: 5.789 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.01 %) :: 0.005 ms / frame (1 calls) | |
......Unaccounted: (0.295 %) :: 0.099 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 4.413 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (7.00 %) :: 0.309 ms / frame (1 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (7.14 %) :: 0.315 ms / frame (1 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (85.81 %) :: 3.787 ms / frame (1 calls) | |
.........Unaccounted: (0.045 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.309 ms) --- | |
............0 -- GPU write back velocities and transforms (96.76 %) :: 0.299 ms / frame (1 calls) | |
............1 -- breakViolatedConstraintsKernel (1.29 %) :: 0.004 ms / frame (1 calls) | |
............Unaccounted: (1.942 %) :: 0.006 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.315 ms) --- | |
............0 -- solveJointConstraintRowsKernels (2.22 %) :: 0.007 ms / frame (4 calls) | |
............1 -- batch joints (18.73 %) :: 0.059 ms / frame (1 calls) | |
............Unaccounted: (79.048 %) :: 0.249 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.059 ms) --- | |
...............0 -- sortConstraintByBatch3 (6.78 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (93.220 %) :: 0.055 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.004 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (1 calls) | |
..................Unaccounted: (100.000 %) :: 0.004 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 3.787 ms) --- | |
............0 -- getInfo2Kernel (7.21 %) :: 0.273 ms / frame (1 calls) | |
............1 -- info1 and init batchConstraint (23.18 %) :: 0.878 ms / frame (1 calls) | |
............2 -- m_initSolverBodiesKernel (14.50 %) :: 0.549 ms / frame (1 calls) | |
............Unaccounted: (55.110 %) :: 2.087 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.878 ms) --- | |
...............0 -- initBatchConstraintsKernel (79.04 %) :: 0.694 ms / frame (1 calls) | |
...............1 -- getInfo1Kernel (20.50 %) :: 0.180 ms / frame (1 calls) | |
...............Unaccounted: (0.456 %) :: 0.004 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.694 ms) --- | |
..................0 -- init batch constraints (29.25 %) :: 0.203 ms / frame (1 calls) | |
..................Unaccounted: (70.749 %) :: 0.491 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.191 ms) --- | |
.........0 -- GPU solveContactConstraint (69.50 %) :: 14.032 ms / frame (1 calls) | |
.........1 -- batching (30.12 %) :: 6.081 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.39 %) :: 0.078 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.032 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.03 %) :: 6.459 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.97 %) :: 7.573 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.081 ms) --- | |
............0 -- m_contactToConstraintKernel (27.28 %) :: 1.659 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.07 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.98 %) :: 4.377 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.21 %) :: 0.013 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.28 %) :: 0.017 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.132 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.078 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.56 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.436 %) :: 0.076 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.882 ms) --- | |
.........0 -- clipHullHullKernel (56.90 %) :: 1.640 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.50 %) :: 0.245 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.53 %) :: 0.678 ms / frame (1 calls) | |
.........Unaccounted: (11.069 %) :: 0.319 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.789 ms) --- | |
.........0 -- sapKernel (70.75 %) :: 4.096 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.85 %) :: 0.165 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.75 %) :: 0.159 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.05 %) :: 0.466 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.57 %) :: 0.091 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.50 %) :: 0.087 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.04 %) :: 0.639 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.451 %) :: 0.084 ms | |
================================== | |
Frame 43: | |
---------------------------------- | |
Profiling: Root (total running time: 138.384 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.14 %) :: 0.189 ms / frame (1 calls) | |
3 -- gui->draw (0.90 %) :: 1.240 ms / frame (1 calls) | |
4 -- renderScene (2.55 %) :: 3.525 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (95.93 %) :: 132.751 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.491 %) :: 0.679 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 3.525 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (99.40 %) :: 3.504 ms / frame (2 calls) | |
...Unaccounted: (0.596 %) :: 0.021 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 3.504 ms) --- | |
......0 -- glUseProgram(0); (0.14 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (1.88 %) :: 0.066 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (94.49 %) :: 3.311 ms / frame (6 calls) | |
......3 -- glFlush2 (0.51 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (0.09 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.11 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (2.768 %) :: 0.097 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 132.751 ms) --- | |
...0 -- cl2gl_upload (77.15 %) :: 102.422 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (22.85 %) :: 30.327 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.327 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.68 %) :: 21.434 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.77 %) :: 2.660 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.80 %) :: 0.243 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.23 %) :: 5.832 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.491 %) :: 0.149 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.434 ms) --- | |
.........0 -- GPU solveContactConstraint (70.19 %) :: 15.045 ms / frame (1 calls) | |
.........1 -- batching (29.47 %) :: 6.317 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.34 %) :: 0.072 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.045 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.85 %) :: 7.048 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.15 %) :: 7.997 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.317 ms) --- | |
............0 -- m_contactToConstraintKernel (27.24 %) :: 1.721 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.09 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.65 %) :: 4.526 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.16 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.602 %) :: 0.038 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.072 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.78 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.222 %) :: 0.070 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.660 ms) --- | |
.........0 -- clipHullHullKernel (58.98 %) :: 1.569 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.47 %) :: 0.172 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.55 %) :: 0.653 ms / frame (1 calls) | |
.........Unaccounted: (10.000 %) :: 0.266 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.832 ms) --- | |
.........0 -- sapKernel (70.73 %) :: 4.125 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.47 %) :: 0.086 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.73 %) :: 0.101 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.92 %) :: 0.462 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.46 %) :: 0.085 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.47 %) :: 0.086 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.80 %) :: 0.805 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.372 %) :: 0.080 ms | |
================================== | |
Frame 44: | |
---------------------------------- | |
Profiling: Root (total running time: 202.889 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.09 %) :: 0.174 ms / frame (1 calls) | |
3 -- gui->draw (0.61 %) :: 1.238 ms / frame (1 calls) | |
4 -- renderScene (0.19 %) :: 0.390 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.08 %) :: 201.020 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.033 %) :: 0.067 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.390 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.54 %) :: 0.357 ms / frame (2 calls) | |
...Unaccounted: (8.462 %) :: 0.033 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.357 ms) --- | |
......0 -- glUseProgram(0); (1.40 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.64 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (34.73 %) :: 0.124 ms / frame (6 calls) | |
......3 -- glFlush2 (5.32 %) :: 0.019 ms / frame (2 calls) | |
......4 -- updateCamera (0.56 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.40 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (52.941 %) :: 0.189 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (60.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (40.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 201.020 ms) --- | |
...0 -- cl2gl_upload (86.00 %) :: 172.887 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (13.99 %) :: 28.130 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 28.130 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.82 %) :: 19.640 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.19 %) :: 2.585 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.75 %) :: 0.210 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.95 %) :: 5.613 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.009 ms / frame (1 calls) | |
......Unaccounted: (0.260 %) :: 0.073 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 19.640 ms) --- | |
.........0 -- GPU solveContactConstraint (67.95 %) :: 13.346 ms / frame (1 calls) | |
.........1 -- batching (31.69 %) :: 6.223 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.071 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.346 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.36 %) :: 6.187 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.64 %) :: 7.159 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.223 ms) --- | |
............0 -- m_contactToConstraintKernel (28.20 %) :: 1.755 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.25 %) :: 4.434 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.064 %) :: 0.004 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.071 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.82 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.183 %) :: 0.069 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.585 ms) --- | |
.........0 -- clipHullHullKernel (61.47 %) :: 1.589 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.23 %) :: 0.161 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.03 %) :: 0.647 ms / frame (1 calls) | |
.........Unaccounted: (7.273 %) :: 0.188 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.613 ms) --- | |
.........0 -- sapKernel (74.02 %) :: 4.155 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.89 %) :: 0.106 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.32 %) :: 0.074 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.89 %) :: 0.443 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.07 %) :: 0.060 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (0.94 %) :: 0.053 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.95 %) :: 0.671 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.873 %) :: 0.049 ms | |
================================== | |
Frame 45: | |
---------------------------------- | |
Profiling: Root (total running time: 205.900 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.137 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.238 ms / frame (1 calls) | |
4 -- renderScene (0.14 %) :: 0.296 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.16 %) :: 204.166 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.063 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.296 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.58 %) :: 0.277 ms / frame (2 calls) | |
...Unaccounted: (6.419 %) :: 0.019 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.277 ms) --- | |
......0 -- glUseProgram(0); (1.44 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.69 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (39.35 %) :: 0.109 ms / frame (6 calls) | |
......3 -- glFlush2 (6.14 %) :: 0.017 ms / frame (2 calls) | |
......4 -- updateCamera (0.36 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (0.72 %) :: 0.002 ms / frame (2 calls) | |
......Unaccounted: (47.292 %) :: 0.131 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.002 ms) --- | |
.........0 -- texture (50.00 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 204.166 ms) --- | |
...0 -- cl2gl_upload (85.66 %) :: 174.887 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.34 %) :: 29.274 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.274 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.46 %) :: 20.626 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.24 %) :: 2.704 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.64 %) :: 0.187 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.36 %) :: 5.667 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.284 %) :: 0.083 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.626 ms) --- | |
.........0 -- GPU solveContactConstraint (69.63 %) :: 14.362 ms / frame (1 calls) | |
.........1 -- batching (30.03 %) :: 6.194 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.069 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.362 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.16 %) :: 6.629 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.84 %) :: 7.732 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.194 ms) --- | |
............0 -- m_contactToConstraintKernel (27.28 %) :: 1.690 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.09 %) :: 4.465 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.145 %) :: 0.009 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.069 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.45 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.551 %) :: 0.068 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.704 ms) --- | |
.........0 -- clipHullHullKernel (60.47 %) :: 1.635 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.32 %) :: 0.171 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.74 %) :: 0.642 ms / frame (1 calls) | |
.........Unaccounted: (9.467 %) :: 0.256 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.667 ms) --- | |
.........0 -- sapKernel (72.21 %) :: 4.092 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.98 %) :: 0.112 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.96 %) :: 0.111 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.91 %) :: 0.448 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.66 %) :: 0.094 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.57 %) :: 0.089 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.35 %) :: 0.643 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.341 %) :: 0.076 ms | |
================================== | |
Frame 46: | |
---------------------------------- | |
Profiling: Root (total running time: 206.091 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.139 ms / frame (1 calls) | |
3 -- gui->draw (0.61 %) :: 1.264 ms / frame (1 calls) | |
4 -- renderScene (0.22 %) :: 0.458 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.07 %) :: 204.170 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.029 %) :: 0.060 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.458 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.10 %) :: 0.431 ms / frame (2 calls) | |
...Unaccounted: (5.895 %) :: 0.027 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.431 ms) --- | |
......0 -- glUseProgram(0); (1.16 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.41 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (53.83 %) :: 0.232 ms / frame (6 calls) | |
......3 -- glFlush2 (4.41 %) :: 0.019 ms / frame (2 calls) | |
......4 -- updateCamera (0.70 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.70 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (34.803 %) :: 0.150 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (100.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 204.170 ms) --- | |
...0 -- cl2gl_upload (85.77 %) :: 175.121 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.23 %) :: 29.045 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.045 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.75 %) :: 20.548 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.91 %) :: 2.588 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.86 %) :: 0.249 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.28 %) :: 5.599 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.186 %) :: 0.054 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.548 ms) --- | |
.........0 -- GPU solveContactConstraint (69.61 %) :: 14.303 ms / frame (1 calls) | |
.........1 -- batching (30.05 %) :: 6.175 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.34 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.303 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.53 %) :: 6.655 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.46 %) :: 7.647 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.175 ms) --- | |
............0 -- m_contactToConstraintKernel (27.13 %) :: 1.675 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.05 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.40 %) :: 4.471 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.032 %) :: 0.002 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.86 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.143 %) :: 0.068 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.588 ms) --- | |
.........0 -- clipHullHullKernel (61.59 %) :: 1.594 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (4.91 %) :: 0.127 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (26.43 %) :: 0.684 ms / frame (1 calls) | |
.........Unaccounted: (7.071 %) :: 0.183 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.599 ms) --- | |
.........0 -- sapKernel (73.85 %) :: 4.135 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.63 %) :: 0.091 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.32 %) :: 0.074 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.66 %) :: 0.429 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.55 %) :: 0.087 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.39 %) :: 0.078 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.68 %) :: 0.654 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.875 %) :: 0.049 ms | |
================================== | |
Frame 47: | |
---------------------------------- | |
Profiling: Root (total running time: 205.828 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.139 ms / frame (1 calls) | |
3 -- gui->draw (0.61 %) :: 1.265 ms / frame (1 calls) | |
4 -- renderScene (0.22 %) :: 0.463 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.07 %) :: 203.918 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.021 %) :: 0.043 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.463 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.95 %) :: 0.435 ms / frame (2 calls) | |
...Unaccounted: (6.048 %) :: 0.028 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.435 ms) --- | |
......0 -- glUseProgram(0); (1.15 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.37 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (53.79 %) :: 0.234 ms / frame (6 calls) | |
......3 -- glFlush2 (4.60 %) :: 0.020 ms / frame (2 calls) | |
......4 -- updateCamera (0.69 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.69 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (34.713 %) :: 0.151 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.918 ms) --- | |
...0 -- cl2gl_upload (84.69 %) :: 172.702 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (15.31 %) :: 31.211 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 31.211 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.59 %) :: 21.720 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.41 %) :: 2.937 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.94 %) :: 0.294 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.61 %) :: 6.121 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.423 %) :: 0.132 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.720 ms) --- | |
.........0 -- GPU solveContactConstraint (70.78 %) :: 15.373 ms / frame (1 calls) | |
.........1 -- batching (28.90 %) :: 6.276 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.071 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.373 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.58 %) :: 7.160 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.42 %) :: 8.212 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.276 ms) --- | |
............0 -- m_contactToConstraintKernel (27.61 %) :: 1.733 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.26 %) :: 4.472 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.653 %) :: 0.041 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.071 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.41 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.592 %) :: 0.070 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.937 ms) --- | |
.........0 -- clipHullHullKernel (54.51 %) :: 1.601 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.87 %) :: 0.231 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.63 %) :: 0.694 ms / frame (1 calls) | |
.........Unaccounted: (13.994 %) :: 0.411 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.121 ms) --- | |
.........0 -- sapKernel (68.73 %) :: 4.207 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.96 %) :: 0.120 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.94 %) :: 0.119 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.50 %) :: 0.459 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.11 %) :: 0.129 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.90 %) :: 0.116 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (14.34 %) :: 0.878 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.503 %) :: 0.092 ms | |
================================== | |
Frame 48: | |
---------------------------------- | |
Profiling: Root (total running time: 205.586 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.10 %) :: 0.204 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.237 ms / frame (1 calls) | |
4 -- renderScene (0.18 %) :: 0.374 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.10 %) :: 203.727 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.021 %) :: 0.044 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.374 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.18 %) :: 0.341 ms / frame (2 calls) | |
...Unaccounted: (8.824 %) :: 0.033 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.341 ms) --- | |
......0 -- glUseProgram(0); (1.47 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.81 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (32.84 %) :: 0.112 ms / frame (6 calls) | |
......3 -- glFlush2 (4.99 %) :: 0.017 ms / frame (2 calls) | |
......4 -- updateCamera (0.88 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.17 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (54.839 %) :: 0.187 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.727 ms) --- | |
...0 -- cl2gl_upload (85.20 %) :: 173.567 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.80 %) :: 30.157 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.157 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.54 %) :: 20.970 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.69 %) :: 2.921 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.91 %) :: 0.275 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.47 %) :: 5.873 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.371 %) :: 0.112 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.970 ms) --- | |
.........0 -- GPU solveContactConstraint (70.14 %) :: 14.709 ms / frame (1 calls) | |
.........1 -- batching (29.45 %) :: 6.176 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.41 %) :: 0.085 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.709 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.26 %) :: 6.804 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.74 %) :: 7.904 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.176 ms) --- | |
............0 -- m_contactToConstraintKernel (27.06 %) :: 1.671 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.11 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.73 %) :: 4.430 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.664 %) :: 0.041 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.085 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.35 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.647 %) :: 0.083 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.921 ms) --- | |
.........0 -- clipHullHullKernel (61.62 %) :: 1.800 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.65 %) :: 0.165 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.52 %) :: 0.687 ms / frame (1 calls) | |
.........Unaccounted: (9.209 %) :: 0.269 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.873 ms) --- | |
.........0 -- sapKernel (71.51 %) :: 4.200 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.20 %) :: 0.129 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.06 %) :: 0.121 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.70 %) :: 0.452 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.86 %) :: 0.109 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.86 %) :: 0.109 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.80 %) :: 0.693 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.005 %) :: 0.059 ms | |
================================== | |
Frame 49: | |
---------------------------------- | |
Profiling: Root (total running time: 206.586 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.151 ms / frame (1 calls) | |
3 -- gui->draw (0.63 %) :: 1.308 ms / frame (1 calls) | |
4 -- renderScene (0.63 %) :: 1.294 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (98.63 %) :: 203.765 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.032 %) :: 0.067 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 1.294 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (88.56 %) :: 1.146 ms / frame (2 calls) | |
...Unaccounted: (11.437 %) :: 0.148 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 1.146 ms) --- | |
......0 -- glUseProgram(0); (0.44 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (1.66 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (32.20 %) :: 0.369 ms / frame (6 calls) | |
......3 -- glFlush2 (2.27 %) :: 0.026 ms / frame (2 calls) | |
......4 -- updateCamera (0.26 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.44 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (62.740 %) :: 0.719 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (80.00 %) :: 0.004 ms / frame (2 calls) | |
.........Unaccounted: (20.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.765 ms) --- | |
...0 -- cl2gl_upload (84.84 %) :: 172.881 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.000 ms / frame (1 calls) | |
...2 -- stepSimulation (15.16 %) :: 30.882 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.882 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.05 %) :: 21.634 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.64 %) :: 2.976 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.78 %) :: 0.240 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.18 %) :: 5.923 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.330 %) :: 0.102 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.634 ms) --- | |
.........0 -- GPU solveContactConstraint (71.05 %) :: 15.372 ms / frame (1 calls) | |
.........1 -- batching (28.62 %) :: 6.192 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.372 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.27 %) :: 7.113 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.72 %) :: 8.258 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.192 ms) --- | |
............0 -- m_contactToConstraintKernel (26.92 %) :: 1.667 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.05 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.19 %) :: 4.470 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.23 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.436 %) :: 0.027 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.86 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.143 %) :: 0.068 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.976 ms) --- | |
.........0 -- clipHullHullKernel (54.94 %) :: 1.635 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.59 %) :: 0.226 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.79 %) :: 0.708 ms / frame (1 calls) | |
.........Unaccounted: (13.676 %) :: 0.407 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.923 ms) --- | |
.........0 -- sapKernel (70.49 %) :: 4.175 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.08 %) :: 0.123 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.26 %) :: 0.134 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.36 %) :: 0.495 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.89 %) :: 0.112 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.86 %) :: 0.110 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.57 %) :: 0.685 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.469 %) :: 0.087 ms | |
================================== | |
Frame 50: | |
---------------------------------- | |
Profiling: Root (total running time: 204.910 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.09 %) :: 0.184 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.232 ms / frame (1 calls) | |
4 -- renderScene (0.15 %) :: 0.306 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.12 %) :: 203.116 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.035 %) :: 0.072 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.306 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.14 %) :: 0.285 ms / frame (2 calls) | |
...Unaccounted: (6.863 %) :: 0.021 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.285 ms) --- | |
......0 -- glUseProgram(0); (1.40 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.56 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (44.56 %) :: 0.127 ms / frame (6 calls) | |
......3 -- glFlush2 (9.12 %) :: 0.026 ms / frame (2 calls) | |
......4 -- updateCamera (1.05 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.05 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (38.246 %) :: 0.109 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (100.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.116 ms) --- | |
...0 -- cl2gl_upload (85.27 %) :: 173.205 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.73 %) :: 29.909 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.909 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (71.64 %) :: 21.426 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.67 %) :: 2.592 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.65 %) :: 0.193 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (18.73 %) :: 5.601 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.298 %) :: 0.089 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.426 ms) --- | |
.........0 -- GPU solveContactConstraint (70.93 %) :: 15.198 ms / frame (1 calls) | |
.........1 -- batching (28.71 %) :: 6.152 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.076 ms / frame (1 calls) | |
what's wrong | |
.........Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 15.198 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.10 %) :: 7.007 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.90 %) :: 8.191 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.152 ms) --- | |
............0 -- m_contactToConstraintKernel (27.49 %) :: 1.691 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.07 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.93 %) :: 4.425 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.16 %) :: 0.010 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.081 %) :: 0.005 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.076 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.63 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.368 %) :: 0.074 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.592 ms) --- | |
.........0 -- clipHullHullKernel (59.07 %) :: 1.531 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.67 %) :: 0.147 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.23 %) :: 0.654 ms / frame (1 calls) | |
.........Unaccounted: (10.031 %) :: 0.260 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.601 ms) --- | |
.........0 -- sapKernel (74.02 %) :: 4.146 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.04 %) :: 0.114 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.68 %) :: 0.094 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.28 %) :: 0.464 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.45 %) :: 0.081 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.45 %) :: 0.081 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (9.66 %) :: 0.541 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.393 %) :: 0.078 ms | |
================================== | |
Frame 51: | |
---------------------------------- | |
Profiling: Root (total running time: 204.938 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.06 %) :: 0.124 ms / frame (1 calls) | |
3 -- gui->draw (0.59 %) :: 1.216 ms / frame (1 calls) | |
4 -- renderScene (0.12 %) :: 0.240 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.20 %) :: 203.295 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.063 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.240 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.67 %) :: 0.220 ms / frame (2 calls) | |
...Unaccounted: (8.333 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.220 ms) --- | |
......0 -- glUseProgram(0); (1.82 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (5.45 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (43.18 %) :: 0.095 ms / frame (6 calls) | |
......3 -- glFlush2 (10.00 %) :: 0.022 ms / frame (2 calls) | |
......4 -- updateCamera (0.91 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.36 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (37.273 %) :: 0.082 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.295 ms) --- | |
...0 -- cl2gl_upload (85.63 %) :: 174.091 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.36 %) :: 29.201 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.201 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.93 %) :: 20.420 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.17 %) :: 2.679 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.71 %) :: 0.207 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.91 %) :: 5.813 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.257 %) :: 0.075 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.420 ms) --- | |
.........0 -- GPU solveContactConstraint (69.12 %) :: 14.115 ms / frame (1 calls) | |
.........1 -- batching (30.53 %) :: 6.234 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.34 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.115 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.11 %) :: 6.509 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.89 %) :: 7.606 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.234 ms) --- | |
............0 -- m_contactToConstraintKernel (27.06 %) :: 1.687 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.18 %) :: 0.011 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.38 %) :: 4.450 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.962 %) :: 0.060 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.011 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.011 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.011 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.011 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.43 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.571 %) :: 0.069 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.679 ms) --- | |
.........0 -- clipHullHullKernel (58.64 %) :: 1.571 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.46 %) :: 0.173 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.82 %) :: 0.665 ms / frame (1 calls) | |
.........Unaccounted: (10.078 %) :: 0.270 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.813 ms) --- | |
.........0 -- sapKernel (71.05 %) :: 4.130 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.88 %) :: 0.109 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.25 %) :: 0.131 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.41 %) :: 0.489 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.79 %) :: 0.104 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.75 %) :: 0.102 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.63 %) :: 0.676 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.204 %) :: 0.070 ms | |
================================== | |
Frame 52: | |
---------------------------------- | |
Profiling: Root (total running time: 205.106 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.145 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.235 ms / frame (1 calls) | |
4 -- renderScene (0.14 %) :: 0.277 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.16 %) :: 203.380 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.034 %) :: 0.069 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.277 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.06 %) :: 0.255 ms / frame (2 calls) | |
...Unaccounted: (7.942 %) :: 0.022 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.255 ms) --- | |
......0 -- glUseProgram(0); (1.96 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.71 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (40.39 %) :: 0.103 ms / frame (6 calls) | |
......3 -- glFlush2 (7.84 %) :: 0.020 ms / frame (2 calls) | |
......4 -- updateCamera (1.18 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.57 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (42.353 %) :: 0.108 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.380 ms) --- | |
...0 -- cl2gl_upload (85.89 %) :: 174.690 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.11 %) :: 28.687 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 28.687 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.46 %) :: 20.212 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.43 %) :: 2.705 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.58 %) :: 0.166 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.25 %) :: 5.523 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.261 %) :: 0.075 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.212 ms) --- | |
.........0 -- GPU solveContactConstraint (69.70 %) :: 14.087 ms / frame (1 calls) | |
.........1 -- batching (29.95 %) :: 6.053 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.36 %) :: 0.072 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.087 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.96 %) :: 6.475 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.04 %) :: 7.612 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.053 ms) --- | |
............0 -- m_contactToConstraintKernel (27.13 %) :: 1.642 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.70 %) :: 4.340 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.23 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.694 %) :: 0.042 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.072 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.78 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.222 %) :: 0.070 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.705 ms) --- | |
.........0 -- clipHullHullKernel (59.15 %) :: 1.600 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.80 %) :: 0.184 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.07 %) :: 0.651 ms / frame (1 calls) | |
.........Unaccounted: (9.982 %) :: 0.270 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.523 ms) --- | |
.........0 -- sapKernel (74.40 %) :: 4.109 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.15 %) :: 0.119 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.72 %) :: 0.095 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.62 %) :: 0.421 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.48 %) :: 0.082 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.45 %) :: 0.080 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (9.80 %) :: 0.541 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.340 %) :: 0.074 ms | |
================================== | |
Frame 53: | |
---------------------------------- | |
Profiling: Root (total running time: 205.714 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.09 %) :: 0.186 ms / frame (1 calls) | |
3 -- gui->draw (0.63 %) :: 1.292 ms / frame (1 calls) | |
4 -- renderScene (0.25 %) :: 0.516 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.00 %) :: 203.652 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.033 %) :: 0.068 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.516 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.96 %) :: 0.490 ms / frame (2 calls) | |
...Unaccounted: (5.039 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.490 ms) --- | |
......0 -- glUseProgram(0); (1.02 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.88 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (48.78 %) :: 0.239 ms / frame (6 calls) | |
......3 -- glFlush2 (4.29 %) :: 0.021 ms / frame (2 calls) | |
......4 -- updateCamera (0.20 %) :: 0.001 ms / frame (2 calls) | |
......5 -- init (0.82 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (41.020 %) :: 0.201 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 203.652 ms) --- | |
...0 -- cl2gl_upload (85.50 %) :: 174.124 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (14.50 %) :: 29.524 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.524 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.38 %) :: 20.780 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.47 %) :: 2.501 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.85 %) :: 0.252 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.87 %) :: 5.866 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.396 %) :: 0.117 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.780 ms) --- | |
.........0 -- GPU solveContactConstraint (70.10 %) :: 14.566 ms / frame (1 calls) | |
.........1 -- batching (29.49 %) :: 6.127 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.42 %) :: 0.087 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.566 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.99 %) :: 6.699 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.00 %) :: 7.866 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.127 ms) --- | |
............0 -- m_contactToConstraintKernel (27.40 %) :: 1.679 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.11 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.78 %) :: 4.398 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.294 %) :: 0.018 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.087 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.30 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (97.701 %) :: 0.085 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.501 ms) --- | |
.........0 -- clipHullHullKernel (62.65 %) :: 1.567 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.28 %) :: 0.132 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.67 %) :: 0.617 ms / frame (1 calls) | |
.........Unaccounted: (7.397 %) :: 0.185 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.866 ms) --- | |
.........0 -- sapKernel (71.10 %) :: 4.171 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.69 %) :: 0.099 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.11 %) :: 0.124 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.86 %) :: 0.461 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.91 %) :: 0.112 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.69 %) :: 0.099 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.26 %) :: 0.719 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.347 %) :: 0.079 ms | |
================================== | |
Frame 54: | |
---------------------------------- | |
Profiling: Root (total running time: 204.805 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
2 -- window->endRendering (0.09 %) :: 0.176 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.235 ms / frame (1 calls) | |
4 -- renderScene (0.17 %) :: 0.344 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.11 %) :: 202.986 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.031 %) :: 0.063 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.344 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.60 %) :: 0.322 ms / frame (2 calls) | |
...Unaccounted: (6.395 %) :: 0.022 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.322 ms) --- | |
......0 -- glUseProgram(0); (1.55 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.73 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (43.79 %) :: 0.141 ms / frame (6 calls) | |
......3 -- glFlush2 (6.52 %) :: 0.021 ms / frame (2 calls) | |
......4 -- updateCamera (0.62 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.55 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (42.236 %) :: 0.136 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (60.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (40.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.986 ms) --- | |
...0 -- cl2gl_upload (84.05 %) :: 170.608 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (15.95 %) :: 32.373 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 32.373 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (66.62 %) :: 21.567 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (10.01 %) :: 3.242 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (1.07 %) :: 0.346 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (21.93 %) :: 7.098 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.349 %) :: 0.113 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.567 ms) --- | |
.........0 -- GPU solveContactConstraint (68.25 %) :: 14.720 ms / frame (1 calls) | |
.........1 -- batching (31.16 %) :: 6.721 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.58 %) :: 0.125 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.720 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.89 %) :: 6.755 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.11 %) :: 7.965 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.721 ms) --- | |
............0 -- m_contactToConstraintKernel (27.67 %) :: 1.860 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.09 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.02 %) :: 4.773 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.01 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.21 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.833 %) :: 0.056 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.006 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.006 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.006 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.125 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (0.80 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (99.200 %) :: 0.124 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.242 ms) --- | |
.........0 -- clipHullHullKernel (57.90 %) :: 1.877 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.35 %) :: 0.206 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (21.84 %) :: 0.708 ms / frame (1 calls) | |
.........Unaccounted: (13.911 %) :: 0.451 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 7.098 ms) --- | |
.........0 -- sapKernel (61.33 %) :: 4.353 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.30 %) :: 0.163 ms / frame (1 calls) | |
.........2 -- scatterKernel (3.42 %) :: 0.243 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(9.96 %) :: 0.707 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.93 %) :: 0.208 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.92 %) :: 0.136 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (14.86 %) :: 1.055 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (3.254 %) :: 0.231 ms | |
================================== | |
Frame 55: | |
---------------------------------- | |
Profiling: Root (total running time: 204.830 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.08 %) :: 0.158 ms / frame (1 calls) | |
3 -- gui->draw (0.61 %) :: 1.248 ms / frame (1 calls) | |
4 -- renderScene (0.19 %) :: 0.387 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.09 %) :: 202.976 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.061 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.387 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.83 %) :: 0.367 ms / frame (2 calls) | |
...Unaccounted: (5.168 %) :: 0.020 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.367 ms) --- | |
......0 -- glUseProgram(0); (3.00 %) :: 0.011 ms / frame (2 calls) | |
......1 -- glFlush (3.54 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (50.95 %) :: 0.187 ms / frame (6 calls) | |
......3 -- glFlush2 (4.63 %) :: 0.017 ms / frame (2 calls) | |
......4 -- updateCamera (0.82 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.82 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (36.240 %) :: 0.133 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.976 ms) --- | |
...0 -- cl2gl_upload (85.42 %) :: 173.382 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.58 %) :: 29.590 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.590 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (68.54 %) :: 20.282 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (10.31 %) :: 3.052 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.96 %) :: 0.283 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.80 %) :: 5.860 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.355 %) :: 0.105 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.282 ms) --- | |
.........0 -- GPU solveContactConstraint (68.39 %) :: 13.870 ms / frame (1 calls) | |
.........1 -- batching (31.26 %) :: 6.341 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.070 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.870 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.77 %) :: 6.348 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.23 %) :: 7.522 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.341 ms) --- | |
............0 -- m_contactToConstraintKernel (28.15 %) :: 1.785 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (70.84 %) :: 4.492 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.536 %) :: 0.034 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.070 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.43 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.571 %) :: 0.069 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.052 ms) --- | |
.........0 -- clipHullHullKernel (54.72 %) :: 1.670 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.39 %) :: 0.256 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.17 %) :: 0.707 ms / frame (1 calls) | |
.........Unaccounted: (13.729 %) :: 0.419 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.860 ms) --- | |
.........0 -- sapKernel (71.31 %) :: 4.179 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.61 %) :: 0.153 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.68 %) :: 0.157 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.16 %) :: 0.478 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.13 %) :: 0.066 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.38 %) :: 0.081 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.31 %) :: 0.663 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.399 %) :: 0.082 ms | |
================================== | |
Frame 56: | |
---------------------------------- | |
Profiling: Root (total running time: 204.577 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.06 %) :: 0.115 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.233 ms / frame (1 calls) | |
4 -- renderScene (0.14 %) :: 0.290 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.17 %) :: 202.881 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.028 %) :: 0.058 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.290 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.14 %) :: 0.273 ms / frame (2 calls) | |
...Unaccounted: (5.862 %) :: 0.017 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.273 ms) --- | |
......0 -- glUseProgram(0); (1.47 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.40 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (45.42 %) :: 0.124 ms / frame (6 calls) | |
......3 -- glFlush2 (3.66 %) :: 0.010 ms / frame (2 calls) | |
......4 -- updateCamera (0.73 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.73 %) :: 0.002 ms / frame (2 calls) | |
......Unaccounted: (43.590 %) :: 0.119 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.002 ms) --- | |
.........0 -- texture (100.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.881 ms) --- | |
...0 -- cl2gl_upload (85.35 %) :: 173.166 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.65 %) :: 29.712 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.712 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.64 %) :: 20.691 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.51 %) :: 2.826 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.75 %) :: 0.224 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.77 %) :: 5.875 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.300 %) :: 0.089 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.691 ms) --- | |
.........0 -- GPU solveContactConstraint (69.53 %) :: 14.387 ms / frame (1 calls) | |
.........1 -- batching (30.14 %) :: 6.236 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.31 %) :: 0.065 ms / frame (1 calls) | |
.........Unaccounted: (0.015 %) :: 0.003 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.387 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.56 %) :: 6.699 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.43 %) :: 7.687 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.236 ms) --- | |
............0 -- m_contactToConstraintKernel (27.82 %) :: 1.735 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.63 %) :: 4.467 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.064 %) :: 0.004 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (80.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (20.000 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.065 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.54 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.462 %) :: 0.064 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.826 ms) --- | |
.........0 -- clipHullHullKernel (58.14 %) :: 1.643 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.86 %) :: 0.222 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.39 %) :: 0.661 ms / frame (1 calls) | |
.........Unaccounted: (10.616 %) :: 0.300 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.875 ms) --- | |
.........0 -- sapKernel (70.47 %) :: 4.140 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.64 %) :: 0.155 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.87 %) :: 0.110 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.85 %) :: 0.461 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.50 %) :: 0.088 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.66 %) :: 0.156 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.56 %) :: 0.679 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.430 %) :: 0.084 ms | |
================================== | |
Frame 57: | |
---------------------------------- | |
Profiling: Root (total running time: 204.876 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
2 -- window->endRendering (0.08 %) :: 0.162 ms / frame (1 calls) | |
3 -- gui->draw (0.61 %) :: 1.258 ms / frame (1 calls) | |
4 -- renderScene (0.23 %) :: 0.478 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.05 %) :: 202.923 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.026 %) :: 0.054 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.478 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.35 %) :: 0.451 ms / frame (2 calls) | |
...Unaccounted: (5.649 %) :: 0.027 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.451 ms) --- | |
......0 -- glUseProgram(0); (1.11 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.43 %) :: 0.020 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (52.55 %) :: 0.237 ms / frame (6 calls) | |
......3 -- glFlush2 (4.43 %) :: 0.020 ms / frame (2 calls) | |
......4 -- updateCamera (0.44 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.11 %) :: 0.005 ms / frame (2 calls) | |
......Unaccounted: (35.920 %) :: 0.162 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.005 ms) --- | |
.........0 -- texture (60.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (40.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.923 ms) --- | |
...0 -- cl2gl_upload (85.29 %) :: 173.067 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.71 %) :: 29.853 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.853 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (68.42 %) :: 20.426 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (10.28 %) :: 3.068 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.96 %) :: 0.286 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (20.02 %) :: 5.978 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.01 %) :: 0.004 ms / frame (1 calls) | |
......Unaccounted: (0.305 %) :: 0.091 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.426 ms) --- | |
.........0 -- GPU solveContactConstraint (69.31 %) :: 14.157 ms / frame (1 calls) | |
.........1 -- batching (30.38 %) :: 6.205 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.31 %) :: 0.064 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.157 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.91 %) :: 6.500 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.08 %) :: 7.656 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.205 ms) --- | |
............0 -- m_contactToConstraintKernel (27.51 %) :: 1.707 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.05 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.86 %) :: 4.459 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.10 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.226 %) :: 0.014 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (66.67 %) :: 0.002 ms / frame (1 calls) | |
...............Unaccounted: (33.333 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.002 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.002 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.064 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.56 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.437 %) :: 0.063 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.068 ms) --- | |
.........0 -- clipHullHullKernel (54.07 %) :: 1.659 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.96 %) :: 0.275 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.37 %) :: 0.717 ms / frame (1 calls) | |
.........Unaccounted: (13.592 %) :: 0.417 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.978 ms) --- | |
.........0 -- sapKernel (69.96 %) :: 4.182 ms / frame (1 calls) | |
.........1 -- sap2Kernel (3.14 %) :: 0.188 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.19 %) :: 0.131 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.75 %) :: 0.463 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.34 %) :: 0.080 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (1.35 %) :: 0.081 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.08 %) :: 0.782 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.171 %) :: 0.070 ms | |
================================== | |
Frame 58: | |
---------------------------------- | |
Profiling: Root (total running time: 204.303 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.08 %) :: 0.164 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.232 ms / frame (1 calls) | |
4 -- renderScene (0.16 %) :: 0.335 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.13 %) :: 202.529 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.021 %) :: 0.043 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.335 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.33 %) :: 0.316 ms / frame (2 calls) | |
...Unaccounted: (5.672 %) :: 0.019 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.316 ms) --- | |
......0 -- glUseProgram(0); (0.95 %) :: 0.003 ms / frame (2 calls) | |
......1 -- glFlush (3.80 %) :: 0.012 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (48.10 %) :: 0.152 ms / frame (6 calls) | |
......3 -- glFlush2 (4.43 %) :: 0.014 ms / frame (2 calls) | |
......4 -- updateCamera (0.95 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (1.27 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (40.506 %) :: 0.128 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.529 ms) --- | |
...0 -- cl2gl_upload (85.03 %) :: 172.202 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.97 %) :: 30.324 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.324 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (67.87 %) :: 20.582 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (10.08 %) :: 3.057 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.91 %) :: 0.276 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (20.72 %) :: 6.282 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.392 %) :: 0.119 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.582 ms) --- | |
.........0 -- GPU solveContactConstraint (69.38 %) :: 14.279 ms / frame (1 calls) | |
.........1 -- batching (30.27 %) :: 6.231 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.35 %) :: 0.072 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.279 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.00 %) :: 6.568 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.00 %) :: 7.711 ms / frame (1 calls) | |
what's wrong | |
............Unaccounted: (-0.000 %) :: -0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.231 ms) --- | |
............0 -- m_contactToConstraintKernel (27.56 %) :: 1.717 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.83 %) :: 4.476 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.21 %) :: 0.013 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.144 %) :: 0.009 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (83.33 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (16.667 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.072 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.39 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.611 %) :: 0.071 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.057 ms) --- | |
.........0 -- clipHullHullKernel (54.60 %) :: 1.669 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (9.19 %) :: 0.281 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.90 %) :: 0.700 ms / frame (1 calls) | |
.........Unaccounted: (13.314 %) :: 0.407 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.282 ms) --- | |
.........0 -- sapKernel (66.70 %) :: 4.190 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.82 %) :: 0.177 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.21 %) :: 0.139 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.09 %) :: 0.508 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.56 %) :: 0.161 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.42 %) :: 0.152 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.53 %) :: 0.850 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.640 %) :: 0.103 ms | |
================================== | |
Frame 59: | |
---------------------------------- | |
Profiling: Root (total running time: 204.483 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.142 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.221 ms / frame (1 calls) | |
4 -- renderScene (0.22 %) :: 0.459 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.08 %) :: 202.600 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.030 %) :: 0.061 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.459 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.68 %) :: 0.430 ms / frame (2 calls) | |
...Unaccounted: (6.318 %) :: 0.029 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.430 ms) --- | |
......0 -- glUseProgram(0); (0.93 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (3.26 %) :: 0.014 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (41.86 %) :: 0.180 ms / frame (6 calls) | |
......3 -- glFlush2 (4.42 %) :: 0.019 ms / frame (2 calls) | |
......4 -- updateCamera (2.09 %) :: 0.009 ms / frame (2 calls) | |
......5 -- init (3.02 %) :: 0.013 ms / frame (2 calls) | |
......Unaccounted: (44.419 %) :: 0.191 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.013 ms) --- | |
.........0 -- texture (23.08 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (76.923 %) :: 0.010 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.600 ms) --- | |
...0 -- cl2gl_upload (85.07 %) :: 172.346 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (14.93 %) :: 30.251 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.251 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.23 %) :: 21.245 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.14 %) :: 2.764 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.69 %) :: 0.208 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (19.61 %) :: 5.933 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.314 %) :: 0.095 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 21.245 ms) --- | |
.........0 -- GPU solveContactConstraint (69.85 %) :: 14.839 ms / frame (1 calls) | |
.........1 -- batching (29.83 %) :: 6.337 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.069 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.839 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.60 %) :: 6.767 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.40 %) :: 8.072 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.337 ms) --- | |
............0 -- m_contactToConstraintKernel (27.36 %) :: 1.734 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.05 %) :: 0.003 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.08 %) :: 4.568 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.13 %) :: 0.008 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.21 %) :: 0.013 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.126 %) :: 0.008 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.003 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.003 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.003 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.003 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.069 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.45 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.551 %) :: 0.068 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.764 ms) --- | |
.........0 -- clipHullHullKernel (59.59 %) :: 1.647 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.11 %) :: 0.169 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.02 %) :: 0.664 ms / frame (1 calls) | |
.........Unaccounted: (10.275 %) :: 0.284 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.933 ms) --- | |
.........0 -- sapKernel (69.80 %) :: 4.141 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.63 %) :: 0.156 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.63 %) :: 0.156 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.92 %) :: 0.470 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.48 %) :: 0.088 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.71 %) :: 0.161 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.41 %) :: 0.677 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.382 %) :: 0.082 ms | |
================================== | |
Frame 60: | |
---------------------------------- | |
Profiling: Root (total running time: 204.312 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.06 %) :: 0.131 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.222 ms / frame (1 calls) | |
4 -- renderScene (0.21 %) :: 0.421 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.11 %) :: 202.497 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.020 %) :: 0.041 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.421 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.40 %) :: 0.389 ms / frame (2 calls) | |
...Unaccounted: (7.601 %) :: 0.032 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.389 ms) --- | |
......0 -- glUseProgram(0); (1.03 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (4.11 %) :: 0.016 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (48.07 %) :: 0.187 ms / frame (6 calls) | |
......3 -- glFlush2 (4.11 %) :: 0.016 ms / frame (2 calls) | |
......4 -- updateCamera (0.51 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.77 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (41.388 %) :: 0.161 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.497 ms) --- | |
...0 -- cl2gl_upload (85.37 %) :: 172.862 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.63 %) :: 29.632 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.632 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (68.32 %) :: 20.246 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.59 %) :: 2.843 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.80 %) :: 0.238 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (20.87 %) :: 6.184 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.008 ms / frame (1 calls) | |
......Unaccounted: (0.381 %) :: 0.113 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.246 ms) --- | |
.........0 -- GPU solveContactConstraint (69.07 %) :: 13.983 ms / frame (1 calls) | |
.........1 -- batching (30.55 %) :: 6.186 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.38 %) :: 0.076 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.983 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.83 %) :: 6.409 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.16 %) :: 7.573 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.186 ms) --- | |
............0 -- m_contactToConstraintKernel (27.34 %) :: 1.691 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.34 %) :: 4.413 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.15 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.824 %) :: 0.051 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.076 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.32 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.684 %) :: 0.075 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.843 ms) --- | |
.........0 -- clipHullHullKernel (57.83 %) :: 1.644 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (7.91 %) :: 0.225 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.50 %) :: 0.668 ms / frame (1 calls) | |
.........Unaccounted: (10.763 %) :: 0.306 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.184 ms) --- | |
.........0 -- sapKernel (67.98 %) :: 4.204 ms / frame (1 calls) | |
.........1 -- sap2Kernel (3.10 %) :: 0.192 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.80 %) :: 0.173 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.10 %) :: 0.501 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.34 %) :: 0.145 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.73 %) :: 0.169 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (11.22 %) :: 0.694 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.682 %) :: 0.104 ms | |
================================== | |
Frame 61: | |
---------------------------------- | |
Profiling: Root (total running time: 204.018 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.06 %) :: 0.121 ms / frame (1 calls) | |
3 -- gui->draw (0.60 %) :: 1.228 ms / frame (1 calls) | |
4 -- renderScene (0.18 %) :: 0.359 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.14 %) :: 202.269 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.020 %) :: 0.041 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.359 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (95.26 %) :: 0.342 ms / frame (2 calls) | |
...Unaccounted: (4.735 %) :: 0.017 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.342 ms) --- | |
......0 -- glUseProgram(0); (1.17 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (3.80 %) :: 0.013 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (46.78 %) :: 0.160 ms / frame (6 calls) | |
......3 -- glFlush2 (6.43 %) :: 0.022 ms / frame (2 calls) | |
......4 -- updateCamera (0.58 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.88 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (40.351 %) :: 0.138 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (33.33 %) :: 0.001 ms / frame (2 calls) | |
.........Unaccounted: (66.667 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.269 ms) --- | |
...0 -- cl2gl_upload (85.48 %) :: 172.896 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.52 %) :: 29.370 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 29.370 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.26 %) :: 20.343 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.10 %) :: 2.672 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.80 %) :: 0.234 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (20.43 %) :: 6.001 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.388 %) :: 0.114 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.343 ms) --- | |
.........0 -- GPU solveContactConstraint (68.67 %) :: 13.969 ms / frame (1 calls) | |
.........1 -- batching (31.01 %) :: 6.308 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.32 %) :: 0.066 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 13.969 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.07 %) :: 6.435 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.92 %) :: 7.532 ms / frame (1 calls) | |
............Unaccounted: (0.014 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.308 ms) --- | |
............0 -- m_contactToConstraintKernel (27.36 %) :: 1.726 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.08 %) :: 0.005 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.24 %) :: 4.494 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.00 %) :: 0.000 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.11 %) :: 0.007 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.951 %) :: 0.060 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.005 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.066 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.52 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.485 %) :: 0.065 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.672 ms) --- | |
.........0 -- clipHullHullKernel (59.51 %) :: 1.590 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.76 %) :: 0.154 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.07 %) :: 0.670 ms / frame (1 calls) | |
.........Unaccounted: (9.656 %) :: 0.258 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.001 ms) --- | |
.........0 -- sapKernel (69.29 %) :: 4.158 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.87 %) :: 0.112 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.60 %) :: 0.096 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.18 %) :: 0.491 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.38 %) :: 0.143 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.38 %) :: 0.143 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.48 %) :: 0.809 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (0.800 %) :: 0.048 ms | |
================================== | |
Frame 62: | |
---------------------------------- | |
Profiling: Root (total running time: 204.274 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.07 %) :: 0.148 ms / frame (1 calls) | |
3 -- gui->draw (0.62 %) :: 1.270 ms / frame (1 calls) | |
4 -- renderScene (0.20 %) :: 0.412 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.09 %) :: 202.405 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.019 %) :: 0.039 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.412 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.96 %) :: 0.383 ms / frame (2 calls) | |
...Unaccounted: (7.039 %) :: 0.029 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.383 ms) --- | |
......0 -- glUseProgram(0); (1.04 %) :: 0.004 ms / frame (2 calls) | |
......1 -- glFlush (3.92 %) :: 0.015 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (40.21 %) :: 0.154 ms / frame (6 calls) | |
......3 -- glFlush2 (9.92 %) :: 0.038 ms / frame (2 calls) | |
......4 -- updateCamera (0.52 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (1.04 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (43.342 %) :: 0.166 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (50.00 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.002 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.405 ms) --- | |
...0 -- cl2gl_upload (85.67 %) :: 173.409 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (14.32 %) :: 28.993 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.002 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 28.993 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.89 %) :: 20.553 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.21 %) :: 2.671 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.60 %) :: 0.174 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (18.99 %) :: 5.507 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.005 ms / frame (1 calls) | |
......Unaccounted: (0.286 %) :: 0.083 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.553 ms) --- | |
.........0 -- GPU solveContactConstraint (69.90 %) :: 14.366 ms / frame (1 calls) | |
.........1 -- batching (29.83 %) :: 6.130 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.27 %) :: 0.056 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.366 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.16 %) :: 6.631 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.84 %) :: 7.735 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.130 ms) --- | |
............0 -- m_contactToConstraintKernel (27.10 %) :: 1.661 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.07 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.38 %) :: 4.437 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.08 %) :: 0.005 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.23 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.114 %) :: 0.007 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.056 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (0.00 %) :: 0.000 ms / frame (1 calls) | |
............Unaccounted: (100.000 %) :: 0.056 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.671 ms) --- | |
.........0 -- clipHullHullKernel (60.20 %) :: 1.608 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (6.21 %) :: 0.166 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (25.27 %) :: 0.675 ms / frame (1 calls) | |
.........Unaccounted: (8.311 %) :: 0.222 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.507 ms) --- | |
.........0 -- sapKernel (74.60 %) :: 4.108 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.32 %) :: 0.128 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.45 %) :: 0.080 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.90 %) :: 0.435 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (0.98 %) :: 0.054 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (0.94 %) :: 0.052 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (10.70 %) :: 0.589 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.02 %) :: 0.001 ms / frame (1 calls) | |
.........Unaccounted: (1.090 %) :: 0.060 ms | |
================================== | |
Frame 63: | |
---------------------------------- | |
Profiling: Root (total running time: 407.169 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (49.88 %) :: 203.104 ms / frame (1 calls) | |
3 -- gui->draw (0.35 %) :: 1.409 ms / frame (1 calls) | |
4 -- renderScene (0.13 %) :: 0.514 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (49.63 %) :: 202.077 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.016 %) :: 0.065 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.514 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.16 %) :: 0.484 ms / frame (2 calls) | |
...Unaccounted: (5.837 %) :: 0.030 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.484 ms) --- | |
......0 -- glUseProgram(0); (2.48 %) :: 0.012 ms / frame (2 calls) | |
......1 -- glFlush (3.72 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (41.12 %) :: 0.199 ms / frame (6 calls) | |
......3 -- glFlush2 (3.72 %) :: 0.018 ms / frame (2 calls) | |
......4 -- updateCamera (0.62 %) :: 0.003 ms / frame (2 calls) | |
......5 -- init (0.83 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (47.521 %) :: 0.230 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 202.077 ms) --- | |
...0 -- cl2gl_upload (84.91 %) :: 171.591 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (15.08 %) :: 30.476 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.001 ms / frame (1 calls) | |
...Unaccounted: (0.004 %) :: 0.008 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 30.476 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (67.76 %) :: 20.652 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.98 %) :: 3.043 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (1.09 %) :: 0.333 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (20.61 %) :: 6.282 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.007 ms / frame (1 calls) | |
......Unaccounted: (0.522 %) :: 0.159 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 20.652 ms) --- | |
.........0 -- GPU solveContactConstraint (69.35 %) :: 14.323 ms / frame (1 calls) | |
.........1 -- batching (30.27 %) :: 6.252 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.37 %) :: 0.076 ms / frame (1 calls) | |
.........Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 14.323 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (46.30 %) :: 6.631 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (53.70 %) :: 7.691 ms / frame (1 calls) | |
............Unaccounted: (0.007 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.252 ms) --- | |
............0 -- m_contactToConstraintKernel (27.75 %) :: 1.735 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (70.79 %) :: 4.426 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.14 %) :: 0.009 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.27 %) :: 0.017 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.03 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (0.928 %) :: 0.058 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.076 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.32 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.684 %) :: 0.075 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.043 ms) --- | |
.........0 -- clipHullHullKernel (54.29 %) :: 1.652 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.18 %) :: 0.249 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (23.14 %) :: 0.704 ms / frame (1 calls) | |
.........Unaccounted: (14.394 %) :: 0.438 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.282 ms) --- | |
.........0 -- sapKernel (66.97 %) :: 4.207 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.79 %) :: 0.175 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.87 %) :: 0.180 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.37 %) :: 0.526 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.23 %) :: 0.140 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.40 %) :: 0.151 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.43 %) :: 0.781 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.910 %) :: 0.120 ms | |
================================== | |
Frame 64: | |
---------------------------------- | |
Profiling: Root (total running time: 47.174 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.81 %) :: 0.380 ms / frame (1 calls) | |
3 -- gui->draw (5.87 %) :: 2.771 ms / frame (1 calls) | |
4 -- renderScene (1.47 %) :: 0.694 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (86.94 %) :: 41.013 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (4.909 %) :: 2.316 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.694 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.79 %) :: 0.637 ms / frame (2 calls) | |
...Unaccounted: (8.213 %) :: 0.057 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.637 ms) --- | |
......0 -- glUseProgram(0); (1.73 %) :: 0.011 ms / frame (2 calls) | |
......1 -- glFlush (5.02 %) :: 0.032 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (46.47 %) :: 0.296 ms / frame (6 calls) | |
......3 -- glFlush2 (5.65 %) :: 0.036 ms / frame (2 calls) | |
......4 -- updateCamera (0.63 %) :: 0.004 ms / frame (2 calls) | |
......5 -- init (1.26 %) :: 0.008 ms / frame (2 calls) | |
......Unaccounted: (39.246 %) :: 0.250 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.008 ms) --- | |
.........0 -- texture (62.50 %) :: 0.005 ms / frame (2 calls) | |
.........Unaccounted: (37.500 %) :: 0.003 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 41.013 ms) --- | |
...0 -- cl2gl_upload (5.35 %) :: 2.194 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.01 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (94.64 %) :: 38.813 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.007 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 38.813 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (72.11 %) :: 27.989 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (10.38 %) :: 4.027 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.71 %) :: 0.275 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (16.43 %) :: 6.376 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.06 %) :: 0.023 ms / frame (1 calls) | |
......Unaccounted: (0.317 %) :: 0.123 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 27.989 ms) --- | |
.........0 -- GPU solveContactConstraint (76.55 %) :: 21.426 ms / frame (1 calls) | |
.........1 -- batching (22.88 %) :: 6.405 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.56 %) :: 0.156 ms / frame (1 calls) | |
.........Unaccounted: (0.007 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 21.426 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.66 %) :: 9.783 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.34 %) :: 11.642 ms / frame (1 calls) | |
............Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.405 ms) --- | |
............0 -- m_contactToConstraintKernel (26.99 %) :: 1.729 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.16 %) :: 0.010 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (70.99 %) :: 4.547 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.25 %) :: 0.016 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.50 %) :: 0.032 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.05 %) :: 0.003 ms / frame (1 calls) | |
............Unaccounted: (1.030 %) :: 0.066 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.010 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.010 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.010 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.010 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.156 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.92 %) :: 0.003 ms / frame (1 calls) | |
............Unaccounted: (98.077 %) :: 0.153 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 4.027 ms) --- | |
.........0 -- clipHullHullKernel (46.64 %) :: 1.878 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (15.22 %) :: 0.613 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (19.49 %) :: 0.785 ms / frame (1 calls) | |
.........Unaccounted: (18.649 %) :: 0.751 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.376 ms) --- | |
.........0 -- sapKernel (66.39 %) :: 4.233 ms / frame (1 calls) | |
.........1 -- sap2Kernel (3.18 %) :: 0.203 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.16 %) :: 0.138 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.28 %) :: 0.528 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.90 %) :: 0.121 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.01 %) :: 0.128 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (14.02 %) :: 0.894 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.06 %) :: 0.004 ms / frame (1 calls) | |
.........Unaccounted: (1.992 %) :: 0.127 ms | |
================================== | |
Frame 65: | |
---------------------------------- | |
Profiling: Root (total running time: 201.548 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.09 %) :: 0.173 ms / frame (1 calls) | |
3 -- gui->draw (0.65 %) :: 1.313 ms / frame (1 calls) | |
4 -- renderScene (0.22 %) :: 0.453 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (98.98 %) :: 199.496 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.056 %) :: 0.113 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.453 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (94.48 %) :: 0.428 ms / frame (2 calls) | |
...Unaccounted: (5.519 %) :: 0.025 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.428 ms) --- | |
......0 -- glUseProgram(0); (1.17 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.44 %) :: 0.019 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (46.96 %) :: 0.201 ms / frame (6 calls) | |
......3 -- glFlush2 (7.01 %) :: 0.030 ms / frame (2 calls) | |
......4 -- updateCamera (0.47 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.70 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (39.252 %) :: 0.168 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 199.496 ms) --- | |
...0 -- cl2gl_upload (79.35 %) :: 158.300 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (20.65 %) :: 41.192 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 41.192 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (69.48 %) :: 28.622 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (9.59 %) :: 3.950 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (1.39 %) :: 0.574 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (18.88 %) :: 7.777 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.03 %) :: 0.012 ms / frame (1 calls) | |
......Unaccounted: (0.624 %) :: 0.257 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 28.622 ms) --- | |
.........0 -- GPU solveContactConstraint (76.07 %) :: 21.773 ms / frame (1 calls) | |
.........1 -- batching (23.37 %) :: 6.688 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.56 %) :: 0.159 ms / frame (1 calls) | |
.........Unaccounted: (0.007 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 21.773 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.48 %) :: 9.903 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.51 %) :: 11.869 ms / frame (1 calls) | |
............Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.688 ms) --- | |
............0 -- m_contactToConstraintKernel (26.81 %) :: 1.793 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.007 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.04 %) :: 4.751 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.21 %) :: 0.014 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.48 %) :: 0.032 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.04 %) :: 0.003 ms / frame (1 calls) | |
............Unaccounted: (1.286 %) :: 0.086 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.007 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.007 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.007 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.159 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.26 %) :: 0.002 ms / frame (1 calls) | |
............Unaccounted: (98.742 %) :: 0.157 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.950 ms) --- | |
.........0 -- clipHullHullKernel (45.19 %) :: 1.785 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (10.91 %) :: 0.431 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (20.71 %) :: 0.818 ms / frame (1 calls) | |
.........Unaccounted: (23.190 %) :: 0.916 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 7.777 ms) --- | |
.........0 -- sapKernel (57.05 %) :: 4.437 ms / frame (1 calls) | |
.........1 -- sap2Kernel (4.98 %) :: 0.387 ms / frame (1 calls) | |
.........2 -- scatterKernel (3.59 %) :: 0.279 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(9.00 %) :: 0.700 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (3.21 %) :: 0.250 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (3.16 %) :: 0.246 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (15.80 %) :: 1.229 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.003 ms / frame (1 calls) | |
.........Unaccounted: (3.163 %) :: 0.246 ms | |
================================== | |
Frame 66: | |
---------------------------------- | |
Profiling: Root (total running time: 203.365 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.06 %) :: 0.125 ms / frame (1 calls) | |
3 -- gui->draw (0.62 %) :: 1.253 ms / frame (1 calls) | |
4 -- renderScene (0.21 %) :: 0.418 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.10 %) :: 201.528 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.020 %) :: 0.041 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.418 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (93.78 %) :: 0.392 ms / frame (2 calls) | |
...Unaccounted: (6.220 %) :: 0.026 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.392 ms) --- | |
......0 -- glUseProgram(0); (1.28 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (4.08 %) :: 0.016 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (38.52 %) :: 0.151 ms / frame (6 calls) | |
......3 -- glFlush2 (7.91 %) :: 0.031 ms / frame (2 calls) | |
......4 -- updateCamera (0.51 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.77 %) :: 0.003 ms / frame (2 calls) | |
......Unaccounted: (46.939 %) :: 0.184 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.003 ms) --- | |
.........0 -- texture (66.67 %) :: 0.002 ms / frame (2 calls) | |
.........Unaccounted: (33.333 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 201.528 ms) --- | |
...0 -- cl2gl_upload (81.78 %) :: 164.812 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.001 ms / frame (1 calls) | |
...2 -- stepSimulation (18.22 %) :: 36.714 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.000 %) :: 0.001 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 36.714 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (73.60 %) :: 27.021 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.21 %) :: 3.013 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.77 %) :: 0.282 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (17.05 %) :: 6.261 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.357 %) :: 0.131 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 27.021 ms) --- | |
.........0 -- GPU solveContactConstraint (76.77 %) :: 20.744 ms / frame (1 calls) | |
.........1 -- batching (22.98 %) :: 6.210 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.25 %) :: 0.067 ms / frame (1 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 20.744 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.54 %) :: 9.447 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.45 %) :: 11.296 ms / frame (1 calls) | |
............Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.210 ms) --- | |
............0 -- m_contactToConstraintKernel (26.52 %) :: 1.647 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.06 %) :: 0.004 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.93 %) :: 4.529 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.10 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.24 %) :: 0.015 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.113 %) :: 0.007 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.004 ms) --- | |
...............0 -- batch generation (100.00 %) :: 0.004 ms / frame (1 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.004 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.004 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.067 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.49 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.507 %) :: 0.066 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.013 ms) --- | |
.........0 -- clipHullHullKernel (55.82 %) :: 1.682 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (8.00 %) :: 0.241 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (22.30 %) :: 0.672 ms / frame (1 calls) | |
.........Unaccounted: (13.873 %) :: 0.418 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 6.261 ms) --- | |
.........0 -- sapKernel (66.81 %) :: 4.183 ms / frame (1 calls) | |
.........1 -- sap2Kernel (2.70 %) :: 0.169 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.27 %) :: 0.142 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.11 %) :: 0.508 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.84 %) :: 0.178 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.43 %) :: 0.152 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (13.30 %) :: 0.833 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (1.501 %) :: 0.094 ms | |
================================== | |
Frame 67: | |
---------------------------------- | |
Profiling: Root (total running time: 203.704 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.000 ms / frame (1 calls) | |
2 -- window->endRendering (0.08 %) :: 0.163 ms / frame (1 calls) | |
3 -- gui->draw (0.65 %) :: 1.320 ms / frame (1 calls) | |
4 -- renderScene (0.22 %) :: 0.444 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (99.03 %) :: 201.733 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (0.022 %) :: 0.044 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.444 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (91.44 %) :: 0.406 ms / frame (2 calls) | |
...Unaccounted: (8.559 %) :: 0.038 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.406 ms) --- | |
......0 -- glUseProgram(0); (1.23 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.45 %) :: 0.014 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (35.22 %) :: 0.143 ms / frame (6 calls) | |
......3 -- glFlush2 (6.90 %) :: 0.028 ms / frame (2 calls) | |
......4 -- updateCamera (0.99 %) :: 0.004 ms / frame (2 calls) | |
......5 -- init (1.48 %) :: 0.006 ms / frame (2 calls) | |
......Unaccounted: (50.739 %) :: 0.206 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.006 ms) --- | |
.........0 -- texture (50.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (50.000 %) :: 0.003 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 201.733 ms) --- | |
...0 -- cl2gl_upload (81.64 %) :: 164.703 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.003 ms / frame (1 calls) | |
...2 -- stepSimulation (18.35 %) :: 37.024 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.001 %) :: 0.003 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 37.024 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (75.35 %) :: 27.899 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (7.39 %) :: 2.736 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (0.79 %) :: 0.293 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (16.07 %) :: 5.951 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.02 %) :: 0.006 ms / frame (1 calls) | |
......Unaccounted: (0.375 %) :: 0.139 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 27.899 ms) --- | |
.........0 -- GPU solveContactConstraint (77.24 %) :: 21.550 ms / frame (1 calls) | |
.........1 -- batching (22.42 %) :: 6.256 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.33 %) :: 0.091 ms / frame (1 calls) | |
.........Unaccounted: (0.007 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 21.550 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.53 %) :: 9.812 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.46 %) :: 11.737 ms / frame (1 calls) | |
............Unaccounted: (0.005 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.256 ms) --- | |
............0 -- m_contactToConstraintKernel (26.60 %) :: 1.664 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.10 %) :: 0.006 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (72.51 %) :: 4.536 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.02 %) :: 0.001 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.10 %) :: 0.006 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.22 %) :: 0.014 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.02 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (0.448 %) :: 0.028 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.006 ms) --- | |
...............0 -- batch generation (83.33 %) :: 0.005 ms / frame (1 calls) | |
...............Unaccounted: (16.667 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.005 ms) --- | |
..................0 -- batchingKernel (100.00 %) :: 0.005 ms / frame (1 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.091 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (1.10 %) :: 0.001 ms / frame (1 calls) | |
............Unaccounted: (98.901 %) :: 0.090 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 2.736 ms) --- | |
.........0 -- clipHullHullKernel (59.47 %) :: 1.627 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (5.67 %) :: 0.155 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (24.78 %) :: 0.678 ms / frame (1 calls) | |
.........Unaccounted: (10.088 %) :: 0.276 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 5.951 ms) --- | |
.........0 -- sapKernel (71.30 %) :: 4.243 ms / frame (1 calls) | |
.........1 -- sap2Kernel (1.78 %) :: 0.106 ms / frame (1 calls) | |
.........2 -- scatterKernel (1.71 %) :: 0.102 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(7.58 %) :: 0.451 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (1.76 %) :: 0.105 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.00 %) :: 0.119 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (12.92 %) :: 0.769 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.03 %) :: 0.002 ms / frame (1 calls) | |
.........Unaccounted: (0.907 %) :: 0.054 ms | |
================================== | |
Frame 68: | |
---------------------------------- | |
Profiling: Root (total running time: 203.841 ms) --- | |
0 -- castRaysGPU (0.00 %) :: 0.000 ms / frame (0 calls) | |
1 -- glFinish (0.00 %) :: 0.001 ms / frame (1 calls) | |
2 -- window->endRendering (0.10 %) :: 0.198 ms / frame (1 calls) | |
3 -- gui->draw (0.61 %) :: 1.236 ms / frame (1 calls) | |
4 -- renderScene (0.28 %) :: 0.561 ms / frame (1 calls) | |
5 -- clientMoveAndDisplay (45.99 %) :: 93.749 ms / frame (1 calls) | |
6 -- texture (0.00 %) :: 0.000 ms / frame (0 calls) | |
Unaccounted: (53.030 %) :: 108.096 ms | |
...---------------------------------- | |
...Profiling: castRaysGPU (total running time: 0.000 ms) --- | |
...0 -- raycast copyToHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...1 -- raycast launch1D (0.00 %) :: 0.000 ms / frame (0 calls) | |
...2 -- raycast copyFromHost (0.00 %) :: 0.000 ms / frame (0 calls) | |
...Unaccounted: (0.000 %) :: 0.000 ms | |
...---------------------------------- | |
...Profiling: renderScene (total running time: 0.561 ms) --- | |
...0 -- GLInstancingRenderer::RenderScene (92.69 %) :: 0.520 ms / frame (2 calls) | |
...Unaccounted: (7.308 %) :: 0.041 ms | |
......---------------------------------- | |
......Profiling: GLInstancingRenderer::RenderScene (total running time: 0.520 ms) --- | |
......0 -- glUseProgram(0); (0.96 %) :: 0.005 ms / frame (2 calls) | |
......1 -- glFlush (3.46 %) :: 0.018 ms / frame (2 calls) | |
......2 -- glDrawElementsInstanced (47.69 %) :: 0.248 ms / frame (6 calls) | |
......3 -- glFlush2 (4.81 %) :: 0.025 ms / frame (2 calls) | |
......4 -- updateCamera (0.38 %) :: 0.002 ms / frame (2 calls) | |
......5 -- init (0.77 %) :: 0.004 ms / frame (2 calls) | |
......Unaccounted: (41.923 %) :: 0.218 ms | |
.........---------------------------------- | |
.........Profiling: init (total running time: 0.004 ms) --- | |
.........0 -- texture (75.00 %) :: 0.003 ms / frame (2 calls) | |
.........Unaccounted: (25.000 %) :: 0.001 ms | |
...---------------------------------- | |
...Profiling: clientMoveAndDisplay (total running time: 93.749 ms) --- | |
...0 -- cl2gl_upload (56.40 %) :: 52.878 ms / frame (1 calls) | |
...1 -- cl2gl_convert (0.00 %) :: 0.002 ms / frame (1 calls) | |
...2 -- stepSimulation (43.59 %) :: 40.865 ms / frame (1 calls) | |
...3 -- gl2cl (0.00 %) :: 0.000 ms / frame (1 calls) | |
...Unaccounted: (0.004 %) :: 0.004 ms | |
......---------------------------------- | |
......Profiling: stepSimulation (total running time: 40.865 ms) --- | |
......0 -- solveJoints (0.00 %) :: 0.000 ms / frame (0 calls) | |
......1 -- solveContacts (70.75 %) :: 28.910 ms / frame (1 calls) | |
......2 -- computeConvexConvexContactsGPUSAT (8.95 %) :: 3.659 ms / frame (1 calls) | |
......3 -- primitiveContactsKernel (1.13 %) :: 0.460 ms / frame (1 calls) | |
......4 -- GPU 1-axis SAP calculateOverlappingPairs (18.54 %) :: 7.576 ms / frame (1 calls) | |
......5 -- setupGpuAabbs (0.06 %) :: 0.023 ms / frame (1 calls) | |
......Unaccounted: (0.580 %) :: 0.237 ms | |
.........---------------------------------- | |
.........Profiling: solveJoints (total running time: 0.000 ms) --- | |
.........0 -- solveGroupCacheFriendlyFinish (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........1 -- GpuSolveGroupCacheFriendlyIterations (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........2 -- GPU solveGroupCacheFriendlySetup (0.00 %) :: 0.000 ms / frame (0 calls) | |
.........Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: solveGroupCacheFriendlyFinish (total running time: 0.000 ms) --- | |
............0 -- GPU write back velocities and transforms (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- breakViolatedConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GpuSolveGroupCacheFriendlyIterations (total running time: 0.000 ms) --- | |
............0 -- solveJointConstraintRowsKernels (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- batch joints (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: batch joints (total running time: 0.000 ms) --- | |
...............0 -- sortConstraintByBatch3 (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: sortConstraintByBatch3 (total running time: 0.000 ms) --- | |
..................0 -- cpu batch innerloop (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: GPU solveGroupCacheFriendlySetup (total running time: 0.000 ms) --- | |
............0 -- getInfo2Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............1 -- info1 and init batchConstraint (0.00 %) :: 0.000 ms / frame (0 calls) | |
............2 -- m_initSolverBodiesKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
...............---------------------------------- | |
...............Profiling: info1 and init batchConstraint (total running time: 0.000 ms) --- | |
...............0 -- initBatchConstraintsKernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............1 -- getInfo1Kernel (0.00 %) :: 0.000 ms / frame (0 calls) | |
...............Unaccounted: (0.000 %) :: 0.000 ms | |
..................---------------------------------- | |
..................Profiling: initBatchConstraintsKernel (total running time: 0.000 ms) --- | |
..................0 -- init batch constraints (0.00 %) :: 0.000 ms / frame (0 calls) | |
..................Unaccounted: (0.000 %) :: 0.000 ms | |
.........---------------------------------- | |
.........Profiling: solveContacts (total running time: 28.910 ms) --- | |
.........0 -- GPU solveContactConstraint (76.98 %) :: 22.254 ms / frame (1 calls) | |
.........1 -- batching (22.49 %) :: 6.503 ms / frame (1 calls) | |
.........2 -- GPU Sort contact constraints (determinism) (0.52 %) :: 0.151 ms / frame (1 calls) | |
.........Unaccounted: (0.007 %) :: 0.002 ms | |
............---------------------------------- | |
............Profiling: GPU solveContactConstraint (total running time: 22.254 ms) --- | |
............0 -- m_batchSolveKernel iterations2 (45.40 %) :: 10.104 ms / frame (1 calls) | |
............1 -- m_batchSolveKernel iterations (54.60 %) :: 12.150 ms / frame (1 calls) | |
............Unaccounted: (0.000 %) :: 0.000 ms | |
............---------------------------------- | |
............Profiling: batching (total running time: 6.503 ms) --- | |
............0 -- m_contactToConstraintKernel (26.88 %) :: 1.748 ms / frame (1 calls) | |
............1 -- gpu batchContacts (0.14 %) :: 0.009 ms / frame (1 calls) | |
............2 -- gpu m_copyConstraintKernel (71.49 %) :: 4.649 ms / frame (1 calls) | |
............3 -- gpu m_reorderContactKernel (0.03 %) :: 0.002 ms / frame (1 calls) | |
............4 -- gpuBoundSearch (0.25 %) :: 0.016 ms / frame (1 calls) | |
............5 -- gpuRadixSort (0.52 %) :: 0.034 ms / frame (1 calls) | |
............6 -- GPU set cell idx (0.05 %) :: 0.003 ms / frame (1 calls) | |
............Unaccounted: (0.646 %) :: 0.042 ms | |
...............---------------------------------- | |
...............Profiling: gpu batchContacts (total running time: 0.009 ms) --- | |
...............0 -- batch generation (88.89 %) :: 0.008 ms / frame (1 calls) | |
...............Unaccounted: (11.111 %) :: 0.001 ms | |
..................---------------------------------- | |
..................Profiling: batch generation (total running time: 0.008 ms) --- | |
..................0 -- batchingKernel (87.50 %) :: 0.007 ms / frame (1 calls) | |
..................Unaccounted: (12.500 %) :: 0.001 ms | |
............---------------------------------- | |
............Profiling: GPU Sort contact constraints (determinism) (total running time: 0.151 ms) --- | |
............0 -- gpu reorderContactKernel (determinism) (2.65 %) :: 0.004 ms / frame (1 calls) | |
............Unaccounted: (97.351 %) :: 0.147 ms | |
.........---------------------------------- | |
.........Profiling: computeConvexConvexContactsGPUSAT (total running time: 3.659 ms) --- | |
.........0 -- clipHullHullKernel (50.18 %) :: 1.836 ms / frame (1 calls) | |
.........1 -- findCompoundPairsKernel (10.30 %) :: 0.377 ms / frame (1 calls) | |
.........2 -- findSeparatingAxisKernel (21.13 %) :: 0.773 ms / frame (1 calls) | |
.........Unaccounted: (18.393 %) :: 0.673 ms | |
.........---------------------------------- | |
.........Profiling: GPU 1-axis SAP calculateOverlappingPairs (total running time: 7.576 ms) --- | |
.........0 -- sapKernel (57.88 %) :: 4.385 ms / frame (1 calls) | |
.........1 -- sap2Kernel (4.01 %) :: 0.304 ms / frame (1 calls) | |
.........2 -- scatterKernel (2.72 %) :: 0.206 ms / frame (1 calls) | |
.........3 -- gpu radix sort | |
(8.59 %) :: 0.651 ms / frame (1 calls) | |
.........4 -- flipFloatKernel (2.69 %) :: 0.204 ms / frame (1 calls) | |
.........5 -- copyAabbsKernelLarge (2.81 %) :: 0.213 ms / frame (1 calls) | |
.........6 -- GPU compute best variance axis (18.97 %) :: 1.437 ms / frame (1 calls) | |
.........7 -- copyAabbsKernelSmall (0.04 %) :: 0.003 ms / frame (1 calls) | |
.........Unaccounted: (2.284 %) :: 0.173 ms |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment