Skip to content

Instantly share code, notes, and snippets.

@utilForever
Last active April 21, 2018 16:13
Show Gist options
  • Save utilForever/406644b33cd08c69254f4e5ae449f32a to your computer and use it in GitHub Desktop.
Save utilForever/406644b33cd08c69254f4e5ae449f32a to your computer and use it in GitHub Desktop.
CubbyFlow Time Performance Test Result - OpenMP
Run on (12 X 3298 MHz CPU s)
CPU Caches:
L1 Data 32K (x6)
L1 Instruction 32K (x6)
L2 Unified 262K (x6)
L3 Unified 15728K (x1)
--------------------------------------------------------------------------------------------------
Benchmark Time CPU Iterations
--------------------------------------------------------------------------------------------------
BVH3/Nearest 138131 ns 138108 ns 7467
BVH3/RayIntersects 2331 ns 2302 ns 298667
FDMBLAS2/MVM/64 34070 ns 32785 ns 22400
FDMBLAS2/MVM/256 417105 ns 409807 ns 1792
FDMBLAS2/MVM/1024 6851087 ns 6875000 ns 75
FDMBLAS3/MVM/16 89461 ns 85794 ns 7467
FDMBLAS3/MVM/64 4620708 ns 4667208 ns 154
FDMBLAS3/MVM/256 275303505 ns 265625000 ns 2
FDMCompressedBLAS3/MVM/16 10157 ns 10254 ns 74667
FDMCompressedBLAS3/MVM/64 1046321 ns 1000977 ns 640
FDMCompressedBLAS3/MVM/256 72491071 ns 65972222 ns 9
GridFractionalSinglePhasePressureSolver3/Solve/128/128/0 48941607786 ns 48250000000 ns 1
GridFractionalSinglePhasePressureSolver3/Solve/128/128/1 19789400671 ns 19734375000 ns 1
GridFractionalSinglePhasePressureSolver3/Solve/128/64/0 48896036965 ns 48437500000 ns 1
GridFractionalSinglePhasePressureSolver3/Solve/128/64/1 9907709260 ns 9828125000 ns 1
GridFractionalSinglePhasePressureSolver3/Solve/128/32/0 48755542509 ns 48015625000 ns 1
GridFractionalSinglePhasePressureSolver3/Solve/128/32/1 5839234424 ns 5828125000 ns 1
ListQueryEngine3/Nearest 57521476 ns 57091346 ns 26
ListQueryEngine3/RayIntersects 36092563 ns 36305147 ns 34
MatrixMxN/MVM/256 61937 ns 58594 ns 11200
MatrixMxN/MVM/1024 960833 ns 941685 ns 896
MatrixMxN/MVM/4096 16130277 ns 13719512 ns 41
Octree/Nearest 783362 ns 781250 ns 1120
Octree/RayIntersects 19063 ns 19252 ns 37333
Parallel/ParallelFor/256/1/real_time 1375 ns 1387 ns 540619
Parallel/ParallelFor/256/2/real_time 1337 ns 1339 ns 501912
Parallel/ParallelFor/256/4/real_time 1050 ns 1051 ns 654032
Parallel/ParallelFor/256/8/real_time 1029 ns 1021 ns 688899
Parallel/ParallelFor/65536/1/real_time 308315 ns 311255 ns 2259
Parallel/ParallelFor/65536/2/real_time 160188 ns 158163 ns 4248
Parallel/ParallelFor/65536/4/real_time 84177 ns 83185 ns 7889
Parallel/ParallelFor/65536/8/real_time 79440 ns 79928 ns 8797
Parallel/ParallelFor/16777216/1/real_time 83679427 ns 85069444 ns 9
Parallel/ParallelFor/16777216/2/real_time 42511090 ns 42968750 ns 16
Parallel/ParallelFor/16777216/4/real_time 22266351 ns 22460938 ns 32
Parallel/ParallelFor/16777216/8/real_time 20365753 ns 19761029 ns 34
Parallel/ParallelRangeFor/256/1/real_time 1517 ns 1511 ns 434354
Parallel/ParallelRangeFor/256/2/real_time 2095 ns 2079 ns 338146
Parallel/ParallelRangeFor/256/4/real_time 2911 ns 2929 ns 234719
Parallel/ParallelRangeFor/256/8/real_time 4907 ns 4945 ns 148515
Parallel/ParallelRangeFor/65536/1/real_time 239395 ns 235215 ns 2790
Parallel/ParallelRangeFor/65536/2/real_time 238381 ns 239158 ns 2744
Parallel/ParallelRangeFor/65536/4/real_time 235917 ns 236576 ns 2840
Parallel/ParallelRangeFor/65536/8/real_time 236968 ns 237496 ns 2829
Parallel/ParallelRangeFor/16777216/1/real_time 61331019 ns 61079545 ns 11
Parallel/ParallelRangeFor/16777216/2/real_time 66798539 ns 66761364 ns 11
Parallel/ParallelRangeFor/16777216/4/real_time 65240639 ns 64236111 ns 9
Parallel/ParallelRangeFor/16777216/8/real_time 62863450 ns 62500000 ns 11
PointHashGridSearcher3/Build/32 3418753 ns 3445513 ns 195
PointHashGridSearcher3/Build/1024 3853239 ns 3843583 ns 187
PointHashGridSearcher3/Build/1048576 440956549 ns 437500000 ns 2
PointHashGridSearcher3/ForEachNearbyPoints/32 440 ns 443 ns 1659259
PointHashGridSearcher3/ForEachNearbyPoints/1024 442 ns 439 ns 1600000
PointHashGridSearcher3/ForEachNearbyPoints/1048576 2196 ns 2197 ns 298667
PointKdTreeSearcher3/Build/32 6471 ns 6417 ns 112000
PointKdTreeSearcher3/Build/1024 313995 ns 313895 ns 2240
PointKdTreeSearcher3/Build/1048576 980278125 ns 984375000 ns 1
PointKdTreeSearcher3/ForEachNearbyPoints/32 190 ns 190 ns 3446154
PointKdTreeSearcher3/ForEachNearbyPoints/1024 329 ns 328 ns 2240000
PointKdTreeSearcher3/ForEachNearbyPoints/1048576 3749 ns 3770 ns 194783
PointParallelHashGridSearcher3/Build/32 2597056 ns 2572791 ns 249
PointParallelHashGridSearcher3/Build/1024 2814674 ns 2780720 ns 236
PointParallelHashGridSearcher3/Build/1048576 211879242 ns 203125000 ns 3
PointParallelHashGridSearcher3/ForEachNearbyPoints/32 445 ns 446 ns 1120000
PointParallelHashGridSearcher3/ForEachNearbyPoints/1024 438 ns 446 ns 1120000
PointParallelHashGridSearcher3/ForEachNearbyPoints/1048576 1466 ns 1458 ns 407273
TriangleMesh3/ClosestPoint 1779177 ns 1741640 ns 314
VolumeParticleEmitter3/Update 82064677631 ns 81984375000 ns 1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment