With no cuda jit cache
========================= 1 run
cubin: real 0m3.553s
fatbin: real 0m4.106s
=========================
========================= 10 runs
cubin real 0m38.732s
fatbin real 0m44.738s
With no cuda jit cache
========================= 1 run
cubin: real 0m3.553s
fatbin: real 0m4.106s
=========================
========================= 10 runs
cubin real 0m38.732s
fatbin real 0m44.738s