CUAEV benchmark: https://github.com/roitberg-group/torchani_sandbox/blob/ed90fa65a7f07e59a95e75c962371a37ffb69ba0/tools/aev-benchmark-size.py
intrinsics on: python setup.py develop --ext --cuaev-opt
use_fast_math need nvcc args: -use_fast_math
File: small.pdb, Molecule size: 264 / 264, Species: [1, 6, 7, 8]