Command line:
blaze-bin/third_party/iree/experimental/runners/mlir-proto-opt -linalg-comprehensive-bufferize-inplace /tmp/a.mlirOutput:
return val does not fold: %0 = tensor.generate %arg0, %arg1, %arg2, %arg3  {
^bb0(%arg4: index, %arg5: index, %arg6: index, %arg7: index):  // no predecessors
  %1 = index_cast %arg4 : index to i32