Skip to content

Instantly share code, notes, and snippets.

@marty1885
Created July 11, 2017 15:14
Show Gist options
  • Save marty1885/faaf46d0a0e4e8f1bf6c784f6ab5a218 to your computer and use it in GitHub Desktop.
Save marty1885/faaf46d0a0e4e8f1bf6c784f6ab5a218 to your computer and use it in GitHub Desktop.
diff --git a/src/kernels/MIOpenConvDirUniC.cl b/src/kernels/MIOpenConvDirUniC.cl
index 6a92a63..9a72f2d 100644
--- a/src/kernels/MIOpenConvDirUniC.cl
+++ b/src/kernels/MIOpenConvDirUniC.cl
@@ -325,9 +325,13 @@ static inline void Conv(uint o_map_base,
uint wei_stg_off = wei_stg_base_off +
o_c * MLO_N_IN_TILES_PERSTACK * MLO_FILTER_SZ +
k_act * MLO_FILTER_SIZE0;
+ if(wei_stg_off + MLO_FILTER_SIZE0 > MLO_WEIGHTS_SZ)
+ printf("%d\n",wei_stg_off);
for(uint i = 0; i < MLO_FILTER_SIZE0; ++i)
{
- pvt_wei_stage[i] = lcl_wei[wei_stg_off + i];
+ //Comment offset out so it don't crash on NV/Intel OpenCL
+ //This might not be needed for a AMD system
+ pvt_wei_stage[i] = lcl_wei[/*wei_stg_off + */i];
}
// actual conv
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment