Skip to content

Instantly share code, notes, and snippets.

View GitHubEmploy's full-sized avatar

Mohit Varikuti GitHubEmploy

View GitHub Profile
.version 6.4
.target sm_75
.address_size 64
.reg .u32 %r1, %r2;
.reg .f32 %f1;
.version 6.4
.target sm_75
.address_size 64
// Kernel function: vector addition
.visible .entry vectorAdd(
.param .u64 param0, // pointer to vector A
.param .u64 param1, // pointer to vector B
.param .u64 param2, // pointer to output vector C
.param .u32 param3 // number of elements