-
Such as 56_hopper_ptr_array_batched_gemm. Also, is this concept of "pointer array" demonstrated elsewhere in CUTLASS, or in other contexts for GPU programming? Thanks! |
Beta Was this translation helpful? Give feedback.
Answered by
hwu36
Feb 20, 2024
Replies: 1 comment 3 replies
-
it is the same as cublas. there are two types of batch gemm, one is called pointer array: pointers to the input and output matrices are stored in arrays such as ptr_A[batch_id], ptr_B[batch_id]. the other one is called strided batch gemm, ptrA[batch_id] = ptrA[0]+stride x batch_id |
Beta Was this translation helpful? Give feedback.
3 replies
Answer selected by
hyhieu
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
it is the same as cublas. there are two types of batch gemm, one is called pointer array: pointers to the input and output matrices are stored in arrays such as ptr_A[batch_id], ptr_B[batch_id]. the other one is called strided batch gemm, ptrA[batch_id] = ptrA[0]+stride x batch_id