Skip to content

How to correctly use conv2d split-k parallel #401

Answered by manishucsd
masahi asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @masahi , I syned your branch and tried it. It was a small issue. I noticed the Split-k-mode is not set for Conv::Arguments

  typename ImplicitGemm::Arguments arguments{
    problem_size,
      tensor_a.device_ref(),
      tensor_b.device_ref(),
      tensor_c.device_ref(),
      tensor_d.device_ref(),
      {options.alpha, options.beta},
      split_k_mode                                // set split_k_mode to parallel. The ctor takes serial as a default argument.
      };

The full diff here:

(example-wgrad-splitk) $ git diff
diff --git a/examples/26_ampere_wgrad_mainloop_fusion/ampere_wgrad_mainloop_fusion.cu b/examples/26_ampere_wgrad_mainloop_fusion/ampere_wgrad_mainloop_fusion.cu
i…

Replies: 7 comments 8 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@masahi
Comment options

Answer selected by masahi
Comment options

You must be logged in to vote
3 replies
@masahi
Comment options

@masahi
Comment options

@masahi
Comment options

Comment options

You must be logged in to vote
1 reply
@masahi
Comment options

Comment options

You must be logged in to vote
2 replies
@masahi
Comment options

@manishucsd
Comment options

Comment options

You must be logged in to vote
1 reply
@masahi
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants