Skip to content

Commit

Permalink
add reqd_sub_group_size (#1248)
Browse files Browse the repository at this point in the history
  • Loading branch information
sunjiweiswift authored Jan 8, 2025
1 parent ba04a4a commit 28e7c8c
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion src/ATen/native/xpu/sycl/LinearInt4.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,8 @@ struct LinearInt4KernelFunctor : public __SYCL_KER_CONFIG_CONVENTION__ {
ldc(ldc) {}
void sycl_ker_config_convention(sycl::handler& cgh) {}

void operator()(sycl::nd_item<1> it) const {
[[intel::reqd_sub_group_size(16)]] void operator()(
sycl::nd_item<1> it) const {
int constexpr Unroll = 2;
int constexpr SgSize = 16;
int constexpr blocksize = block_size;
Expand Down

0 comments on commit 28e7c8c

Please sign in to comment.