Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix issue in block sizing for GPU shared basis #1727

Merged
merged 1 commit into from
Jan 14, 2025
Merged

Conversation

zatkins-dev
Copy link
Collaborator

This change fixes a crash when applying a CeedBasis with eval type CEED_EVAL_WEIGHT for order 2 elements.

I'm not sure if the cuda change is also necessary?

Copy link
Member

@jeremylt jeremylt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Odd that P comes into play here since this doesn't really have anything to do with nodes, but 🤷

@zatkins-dev zatkins-dev force-pushed the zach/hip-nontensor-fix branch from 226addc to 97011ea Compare January 14, 2025 22:19
@jeremylt jeremylt merged commit 79881bb into main Jan 14, 2025
28 checks passed
@jeremylt jeremylt deleted the zach/hip-nontensor-fix branch January 14, 2025 22:37
@jeremylt
Copy link
Member

Figured out why:

extern "C" __launch_bounds__(BASIS_INTERP_BLOCK_SIZE) __global__

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants