Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Correct issues in our debug build. #4872

Draft
wants to merge 1 commit into
base: branch-25.02
Choose a base branch
from

Conversation

ChuckHastings
Copy link
Collaborator

This PR fixes our debug builds to compile.

We still need to address the issue defined in #4806. I have isolated things to bad kernel launches in per_v_transform_reduce_e.cuh. My guess is that when debug is enabled we use more resources in the kernels and can't launch as many threads concurrently on the GPU as we can in non-debug mode. Kicking this over to @seunghwak to investigate.

Closes #4806

Copy link

copy-pr-bot bot commented Jan 17, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[QST] Error on invalid device ordinal and cuda_memory_resource
2 participants