-
Notifications
You must be signed in to change notification settings - Fork 49
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Add TODO items. * rough, but something like this? * wip - cleaning up some warnings, but more remain * wip - reorganize * wip - missing kernels * wip - replace t1d * fix some kernels * another typo * more * another one * closer * define T_1D * typosgit add .! * WIP: changes to cuda-shared framework for new kernels * fix output writing * buffer fix * buffer sizes * WIP: fixes for 2 and 3D basis kernels * minor * fix weight kernel for 3d * remove debugging output * minor reorg * fix includes * enable collo grad for cuda-shared * move quoted kernels * renaming * missed a rename * small fix * more naming consistency * faster 'useCollograd=false' path in *-gen * more style * one last style fix * clearer collograd condition * Add gen basis kernels to hip-shared * Try some changes to hip-shared basis block sizes for new kernels * cuda - drop extra kernel arg * cuda - fix collograd check logic * update gen comment about parallelization * tidy up fields struct definition * tidy up structs even more * Update hip-gen basis templates use and move other hip-gen device functions to jit-source * Finish hip-gen basis template update; small style updates to match CUDA * missing isStrided * Update block size used in 3D weight for new shared kernels * update release notes Co-authored-by: Jeremy L Thompson <[email protected]> Co-authored-by: nbeams <[email protected]>
- Loading branch information
1 parent
3c60848
commit 9e201c8
Showing
34 changed files
with
3,406 additions
and
3,762 deletions.
There are no files selected for viewing
1,323 changes: 328 additions & 995 deletions
1,323
backends/cuda-gen/ceed-cuda-gen-operator-build.cpp
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.