-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU tests fail on GTX970 and P100 #57
Comments
It's not the clearest error but the key line is The package makes poor use of GPU memory at the minute due to the requirements of Zygote, so the 4 GB of a GTX 970 isn't enough to run the 16k atoms in the protein test. Sorry about that, hopefully it will change in future. |
Woops, that's completely my bad. Sorry for the random issue then! |
No it's fine, good to see people are using the software. |
I was just testing this on a P100 with 16 GB of available RAM and ran into a related issue:
It seems to have flooded the available memory pool and cannot allocate more space. This could be a garbage collection issue where the tests run fine independently, but fail when put together because the memory has not been properly deallocated? Re-opening this issue because it's another error on protein.jl. How much memory are we asking the users to have for this test? Maybe we should just check the available memory and cancel the test for certain GPUs. |
GPU memory usage on master is very poor and I honestly don't know the range of hardware it will work on. I'm hoping to switch to the kernel setup within the next couple of months though, which should be much better. At that point it is probably worth doing a survey across different hardware and addressing any issues. |
Yeah, that's fair. I'll just quietly comment out the test for now for #99 |
GPU memory usage should be much improved in v0.15.0. Testing and benchmarking on different GPUs is on the todo list. |
I could not get the tests to work on my GTX970 GPU. Seems like there is an issue with
Specifically when called in
test/protein.jl
I think
permuteddims
doesn't work on a CuArray, so I tried keeping it as an array, but eventually ran into an issue with turningis
into an array for theDistanceVecNeighborFinder
. I tried a bunch of different variations, so I'll just leave the unchanged error here:This could be related to #16 , but I felt it was different enough to warrant a separate issue.
The text was updated successfully, but these errors were encountered: