Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

python train_auto.py --model fno --data_name cavity_prop_bc_geo 内存溢出 #10

Open
PlusOrMinus opened this issue Jan 8, 2025 · 5 comments

Comments

@PlusOrMinus
Copy link

具体报错如下
Traceback (most recent call last):
File "/public/yzk/D/DeepLearning/CFDBench/src/train_auto.py", line 381, in
main()
File "/public/yzk/D/DeepLearning/CFDBench/src/train_auto.py", line 351, in main
train(
File "/public/yzk/D/DeepLearning/CFDBench/src/train_auto.py", line 233, in train
outputs: dict = model(**batch)
File "/public/yzk/D/Anaconda3/envs/CFDBench/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/public/yzk/D/Anaconda3/envs/CFDBench/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
File "/public/yzk/D/DeepLearning/CFDBench/src/models/fno/fno2d.py", line 208, in forward
props = props.repeat(1, 1, height, width) # (B, p, H, W)
RuntimeError: CUDA error: out of memory
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

@chen-yingfa
Copy link
Collaborator

请问你是用什么硬件跑的?

@PlusOrMinus
Copy link
Author

PlusOrMinus commented Jan 9, 2025 via email

@PlusOrMinus
Copy link
Author

PlusOrMinus commented Jan 9, 2025 via email

@chen-yingfa
Copy link
Collaborator

可能显存太小,建议尝试降低batch size或者使用别的常见的降低显存占用的手段

@PlusOrMinus
Copy link
Author

PlusOrMinus commented Jan 10, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants