-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: [INFRA-2749] change labels to match for gpu to use gpu-based instances for build #119
Conversation
INFRA-2749 Add GitHub Self-hosted GPU runners
This is required to run CI tests on https://github.com/worldcoin/gpu-iris-mpc Screenshot 2024-07-12 at 11.40.41.png The instance needs to have 3 GPUs at least, e.g.: g4dn.12xlarge |
…stances for tests
Seems like the self-hosted runner images do not have the same tooling as the gh-provided ones, ran into that myself once or twice. |
I added a job to run the e2e tests on the GPU runner. The image is missing the CUDA and NCCL libs atm, should they be baked into the image used or installed during runtime? also the build is pretty slow on them, could probably use some form of caching like https://github.com/Swatinem/rust-cache |
… feat/INFRA-2749
After a long battle with this PR, it seems I finally found the issue. Meanwhile on the Nvidia H100, this works without any issues and that's why we didn't find it for so long. This PR now makes sure that @wojciechsromek what's the deal with this dockerfile? that's the only failing check now. |
This reverts commit 898eb13.
https://linear.app/worldcoin/issue/INFRA-2749/add-github-self-hosted-gpu-runners
change labels to match for gpu to use gpu-based instances for build