Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experimental device requests #204

Draft
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

rocketeerbkw
Copy link
Member

@rocketeerbkw rocketeerbkw commented May 1, 2023

Experimental support for GPU requests. This will eventually go away in favor of lagoon resource workload management in the API. Leaving for now as used in PoCs

@rocketeerbkw rocketeerbkw force-pushed the experimental-device-requests branch from 18eb5ee to aae5606 Compare May 1, 2023 12:54
@@ -1445,6 +1445,9 @@ do
fi
fi

# handle gpu configuration
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We could wrap this experimental feature under a flag like this https://github.com/uselagoon/build-deploy-tool/blob/main/legacy/build-deploy-docker-compose.sh#L845

ADMIN_LAGOON_FEATURE_FLAG_X are only settable on the remote cluster, so we have some constraint to where this can be enabled.

if [ "$ADMIN_LAGOON_FEATURE_FLAG_EXPERIMENTAL_GPU_SUPPORT" = enabled ]; then
	. /kubectl-build-deploy/scripts/exec-gpu-generation.sh
fi

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need a flag at all? Any cluster that doesn't have the k8s plugins installed and GPU nodes available will just fail to schedule pods and cause a deployment failure.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right, I guess that is ok. We can re-evaluate the feature wrapping later. In IO cloud if we ever want to restrict users some how, we will need some sort of flagging system though, problem for another day though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants