Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade GPU operator application to a version with NFD >=v0.17.0 #14013

Open
akash-gautam opened this issue Jan 16, 2025 · 0 comments
Open

Upgrade GPU operator application to a version with NFD >=v0.17.0 #14013

akash-gautam opened this issue Jan 16, 2025 · 0 comments
Labels
kind/feature Categorizes issue or PR as related to a new feature.

Comments

@akash-gautam
Copy link
Contributor

akash-gautam commented Jan 16, 2025

Description of the feature you would like to add / User story

The node feature discovery(NFD) is a dependency for the GPU Operator helm chart, it has a known issue which can potentially overwhelm the kube api-server, the issue has been fixed in NFD version v0.17.0. The GPU operator chart should be upgraded to a version with NFD >=v0.17.0 once the GPU operator chart with the updated dependency is available.

Solution details

  • Upgrade GPU operator chart to a version that has NFD>=v0.17.0 once it's available.

Alternative approaches

To mitigate the issue till NFD >=v0.17.0 is available in the GPU operator chart, NodeFeatureAPI can be disable

node-feature-discovery:
  enableNodeFeatureApi: false
@akash-gautam akash-gautam added the kind/feature Categorizes issue or PR as related to a new feature. label Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature.
Projects
None yet
Development

No branches or pull requests

1 participant