Skip to content

Commit 029b80c

Browse files
author
Nilesh PS
committed
bugfix: limit nvidia-device-plugin to gpu instance types
1 parent dad2e50 commit 029b80c

File tree

1 file changed

+49
-3
lines changed

1 file changed

+49
-3
lines changed

helm_chart/HyperPodHelmChart/values.yaml

Lines changed: 49 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -138,11 +138,57 @@ nvidia-device-plugin:
138138
requiredDuringSchedulingIgnoredDuringExecution:
139139
nodeSelectorTerms:
140140
- matchExpressions:
141-
# nvidia plugin needs at least one node selector. Below label exists for all hyperpod nodes
142-
- key: kubernetes.io/os
141+
- key: node.kubernetes.io/instance-type
143142
operator: In
144143
values:
145-
- "linux"
144+
- ml.g4dn.12xlarge
145+
- ml.g4dn.16xlarge
146+
- ml.g4dn.2xlarge
147+
- ml.g4dn.4xlarge
148+
- ml.g4dn.8xlarge
149+
- ml.g4dn.metal
150+
- ml.g4dn.xlarge
151+
- ml.g5.12xlarge
152+
- ml.g5.16xlarge
153+
- ml.g5.24xlarge
154+
- ml.g5.2xlarge
155+
- ml.g5.48xlarge
156+
- ml.g5.4xlarge
157+
- ml.g5.8xlarge
158+
- ml.g5.xlarge
159+
- ml.g5g.16xlarge
160+
- ml.g5g.2xlarge
161+
- ml.g5g.4xlarge
162+
- ml.g5g.8xlarge
163+
- ml.g5g.metal
164+
- ml.g5g.xlarge
165+
- ml.g6.12xlarge
166+
- ml.g6.16xlarge
167+
- ml.g6.24xlarge
168+
- ml.g6.2xlarge
169+
- ml.g6.48xlarge
170+
- ml.g6.4xlarge
171+
- ml.g6.8xlarge
172+
- ml.g6.xlarge
173+
- ml.g6e.12xlarge
174+
- ml.g6e.16xlarge
175+
- ml.g6e.24xlarge
176+
- ml.g6e.2xlarge
177+
- ml.g6e.48xlarge
178+
- ml.g6e.4xlarge
179+
- ml.g6e.8xlarge
180+
- ml.g6e.xlarge
181+
- ml.gr6.4xlarge
182+
- ml.gr6.8xlarge
183+
- ml.p2.16xlarge
184+
- ml.p2.8xlarge
185+
- ml.p2.xlarge
186+
- ml.p3.16xlarge
187+
- ml.p3.2xlarge
188+
- ml.p3.8xlarge
189+
- ml.p3dn.24xlarge
190+
- ml.p4d.24xlarge
191+
- ml.p5.48xlarge
146192
tolerations:
147193
- key: nvidia.com/gpu
148194
operator: Exists

0 commit comments

Comments
 (0)