-
Notifications
You must be signed in to change notification settings - Fork 40k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AWS Feature: GPU support #23587
Comments
@therc you said you would work on this :-) |
@chengyli to know this. |
Watching this closely, this will definitely help us! Kudos to taking this on for v1.3! |
I'm trying to implement this through Docker volume plugins, since that's already been requested in #16405 and would reduce the amount of custom code (we still need to pass |
@brandoncole what instance types do you use? For v0 I am targeting g2.2xl. |
@therc Answering on behalf of @brandoncole (I work with him). We are using g2.2xl so that works perfectly for us. |
@therc We would be pretty happy with g2.2xl to start with. We're also running Kubernetes on a pretty large scale internally with machines that have multiple NVidia GPUs which we'd love to take advantage of and schedule accordingly as well eventually. @gopinatht is our main guy for this! 👍 |
@therc Docker already support GPU, but the K8s doesn't have the "--device" ability, could you let know the details about "implement this through Docker volume plugins"? I enabled the "--device" in my repo, but the code still need to refine. And so far it works with NVIDIA GPU. Have tested that. |
@Hui-Zhi I'm almost done with the design doc. I'll send it out tomorrow. You're right that |
Automatic merge from submit-queue WIP v0 NVIDIA GPU support ```release-note * Alpha support for scheduling pods on machines with NVIDIA GPUs whose kubelets use the `--experimental-nvidia-gpus` flag, using the alpha.kubernetes.io/nvidia-gpu resource ``` Implements part of kubernetes#24071 for kubernetes#23587 I am not familiar with the scheduler enough to know what to do with the scores. Mostly punting for now. Missing items from the implementation plan: limitranger, rkt support, kubectl support and docs cc @erictune @davidopp @dchen1107 @vishh @Hui-Zhi @gopinatht
@justinsb Now that our v0 GPU support has merged, are there any AWS-specific bits needed to enable it on AWS? |
Moving to 1.4 |
Actually @therc perhaps you can comment on the GPU support that is merged and what (if anything) we need to do additionally for AWS? |
This needs to be triaged as a release-blocker or not for 1.5 |
@justinsb all issues must be labeled either release blocker or non release blocking by end of day 18 November 2016 PST. (or please move it to 1.6) cc @kubernetes/sig-aws |
Can this issue be closed since multi-GPU support is merged with #42116 ? I have a follow-up proposal for some other features kubernetes/community#414 |
Close this one, and please address the rest enhancement separately. Thanks! |
GPU support, both assigning only to nodes that support GPUs & ensuring that we don't overcommit GPUs.
The text was updated successfully, but these errors were encountered: