-
Notifications
You must be signed in to change notification settings - Fork 296
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
✨ vGPU implementation #3025
Merged
Merged
✨ vGPU implementation #3025
Changes from 14 commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
0091291
vGPU implementation
puneetkatyal c5d2a78
Update pkg/services/govmomi/vcenter/clone.go
puneetkatyal b85ad40
Handle adding vGPU in reconcilePCIDevices instead of in Clone
birksl 47d7d72
Duplicate PCI device test for vGPU test
birksl 27933fa
Update docs/gpu-vgpu.md
birksl 9f16da1
Update pkg/services/govmomi/vcenter/clone.go
birksl cd250d7
Make VGPU directly part of PCI specs.
birksl 1e9a6e1
Remove outdated vpgu e2e setup
birksl 6e8d105
Fix nil-pointer mistake
birksl c7f4781
Webhook validation for vgpuProfile
birksl 1353b4d
Update internal/webhooks/vspheremachinetemplate.go
birksl a6c0fd8
Update validation webhooks
birksl 8a6a558
Reorder if-statement
birksl cea0016
Run make generate
birksl 9fc17c9
Share device validation logic in webhooks
birksl 0e84953
Fix empty pointer nit
birksl 1f433eb
Update docs/gpu-vgpu.md
birksl 609542b
Apply suggestions from code review
birksl 438c6e5
Generate manifests
birksl f18ba35
Update PCIDevice doc comments
birksl ee5d3ac
Forgot to run make generate
birksl File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||||
---|---|---|---|---|---|---|---|---|
@@ -0,0 +1,106 @@ | ||||||||
# GPU enabled clusters using vGPU | ||||||||
|
||||||||
## Overview | ||||||||
|
||||||||
You can choose to create a cluster with both worker and control plane nodes having vGPU devices attached to them. | ||||||||
|
||||||||
Before we begin, a few important things to note: | ||||||||
|
||||||||
- [NVIDIA GPU Operator](https://github.com/NVIDIA/gpu-operator) is used to expose the GPU PCI devices to the workloads running on the cluster. | ||||||||
- The OVA templates used for cluster creation should have the VMX version (Virtual Hardware) set to 17 or higher. This is necessary because Dynamic DirectPath I/O was introduced in this version, which enables the Assignable Hardware intelligence for passthrough devices. | ||||||||
- Since we need the VMX version to be >=17, this way of provisioning clusters with PCI passthrough devices works for vSphere 7.0 and above. This is the ESXi/VMX version [compatibility list](https://kb.vmware.com/s/article/2007240). | ||||||||
- UEFI boot mode is recommended for the OVAs used for cluster creation. | ||||||||
- Most of the setup is similar to [GPU enabled clusters via PCI Passthrough](https://github.com/kubernetes-sigs/cluster-api-provider-vsphere/blob/main/docs/gpu-pci.md#create-the-cluster). | ||||||||
|
||||||||
## An example GPU enabled cluster | ||||||||
|
||||||||
Let's create a CAPV cluster with vGPU enabled nodes. | ||||||||
|
||||||||
### Prerequisites | ||||||||
|
||||||||
- Refer the [NVIDIA Virtual GPU Software Quick Start Guide](https://docs.nvidia.com/grid/latest/grid-software-quick-start-guide/index.html) to download and install the vGPU software and configure vGPU licensing. | ||||||||
|
||||||||
- Ensure vGPU compatibility for your vSphere installation and the GPU devices using the [VMware Compatibility Guide - Shared Pass-through Graphics](https://www.vmware.com/resources/compatibility/search.php?deviceCategory=vgpu) | ||||||||
|
||||||||
- Enable Shared Passthrough for the GPU device on the ESXi Host | ||||||||
- Browse to a host in the vSphere Client navigator. | ||||||||
- On the **Configure** tab, expand **Hardware** and click **Graphics**. | ||||||||
- Under **GRAPHICS DEVICES**, select the GPU device to be used for vGPU, click **EDIT...** and select **Shared Direct**. Repeat this for additional GPU devices as needed. | ||||||||
- Select **HOST GRAPHICS**, click **EDIT...** and select **Shared Direct** and select a shared passthrough GPU assignment policy, for example **Group VMs on GPU until full (GPU consolidation)**. | ||||||||
|
||||||||
- Build an OVA template | ||||||||
We can build a custom OVA template using the [image-builder](https://github.com/kubernetes-sigs/image-builder) project. We will build a Ubuntu 20.04 OVA with UEFI boot mode. More documentation on how to use image-builder can be found in the [image-builder book](https://image-builder.sigs.k8s.io/capi/providers/vsphere.html) | ||||||||
- Clone the repo locally and go to the `./images/capi/` directory. | ||||||||
- Create a `packer-vars.json` file with the following content. | ||||||||
|
||||||||
```shell | ||||||||
$ cat packer-vars.json | ||||||||
{ | ||||||||
"vmx_version": 17 | ||||||||
} | ||||||||
``` | ||||||||
|
||||||||
- Run the make file target associated to ubuntu 20.04 UEFI OVA as follows: | ||||||||
|
||||||||
```shell | ||||||||
> PACKER_VAR_FILES=packer-vars.json make build-node-ova-vsphere-ubuntu-2004-efi | ||||||||
``` | ||||||||
|
||||||||
### Source the vGPU profile(s) for the GPU device | ||||||||
|
||||||||
See "2. Choosing the vGPU Profile for the Virtual Machine" at [Using GPUs with Virtual Machines on vSphere](https://blogs.vmware.com/apps/2018/09/using-gpus-with-virtual-machines-on-vsphere-part-3-installing-the-nvidia-grid-technology.html) to see what vGPU profiles are available for your GPU device. | ||||||||
|
||||||||
We are using NVIDIA Tesla V100 32GB cards for this example and will use the `grid_v100d-4c` vGPU profile for this card that allocates 4GB GPU memory to the worker node's vGPU device. | ||||||||
|
||||||||
### Create the cluster template | ||||||||
|
||||||||
```shell | ||||||||
$ make dev-flavors | ||||||||
go run ./packaging/flavorgen --output-dir /home/user/.cluster-api/overrides/infrastructure-vsphere/v0.0.0 | ||||||||
Comment on lines
+58
to
+59
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
We dropped the dev-flavors target in the meantime |
||||||||
``` | ||||||||
|
||||||||
Edit the generated Cluster template (`cluster-template.yaml`) to set the values for the `pciDevices` array. Here we are editing the VSphereMachineTemplate object for the worker nodes. This will create a worker node with a single NVIDIA 16GB vGPU device attached to the VM. | ||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||
|
||||||||
```yaml | ||||||||
--- | ||||||||
apiVersion: infrastructure.cluster.x-k8s.io/v1beta1 | ||||||||
kind: VSphereMachineTemplate | ||||||||
metadata: | ||||||||
name: ${CLUSTER_NAME}-worker | ||||||||
namespace: '${NAMESPACE}' | ||||||||
spec: | ||||||||
template: | ||||||||
spec: | ||||||||
cloneMode: linkedClone | ||||||||
datacenter: '${VSPHERE_DATACENTER}' | ||||||||
datastore: '${VSPHERE_DATASTORE}' | ||||||||
diskGiB: 25 | ||||||||
folder: '${VSPHERE_FOLDER}' | ||||||||
memoryMiB: 8192 | ||||||||
network: | ||||||||
devices: | ||||||||
- dhcp4: true | ||||||||
networkName: '${VSPHERE_NETWORK}' | ||||||||
numCPUs: 2 | ||||||||
os: Linux | ||||||||
powerOffMode: trySoft | ||||||||
resourcePool: '${VSPHERE_RESOURCE_POOL}' | ||||||||
server: '${VSPHERE_SERVER}' | ||||||||
storagePolicyName: '${VSPHERE_STORAGE_POLICY}' | ||||||||
template: '${VSPHERE_TEMPLATE}' | ||||||||
thumbprint: '${VSPHERE_TLS_THUMBPRINT}' | ||||||||
pciDevices: | ||||||||
- vgpuProfile: "grid_t4-1a" # value from above | ||||||||
``` | ||||||||
|
||||||||
Set the required values for the other fields and the cluster template is ready for use. The similar changes can be made to a template generated using clusterctl generate cluster command as well. | ||||||||
birksl marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
### Create the cluster | ||||||||
|
||||||||
Set the size of the GPU nodes appropriately, since the Nvidia gpu-operator requires additional CPU and memory to install the device drivers on the VMs. | ||||||||
|
||||||||
Note: For GPU nodes (PCI Passthrough or vGPU), all memory of the nodes must be reserved. CAPV will automatically do this for nodes that have a PCI Passthrough GPU or a vGPU device in the spec. See "Memory Reservation" at [Using GPUs with Virtual Machines on vSphere](https://blogs.vmware.com/apps/2018/09/using-gpus-with-virtual-machines-on-vsphere-part-2-vmdirectpath-i-o.html) | ||||||||
|
||||||||
Apply the manifest from the previous step to your management cluster to have CAPV create a workload cluster with worker nodes that have vGPUs. | ||||||||
|
||||||||
From this point on, the setup is exactly the same as [GPU enabled clusters via PCI Passthrough](https://github.com/kubernetes-sigs/cluster-api-provider-vsphere/blob/main/docs/gpu-pci.md#create-the-cluster). | ||||||||
birksl marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same for VendorID & VGPUProfile
(Note: this won't change the CRD YAML as "omitempty" already made the fields optional in the generated YAML)