Adding New Worker Node
Adding the node in k8s
In order to add a new worker node, we follow the steps as outlined by the kubespray project.
Lets assume we are adding one new worker node: computegpu001.p40.example.com
and add to relevant sections.
-
Add the node to your ansible inventory file
-
Ensure hostname is correctly set and hosts file has 127.0.0.1 entry
-
Run scale.yaml to add the node to your cluster
Once step 3 competes succesfully, validate that the node is up and running in the cluster
Adding the node in openstack
Once the node is added in k8s cluster, adding the node to openstack service is simply a matter of labeling the node with the right labels and annotations.
-
Export the nodes to add
-
For compute node add the following labels
# Label the openstack compute nodes kubectl label node computegpu001.p40.example.com openstack-compute-node=enabled # With OVN we need the compute nodes to be "network" nodes as well. While they will be configured for networking, they wont be gateways. kubectl label node computegpu001.p40.example.com openstack-network-node=enabled
-
Add the right annotations to the node
kubectl annotate \ nodes \ ${NODES} \ ovn.openstack.org/int_bridge='br-int' kubectl annotate \ nodes \ ${NODES} \ ovn.openstack.org/bridges='br-ex' kubectl annotate \ nodes \ ${NODES} \ ovn.openstack.org/ports='br-ex:bond1' kubectl annotate \ nodes \ ${NODES} \ ovn.openstack.org/mappings='physnet1:br-ex' kubectl annotate \ nodes \ ${NODES} \ ovn.openstack.org/availability_zones='nova'
-
Verify all the services are up and running
At this point the compute node should be up and running and your openstack
cli command should list the compute node under hosts.
For PCI passthrough
If you are adding a new node to be a PCI passthrough compute, say for exposing GPU to the vm, at this stage you will have to setup your PCI Passthrough configuration. Follow steps from: Configuring PCI Passthrough in OpenStack
Once the PCI setup is complete follow the instructions from: Adding Host Aggregates to setup host aggregates for the group of PCI devices. This helps us control the image/flavor/tennant build restriction on a given aggregate to better use underlying GPU resources.
Once the host aggregate is setup follow the instructions from: Genestack flavor documentation to setup the right flavor.