[local-volume] New controller to handle node deletion #817

msau42 · 2018-06-18T17:14:55Z

In cloud environments, nodes can be deleted and recreated fairly often. When nodes are deleted, the local disks are also deleted along with them, however local PVs remain and pods will get stuck scheduling because they are bound to a node that no longer exists.

For workloads that tolerate data loss and can recover with a brand new disk, the user can delete and recreate the PVC, which will cause the new PVC to be bound to a new disk. If using StatefulSets, the StatefulSet controller will automatically recreate a PVC if it doesn't exist.

The process of detecting node deletion and deleting the PVC could be automated by a controller. There's a few things to consider:

Workload needs to opt in to this controller. Not all workloads want this behavior.
Node deletion detection can be tricky in some environments. I know in GCE, the managed instance group recreates nodes with the same name. And in K8s 1.11, I think the Node object is no longer recreated by kubelet if the instance ID changes.
There are two scenarios: 1) PVC is already bound to a local PV. When local PV is released, there may be additional cleanup needed too since the daemonset provisioner no longer runs on that node. 2) Local PV Is unbound and available (but not actually since node is gone)

As for implementation ideas, I think metacontroller would be a cool framework to try out for this.

msau42 · 2018-06-18T17:16:14Z

/area local-volume

msau42 · 2018-06-18T18:19:28Z

Thinking about it a little more, the scenario where node is recreated with the same name could be handled by kubernetes/community#1484, which can detect that the path/disk no longer exists on the node.

So maybe there are actually 3 controllers involved here:

Daemonset that monitors disks on each node: PV monitoring proposal kubernetes/community#1484
Single controller that monitors deletion of node objects
Single controller that manages workloads using local PVs

NickrenREN · 2018-06-19T07:18:21Z

1 and 2 are being implemented here in https://github.com/caicloud/kube-storage-monitor

msau42 · 2018-12-19T01:12:06Z

Migrating to new repo: kubernetes-sigs/sig-storage-local-static-provisioner#10
/close

k8s-ci-robot · 2018-12-19T01:12:07Z

@msau42: Closing this issue.

In response to this:

Migrating to new repo: kubernetes-sigs/sig-storage-local-static-provisioner#10
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

msau42 mentioned this issue Jun 18, 2018

PV monitoring proposal kubernetes/community#1484

Closed

k8s-ci-robot added the area/local-volume label Jun 18, 2018

msau42 mentioned this issue Dec 19, 2018

Local volume health monitoring kubernetes-sigs/sig-storage-local-static-provisioner#10

Open

k8s-ci-robot closed this as completed Dec 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[local-volume] New controller to handle node deletion #817

[local-volume] New controller to handle node deletion #817

msau42 commented Jun 18, 2018

msau42 commented Jun 18, 2018

msau42 commented Jun 18, 2018

NickrenREN commented Jun 19, 2018

msau42 commented Dec 19, 2018

k8s-ci-robot commented Dec 19, 2018

[local-volume] New controller to handle node deletion #817

[local-volume] New controller to handle node deletion #817

Comments

msau42 commented Jun 18, 2018

msau42 commented Jun 18, 2018

msau42 commented Jun 18, 2018

NickrenREN commented Jun 19, 2018

msau42 commented Dec 19, 2018

k8s-ci-robot commented Dec 19, 2018