The Wire · Showcase
GCP CLOUD PROVIDER ROLLBACK REVEALS RBAC REGRESSION—NODE DELETION BROKEN IN GKE
By RepoJournal · Filed · About Kubernetes
The cloud-provider-gcp team reverted a critical library update after discovering it broke node lifecycle permissions, leaving orphaned GCE instances stranded in clusters.
A v1.34.0 library bump [1] shipped yesterday but was reverted within hours [2] after exposing a cascading permission failure. The root cause: NodeLifecycleController was using a hardcoded client identity instead of respecting configured RBAC roles, preventing nodes from being deleted after their backing GCE instances vanished [3]. This regression was introduced by an earlier permission refactor [3], so the team is now carefully unwinding that work. In parallel, the cluster directory removal [4] continues as planned—kubetest2-gce dependency is gone, so that testing infrastructure is being cleaned up. Meanwhile, test-infra is tightening its own house: 14 periodic jobs with zero green builds in 90+ days are getting axed [5], and Windows Azure tests are coming back online after a capz provider fix [6].
Action items
- → Do not upgrade cloud-provider-gcp to v1.34.0—revert to previous if already deployed kubernetes/cloud-provider-gcp [immediate]
- → If running GKE with node autoscaling, verify node-controller RBAC permissions are correctly bound kubernetes/cloud-provider-gcp [plan]
- → Watch for the fixed NodeLifecycleController patch—expect it in the next cloud-provider-gcp release kubernetes/cloud-provider-gcp [monitor]
References
- [1] Update GoogleCloudPlatform/k8s-cloud-provider to v1.34.0 ↗ kubernetes/cloud-provider-gcp
- [2] Revert "Update GoogleCloudPlatform/k8s-cloud-provider to v1.34.0" ↗ kubernetes/cloud-provider-gcp
- [3] fix: Ensure NodeLifecycleController has correct permissions (#1118) kubernetes/cloud-provider-gcp
- [4] Remove the cluster directory ↗ kubernetes/cloud-provider-gcp
- [5] config/jobs: delete 14 periodics with no green build in 90+ days ↗ kubernetes/test-infra
- [6] test(cloud-provider-azure): re-add Windows periodic jobs after capz fix ↗ kubernetes/test-infra
FAQ
- What changed in Kubernetes on May 12, 2026?
- The cloud-provider-gcp team reverted a critical library update after discovering it broke node lifecycle permissions, leaving orphaned GCE instances stranded in clusters.
- What should Kubernetes teams do about it?
- Do not upgrade cloud-provider-gcp to v1.34.0—revert to previous if already deployed • If running GKE with node autoscaling, verify node-controller RBAC permissions are correctly bound • Watch for the fixed NodeLifecycleController patch—expect it in the next cloud-provider-gcp release
- Which Kubernetes repositories shipped on May 12, 2026?
- kubernetes/cloud-provider-gcp, kubernetes/test-infra