At the start of the incident, Janar discovered that the Kubernetes cluster has no nodes servicing it. This means all workloads on the Kubernetes cluster were stopped.
Google Cloud Console showed all the nodes that were supposed to service the cluster as online, leading us to believe that they had somehow disconnected / deauthorized from the cluster.
We spun down all the disconnected nodes and spun up new servers to start servicing the workloads.
The cluster immediately started self-healing to restore all workloads to operational condition.