Doing this at anything > 1k nodes is a pain in the butt. We decided to run many ...

kvrty · 2025-11-24T12:35:33 1763987733

Same here. Non Kubernetes project originated control plane components start failing beyond a certain limit - your ingress controllers, service meshes etc. So I don't usually take node numbers from these benchmarks seriously for our kind of workloads. We run a bunch of sub-1k node clusters.

liveoneggs · 2025-11-24T13:46:12 1763991972

Same. The control plane and various controllers just aren't up to the task.

preisschild · 2025-11-24T15:32:38 1763998358

Meh, I've had had clusters with close to 1k nodes (w/ cilium as CNI) and didnt have major issues

__turbobrew__ · 2025-11-24T16:23:33 1764001413

When I was involved about a year ago, cilium falls apart at around a few thousand nodes.

One of the main issues of cilium is that the bpf maps scale with the number of nodes/pods in the cluster, so you get exponential memory growth as you add more nodes with the cilium agent on them. https://docs.cilium.io/en/stable/operations/performance/scal...

preisschild · 2025-11-25T09:05:44 1764061544

Thats true and I definitely had to "tune" the bpf map limits, but it wasn't really that difficult to do.

oasisaimlessly · 2025-11-24T17:09:34 1764004174

Wouldn't that be quadratic rather than exponential?