Grasshopper ----------> Master ( D 2 ): September 2022

When it comes to Kubernetes resource allocation, CPU allocation is considered as a critical factor for application/pod performance.

According to Kubernetes spec, there is a way to do this using Limits and Requests.

But how do we make sure or derive a value for both CPU requests and limits.

I found below article:

Here is the FUN part :) .

Do pods always get the CPU requested by their CPU request ? Is it guaranteed?

In CFS, every running entity, a process or a task group, has a virtual runtime (vruntime) which accounts for the entity's CPU usage.

The scheduling goal of CFS is to keep the vruntime of all running entities to be the same.

Each entity gets a portion of cpu.shares proportional to the task group's running load on the CPU

CPU requests can't be guaranteed since CPU time depends on process load.

Set CPU requests and limits to the same value
Add an HPA (Horizontal Pods Auto scaling) that allows Pods to automatically scale up and down based on load.
Use circuit breaker pattern.