English

Distributedness based scheduling

Distributed, Parallel, and Cluster Computing 2025-06-04 v1

Abstract

Efficient utilization of computing resources in a Kubernetes cluster is often constrained by the uneven distribution of pods with similar usage patterns. This paper presents a novel scheduling strategy designed to optimize the distributedness of Kubernetes resources based on their usage magnitude and patterns across CPU, memory, network, and storage. By categorizing resource usage into labels such as "cpu high spike" or "memory medium always," and applying these to deployed pods, the system calculates the variance or distributedness factor of similar resource types across cluster nodes. A lower variance indicates a more balanced distribution. The Kubernetes scheduler is enhanced to consider this factor during scheduling decisions, placing new pods on nodes that minimize resource clustering. Furthermore, the approach supports redistribution of existing pods through simulated scheduling to improve balance. This method is adaptable at the cluster, namespace, or application level and is integrated within the standard Kubernetes scheduler, providing a scalable, label-driven mechanism to improve overall resource efficiency in cloud-native environments.

Keywords

Cite

@article{arxiv.2506.02581,
  title  = {Distributedness based scheduling},
  author = {Paritosh Ranjan and Surajit Majumder and Prodip Roy and Bhuban Padhan},
  journal= {arXiv preprint arXiv:2506.02581},
  year   = {2025}
}

Comments

7 pages