The Performance of the Czech National Grid Infrastructure after Major Reconfiguration of Job Scheduling System

Logo poskytovatele

Varování

Publikace nespadá pod Fakultu sportovních studií, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.
Autoři

KLUSÁČEK Dalibor TÓTH Šimon

Rok publikování 2014
Druh Článek ve sborníku
Konference Cracow Grid Workshop 2014
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
Obor Informatika
Klíčová slova queue reconfiguration; multi-resource fairness; plan-based scheduling
Popis This work describes the outcomes of a large reconfiguration of the job scheduling system used in the Czech National Grid MetaCentrum which has been done in January and July 2014. MetaCentrum serves to various users and research groups. It is very important to guarantee that computational resources are used efficiently and in a fair fashion with respect to different users. With the significant growth of MetaCentrum (1,500 CPU cores in 2009 vs. 10,000 CPU cores in 2014) we recently had to revise our scheduling approaches to better reflect the increased size of the system and the growing heterogeneity of hardware resources and users' workloads. This revision took place in three major steps through the year 2014. First of all, new multi-resource aware fair-sharing algorithm was deployed, in order to improve fairness with respect to growing heterogeneity of resources and users demands. Second, large queue reconfiguration was done, in order to decrease resource fragmentation and improve utilization. Finally, new plan-based job scheduler enabling schedule optimization has been deployed in July 2014, currently managing 5 large computer clusters with 4500 CPU cores.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info