Reinforcement Learning applied to Meta -scheduling in grid environments Bernardo Costa Inês Dutra

ISPA 2008 APDCT Workshop 1

Reinforcement Learning applied to Meta-scheduling in grid environments

Bernardo Costa

Inês Dutra

Marta Mattoso

Outline

Introduction Algorithms Experiments Conclusions and Future work

Introduction

Relevance: Available grid schedulers usually do not employ a

strategy that may benefit a single or multiple users.

Some strategies employ performance information dependent algorithms (pida).

Most works are simulated.

Difficulty: monitoring information not reliable due to network latency.

Study of 2 Algorithms (AG) A. Galstyan, K. Czajkowski, and K.

Lerman. Resource allocation in the grid using reinforcement learning. In AAMAS, pages 1314–1315. IEEE, 2004.

(MQD) Y. C. Lee and A. Y. Zomaya. A grid scheduling algorithm for bag-of-tasks applications using multiple queues with duplication. 5th IEEE/ACIS International Conference on Computer and Information Science and 1st IEEE/ACIS International Workshop on Component-Based Software Engineering, Software Architecture and Reuse. ICIS-COMSAR, pages 5–10, 2006.

What is reinforcement learning?

Machine learning technique used to learn behaviours given a series of temporal events.

Non-supervised learning. Based on the idea of rewards and

punishments.

Algorithms

AG and MQD use reinforcement learning to associate an efficiency rank to an RMS.

Reinforcement learning native to AG. MQD was modified to use this technique to

estimate computational power of an RMS. AG allocates RMS in a greedy and probabilistic

way. MQD allocates RMS associatively and

deterministically.

Algorithms

Calculating efficiency: Reward is assigned to RMS that has performance

better than average. Reward can be negative (punishment). RMS may not change its efficiency value.

Algorithms

Calculating efficiency: parameters: and l is the importance of the time spent executing a

task affects rewarding.

l is a learning parameter

Algorithms

AG: With high prob, associates job to the best

available RMS, otherwise, selects randomly. MQD:

Groups of jobs sorted according execution time are associated to an RMS. Most efficient executes the heaviest jobs. Initial allocation to estimate RMS´ efficiency

Algorithm AG

J1 J3J2 J4 J5

J6 J7 J8 J9

R1E = 0

R2E = 0

R3E = 0

J6 J7 J8 J9

R1E = 0

R2E = 0,3

R3E = -0,3

J7 J8 J9

R1E = 0,3

R2E = 0,057

R3E = 0,51

Algorithm MQD

R1E = 0

R2E = 0

R3E = 0

R1E = 0

R2E = 0

R3E = 0

R1E = 0,3

R2E = -0,3

R3E = 0

R1E = 0,09

R2E = -0,09

R3E = -0,3

ISPA 2008 APDCT Workshop

Avg per proc

Global Avg

Experiments

GridbusBroker:No need to install it in other grid sitesOnly requirement: ssh access to a grid

nodeRound-robin scheduler (RR)

Limitations:Does not support job duplication Imposes a limit on the number of active

jobs per RMS

Experiments

Resources in 6 grid sites:LabIA: 24 (Torque/Maui)LCP: 28 (SGE)Nacad: 16 (PBS PRO)UERJ: 144 (Condor)UFRGS: 4 (Torque)LCC: 44 (Torque)

Experiments

Objective: study performance of algorithms in a real grid environment.

Application: bag-of-tasks. CPU intensive.

Duration between 3 and 8 minutes.

Experiments

Evaluation criteria: makespan.

Makespan was normalized with respect to RR

Experiments

Phase I: Tuning of parameters and l 500 jobs.

Phase II: Performance of re-scheduling. Later load increased to 1000 jobs.

Experiments

One experiment is a run of consecutive executions of RR, AG and MQD.

A scenario is a set of experiments with fixed parameters.

For each scenario: 15 runs. T-tests to verify statistical difference

beteween AG/MQD e RR, with 95% confidence (the results have a normal distribution).

Experiments (Phase I)

Experiments (Phase II)

Conclusions and Future work

Results showed that was possible to achieve optimizations with both AG and MQD wrt RR

Experiments validate MQD simulation results found in the literature.

Reinforcement learning is a promising technique to classify resources in real grid environments.

Conclusions and Future work

Study the behavior of AG and MQD with other kinds of applications, e.g., data intensive, with dependencies.

Questions?

Definições

Gerenciador de recursos: sistema que gerencia a submissão e execução de jobs dentro de um domínio específico.

Resource Management System (RMS): sinônimo para gerenciador de recursos.

Batch job scheduler: escalonador típico de um RMS. Ex: SGE, PBS/Torque.

Definições

Meta-escalonador: um escalonador que não tem acesso direto aos recursos, mas apenas aos RMS que os gerenciam.

Aprendizado por reforço: técnica que induz um agente a tomar decisões por meio de recompensas oferecidas.

Makespan: tempo total gasto por um meta-escalonador para finalizar a execução de um conjunto de jobs a ele designado.

Definições

Job: aplicativo submetido ao grid por um usuário, executado em geral por um RMS. Exemplos de tipos de jobs: Bag-of-Tasks: jobs que não possuem relação de

dependência ou precedência explícita entre si. Troca de parâmetros (APST): jobs de um mesmo

executável que diferenciam-se por um valor de entrada que varia entre as execuções.

Reinforcement Learning applied to Meta -scheduling in grid environments Bernardo Costa Inês Dutra

Documents

Geraldo Cechella Isaia Bernardo Fonseca Tutikian Inês ... · Bernardo Fonseca Tutikian Inês Laranjeira da Silva Battagin. ... Jorge Batlouni Neto. Materiais de Construção Civil

DGA DRH Cadastro-KM 224e-20200107115110€¦ · Inês Cruz Clériguinho Franco Inverno Inês de Ornelas Fouto Varela Inês Maria Marques Martins Inês Tomé Lapa Inês Vieira Soares

GLite authentication and authorization Discipline: Grid Computing, 07/08-2 Practical classes Inês Dutra, DCC/FCUP

Introdução ao Ambiente Unix Profs: Rita Ribeiro (rpribeiro@dcc.fc.up.pt) Inês Dutra (ines@dcc.fc.up.pt) DCC/FCUP

Componentes de um sistema computacional moderno - FCUPines/aulas/0910/SO/so1.pdf · DCC/FCUP Inês Dutra Sistemas de Operação 5. Gerações de Sistemas de Operação ... (como visto

NATAL - museudoazulejo.gov.pt · 18H00 Venda de Natal ... Entrai Pastores, Entrai (peça tradicional de Natal de Beja) Bernardo Aguiar (violino) Aluno da Prof.ª Inês Saraiva Fritz

AULA 3cursosaudementalsbc2012.weebly.com/uploads/1/3/6/6/... · AULA 3 Maria Inês Badaró Moreira São Bernardo do Campo 2012 . 3ª aula 09/11

Scanned Document - Lisboa...146 Inês Isabel Bernardo de Sousa 10,00 187 Inês Isabel Marques Rodrigues 17,00 154 Inês Moreira Coelho 17,00 132 Inês Sofia dos Santos Galante Covita

Inês nunes

An Introduction to Grids and the EELA-2 infrastructureines/aulas/0910/MAPi/Intro.pdf · An Introduction to Grids and the EELA-2 infrastructure Inês Dutra ... I-Way –IEEE/ACM 1995

Portal Diplomático€¦ · Carlos Diogo Fernades Duarte Carlos Miguel Correia Lopes Carlota Neto Ahrens Teixeira Carolina Bernardo da Silva Carolina Inês Brito Cecílio Carolina

Anália Torres | Rui Brites | Bernardo Coelho | Inês Cardoso | Paula Jerónimo Family and gender in Europe Trends of convergence and divergence comparing

Carta Imagem - teses.usp.br · Jardim Ângela Pedreira Cidade Dutra Socorro Jardim São Luiz São Paulo São Paulo São Bernardo do Campo Diadema Diadema Parelheiros Grajaú Capão

Inês Funenga.pdf

Claudio F. R. Geyer II - UFRGS Inês de Castro Dutra COPPE - Sistemas - UFRJ

Biografia inês

Aula Dutra

Inês Nunes Nº9 Catarina Silvestre Nº2 Inês Santos Nº7

Felipe Dutra

Dutra 4400