57
Budapest University of Technology and Economics Department of Measurement and Information Systems Budapest University of Technology and Economics Fault Tolerant Systems Research Group Capacity Planning and Performance Management for VCL Clouds Ágnes SalΓ‘nki, GergΕ‘ Kincses, LΓ‘szlΓ³ GΓΆnczy, Imre Kocsis

Capacity Planning and Performance Management for VCL Clouds

Embed Size (px)

Citation preview

Page 1: Capacity Planning and Performance Management for VCL Clouds

Budapest University of Technology and EconomicsDepartment of Measurement and Information Systems

Budapest University of Technology and EconomicsFault Tolerant Systems Research Group

Capacity Planning and Performance Management for VCL Clouds

Ágnes SalΓ‘nki, GergΕ‘ Kincses, LΓ‘szlΓ³ GΓΆnczy, Imre Kocsis

Page 2: Capacity Planning and Performance Management for VCL Clouds

Motivation

Page 3: Capacity Planning and Performance Management for VCL Clouds

Motivation

Lab

Page 4: Capacity Planning and Performance Management for VCL Clouds

Motivation

Lab Privateuniversity cloud

Page 5: Capacity Planning and Performance Management for VCL Clouds

Motivation

Lab Privateuniversity cloud

Enterprise cloudPurchased CPU time

Page 6: Capacity Planning and Performance Management for VCL Clouds

Our VCL cloud

Maintained by our research group

5 semesters

o 2 courses/semester

9 hosts

~20 000 reservations

o Only 22 rejected

Page 7: Capacity Planning and Performance Management for VCL Clouds

Reservation Workflow in VCL

Request

o VM type

o Length

o Immediately or later

Page 8: Capacity Planning and Performance Management for VCL Clouds

Reservation Workflow in VCL

Request

o VM type

o Length

o Immediately or later

Page 9: Capacity Planning and Performance Management for VCL Clouds

Reservation Workflow in VCL

Request

o VM type

o Length

o Immediately or later

Load time

Page 10: Capacity Planning and Performance Management for VCL Clouds

Reservation Workflow in VCL

Request

o VM type

o Length

o Immediately or later

Hard reservation limit

Load time

Page 11: Capacity Planning and Performance Management for VCL Clouds

Reservation Workflow in VCL

Request

o VM type

o Length

o Immediately or later

Hard reservation limit

Load time

Page 12: Capacity Planning and Performance Management for VCL Clouds

Reservation Workflow in VCL

Request

o VM type

o Length

o Immediately or later

Hard reservation limit

Load time

Page 13: Capacity Planning and Performance Management for VCL Clouds

Capacity Planning in VCLCapacity Planning

Aron Imre

Page 14: Capacity Planning and Performance Management for VCL Clouds

Capacity Planning in VCLCapacity Planning

Aron Imre

Page 15: Capacity Planning and Performance Management for VCL Clouds

Capacity Planning in VCLCapacity Planning

Can I start solving my homework

now?

Aron ImreGergo

Page 16: Capacity Planning and Performance Management for VCL Clouds

Capacity Planning in VCLCapacity Planning

Can I start solving my homework

now?

Do we have spare capacity

for my research next week?

Aron Imre

Agnes

Gergo

Page 17: Capacity Planning and Performance Management for VCL Clouds

Capacity Planning in VCLCapacity Planning

Can I start solving my homework

now?

Do we have spare capacity

for my research next week?

I am responsible for a course with 250 students next September. Can we handle this workload?

Aron Imre

Agnes

Gergo

Laszlo

Page 18: Capacity Planning and Performance Management for VCL Clouds

Capacity Planning in VCLCapacity Planning

Support for hard limit estimation

Spare capacity prediction

Long-term capacity planning/scheduling

Page 19: Capacity Planning and Performance Management for VCL Clouds

The Available Dataset

Host1

Host2

VM1 VM2

VM1 VM2

reservation type, time to load, etc.

cpu usage, memory usage, etc.

cpu usage, memory usage, etc.

deadlines, #students

Page 20: Capacity Planning and Performance Management for VCL Clouds

Data Analysis Steps

Host1

Host2

VM1 VM2

VM1 VM2

Workload prediction

Resource util. pred.

Page 21: Capacity Planning and Performance Management for VCL Clouds

Workflow

Workload prediction

Resource util. pred.

Capacity planning

Host1

Host2

VM1 VM2

VM1 VM2

Page 22: Capacity Planning and Performance Management for VCL Clouds

Workflow

Workload prediction

Resource util. pred.

Capacity planning

Host1

Host2

VM1 VM2

VM1 VM2

Use case 1

Host

VM VM ?

Page 23: Capacity Planning and Performance Management for VCL Clouds

Workflow

Workload prediction

Resource util. pred.

Capacity planning

Host1

Host2

VM1 VM2

VM1 VM2

Use case 1

Use case 2

Host

VM VM ?

?VM VM VM

Page 24: Capacity Planning and Performance Management for VCL Clouds

Workflow

Workload prediction

Resource util. pred.

Capacity planning

Host1

Host2

VM1 VM2

VM1 VM2

Use case 1

Use case 2

Use case 3

Host

VM VM ?

?

?VM VM VM

Page 25: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Page 26: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Deadline

Page 27: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Deadline

Page 28: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Daily workload follows a Gaussian-like distribution

Page 29: Capacity Planning and Performance Management for VCL Clouds

Model fitting

Page 30: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Daily workload follows a Gaussian-like distribution

Exponential increase in peak numbers

maximum location between 7 PM and 11 PM

~4 hours as standard deviation

Page 31: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Daily workload follows a Gaussian-like distribution

Exponential increase in peak numbers

maximum location between 7 PM and 11 PM

~4 hours as standard deviation

Page 32: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Page 33: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Students work even in the night

Page 34: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Students work even in the night

They have lunch and dinner

Page 35: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Students work even in the night

They have lunch and dinner

They skip their lectures

Page 36: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Changes in students’ behavior?

Page 37: Capacity Planning and Performance Management for VCL Clouds

Workload prediction

Changes in students’ behavior?

Page 38: Capacity Planning and Performance Management for VCL Clouds

Resource Utilization Prediction

𝑓(π‘€π‘œπ‘Ÿπ‘˜π‘™π‘œπ‘Žπ‘‘)??

Page 39: Capacity Planning and Performance Management for VCL Clouds

Challenges

It is a cloud

o Statistical multiplexing

o Workload is not uniformly distributed

Meleg tartalΓ©k

MΓ‘s vΓ‘lasz ugyanarra a terhelΓ©sre, pl. a memΓ³riΓ‘nΓ‘l

2012/2013/2 2013/2014/1 2013/2014/2 2014/2015/1

Page 40: Capacity Planning and Performance Management for VCL Clouds

Challenges

It is a cloud

o Statistical multiplexing

o Workload is not uniformly distributed

Meleg tartalΓ©k

MΓ‘s vΓ‘lasz ugyanarra a terhelΓ©sre, pl. a memΓ³riΓ‘nΓ‘l

2012/2013/2 2013/2014/1 2013/2014/2 2014/2015/1

Page 41: Capacity Planning and Performance Management for VCL Clouds

Challenges

It is a cloud

o Statistical multiplexing

o Workload is not uniformly distributed

Meleg tartalΓ©k

MΓ‘s vΓ‘lasz ugyanarra a terhelΓ©sre, pl. a memΓ³riΓ‘nΓ‘l

2012/2013/2 2013/2014/1 2013/2014/2 2014/2015/1

Page 42: Capacity Planning and Performance Management for VCL Clouds

Host1

Challenges

It is a cloud

Hosts show different behavior

Page 43: Capacity Planning and Performance Management for VCL Clouds

Host2Host1

Challenges

It is a cloud

Hosts show different behavior

Page 44: Capacity Planning and Performance Management for VCL Clouds

Host2Host1

Challenges

It is a cloud

Hosts show different behavior

o Warm spare

o Different user behavior

o ???

Page 45: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: memory

Linear model

oπ‘€π‘’π‘š(𝑉𝑀1) + π‘€π‘’π‘š(𝑉𝑀2) + … + π‘€π‘’π‘š(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Page 46: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: memory

Linear model

oπ‘€π‘’π‘š(𝑉𝑀1) + π‘€π‘’π‘š(𝑉𝑀2) + … + π‘€π‘’π‘š(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Page 47: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: memory

Linear model

oπ‘€π‘’π‘š(𝑉𝑀1) + π‘€π‘’π‘š(𝑉𝑀2) + … + π‘€π‘’π‘š(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Very good at following drastic changes

Page 48: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: memory

Linear model

oπ‘€π‘’π‘š(𝑉𝑀1) + π‘€π‘’π‘š(𝑉𝑀2) + … + π‘€π‘’π‘š(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Very good at following drastic changes

Within 5% by the 97% of time

Page 49: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: memory

Linear model

oπ‘€π‘’π‘š(𝑉𝑀1) + π‘€π‘’π‘š(𝑉𝑀2) + … + π‘€π‘’π‘š(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Page 50: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: memory

Linear model

oπ‘€π‘’π‘š(𝑉𝑀1) + π‘€π‘’π‘š(𝑉𝑀2) + … + π‘€π‘’π‘š(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Page 51: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: CPU

Linear model

o πΆπ‘ƒπ‘ˆ(𝑉𝑀1) + πΆπ‘ƒπ‘ˆ(𝑉𝑀2) + … + πΆπ‘ƒπ‘ˆ(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload CPU is much more sensitive than memory

Page 52: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: CPU

Linear model

o πΆπ‘ƒπ‘ˆ(𝑉𝑀1) + πΆπ‘ƒπ‘ˆ(𝑉𝑀2) + … + πΆπ‘ƒπ‘ˆ(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Page 53: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: CPU

Linear model

o πΆπ‘ƒπ‘ˆ(𝑉𝑀1) + πΆπ‘ƒπ‘ˆ(𝑉𝑀2) + … + πΆπ‘ƒπ‘ˆ(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

Page 54: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: CPU

Linear model

o πΆπ‘ƒπ‘ˆ(𝑉𝑀1) + πΆπ‘ƒπ‘ˆ(𝑉𝑀2) + … + πΆπ‘ƒπ‘ˆ(π‘šπ‘”π‘šπ‘‘)

o Weighted by the workload

The students usethe CPU more

intensively beforethe deadline

Page 55: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: CPU

Linear model

o πΆπ‘ƒπ‘ˆ(𝑉𝑀1, π’˜π’) + πΆπ‘ƒπ‘ˆ(𝑉𝑀2, π’˜π’) + …

o Weighted by the workload

Page 56: Capacity Planning and Performance Management for VCL Clouds

Resource utilization analysis: CPU

Linear model

o πΆπ‘ƒπ‘ˆ(𝑉𝑀1, π’˜π’) + πΆπ‘ƒπ‘ˆ(𝑉𝑀2, π’˜π’) + …

o Weighted by the workload

Page 57: Capacity Planning and Performance Management for VCL Clouds

Summary

Data-driven static capacity planning

o β€žuser behavior” analysis

o resource fingerprint estimation

Conclusions:

o student behavior can be modelled easily

o we were sometimes (too) strict

Dynamic capacity planning?

o Long loading time failed reservations soon

oWhen to burst out to a public cloud?