33
RIA-653549 INDIGO-Datacloud Improving IaaS resources to accommodate scientific applications Alvaro Lopez Garcia 1 , Patrick Fuhrmann, Giacinto Donvito, Andrea Chierici 1 [email protected] Spanish National Research Council - CSIC October 9, 2015 INDIGO-Datacloud RIA-653549 1

INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

RIA-653549

INDIGO-Datacloud

Improving IaaS resources toaccommodate scientific applications

Alvaro Lopez Garcia 1, Patrick Fuhrmann,Giacinto Donvito, Andrea Chierici

[email protected] National Research Council - CSIC

October 9, 2015

INDIGO-Datacloud RIA-653549 1

Page 2: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 2

Page 3: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 3

Page 4: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

INDIGO-DataCloud Project

• H2020 project approved in January 2015 in theEINFRA-1-2014 call.

• Who: 26 european partners from 11 EU countries.

• What: develop an open source cloud platform for computingand data (DataCloud).

• For: multi-disciplinary communities.

• Where: deployable on hybrid (public or private) cloudinfrastructures.

• Why: requirements coming 11 different communities.

http://indigo-datacloud.eu

October 9, 2015

INDIGO-Datacloud RIA-653549 4

Page 5: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

INDIGO key challenges

• (The very hard task of) Collecting and consolidatingevolving user requests.

• Creation of a new sustainable cloud competence in Europefor PaaS, for both the scientific and industrial sectors, similarto what OpenStack and OpenNebula have done for IaaS.

• Many technology gaps, for example: storage QoS, PaaSstandardization, distributed AuthZ, static allocation ofhardware resources, data sharing, customizable applicationportals.

October 9, 2015

INDIGO-Datacloud RIA-653549 5

Page 6: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

(some) High-level IaaS technology gaps

• Existing lock-ins.

– Vendor lock-in.– Hypervisor or technology lock-in.

• Naive resource allocation strategies (i.e. scheduling).

– Prevents users from exploiting effectively the resources.– Makes difficult for providers to partition the resources.– Under-utilization of resources

• Inflexible way of distributing applications.

– Difficult to port applications between providers.– Different interfaces, different APIs.

• Lack of access to specialized hardware (e.g. InfiniBand).

October 9, 2015

INDIGO-Datacloud RIA-653549 6

Page 7: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Project organization

• WP2: Communityrequirements.

• WP3: SW managementand pilot services.

• WP4: Enhanced IaaS.

• WP5: PaaS design(leverages WP4)

• WP6: Portal and useraccess (leverages WP5).

October 9, 2015

INDIGO-Datacloud RIA-653549 7

Page 8: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 8

Page 9: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

WP4.1 Description

From DoWExtend the features and functionality of IaaS cloud middleware tosupport the reliable management of a performant virtual computinginfrastructure for users (WP6) and the PaaS layer (WP5).

• Enable the use of containers for executing the user workloads on thee-infrastructure.

• Provide local site orchestration, simplifying lifecycle management of thevirtual computing infrastructure.

• Improve the performance of the cloud management platforms bydesigning and implementing novel scheduling mechanisms and policies atthe site level.

• Integrate novel existing solutions from the field of authentication andauthorization.

• Provide a discovery, availability and accounting service for the virtualizedinfrastructure.

October 9, 2015

INDIGO-Datacloud RIA-653549 9

Page 10: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

WP4.1 tasks and areas overview

• Subtasks T4.1.1/2/3: Providing support for containers.1. Develop/extend container support.2. Integration of trusted repositories for containers.3. Extend relevant IaaS standard interfaces.

• Subtask T4.1.4: Improve the existing cloud schedulers.a Fair-share scheduling.b Spot instances.

• Subtask T4.1.5: Integration of container execution in batchsystems and explore access to InfiniBand and GPGPUs.

• Subtask T4.1.6: Provide local IaaS site orchestration usingstandards (i.e. TOSCA)

October 9, 2015

INDIGO-Datacloud RIA-653549 10

Page 11: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 11

Page 12: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

WP4.1 architecture

October 9, 2015

INDIGO-Datacloud RIA-653549 12

Page 13: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

General architectural choices

Interfaces

• Standard based whenever possible (OCCI, OASIS TOSCA).

• Native interfaces may be used in the short term (so that a gapdoes not hinder the development).

• In the long term standards should be enforced.

Cloud Middleware Frameworks

• Support for OpenStack and OpenNebula.

Containers

• Initial work will leverage Docker.

• Open to consider any other technology in the mid-term.

October 9, 2015

INDIGO-Datacloud RIA-653549 13

Page 14: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 14

Page 15: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Container execution support

Work in three interlinked topics for IaaS Container Support:

1. Container support in Cloud Management Frameworks(CMFs): OpenStack and Opennebula.

2. Integration of repositories.

3. IaaS standard interfaces extension.

Plus two extra topics, also regarding container execution, but in adifferent area:

1. Execution of containers in Batch Systems.

2. Explore access to GPGPUs and InfiniBand resources.

October 9, 2015

INDIGO-Datacloud RIA-653549 15

Page 16: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

IaaS Container Execution support I

Container support in CMFs

• Docker is the container technology chosen.

• Execute containers as first-class resources.

• OpenStack– Existing nova-docker driver, follow up development– Evaluate Magnum project: Deploy containers on-top of OpenStack

(still under development, implies an API change).

• OpenNebula– Driver available only for LXC.– No Docker driver, we will develop one, similar to nova-docker.– Extend ONE where needed to support containers (e.g. datastore,

onetemplate, etc.)– Evaluate Megam project, similar to OpenStack Magnum.

October 9, 2015

INDIGO-Datacloud RIA-653549 16

Page 17: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

IaaS Container Execution support II

Integration of container repositories

• Automatic sync with INDIGO container repository(DockerHub).

• Same baseline of images in all resource providers.

• Integration with OpenStack Glance and OpenNebulaoneimage.

IaaS standard interfaces extension

• OCCI is being considered as the IaaS standard interface.

• Develop needed extensions regarding container support.

October 9, 2015

INDIGO-Datacloud RIA-653549 17

Page 18: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Batch System container integration I

• Let users run containers in existing batch facilities.

• Approach 1: Integration of execution batch systems

– A bit intrusive.– Based on prologue and epilogue scripts, server and client for

actually launching them

• Approach 2: Running in user-space

– Less intrusive.– Implementing proof of concept.

• Exploring possibility of accessing GPGPUs and InfiniBandresources within a container

October 9, 2015

INDIGO-Datacloud RIA-653549 18

Page 19: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 19

Page 20: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Scheduling problems

• Current cloud scheduling is focused on satisfying industryneeds.

• Schedulers are too simple: no priorities, static partitioning,static quotas.

• From a resource provider point of view:– Difficult to partition resources.– Compromise between using the infrastucture as much as possible

and give enough share to each group.– Difficult to get a good utilization of the infrastructure (e.g.

backfilling not possible).– Difficult to prioritize workloads from different users/groups.

• From a user’s point of view:– Difficult to exploit resources: limited quota even if free resources.– Opportunistic usage not possible.

• Two complementary mechanisms: fair-sharing and spot orpreemptible instances.

October 9, 2015

INDIGO-Datacloud RIA-653549 20

Page 21: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Fair-share scheduling I

• Currently, cloud instances are executed without duration.

• Scheduling is on first-come, first-served basis.

• Difficult for resource providers to prioritize tasks.

• An instance is scheduled or fails: cannot be queued.

• A user has a fixed static quota, it does not take into accountpast usage.

• Develop a new service: Synergy.

October 9, 2015

INDIGO-Datacloud RIA-653549 21

Page 22: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Fair-share scheduling II

Synergy

• Implemented as external service that interacts with the CMF.– No need to modify the middleware.– No new states are introduced.– Non-fairshare users are not aware of it.

• Based on Slurm Priority Multifactor Strategy.

• Synergy intercepts user’s requests.

• Instances are:– associated with a fixed WALL time.– queued according to its previous usage (i.e. fair-share).– terminated when wall time is expired.

• Aiming for the integration of Synergy into the OpenStackecosystem.

October 9, 2015

INDIGO-Datacloud RIA-653549 22

Page 23: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Fair-share scheduling III

October 9, 2015

INDIGO-Datacloud RIA-653549 23

Page 24: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Spot or preemptible instances support

• A spot instance is a special kind of instance, that can beterminated by a higher priority task.

• Priority can be determined by a bid price (spot market), afair-sharing algorithm or simply considering spot VS non spot.

• Useful for opportunistic usage: get as much as you want, butbe aware that you can be killed.

• Aming for introducing basic support for spot instances intoOpenStack core (blueprint submitted).

• Policies implementation using external services (e.g. bid price)and scheduler weighters.

October 9, 2015

INDIGO-Datacloud RIA-653549 24

Page 25: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Spot instances flowchart

October 9, 2015

INDIGO-Datacloud RIA-653549 25

Page 26: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Partition Director

• Resource providers operate different infrastructures.

• Shares and quotas has to be configured independently.

• Develop a new service for dynamically adjust it so that theshare takes into account the whole infrastructure (cloud,batch).

October 9, 2015

INDIGO-Datacloud RIA-653549 26

Page 27: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 27

Page 28: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Orchestration architecture

October 9, 2015

INDIGO-Datacloud RIA-653549 28

Page 29: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

IaaS orchestration

• Provide IaaS orchestration using the OASIS TOSCA language(used at different levels within INDIGO).

• OpenStack– OpenStack Orchestration (Heat) as the IaaS orchestration

engine.– TOSCA-translator at the CLI level.– Aiming for making Heat get TOSCA requests directly.

I Discussions with OpenStack devs, blueprint being drafted.

• OpenNebula– No native orchestration component available.– Use the Infrastructure Manager (IM) 1.– Adapt TOSCA-translator to produce RADL templates (IM

native language).

• The IM makes possible to use any other existing CMF thatspeaks OCCI, as long as other commercial providers.

1http://www.grycap.upv.es/im/index.php

October 9, 2015

INDIGO-Datacloud RIA-653549 29

Page 30: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Outline

INDIGO-DataCloud project

Resource Virtualization TaskArchitecture and architectural choicesContainer execution supportScheduling improvementsTOSCA-based orchestration

Progress

October 9, 2015

INDIGO-Datacloud RIA-653549 30

Page 31: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Where we are now I

Mainly finalizing the design phase for all areas, and startingimplementation.

Container support

• Starting implementation of ONE docker support.

• Deploying and evaluating nova-docker sites.

• Evaluating extension requirements for OCCI interface.

October 9, 2015

INDIGO-Datacloud RIA-653549 31

Page 32: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Where we are now II

Scheduling

• Integrating Synergy as external OpenStack project.

• Submitted spot-instances blueprint for discussion inOpenStack 2.

Orchestration

• Discussions regarding TOSCA support with OpenStackcommunity.

• Drafting blueprint for submission in OpenStack.

2https://review.openstack.org/#/c/104883/

October 9, 2015

INDIGO-Datacloud RIA-653549 32

Page 33: INDIGO-Datacloud - COnnecting REpositories · Creation of a new sustainable cloud competence in Europe ... Deploy containers on-top of OpenStack (still under development, implies

Thanks

Indigo Web: http://indigo-datacloud.eu

[email protected]

October 9, 2015

INDIGO-Datacloud RIA-653549 33