17
SLA data management criteria Katerina Stamou, Verena Kantere, Jean- Henry Morin Institute of Services Science, University of Geneva, Switzerland 08/17/2022 1 Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

SLA data management criteria presentation

Embed Size (px)

DESCRIPTION

Scalable Data Management Workshop, IEEE BigData Conference, Santa Clara, US, 6-9 October 2013

Citation preview

Page 1: SLA data management criteria presentation

04/12/20231

SLA data management criteria Katerina Stamou, Verena Kantere, Jean-Henry Morin

Institute of Services Science, University of Geneva, Switzerland

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Page 2: SLA data management criteria presentation

04/12/2023

2

In a nutshell…

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

The systematic management of SLA data is required, as it increases SLA and service manipulation opportunities in the cloud computing setting. Thus, it contributes to additional business value in a service-oriented economy.

The term SLA data management encloses data operations that may take place before, during or after SLA/service execution.

We propose that the systematic management of SLAs can be efficiently achieved using a digraph data model that perceives SLA elements and their data relations as an operational pipeline.

Page 3: SLA data management criteria presentation

04/12/2023

3

Agenda

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Systematic SLA data management

Current SLA role in virtual economies

SLA data complexity

SLA data analysis

SLA digraph data model

Ongoing work

Page 4: SLA data management criteria presentation

04/12/2023

4Definitions and assumptions

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Service Level Agreements (SLAs) express mutually agreed service levels between providers and customers [1].

SLAs define quality of service (QoS) criteria, along with functional service properties.

The definition and structure of SLAs for cloud computing services are not yet standardized.

The term “systematic SLA data management” describes the process of SLA formulation, storage and processing by a backend supporting data-store or DBMS.

SLAs are automated and cloud providers use automated processing systems for the management of their offered services.

SLA templates can be used as what-you-see-is-what-you-get (WYSIWYG) artifacts that customers use to negotiate and finalize their service selection.

Page 5: SLA data management criteria presentation

04/12/2023

5

Systematic SLA data management

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Automated formulation: using a modular and adaptive data structure that addresses SLA data intricacies.

Storage: finding the optimal storage mode for, typically, short-term data.

Processing: SLA information contains inner-dependencies and internal functions that take place during service execution.

SLA information is not BigData; it is about managing and processing complex information that may result to or involve operations on massive data-sets.

SLAs represent semi-structured or even unstructured data, where no rigid schema applies. Thus, an efficient data model is required to allow for dynamic data processing.

Page 6: SLA data management criteria presentation

04/12/2023

6SLA anatomy - Web Service Level Agreement (WSLA), IBM [2]

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Signatories, third parties: customer-provider pair and their connections to third party support for the service execution.

Service description: decomposition and hierarchical classification of service objects, whose accumulation or combination constitutes the service definition.

Guarantees: obligations, typically from the provider part, to fulfill agreed and promised levels or service provisioning. IBM distinguishes between measureable targets (objectives) and predefined actions that occur during the service up-time.

Page 7: SLA data management criteria presentation

04/12/2023

7Challenges for SLA manipulation in a cloud service economy

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

The SLA definition provides an explicit view on how the service provisioning is planned. It indicates precise bounds on service levels that a provider can afford.

1. SLAs as automated processes versus static documents that currently appear in cloud marketplaces.

2. Diversified service offerings, various vocabularies of service descriptions => SLA semantic and structural heterogeneity.

3. SLA formulation depends from resource availability and is typically subject to customer-provider variations. Given heterogeneity and unbounded length, SLAs represent a fine example of semi-structured information that needs concurrent processing over distributed computing settings.

Page 8: SLA data management criteria presentation

04/12/2023

8SLA data complexity stems from:

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Heterogeneity of data format and structure

Service dependencies between internal SLA components. Service dependencies within an SLA lifecycle can be thought as actions that have to occur, when a predefined condition is triggered.

Real time measurement/updates: internal SLA components may be used for the definition and computation of other SLA components that typically reside within the same SLA instance. Data relationships may deal with monitoring and measurement of values that are described by data end-point sources. Data connections may also deal with updates of SLA component values that are dependent from the values of neighbor SLA components. A persisted SLA instance needs to be accessed by both external sources as well as DBMS internal processes.

Page 9: SLA data management criteria presentation

04/12/2023

9

SLA data analysis I

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

The term SLA data management encloses all data operations that may take place before, during or after SLA/service execution. Such operations can be classified according to pre-instantiated, active and terminated SLAs. They typically include fine-grained SLA elements that need dynamic processing.

Compared to other types of service contracts (e.g. terms-and-conditions, software licences) the values of SLA terms need to be monitored and measured during service execution to verify that SLOs are met and that no service violations have occured.

The requirement for real-time data updates particularly applies in the cloud computing setting, where services are exchanged on demand and business relationships may enclose financial responsibilities.

Nested SLA information may include dependencies between diverse components or component sets (e.g. a change in an SLA parameter value may affect respective SLO values).

Page 10: SLA data management criteria presentation

04/12/2023

10

SLA data analysis II

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Data criteria SLA parameter

Metrics Measurable objectives

Action guarantees

complete SLA doc

accessibility, integrity

✔ ✔ ✔ ✔ ✔

velocity rate high high high low ~

replication, staging

✔ ✔ ✔

dependencies ✔ ✔ ✔ ✔

cleanness ✔ ✔ ✔ ✔ ✔

accuracy ✔ ✔ ✔ ✔ ✔

ownership, authenticity

✔ ✔ ✔ ✔ ✔

heterogeneity ✔ ✔ ✔ ✔ ✔

Page 11: SLA data management criteria presentation

04/12/2023

11

SLA digraph formalization

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Page 12: SLA data management criteria presentation

04/12/2023

12

SLA into property graph

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

According to [5], a property graph G=(V, E, ) represents a directed, attributed, edge-labeled graph that contains multi-relations, which are expressed as key/value pairs on the graph elements.The computing structure is the graph and the computing process consists of the graph traversals.

The SLA digraph representation includes only uni-directed edges to denote the flow of dependencies within any SLA “pipeline”.

Three immediate advantages:

Modular: decomposable, flexible structure, extensible Adaptive: with respect to diversified service environments,

inclusion/exclusion of additional elementsDynamic: concurrent execution of operations and transactions within the

same or multiple graphs.

Page 13: SLA data management criteria presentation

04/12/2023

13SLA dependencies

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

According to [3], service dependencies represent customer/provider relationships that are reflected to the various cooperating components within a distributed service management system.

A dependency denotes the directed relationship between a dependent service or application component that requires an operation performed by an antecedent component in order for the former to execute its function.

SL A data elements are connected according to structural or operational dependencies, where satisfactory dependency conditions are defined as edge-property triggers.

SLA dependency examples: <ActionGuarantee, SLO>uses, <CompositeMetric, ResourceMetric>uses, <SupportParty, ActionGuarantee>obliged, where for every pair of SLA nodes the following relationship holds:

<Dependent, Antecedent>rel, Dependentvalue -> function(Antecedentvalue)

,while a predefined set of conditions is valid and 'rel’ represents an outgoing edge from the dependent to the antecedent component.

Page 14: SLA data management criteria presentation

04/12/2023

14Query expressiveness, clear information flow, SLA questions:

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Provider aspect:

1. reach resourcex and get value of metricy; return value and update all relations, where value is used.

2. update SLAxy; add new branch ServiceDefinition and in Obligations add SLO branches and ActionGuarantees; update the dependencies/relations between the newly added components.

3. update SLAqw23; delete SupportPartyold with name ’someCompany’ and update all obligations of SupportPartyold to SupportPartynew

Customer aspect:

4. reach SupportPartynew; ask to return monitored values from a given list of metrics

5. how can I add a new SLA to my currently running one(s)?

6. which service is best for me? what are my service criteria?

Page 15: SLA data management criteria presentation

04/12/2023

15

Conclusions

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

The SLA digraph has been initially implemented using an in-memory graph database, NetworkX [4].

Next, the data model has been re-implemented using the Titan distributed graph database [6], where Gremlin [7] is used as the primary DSL.

Query comparison between Graph DSL, XQuery and MySQL.

Currently, we are testing the digraph efficiency using Cassandra as the persistence backend behind Titan. We exercise the scenario, where massive http requests reach the SLA information concurrently and request information retrieval and filtered operations.

Actual SLA data represents a requirement…to avoid the use of fictitious information. TPC benchmark to be used to further testing.

Page 16: SLA data management criteria presentation

04/12/2023

16

Thank you :)

Questions? -> http://www.cui.unige.ch/~stamou/slides, full paper: http://www.slideshare.net/kat_slides/scdm

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

Page 17: SLA data management criteria presentation

04/12/2023

17

References

Scalable Cloud Data Management workshop, IEEE BigData Conference, Santa Clara, US

1. A. Dan, H. Ludwig, and G. Pacifici, “Web service differentiation with service level agreements,” White Paper IBM Corporation, 2003.

2. H. Ludwig, A. Keller, A. Dan, R. King, and R. Franck, “Web Service Level Agreement (WSLA) Language Specification,” IBM Corporation, 2003.

3. A. Keller, U. Blumenthal, and G. Kar, “Classification and Computation of Dependencies for Distributed Management,” in Proc. of the Fifth IEEE Symposium on Computers and Communications (ISCC 2000), ser. ISCC ’00. IEEE Computer Society, 2000.

4. A. Hagberg, D. Schult, and P. Swart, “NetworkX,” http://networkx. github.io/, accessed: March, 2013.

5. M. Rodriguez, “Property Graph Algorithms,” http://markorodriguez. com/2011/02/08/property-graph-algorithms/, accessed: July, 2013.

6. ThinkAurelius, “Titan Distributed Graph Database,” http://thinkaurelius.github.io/titan/, accessed: July, 2013.

7. ThinkAurelius team, “Gremlin graph query language,” https://github.com/tinkerpop/gremlin/wiki, accessed: July, 2013.