Main Effects Screening: A Distributed Continuous Quality Assurance Process for Monitoring...

Main Effects Screening: A Distributed Continuous Quality Assurance Process for Monitoring Performance Degradation in Evolving Software Systems

Cemal Yilmaz, Arvind S. Krishnaz, Atif Memon, Adam Porter, Douglas C. Schmidt, Aniruddha Gokhale, Balachandran Natarajan

Presented By: Walaa El-Din M. Moustafa

Objective As software systems change,

developers often run regression tests to detect unintended functional side effects.

QA efforts can be confounded by the enormous configuration space.

Time and resource constraints severely limit the number of configurations that can be examined.

Main Effects Screening Screening Designs are

Highly economical Can reveal important low order effects

that strongly affect performance.

Low Order Effects First-, second-, or third-order effects. nth-order effect is an effect caused by

the simultaneous interaction of n factors.

Low Order Effects Example For certain web server applications:

A 1st-order effect might be that performance slows considerably when logging is turned on and another might be that it also slows when few server threads are used.

A 2nd order effect involves the interaction of two options, e.g., web server performance may slow down when caching is turned off and the server performs blocking reads.

Factorial Designs Full factorial design involving k

binary factors. Such a design exhaustively tests all combinations.

Fractional factorial designs. These designs use only a carefully selected fraction (such as 1/2 or 1/4) of a full factorial design.

Fractional Factorial Designs Save money Do so by giving up the ability to

measure some higher-order effects.

Fractional Factorial Design Example To show how screening designs are

computed, we present a hypothetical example of a performance-intensive software system: 4 binary configuration options, A

through D. No inter-option constraints. The full configuration space therefore

has 24 = 16 configurations.

Fractional Factorial Design Example 8 configurations. Will focus only on 1st-order effects. This design is referred to as a 24 -1

design, where 4 refers to the total number of options we will examine and the -1 (2-1 = 1/2) indicates the fraction of the full factorial over which we will collect data.

Fractional Factorial Design Example We can extend the design and

estimate the effect of option D without going to a 24 full factorial design.

Fractional Factorial Design Example Design Generator:

Specifies the aliasing patterns used to build the design

For this example, we select the design generator D = ABC.

Fractional Factorial Design Example

Fractional Factorial Design Example The design we described above is a

resolution IV design. In resolution R designs, no effects

involving i factors are aliased with effects involving less than R - i factors.

Fractional Factorial Design Example Run the experiment For binary options (with settings - or +), the

main effect of option A, ME(A), isME(A) = z(A-) - z(A+)where z(A-) and z(A+) are the mean values of the observed data over all runs where option A is (-) and where option A is (+), respectively.

If desired, 2nd order effects can be calculated in a similar way. The interaction effect of option A and B, INT(A, B) is:= ½ (ME(B|A+) - ME(B|A-)) = ½ (ME(A|B+) - ME(A|B-))

The Experiment ACE v5.4 + TAO v1.4 + CIAO v0.4. 14 binary run-time options 214 =

16,384 different configurations. For each task measure:

Message latency. Throughput between the client and the

server.

The Experiment

The Full Data Set

Evaluating Screening Designs Can screening designs correctly

identify the important options discovered during the analysis of the full data set?

Evaluating Screening Designs Scr32 is a 214 - 9 IV with design generators

F = ABC, G = ABD, H = ACD, I = BCD, J = ABE, K = ACE, L = BCE, M = ADE, N = BDE.

Scr64 is a 214 - 8 IV with design generators G = ABC, H = ABD, I = ABE, J = ACDE, K = ABF, L = ACDF, M = ACEF, N = ADEF.

Scr128 is a 214 - 7 IV with design generators H = ABC, I = ABDE, J = ABDF, K = ACEF, L = ACDG, M = ABEFG, N = BCDEFG.

Evaluating Screening Designs

Evaluating Screening Designs Screening designs can detect

important options at a large fraction of the cost of exhaustive testing.

The smaller the effect, the larger the run size needed to identify it.

Estimating Performance with Screening Suites The estimates are generated by

examining all combinations of the most important options, while defaulting the settings of the unimportant options

Make the estimates based on benchmarking either 4 (all combinations of options B and J) or 32 (all combinations of options B, J, C, I, and F) configurations.

Estimating Performance with Screening Suites

Estimating Performance with Screening Suites The distributions of the top-5 and

top-2 screening suites closely track the overall performance data.

Screening Suites vs. Random Sampling

Screening Suites vs. Random Sampling These graphs suggest the obvious

weakness of random sampling.

Dealing with Evolving Systems Detect

performance degradations in evolving systems quickly.

Dealing with Evolving Systems Developer records and problem

reports indicate that problems were noticed on 12/14/03.

Higher Order Effects Resolution VI design Increased the run size to 2,048 to

capture the second-order effects.

Higher Order Effects

Higher Order Effects The important interaction effects

involve only options that are already considered important by themselves

The screening design correctly identifies the 5 most important pairwise interactions at 1/8th the cost of exhaustive testing.

Thank you

Main Effects Screening: A Distributed Continuous Quality Assurance Process for Monitoring...

Documents

Amine Thermal Degradation By: Jason Davis. Overview Carbamate Polymerization of MEA Background Chemistry Model PZ and MEA/PZ Blends Amine Screening

Screening, Characterization and Identification of Soil Isolates … et al.pdf · Screening, Characterization and Identification of Soil Isolates for degradation of Organophosphorus

Chloropyll Degradation

Screening for rapidly evolving genes in the ectomycorrhizal fungus

Achieving Land Degradation Neutrality at the country level · of how to put the evolving LDN concept into practice. The LDN target setting process provides a major opportunity for

The Evolving African SecurityThe Evolving African Security ...aardvarkaoc.co.za/wp-content/Conf_Aug_2009/Henri Boshoff 1.pdf · The Evolving African SecurityThe Evolving African Security

Crop residue degradation by fungi isolated from ... · Isolation and primary screening of lignocellulolytic fungi After harvesting of wheat, in May, 2013 soil samples were collected

Cellular protein degradation Proteolytic pathways in eukaryotes - lysosomal degradation of proteins - ubiquitin-proteasome dependent protein degradation

First Advantage Background Screening Trends … Advantage Background Screening Trends Report 2015: ... it’s evolving at warp-speed with the ... First Advantage Background Screening

CHAPTER5 Biomaterial Degradation 5.1 Introduction: Degradation in the Biological Environment Biomaterial degradation: uncontrolled vs. controlled degradation

Spectroscopic screening of degradation process in edible oils and … · 2014. 11. 20. · rancidity and decreasing of nutritional value [2]. The most often change in oil composition

JavaScript 2.0: Evolving a Language for Evolving Systems

Evolving development goals in an evolving world report

Evolving markets require evolving market approaches

Plants and Soil Soil Degradation Addressing Soil Degradation

Accelerated screening methods for determining chemical ......temperature (range) at which the sample undergoes thermal degradation or reaction. When the sample and reference tubes

Evolving Schools, Evolving Workplaces in the Bluegrass

Sequestration of a highly reactive intermediate in an ... · Sequestration of a highly reactive intermediate in an evolving pathway for degradation of pentachlorophenol Itamar Yadid

Developmental Screening and Surveillance - … · Outline 1. Why screen? The importance of early intervention 2. Current and evolving recommendations for expanded developmental surveillance

Forced Degradation Studies for Therapeutic Proteins - EBE€¦ · · 2017-06-26Forced degradation studies ... antibody drug conjugates and other ... degradation routes. Degradation