28
Measures of Central Tendency & Spread CCGPS Coordinate Algebra Unit 4: Describing Data

CCGPS Coordinate Algebra Unit 4: Describing Data

Embed Size (px)

Citation preview

Page 1: CCGPS Coordinate Algebra Unit 4: Describing Data

Measures of Central Tendency

& Spread

CCGPS Coordinate AlgebraUnit 4: Describing Data

Page 2: CCGPS Coordinate Algebra Unit 4: Describing Data

A measure of central tendency is a measure

that tells us where the middle of a bunch of data lies.

The three most common measures of central tendency are the mean, the median, and the mode.

Definition of Measures of Central Tendency

Page 3: CCGPS Coordinate Algebra Unit 4: Describing Data

The average value of a data set,

found by summing all values and dividing by the number of data points

Mean

Page 4: CCGPS Coordinate Algebra Unit 4: Describing Data

Median is the number present in

the middle when the numbers in a set of data are arranged in ascending or descending order. If the number of numbers in a data set is even, then the median is the mean of the two middle numbers.

Median

Page 5: CCGPS Coordinate Algebra Unit 4: Describing Data

The middle-most value of a data

set; 50% of the data is less than this value, and 50% is greater than it

Example:

Median

Page 6: CCGPS Coordinate Algebra Unit 4: Describing Data

Mode is the value that occurs

most frequently in a set of data.

Mode

Page 7: CCGPS Coordinate Algebra Unit 4: Describing Data

Measures of spread describe how similar or

varied the set of observed values are for a particular variable (data item). Measures of spread include the range, quartiles and the interquartile range, variance and standard deviation.

It is usually used in conjunction with a measure of central tendency, such as the mean or median, to provide an overall description of a set of data.

Measures of Spread

Page 8: CCGPS Coordinate Algebra Unit 4: Describing Data

There are many reasons why the measure of the spread of data values is important, but one of the main reasons regards its relationship with measures of central tendency. A measure of spread gives us an idea of how well the mean, for example, represents the data. If the spread of values in the data set is large, the mean is not as representative of the data as if the spread of data is small. This is because a large spread indicates that there are probably large differences between individual scores. Additionally, in research, it is often seen as positive if there is little variation in each data group as it indicates that the similar.

Why is it important to measure the spread of data?

Page 9: CCGPS Coordinate Algebra Unit 4: Describing Data

The range is the difference

between the highest and lowest scores in a data set and is the simplest measure of spread. So we calculate range as:

Range = maximum value - minimum value

Range

Page 10: CCGPS Coordinate Algebra Unit 4: Describing Data

Quartiles tell us about the spread

of a data set by breaking the data set into quarters, just like the median breaks it in half.

Quartiles are the values that divide a list of numbers into quarters.

Quartiles

Page 11: CCGPS Coordinate Algebra Unit 4: Describing Data

First put the list of

numbers in order Then cut the list into four

equal parts The Quartiles are at the

"cuts"

To Create Quartiles…

Page 12: CCGPS Coordinate Algebra Unit 4: Describing Data

First QuartileThe value that identifies the lower 25% of the

data; it is the median of the lower half of the data set; written as

Example: 1Q

Page 13: CCGPS Coordinate Algebra Unit 4: Describing Data

Third QuartileValue that identifies the upper 25% of the

data; it is the median of the upper half of the data set; 75% of all data is less than this value; written as

Example: 3Q

Page 14: CCGPS Coordinate Algebra Unit 4: Describing Data

Example: 5, 8, 4, 4, 6, 3, 81. Put them in order: 3, 4, 4, 5, 6, 8, 82. Cut the list into quarters:

Page 15: CCGPS Coordinate Algebra Unit 4: Describing Data

Quartile 1 (Q1) = 4

Quartile 2 (Q2), which is also the Median, = 5

Quartile 3 (Q3) = 8

And the result is:

Page 16: CCGPS Coordinate Algebra Unit 4: Describing Data

The numbers are already in order Cut the list into quarters:

Example: 1, 3, 3, 4, 5, 6, 6, 7, 8, 8

In this case Quartile 2 is half way between 5 and 6:Q2 = (5+6)/2 = 5.5

Page 17: CCGPS Coordinate Algebra Unit 4: Describing Data

Quartile 1 (Q1) = 3

Quartile 2 (Q2) = 5.5

Quartile 3 (Q3) = 7

And the result is:

A common way of expressing quartiles is as an interquartile range.

Page 18: CCGPS Coordinate Algebra Unit 4: Describing Data

The interquartile range describes the

difference between the third quartile (Q3) and the first quartile (Q1), telling us about the range of the middle half of the scores in the distribution.

The "Interquartile Range" is from Q1 to Q3

Interquartile Range

Page 19: CCGPS Coordinate Algebra Unit 4: Describing Data
Page 20: CCGPS Coordinate Algebra Unit 4: Describing Data

To calculate it just subtract Quartile 1 from Quartile 3.

The Interquartile Range is: Q3 - Q1 = 8 - 4 = 4

Page 21: CCGPS Coordinate Algebra Unit 4: Describing Data

Outlier

)(5.11 IQRQ

)(5.13 IQRQ

A data value that is much greater than or much less than the rest of the data in a data set; mathematically, any data less than

or greater than is an outlier

Example:

Page 22: CCGPS Coordinate Algebra Unit 4: Describing Data

Another measure of variability is called the

mean absolute deviation - the average of the absolute values of the differences between each data value in a data set and the set's mean. In other words, it is the average distance that each value is away from the mean.

Mean Absolute Deviation (MAD)

Page 23: CCGPS Coordinate Algebra Unit 4: Describing Data

If a data set has a small mean absolute

deviation, then this means that the data values are relatively close to the mean.

If the mean absolute deviation is large, then the values are spread out and far from the mean.

Mean Absolute Deviation (MAD)

Page 24: CCGPS Coordinate Algebra Unit 4: Describing Data

1. Find the mean ()2. Subtract the mean from each data

value (x )3. Take the absolute value of each value

from step #2. ()4. Add up all values from step #3.5. Divide by the number of data values.

To find the MAD:

Page 25: CCGPS Coordinate Algebra Unit 4: Describing Data

The numbers below represent the number of homeruns hit by players of the Wheelr baseball team.

2, 3, 5, 7, 8, 10, 14, 18, 19, 21, 25, 28

Q1 = 6

Q3 = 20

Interquartile Range: 20 – 6 = 14

Do the same for Harrison: 4, 5, 6, 8, 9, 11, 12, 15, 15, 16, 18, 19, 20

Page 26: CCGPS Coordinate Algebra Unit 4: Describing Data

The numbers below represent the number of homeruns hit by players of the Wheeler baseball team.

2, 3, 5, 7, 8, 10, 14, 18, 19, 21, 25, 28

Q1 = 6

Q3 = 20

Interquartile Range: 20 – 6 = 14

12 206

Page 27: CCGPS Coordinate Algebra Unit 4: Describing Data

4, 17, 7, 14, 18, 12, 3, 16, 10, 4, 4, 11

1. Put the numbers in order:

3, 4, 4, 4, 7, 10, 11, 12, 14, 16, 17, 18

2. Cut it into quarters: 3, 4, 4 | 4, 7, 10 | 11, 12, 14 | 16, 17,

18

Page 28: CCGPS Coordinate Algebra Unit 4: Describing Data

3, 4, 4 | 4, 7, 10 | 11, 12, 14 | 16, 17, 18

In this case all the quartiles are between numbers:

Quartile 1 (Q1) = (4+4)/2 = 4Quartile 2 (Q2) = (10+11)/2 = 10.5Quartile 3 (Q3) = (14+16)/2 = 15

Also: The Lowest Value is 3, The Highest Value is 18