30
Chapter 3-Describing Data: Numerical Measures Jie Zhang, Ph.D. Student Account and Information Systems Department College of Business Administration The University of Texas at El Paso [email protected] Spring, 2014 Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 1

Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Embed Size (px)

Citation preview

Page 1: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Chapter 3-Describing Data: Numerical MeasuresJie Zhang, Ph.D. Student

Account and Information Systems Department

College of Business Administration

The University of Texas at El Paso

[email protected]

Spring, 2014

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 1

Page 2: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Learning Objectives• LO1 Identify and compute the arithmetic mean.

• LO2 Determine the median

• LO3 Identify the mode.

• LO4.Explain and apply measures of dispersion.

• LO5 Compute and interpret the standard deviation.

• LO6 Empirical Rule.

• L7 Compute the mean and standard deviation of

grouped data.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 2

Page 3: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Characteristics of the Mean

• The arithmetic mean is the most widely used measure of location.

Major characteristics:

• All values are used.

• It is unique.

• The sum of the deviations from the mean is 0.

• It is calculated by summing the values and dividing by the number of values.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 3

Page 4: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Population Mean

• The population mean is the sum of all the population values dividedby the total number of population values:

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 4

Page 5: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

EXAMPLE

• There are 42 exits on I-75 through the state of Kentucky. Listed below are the distances between exits (in miles).

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 5

Why is this information a population? What is the mean number of miles between exits?

Page 6: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

• This is a population because we are considering all the exits in Kentucky.

• What is the mean number of miles between exits?

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 6

Page 7: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Parameter Versus Statistics

• PARAMETER - A measurable characteristic of a population.

• STATISTIC - A measurable characteristic of a sample.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 7

Page 8: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Sample Mean

• The sample mean is the sum of all the sample values divided by the number of sample values:

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 8

Page 9: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

EXAMPLE – Sample Mean

• SunCom is studying the number of minutes used by clients in a particular call phone rate plan. A random sample of 12 clients showed the following number of minutes used last month.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 9

What is the arithmetic mean number of minutes used?

Page 10: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

The Median

• MEDIAN - The midpoint of the values after they have been ordered from the smallest to the largest, or the largest to the smallest.

PROPERTIES OF THE MEDIAN

• There is a unique median for each data set.

• It is not affected by extremely large or small values

• It can be computed for ratio-level, interval-level, and ordinal-level data.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 10

Page 11: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

EXAMPLES - Median

The ages for a sample of five college students are:

21, 25, 19, 20, 22

Arranging the data in ascending order gives:

19, 20, 21, 22, 25.

Thus the median is 21

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 11

The heights of four basketball players, in inches, are:

76, 73, 80, 75

Arranging the data in ascending order gives:

73, 75, 76, 80.

Thus the median is 75.5

Page 12: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

The Mode

• MODE-The value of the observation that appears most frequently.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 12

Page 13: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Example - Mode

• Using the data regarding the distance in miles between exits on I-75 through Kentucky. The information is repeated below. What is the modal distance?

• Organize the distances into a frequency table.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 13

Page 14: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

The Relative Positions of the Mean, Median and the Mode

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 14

Page 15: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Dispersion• A measure of location, such as the mean or the median, only describes the center of the

data. It is valuable from that standpoint, but it does not tell us anything about the spreadof the data.

• For example, if your nature guide told you that the river ahead averaged 3 feet in depth, would you want to wade across on foot without additional information? Probably not. You would want to know something about the variation in the depth.

• A second reason for studying the dispersion in a set of data is to compare the spread in two or more distributions.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 15

Page 16: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Measures of Dispersion

• Range

• Variance and Standard Deviation

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 16

Page 17: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

EXAMPLE – Range

• The number of cappuccinos sold at the Starbucks location in the Orange Country Airport between 4 and 7 p.m. for a sample of 5 days last year were 20, 40, 50, 60, and 80. Determine the range for the number of cappuccinos sold.

Range = Largest – Smallest value

= 80 – 20 = 60

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 17

Page 18: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Variance and Standard Deviation• VARIANCE - The arithmetic mean of the squared deviations from the mean.

• STANDARD DEVIATION - The square root of the variance.

• The variance and standard deviations are nonnegative and are zero only if all observations are the same.

• For populations whose values are near the mean, the variance and standard deviation will be small.

• For populations whose values are dispersed from the mean, the population variance and standard deviation will be large.

• The variance overcomes the weakness of the range by using all the values in the population

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 18

Page 19: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Variance – Formula and Computation

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 19

Steps in Computing the Variance.

Step 1: Find the mean.Step 2: Find the difference between each observation and the mean, and

square that difference.Step 3: Sum all the squared differences found in step 2Step 4: Divide the sum of the squared differences by the number of items in

the population.

Page 20: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

EXAMPLE – Variance and Standard Deviation• The number of traffic citations issued during the last five months in

Beaufort County, South Carolina, is reported below:

• What is the population variance?

• Step 1: Find the mean.

• Step 2: Find the difference between each observation and the mean, and square that difference.

• Step 3: Sum all the squared differences found in step 3

• Step 4: Divide the sum of the squared differences by the number of items in the population.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 20

2912

348

12

1034...1719

N

x

Page 21: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 21

12412

488,1)( 2

2

N

X

Page 22: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Sample Variance

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 22

sample the in nsobservatio of number the is

sample the of mean the is

sample the in nobservatio each of value the is

variance sample the is

:Where

2

n

X

X

s

Page 23: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

EXAMPLE – Sample Variance

The hourly wages for a sample of part-time employees at Home Depot are: $12, $20, $16, $18, and $19.

What is the sample variance?

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 23

Page 24: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Sample Standard Deviation

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 24

sample the in nsobservatio of number the is

sample the of mean the is

sample the in nobservatio each of value the is

variance sample the is

:Where

2

n

X

X

s

Page 25: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

The Empirical Rule

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 25

Page 26: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

The Arithmetic Mean of Grouped Data

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 26

Page 27: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

The Arithmetic Mean of Grouped Data -Example• Recall in Chapter 2, we constructed a frequency distribution for

Applewood Auto Group profit data for 180 vehicles sold. The information is repeated on the table. Determine the arithmetic mean profit per vehicle.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 27

Page 28: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 28

Page 29: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Standard Deviation of Grouped Data -Example• Refer to the frequency distribution for the Applewood Auto Group

data used earlier. Compute the standard deviation of the vehicle profits.

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 29

Page 30: Chapter 3-Describing Data: Numerical Measures - …jzhang6.weebly.com/uploads/2/5/2/2/25221768/lecture4-chapter_3... · Chapter 3-Describing Data: Numerical Measures ... •LO4.Explain

Finally, let’s go back to R!

• http://jzhang6.weebly.com/qmb2301.html

Jie Zhang, QMB 2301 Fundamentals of Business Statistics, UTEP 30