38
Outline of Today’s Discussion 1. Displaying the Order in a Group of Numbers: 2. The Mean, Variance, Standard Deviation, & Z-Scores 3. SPSS: Data Entry, Definition, & Frequency Analysis

Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Embed Size (px)

DESCRIPTION

Two Branches of Statistical Methods Descriptive statistics –Summarize and describe a group of numbers such as the results of a research study Inferential statistics –Allow researchers to draw conclusions and inferences that are based on the numbers from a research study, but go beyond these numbers

Citation preview

Page 1: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Outline of Today’s Discussion1. Displaying the Order in a Group of Numbers:

2. The Mean, Variance, Standard Deviation, & Z-Scores

3. SPSS: Data Entry, Definition, & Frequency Analysis

Page 2: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Part 1

Displaying the OrderIn a Group of Numbers:

Page 3: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Two Branches of Statistical Methods

• Descriptive statistics– Summarize and describe a group of numbers

such as the results of a research study

• Inferential statistics– Allow researchers to draw conclusions and

inferences that are based on the numbers from a research study, but go beyond these numbers

Page 4: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Basic concepts

• Variables– Characteristics that can take on any number of different

values

• Values (think of this as ‘levels’)– Possible numbers or categories that of a variable can

have

• Scores– A particular person’s value on a variable

Page 5: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Kinds of Variables

• Numeric variables– Scores are numbers that tell you how much there is of the thing

being measured

– Equal-interval variables• Values are numbers that stand for equal amounts of what is

being measured• e.g., a height of 6 feet, an IQ of 138

– Rank-order variables• Values are ranks• e.g., 2nd in the Olympic ski jump, 6th in your graduating class

Page 6: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Kinds of Variables

• Nominal variables– Values are categories– e.g., Hair color of brown, American citizenship

Page 7: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Frequency Tables

• Frequency table– Shows how many times each value was used

for a particular variable– Percentage of scores of each value

• Grouped frequency table– Range of scores in each of several equally sized

intervals

Page 8: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Steps for Making a Frequency Table

1. Make a list of each possible value, from highest to lowest2. Go one by one through the scores, making a mark for each score next

to its value on the list3. Make a table showing how many times each value on your list was

used4. Figure the percentage of scores for each value

Page 9: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

A Frequency Table

Travel Time Frequency Percent10 14 9.3

9 15 9.9

8 26 17.2

7 31 20.5

6 13 8.6

5 18 11.9

4 16 10.6

3 12 7.9

2 3 2.0

1 1 0.7

0 2 1.3

Page 10: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

A Grouped Frequency Table

Travel Time (min) Frequency Percent

0-5 14 9

8-9 41 27

6-7 44 29

4-5 34 23

2-3 15 10

0-1 3 2

Page 11: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Frequency Graphs• Histogram

– Depicts information from a frequency table or a grouped frequency table as a bar graph

Page 12: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Frequency Graphs• Frequency Polygon

– Depicts the same information by plotting a point to show the frequency of each value or range of values

– Points are joined with a line

Page 13: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Frequency Graphs• Histogram and frequency polygon for the same data

Page 14: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Shapes of Frequency Distributions

• Unimodal– Having one peak

• Bimodal– Having two peaks

• Multimodal– Having two or more peaks

• Rectangular– Having no peaks

Page 15: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Symmetrical vs. Skewed Frequency Distributions

• Symmetrical distribution– Approximately equal numbers of observations above and below

the middle

• Skewed distribution– One side is more spread out that the other, like a tail– Direction of the skew

• Right or left (i.e., positive or negative)• Side with the fewer scores • Side that looks like a tail

Page 16: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Skewed Frequency Distributions

• Skewed right (b)– Fewer scores right of the peak– Positively skewed– Can be caused by a floor effect

• Skewed left (c)– Fewer scores left of the peak– Negatively skewed– Can be caused by a ceiling effect

Page 17: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Potential Pop Quiz Item• Using your imagination, identify a phenomenon that would be well

described by a positively skewed frequency distribution. Draw the positively skewed distribution, and be sure to label your axes. Then briefly explain why the distribution would be positively skewed.

• Repeat the exercise above, but now for a negatively skewed frequency distribution.

Page 18: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Ceiling and Floor Effects

• Ceiling effects– Occur when scores can go no

higher than an upper limit and “pile up” at the top

– e.g., scores on an easy exam, as shown on the right

– Causes negative skew

• Floor effects– Occur when scores can go no

lower than a lower limit and pile up at the bottom

– e.g., household income– Causes positive skew

Page 19: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Kurtosis• Degree to which tails of the distribution are “heavy” or “light”

– heavy tails = many scores at the extreme ends (b)– Light tails = few scores at the extreme ends (c)

leptokurtic

platykurtic

Page 20: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Kurtosis• A perfect, normal distribution zero kurtosis irrespective of its mean or

standard deviation. It is said to be mesokurtic.

• If a random variable’s kurtosis is greater than 0, it is said to be leptokurtic. (peaked, and fat tails i.e., more frequent extreme values than a Gaussian.)

• If its kurtosis is less than 0, it is said to be platykurtic. (flat-ish, and thin tails, i.e., less frequent extreme values than a Gaussian)

• Can you think of a mnemonic for those terms?

Page 21: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

The Normal Curve• Often seen in social and behavioral science research and in nature

generally• Also Called “Gaussian” Distribution or Curve• Particular characteristics

– Bell-shaped– Unimodal– Symmetrical– Average tails

Page 22: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

The Normal Curve

• Here’s the equation for the Bell Curve:

• Do the equations below look familiar?Y = mX + BY = aX2 + bX + c

• How many parameters in each equation?

                      

Page 23: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Part 2

The Mean, Variance,Standard Deviation,

&Z-Scores

(I LOVE Z-Scores!!!!)

Page 24: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Central tendency = representative or typical value in a distribution

• Mean– Same thing as an average– Computed by

• Summing all the scores (sigma, )• Dividing by the number of scores (N)

M

XN

Page 25: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Mean– Often the best measure of central tendency – Most frequently reported in research articles

• Think of the mean as the “balancing point” of the distribution

Page 26: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Examples of means as balancing points of various distributions– Does not have to be a score exactly at the mean– Note that a score’s distance from the balancing point matters in

addition to the number of scores above or below it

Page 27: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Mode– Most common single number in a distribution– If distribution is symmetrical and unimodal, the

mode = the mean– Typical way of describing central tendency of a

nominal variable

Page 28: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Examples of means and modes

Page 29: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Median– Middle value in a group of scores– Point at which

• half the scores are above• half the scores are below

– Unaffected by extremity of individual scores• Unlike the mean• Preferable as a measure of central tendency when a

distribution has some extreme scores

Page 30: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Central Tendency

• Steps to computing the median1. Line up scores from highest to lowest2. Figure out how many scores to the middle

• Add 1 to number of scores• Divide by 2

3. Count up to middle score• If there is 1 middle score, that’s the median• If there are 2 middle scores, median is their average

Page 31: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Variation

• Variation = how spread out scores are• Distributions to the right all have same mean but different

amounts of spread• Variance

– Measure of variation– Average of each score’s squared deviations (differences) from the

mean

Page 32: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Variation

• Steps to computing the variance– 1. Subtract the mean from each score– 2. Square each deviation score– 3. Add up the squared deviation scores– 4. Divide sum by the number of scores

• Thus, variance is the average (steps 3 & 4) of the squared (step 2) deviations from the mean (step 1)

Page 33: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Measures of Variation

• Standard deviation– Another measure of variation, roughly the average

amount that scores differ from the mean– Used more widely than variance– Abbreviated as “SD”

• To compute standard deviation– Compute variance– Simply take the square root

• SD is square root of variance• Variance is SD squared

Page 34: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Z Scores

• Allow you to describe a particular score in terms of where it fits into the overall group of scores in a distribution– If positive, indicates the number of SDs a score is

above the mean of its own distribution– If negative, indicates the number of SDs a score is

below the mean of its own distribution• Z score of 1.0 is one SD above the mean• Z score of -2.5 is two-and-a-half SDs below the mean• Z score of 0 is at the mean

Page 35: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Z Scores

• When values in a distribution are converted to Z scores, the distribution will have – Mean of 0– Standard deviation of 1

• Useful– Allows variables to be compared to one another even

when they are measured on different scales, have very different distributions, etc.

– Provides a generalized standard of comparison

Page 36: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Z Scores

• To compute a Z score, subtract the mean from a raw score and divide by the SD

• To convert a Z score back to a raw score, multiply the Z score by the SD and then add the mean

Page 37: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Part 3

SPSS: Data Entry, Definition &

Frequency Analysis

Page 38: Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,