15
STATISTICAL ANALYSIS Figure 1 .H istogram offinalexam scores. Frequency Distribution # Indi vidu als Media n Mean Media n Mean Mean Media n Figure 2.Frequency distributions of three different samples. A B C negative normal positive

STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Embed Size (px)

Citation preview

Page 1: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

STATISTICAL ANALYSIS

Figure 1. Histogram of final exam scores.

Frequency Distribution# In

divi

dual

s

MedianMean

MedianMean MeanMedian

Figure 2.Frequency distributions of three different samples.

A B C

negative normal positive

Page 2: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Descriptive Statistics: used to describe, simplify, and summarize a collection of data in a clear understandable way.Ex. Mean, standard deviation, frequency

Inferential Statistics: allows you to make inferences about a population from a sample. It’s used to test the Ho.Ex. t-test, ANOVA, ANCOVA

Raw Data: the data you collect directly from the organisms or environment you are studying.

Population (N): the total # of individuals in the population of interest.

Sample (n): the number of observations or individuals measured

Page 3: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Mean: n

XX

_X = 1.2+3.0+0.5+2.3+1.5 5

= 1.7 m

Median: Middle number in a data set. Order from smallest to largest. 0.5 1.2 1.5 2.3 3.0

Range: Difference between the largest and smallest data points in a sample. 3.0 - 0.5 = 2.5 m

Fern Height (m)

1.2 3.0 0.5 2.3 1.5

Hand calculations

Page 4: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Sample Variance1

)( 22

n

XXs

_ _ X X-X (X-X)2 0.5 -1.2 1.441.2 -0.5 0.251.5 -0.2 0.042.3 0.6 0.363.0 1.3 1.69

3.78

n = 5_X = 1.7

3.78 4

S2 = = 0.945

Standard Deviation1

)( 2

n

XXs S = 0.945 = 0.972

Standard Errorn

s

n

sSE

2435.

5

972.

5

945.SE

Page 5: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Confidence Interval

)(CI df95 SEtX

95% confidence interval

mean

Mean = 1.7SE = 0.435

Standard Error

df = n-1

df = 5-1 = 4

CI95 = 1.7 ± (2.78*0.435)

CI95 = 1.7 ± 1.21 m

df α (2-tail) α (1-tail)

.10

.05 .05 .025

.02

.01 .01 .005

.005 .0025

.002

.001 .001

.0005

1 6.314 12.71 31.82 63.66 127.3 318.3 636.6 2 2.920 4.303 6.965 9.925 14.09 22.33 31.60 3 2.353 3.182 4.541 5.841 7.453 10.21 12.92 4 2.132 2.776 3.747 4.604 5.598 7.173 8.610 5 2.015 2.571 3.365 4.032 4.773 5.893 6.869

Page 6: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

df = n-1

_X - u s n

t =

Testing Ho using t-test

Distribution must be normalUse when n > 30

Ho = 0Ha > 0

t = 1.7 - 00.972/ 5 = 0.39 df=5-1=4

Ignore “-” signs

df α (2-tail) α (1-tail)

.10

.05 .05 .025

.02

.01 .01 .005

.005 .0025

.002

.001 .001

.0005

1 6.314 12.71 31.82 63.66 127.3 318.3 636.6 2 2.920 4.303 6.965 9.925 14.09 22.33 31.60 3 2.353 3.182 4.541 5.841 7.453 10.21 12.92 4 2.132 2.776 3.747 4.604 5.598 7.173 8.610 5 2.015 2.571 3.365 4.032 4.773 5.893 6.869 6 1.943 2.447 3.143 3.707 4.317 5.208 5.959

t-Table

two tail is more strict

one-tail two-tail2.7760.39

Your t-value is less than t-critical. Fail to reject Ho.

Page 7: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Comparing two sample means

All right. It is time to use Excel!

X1 X2

5 64 73 55 94 62 85 7

t =

_ _X 1 - X2

s12 + s2

2

n1 n2

df = n1+ n2-2

Variances must be equal

Ho: X1=X2

Ha: X1=X2

Page 8: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Using Excel

Click Tools, Data Analysis, t-test: Two-sample Assuming Unequal Variances

Enter datax1 x2

5 64 73 55 94 62 85 7

Data Set

0

Page 9: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

x1 x2 t-Test: Two-Sample Assuming Unequal Variances5 64 7 x1 x23 5 Mean 4 6.865 9 Variance 1.33 1.814 6 Observations 7 72 8 Hypothesized Mean Difference 05 7 df 12

t Stat -4.26P(T<=t) one-tail 0.00t Critical one-tail 1.78P(T<=t) two-tail 0.00t Critical two-tail 2.18

t-critical Value

Your Value

Reporting results in the literature: Sample X1 and sample X2 are significantly different, t(12) = -4.26, p < 0.05, 2-tail.

df

Do you accept or reject the Ho?

Lets the reader know that you set alpha to 0.05. The p-value is less than 0.05, therefore the results are significant.

Page 10: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

When you compare more than 2 samples, you need to do an Analysis of Variance (ANOVA).

The relationship between an ANOVA and t-test: F=t2

An ANOVA is the similar to a t-test in the sense that you compare the F value to the critical F value.

You also need to look at the p-value. If you set alpha to 0.05 and your p value is less that alpha, you can reject the Ho; if greater than alpha then fail to reject the Ho.

ANOVA

Page 11: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Descriptive Stats

Page 12: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Correlation & Regression

Correlation: describe the strength of association between two variables. The correlation coefficient is designated as r. r can range from -1 to +1.

+1 = a strong positive correlation 0 = no correlation

-1 = a strong negative correlation

r = +1 r = -1

r = 0

Page 13: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

This relationship can be represented by a regression equation.

Table 1. Sample data of tree fern age, height, and basal area. Age (years) Height (m) Basal Area

(m2) 10 0.5 .025 25 1.2 .053 30 1.5 .052 40 2.3 .110 60 3.0 .180

Equation for a straight line:y = bx + a y = DV, b = slope, x = IV, a = intercept

Simple Linear Regression: shows relationship between variables.

Page 14: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Correlation of Tree Fern Height & Basal Area

r = .967, p < .01

0

0.5

1

1.5

2

2.5

3

3.5

0 0.05 0.1 0.15 0.2

Basal Area ~ Meters Sqr

Hei

gh

t ~

Met

ers

Page 15: STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC

Tree Fern Height vs. Age

0

0.5

1

1.5

2

2.5

3

3.5

0 20 40 60 80

Age ~ Years

Hei

gh

t ~

Met

ers

Tree Fern Height vs. Age

y = 0.0518(± .02) x - 0.0098R2 = 0.98, p<.01

0

0.5

1

1.5

2

2.5

3

3.5

0 20 40 60 80

Age ~ Years

Hei

gh

t ~

Met

ers