28
Introduction to Sampling Theory Lecture 5 Simple Random Sampling Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur 1 Slides can be downloaded from http://home.iitk.ac.in/~shalab/sp

Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Introduction to Sampling Theory

Lecture 5Simple Random Sampling

ShalabhDepartment of Mathematics and  Statistics

Indian Institute of Technology Kanpur

1

Slides can be downloaded from http://home.iitk.ac.in/~shalab/sp

Page 2: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Probability of Selection of a UnitLet the size of the population is N.

One out of N sampling unit is to be chosen.

SRSWOR

The probability of drawing a sampling unit = 𝟏𝑵

SRSWR

The probability of drawing a sampling unit = 𝟏𝑵

2

Page 3: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Probability of Selection of a Unit by SRSWOR or SRSWR

3

1 2 3 4 5 6 7 8 9 10

1

2

Probability of drawing ball 1= 1/10

10

Probability of drawing ball 2= 1/10

Probability of drawing ball 10= 1/10

Page 4: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Unit: SRSWORLet Al : Event that a particular jth unit is not selected at the  ith

draw. 

The probability of selecting, say, jth unit at kth draw is 

4

1 2 1

1 2 1 3 1 2 1 1 2 2 1 2 1

( )

( ... )

( ) ( | ) ( | )... ( | , ... ) ( | ... )

1 1 1 1 1 1 1 1 ... 1

1 2 2 1

th

j

k k

k k k k

P u k

P A A A A

P A P A A P A A A P A A A A P A A A A

N N N N k N k

selection of at draw     

        

      

      

   1 2 1 1

. ... .1 2 1

1 .

N N N k

N N N k N k

N

         

      

Page 5: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Probability of Selection of a SampleLet the size of the population is N.

Let the size of the sample is n.

A sample of n sampling units out of N sampling units is to be chosen.

5

Page 6: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Probability of Selection of a SampleSRSWOR

Total number of combinations to choose n sampling units out of N

sampling unit = 𝑵𝒏

The probability of drawing a sample = 𝟏𝑵𝒏

6

Page 7: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

7

Probability of Selection of a SampleSRSWOR

Suppose N =3, n = 2

Total samples = 𝟑𝟐 𝟑

Sample 1

Sample 2

Sample 3

Probability of drawing a sample = 𝟏𝟑

1 2 3

1 2

2 3

1 3

Page 8: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

8

Probability of Selection of a SampleSRSWOR

Suppose N =3, n = 2

Total samples = 𝟑𝟐 𝟑

Sample 1

Sample 2

Sample 3

Probability of drawing a sample = 𝟏𝟑

Page 9: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Sample: SRSWOR

A unit can be selected at any one of the n draws.

Let ui be the ith unit selected in the sample.

This unit can be selected in the sample either at first draw, second

draw, …, or nth draw.

Let Pj(i) denotes the probability of selection of ui at the jth draw, j =

1,2,...,n. Then

9

1 2( ) ( ) ( ) ... ( )

1 1 1 ... ( )

.

j nP i P i P i P i

n timesN N NnN

Page 10: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Sample: SRSWOR

Let u1, u2,…,un are the n unit selected in the sample.

The probability of their selection is

P(u1, u2,…, un) = P(u1). P(u2). . .P(un)

When the first unit is to be selected, then there are n units left to be

selected in the sample from the population of N units.

So P(u1)=𝒏𝑵

10

Page 11: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Sample: SRSWOR

When the second unit is to be selected, then there are (n – 1) units

left to be selected in the sample from the population of (N – 1) units.

So P(u2)=𝒏 𝟏𝑵 𝟏

When the third unit is to be selected, then there are (n – 2) units left

to be selected in the sample from the population of (N – 2) units and

so on.

So P(u3)=𝒏 𝟐𝑵 𝟐

And so on, P(un)=𝟏

𝑵 𝒏 𝟏11

Page 12: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Sample: SRSWOR

Thus probability of their selection is

12

1 2 1 2( , ,..., ) ( ). ( )... ( )1 2 1 = . . ...1 2 1

1 .

n nP u u u P u P u P un n nN N N N n

Nn

Page 13: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Probability of Selection of a Sample

SRSWR

Total number of combinations to choose n sampling units out of N

sampling unit = Nn

The probability of drawing a sample = 𝟏Nn

13

Page 14: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

14

Probability of Selection of a SampleSRSWR

Suppose N =3,

Total samples N=3, n=2, Nn = 32 = 9

Sample 1 Sample 4 Sample 7

Sample 2 Sample 5 Sample 8

Sample 3 Sample 6 Sample 9

Probability of drawing a sample = 𝟏𝟗

1 2 3

1 1

1 2

1 3

2 1

2 2

2 3

3 1

3 2

3 3

Page 15: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

15

Probability of Selection of a SampleSRSWR

Suppose N =3,

Total samples N=3, n=2, Nn = 32 = 9

Sample 1 Sample 4 Sample 7

Sample 2 Sample 5 Sample 8

Sample 3 Sample 6 Sample 9

Probability of drawing a sample = 𝟏𝟗

Page 16: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Sample: SRSWR

Let ui be the ith unit selected in the sample.

This unit can be selected in the sample either at 1st draw, 2nd draw,

…, or nth draw.

At any stage, there are always N units in the population in case of

SRSWR, so the

probability of selection of ui at any stage = 1/N for all i = 1,2,…,n.

16

Page 17: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Proof: Probability of Selection of a Sample: SRSWR

Then the probability of selection of n units u1, u2,…,un in the

sample is

17

1 2 1 2( , ,..., ) ( ) . ( )... ( )1 1 1 . ...

1 .

n n

n

P u u u P u P u P u

N N N

N

Page 18: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Notations:

Following notations will be used:

N  : Number of sampling units in the population (Population size).

n  : Number of sampling units in the sample (Sample size)

Y : The characteristic under consideration

Yi : Value of characteristic for the ith unit of the population 

(i = 1,2,… 2,…,N)

yi : Value of the characteristic for the ith unit of the sample 

(i = 1, 2,…,n)18

Page 19: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Notations: ExampleY: Height of students in a class

N = 10 : Number of students in the class (Population size)

n = 3 : Number of students in the sample (Sample size)

Yi : Height of ith student in the population

19

Page 20: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

ExampleY: Height of students in a class

N = 10 : Number of students in the class (Population size)

n = 3 : Number of students in the sample (Sample size)

20

Name of Student Yi= Height of students (in Centimeters)

A Y1= 151B Y2= 152C Y3 = 153D Y4= 154E Y5 = 155F Y6= 156G Y7 = 157H Y8= 158I Y9 = 159J Y10= 160

Page 21: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Notations: ExampleSuppose

Y1 = 151 cms., Y2 = 152 cms., Y3 = 153 cms., Y4 = 154 cms., Y5 = 155 cms.,

Y6 = 156 cms., Y7 = 157 cms., Y8 = 158 cms., Y9 = 159 cms., Y10 = 160 cms.,

yi : Height of ith student in the sample

Selected sample = 3rd , 7th and 9th student

y1 = Y3 = 153 cms., y2 = Y7 = 157 cms., y3 = Y9 = 159 cms.

21

Page 22: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Drawing of sampleSuppose we want to select the name of student or Height of the

student.

The data in R will usually be given in a data frame, CSV file or any

other format.

Suppose the data is stored in a data frame heightdata by using

the following commands:

height=c(151,152,153,154,155,156,157,158,159,160)

name=c("A","B","C","D","E","F","G","H","I","J")

heightdata=data.frame(name,height)22

Page 23: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Drawing of sample using R> heightdata

name height1 A 1512 B 1523 C 1534 D 1545 E 1556 F 1567 G 1578 H 1589 I 15910 J 160

> names=heightdata$name> names

[1] A B C D E F G H I JLevels: A B C D E F G H I J

> heights=heightdata$height> heights

[1] 151 152 153 154 155 156 157 158 159 160 23

Page 24: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Drawing of sample using R

24

Page 25: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Drawing of sample using R : SRSWORSuppose we want this sample in terms of names of persons.

sample(names, size=5, replace = FALSE)

> sample(names, size=5, replace = FALSE)[1] G F A B HLevels: A B C D E F G H I J

Suppose we want this sample in terms of heights of persons.

sample(heights, size=5, replace = FALSE)

> sample(heights, size=5, replace = FALSE)[1] 152 156 154 155 158

25

Page 26: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Drawing of sample using R : SRSWOR

26

Page 27: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Drawing of sample using R : SRSWRSuppose we want this sample in terms of names of persons.

Sample of size 5

> sample(names, size=5, replace = TRUE)

[1] F F I E A

Levels: A B C D E F G H I J

Sample of size 8

> sample(names, size=8, replace = TRUE)

[1] C C D D J H G E

Levels: A B C D E F G H I J

27

Page 28: Introduction to Sampling Theoryhome.iitk.ac.in/~shalab/swayamprabha/samp/sp-sampling-lect-5.pdf · Probability of Selection of a Unit by SRSWOR or SRSWR 3 1 234 5 6 7 8 9 10 1 2 Probability

Estimation of population mean: Notations

28

1

1

1 2

1 2

1

1:

...,

...,

N

ii

n

ii

N

n

Y YN

y yn

Y Y Y

y y y

opulation 

 

opulation mean

   Sample mean

:   P

:  Sample

    :   P

    

, ,

, ,