18
Paradata on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Paradata on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

  • Upload
    waylon

  • View
    98

  • Download
    0

Embed Size (px)

DESCRIPTION

Paradata on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012. Agenda. Review of what Paradata is Paradata collection on NHANES Analysis of NHANES Paradata Conclusions and Recommendations. Paradata – Definition. - PowerPoint PPT Presentation

Citation preview

Page 1: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Paradata on the NHANES Survey

George ZipfNational Center for Health Statistics

August 7, 2012

Page 2: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Agenda

• Review of what Paradata is

• Paradata collection on NHANES

• Analysis of NHANES Paradata

• Conclusions and Recommendations

Page 3: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Paradata – Definition

• Paradata are data about the data collection process.

• Examples include:• Time and date of data collection• Interviewer• Wave of data collection, when samples are

sequentially released

Page 4: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Paradata – Current Collectors

• U.S. Census Bureau

• Bureau of Labor Statistics

• National Center for Health Statistics

Paradata has entered the lexicon and is entering field operations

Page 5: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Paradata – Means of Collection

• Data Entry

• Digital Pen

• Teleform Scanning

• Electronic Record of Contact (eROC)

Page 6: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Paradata – Value

• Cost per Response

• Responsive Design

• Correlation between paradata and survey values

Page 7: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Overview – Picture• Obtain national medical prevalence estimates,

1999 – present

Page 8: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata – Teleform

Page 9: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata – Teleform

Page 10: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Data Quality

Completeness:• Missing data

Validity• Out-of range• Vague

Three variables will be discussed here:• Day of Week• Time of Day• Disposition Code

Page 11: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Data Quality

Day of Week:• Missing data: 12 obs.• Out-of-Range: 0 obs.• Vague: 4 obs. (1 “S” and 3 “T”)

Given that there were 47,659 contact attempts in 2011, post-QC day of week data quality was extremely high.

Page 12: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Data Quality

Time of Day:• Missing data: 43 obs. (0.1% of total)• Out-of-Range: 2 obs. (both “15 PM”)• Missing AM/PM: 397 obs. (0.8% of total)• Other: a few values recorded a contact attempt

at 1:00 AM, 1:25 AM, 3:10 AM, etc.

Page 13: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Data Quality

Disposition Code:• Missing data: 3 observations• Out-of-Range: 13 observations• Vague: 0

Page 14: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES 2011 Paradata ResultsPreviously available information:• 13,244 addresses selected for screening in 2011• 6,555 survey participants (SP) identified• 2,975 addresses with eligible SPs

New information provided by paradata:• 47,659 contact attempts• 3.6 average number of contacts per address• 7.3 average number of contacts per SP• 39% Screeners closed on 1st contact

Page 15: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Results – 2011

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 200

20

40

60

80

100

Cumulative Percent Screeners Closed

Number of Contact Attempts

• How many contacts is enough?

Page 16: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Results – PSUs

1 2 3 4 5 60%

20%

40%

60%

80%

100%

Best and Worst PSU

Number of Contacts

Cum

. % S

cree

ners

Clo

sed

• We love small rural stands• We do not love large, congested, urban areas

Rural

Urban

Page 17: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

NHANES Paradata Results – “Not at Home”• NHANES contacted 13,244 households in 2011• “Not at Home” is the key factor in field operations

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 200

1000

2000

3000

4000

5000

6000

7000

0%

20%

40%

60%

80%

100%

Stand Progress: Disposition = "Not at Home"

# HH % Not at Home

# H

ouse

hold

s Not

at H

ome

for

each

Con

tact

Atte

mpt

% N

ot a

t Hom

e fo

r ea

ch C

onta

ct A

ttem

pt

Page 18: Paradata  on the NHANES Survey George Zipf National Center for Health Statistics August 7, 2012

Conclusion

Teleforms:• Variables – acceptable• Data Quality – acceptable• Cost – not cheap when data clean-up and training are

included• Timeliness – slow, not actionable in field

Next steps:• Consider use paradata to set contact attempt limits

(e.g. NHANES should probably be at 12)• Refine analysis on time of day, day of week, and

merge with demographic information, census data