Recap of basic SPSS and statistics 5 th - 9 th December 2011, Rome

Recap of basic SPSS and statistics

5th - 9th December 2011, Rome

Manage the database Import / export file Import variable from another database / merge files Restructure cases to variables

Merging datasets For each level of investigation in a survey, there is

typically a dataset For example if a survey asks questions at the household

level, then measures anthropometry of children under 5 and for women of reproductive age as well as a community level questionnaire, we would expect 4 separate datasets to be created

To do analysis that looks at a case in respect to its context, datasets must be merged

Merging datasets For example – the education level of a household head is recorded in the

household dataset. We may be interested to find if the nutritional status of a child is related to education of the household head. But the child data is in a separate dataset.

In order to merge the datasets, a common variable must exist in each dataset. In this case, a household identifier must be in both datasets.

Household ID

Education level of household head

Household ID

Weight for Age z-score (WAZ)

Household dataset Child dataset

Merging datasets In each dataset, the cases must be sorted on the household identifier In SPSS, select Merge Files > Add Variables; select the datasets and the

variable to match the datasets on The new dataset will have the variable of interest included; In our example,

we will now have a child dataset that also has the literacy of the education level of the household head included and can do our analysis

Household ID

Household dataset Child datasetHousehold ID

Child dataset +

Data cleaning

Unique ID Missing data Define variable properties Scatteplot /histograms Frequency sorting Outliers

Missing values and data cleaning Cleaning data can be a painful process Being systematic about cleaning data from the beginning

of the process can save hours of work later in the analysis There are few key tools to use in SPSS to clean data:

Sorting cases – allows you to quickly see within a variable if there are problematic cases

Indentify duplicate cases – shows cases which have the same unique identifier

Histograms and scatterplots – visually identify problematic variables and cases

Missing values and data cleaning The data cleaning process will also reveal cases where

values are missing for certain variables This is often expected (though in some cases may have

been an error) Handling missing values in SPSS is a simple matter of

telling the software what values to handle as missing in the variable view

Getting ready for analysis Weight file Split file Select cases

Analysis Create new variables

Recode Count Compute Rank cases (quintiles) Aggregate

Frequencies Compare mean Crosstabs

Create new variables using recode Recoding a variable is most commonly

used to take a categorical variable and to re-categorize it’s values.

For example – source of drinking water is a standard question in household surveys with several options that are adapted for the local context.

When describing water sources in analysis, we usually will compare improved vs. unimproved water sources

In the example on the right, the top box represents a module in the household questionnaire and the bottom box represents the categorization of improved vs. unimproved water sources. If we want to recode the question responses into a bi-variate variable, how do we do so in SPSS?

4.2 What is the main source of drinking water for your household? (Circle one)

•1 = Piped water•2 = Well (protected)•3 = Well (unprotected)•4 = River, stream or pond•5 = Collecting rainwater•6 = Tanker truck water

Improved source Unimproved source

Piped water Well (unprotected)

Well (protected) River, stream or pond

Collecting rainwater Tanker truck water

Creating a new variable using compute Computing a new variable is usually

done when a mathematical formula is used to derive a new variable

A number of circumstances in a household questionnaire require computation

For example – a commonly used indicator in assessments when discussing demographics is the percentage of dependents in a household

Given the household questionnaire roster on the right, how can we create a variable for the percentage of dependents (where dependents are people under 15 and over 65)?

1.5 Please complete the household demographics table on the right. Record the number of individuals in each age category, differentiated by males and females.

Age Male Female

a. 0-5 years

|__|__| |__|__|

b. 6-14 years

|__|__| |__|__|

c. 15-64 years

|__|__| |__|__|

d. 65 years or older

|__|__| |__|__|

Type of variables

Continuous(Scale)

Categorical

Interval ex. Age 1 to n

Ratioex. Percentage of expenditure 0% to 100%

Nominal The categories are not ranked ex. 1=female, 2=male

Ordinal The categories are ranked ex. 1=poor, 2=medium, 3= good

We work with two types of variables

Type of variables Type of variables

Quantitative

Qualitative

Ordinal Nominal

Do arithmetic operations on values make sense?

Yes No No

Are values ordered? Yes Yes No

Types of values Numeric Alphanumeric codes

Alphanumeric codes

Descriptive statistics

Continuous Categorical

RangeMean Median Mode

FrequenciesCrosstabs

Best practices Using syntax Export files/outputs Data file comments

Recap of basic SPSS and statistics 5 th - 9 th December 2011, Rome

Documents

Recap, Test 1 prep, Composition and Inheritance. Dates Test 1 – 12 th of March Assignment 1 – 20 th of March

introduction to spss - Arizona State Universityeagle/spssworkshop/Introduction-to-SPSS.pdf · Introduction to SPSS ... Start / All Programs / SPSS Inc / SPSS 16.0 / SPSS ... SPSS

Thursday, February 5 th 2015. Agenda Booster Mission Budget Recap NTCA Percussion Contest March for Water 5K All Region Recap Spring Trip

SPSS 16.0 for Windows, Macintosh, and Linux SPSS …spss.ch/upload/1192788433_SPSS 16 Complete small.pdf · SPSS 16.0 Complete SPSS 16.0 for Windows, Macintosh, and Linux SPSS 16

Using SPSS - York University SPSS.pdf · Using SPSS Starting SPSS 1. From the Windows start menu choose: 2. SPSS for Windows 3. then SPSS for Windows. or If there is an SPSS icon

การใช้งานโปรแกรม SPSS Vpubadm.crru.ac.th/pub_web/pubfile/spss 20.pdf · การใช้งานโปรแกรม spss v.20 นายประกาศิต

Recap of basic SPSS and statistics

August 30 th, 2012. Today’s Agenda Dr. Chris Rios Recap Upcoming Year - Planning Elections Intramurals Committees

MANUAL DE SPSS€¦ · Definición de SPSS. Características del SPSS. Instalar el programa SPSS. Estructura del SPSS. Archivos de datos del SPSS. Editor de Datos del SPSS. Transformar

IKEA Houston 20 th Birthday Facebook Promotion October 15 – October 21, 2012 Campaign Recap

SEPTEMBER 26 TH CPP. Recap of 2013/2014 CPP The following is a list of requirements that you should have completed and collected from 9 th and 10 th grade:

INTRODUCCIÓN AL PROGRAMA SPSS 13humanidades.cchs.csic.es/cchs/web_UAE/tutoriales/PDF/SPSS... · Introducción al SPSS SPSS BÁSICO 1.- INTRODUCCIÓN1 SPSS (Statistical Package for

SPSS statistics - how to use SPSS

Recap & Orientation for SHC Parish Recollection 10 th – 12 th July 2014 SHC Parish Centre Dr. Damian Lee 10/7/14

Lecture 2 recap: Th(A), computing Th( U) and Th( from Th(Aifarah/3-YMCstarA-2019.pdf · Countable saturation Deﬁnition A C∗–algebra C is countably saturated if every countable

SPSS ANALYSIS WITHOUT ANGUISH USING SPSS V12

PRAKTIKUM 1 PENGENALAN SPSS - …suparti.blog.undip.ac.id/files/MODUL-PRAKTIKUM-SPSS-PWK.pdf · PRAKTIKUM 1 PENGENALAN SPSS 1. MEMULAI SPSS Jika anda akan memulai SPSS 10.0 for Windows,

Recap: Neoclassicism vs Romanticism. 2 nd Half of the 19 th Century – REALISM

introduction to spss 07jan2014 - Arizona Stateeagle/spss/spssintro.pdf · Introduction to IBM SPSS Statistics ... SPSS Software Versions ASU is licensed to use SPSS 20, 21, and 22

SPSS 20.0 软件介绍 · SPSS Custom Tables. SPSS Data Preparation. SPSS Decision Trees. SPSS Direct Marketing. SPSS Exact Tests. SPSS Forecasting. SPSS Missing Values. SPSS Neural