A STRUCTURAL ANALYSIS OF THE SOCIAL SKILLS …

The Pennsylvania State University

The Graduate School

College of Education

A STRUCTURAL ANALYSIS OF THE SOCIAL SKILLS IMPROVEMENT SYSTEM

RATING SCALES, PARENT FORM: MEASUREMENT INVARIANCE ACROSS RACE

AND LANGUAGE FORMAT

A Dissertation in

School Psychology

by

Brian P. Schneider

2012

Submitted in Partial Fulfillment

of the Requirements

for the Degree of

Doctor of Philosophy

December 2012

ii

The dissertation of Brian Schneider was reviewed and approved* by the following:

James C. DiPerna

Associate Professor of Education

Dissertation Advisor

Chair of Committee

Professor in Charge of the Program of School Psychology

Robert L. Hale

Professor of Education

Jonna M. Kulikowich


Keith B. Wilson

Special Member


Southern Illinois University

*Signatures are on file in the Graduate School

iii

ABSTRACT

The purpose of the current study was to evaluate aspects of structural validity for the

Social Skills Improvement System Rating Scales, Parent Form (SSIS-PF). Data were obtained

from the SSIS-PF standardization sample. Confirmatory factor analysis (CFA) was applied to

examine the instrument’s first-order and higher-order measurement structures. Resulting baseline

measurement models were subsequently analyzed for invariance across two variables.

Specifically, measurement invariance was examined as a function of race/ethnicity using

subsamples of African American, Latino, and Caucasian children. Invariance then was examined

as a function of the language in which rating scales were written (i.e., English or Spanish). For

both analyses, multi-sample CFA procedures were used to examine invariance at the configural,

metric, and structural levels. Initial analyses provided support for the first-order measurement

structure of the SSIS-PF, though there was some evidence of a lack of discriminant validity

between select subscales (Cooperation and Responsibility). The instrument’s higher-order

measurement structure showed evidence of reduced fit to standardization data relative to the first-

order model. Follow-up analysis of the higher-order measurement structure of the SSIS-PF was

conducted, and an alternative structure was identified. Results of the invariance analyses with

first-order baseline models suggested that the SSIS-PF demonstrates configural, metric, and

structural invariance as a function of race/ethnicity. Configural and metric invariance were

supported in the language format analysis, but structural invariance was not observed across

English and Spanish language groups. Results are discussed in reference to implications for the

use of the SSIS-PF as well as broader considerations for cross-cultural social skills assessments.

iv

TABLE OF CONTENTS

LIST OF FIGURES ................................................................................................................. vi

LIST OF TABLES ................................................................................................................... vii

ACKNOWLEDGEMENTS ..................................................................................................... ix

Chapter 1 Introduction ............................................................................................................ 1

The Need for Cross-cultural Social Skills Research ........................................................ 1 Demographic Trends: The U.S. Hispanic/Latino Population........................................... 2

Goals and Objectives ....................................................................................................... 4

Chapter 2 Literature Review ................................................................................................... 5

Perspectives on Social Competence ................................................................................. 6 Divergent Perspectives on Social Competence ........................................................ 7

Joining Perspectives: Comprehensive Models of Social Competence ..................... 7

The Issue of Context ................................................................................................ 8 Emphasizing the Social Skills Dimension of Social Competence ................................... 9

Defining Social Skills............................................................................................... 9

Toward a Taxonomy of Social Skills ....................................................................... 11 Social Skills in Context ............................................................................................ 13

Cultural Influences on Social Skill Development ............................................................ 15

Traditional Latino Parenting Practices ..................................................................... 16

Within-group Differences: The Effects of Acculturation ......................................... 17

Cultural Interaction and the Effects of Schooling .................................................... 19

Social Skills Assessment .................................................................................................. 21

Cross-cultural Social Skills Assessment .................................................................. 23

Social Skills Improvement System .................................................................................. 23

Social Skills Rating System ..................................................................................... 25

Structural Investigations of the SSRS ...................................................................... 26

Cross-cultural Applications of the SSRS ................................................................. 27

Research Questions .......................................................................................................... 28

Chapter 3 Method ................................................................................................................... 30

Participants ....................................................................................................................... 30

Measures .......................................................................................................................... 30

SSIS Rating Scales ...................................................................................................... 30

SSIS Rating Scales, Spanish Format .......................................................................... 33

Procedures ........................................................................................................................ 34

Design and Analysis ......................................................................................................... 34

Structural Analyses ..................................................................................................... 34

Parceling Technique ................................................................................................... 35

Assessing Model Fit .................................................................................................... 43

Analysis of Invariance ................................................................................................ 44

v

Chapter 4 Results .................................................................................................................... 46

Preliminary Analyses and Data Preparation .................................................................... 46

Analysis of SSIS-PF Invariance by Race/Ethnicity ......................................................... 59

Analysis of SSIS-PF Higher-Order Factor Structure ....................................................... 71 Analysis of SSIS-PF Invariance by Language Format..................................................... 80 Analysis of the Higher-Order Factor Structure for the SSIS-PF English Language

Format and Spanish Language Format ..................................................................... 86

Chapter 5 Discussion .............................................................................................................. 90

Overview .......................................................................................................................... 90

Primary Findings .............................................................................................................. 90

Factor Structure of the SSIS-PF ............................................................................... 90

SSIS-PF Measurement Invariance: Race/Ethnicity .................................................. 92

SSIS-PF Measurement Invariance: Language Format ............................................. 93

Interpretation of Primary Findings in the Context of Prior Research .............................. 94

SSIS-PF Structure and the Social Skills Construct .................................................. 94

Measuring Social Skills across Cultural Groups ...................................................... 96

Limitations and Future Directions ................................................................................... 98

Data Limitations ....................................................................................................... 98

Design Limitations ................................................................................................... 100

Implications for the Use of the SSIS-PF in Research and Practice .................................. 103

Implicaitons for Cross-Cultural Social Skills Assessment ............................................... 104

Conclusions ...................................................................................................................... 105

References ................................................................................................................................ 107

vi

LIST OF FIGURES

Figure 1: First-order baseline model for the analysis of invariance by race/ethnicity,

Model 1.1. ........................................................................................................................ 63

Figure 2: Higher-order basleline model for the analysis of invariance by race/ethnicity,

Model 1.2. ........................................................................................................................ 64

Figure 3: Revised baseline model for the analysis of invariance by race/ethnicity,

Model 1.3. ........................................................................................................................ 68

Figure 4: Modified SSIS-PF higher-order factor structure, Model 2.2 ................................... 74


Figure 6: Modified SSIS-PF higher-order factor structure, Model 2.4. .................................. 77


Figure 8: Revised baseline model for the analysis of invariance by language format,

Model 3.2 ......................................................................................................................... 82

Figure 9: Revised SSIS-PF higher-order factor structure, Model 4.1, fit to data from

English langauge-format subsample ................................................................................ 88

Figure 10: Revised SSIS-PF higher-order factor structure, Model 4.1, fit to data from

Spanish language format subsample ................................................................................ 89

vii

LIST OF TABLES

Table 1: Frequency Distributions for Demographic Variables from the SSIS Rating

Scales Standardization Sample that were Included in the Current Study ........................ 31

Table 2: Summary of Item-Parcel Assignment for Analysis of SSIS-PF Measurement

Invariance by Race/Ethnicity and Language Format ....................................................... 36

Table 3: Means and Standard Deviations for Item-Parcels and Control Variables used in

Analysis of Invariance by Race/Ethnicity ........................................................................ 47

Table 4: Means and Standard Deviations for Item-Parcels and Control Variables used in

Analysis of Invariance by Language Format ................................................................... 48

Table 5: Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and Control

Variables for Full Race/Ethnicity Invariance Sample and African American

Subsample ........................................................................................................................ 49

Table 6: Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and

Control Variables for Caucasian and Latino Subsamples in Race/Ethnicity

Invariance Analysis. ......................................................................................................... 51

Table 7: Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and Control

Variables for Full Language Format Invariance Sample. ................................................ 53

Table 8: Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and

Control Variables for English Language Format and Spanish Language Format

Subsamples in Language Format Invariance Analysis ................................................... 55

Table 9: Internal Consistency Coefficients and Intercorrelations for SSIS-PF Social Skills

Subscales based on Race/Ethnicity Item Parcels and Manual Reported Item-Level

Data .................................................................................................................................. 57

Table 10: Internal Consistency Coefficients and Intercorrelations for SSIS-PF Social

Skills Subscales based on Language Format Item Parcels and Manual Reported

Item-Level Data .............................................................................................................. 58

Table 11: Global Fit Statistics for SSIS-PF Baseline Models fitted to Samples with and

without Multivariate and Univariate Outliers .................................................................. 60

Table 12: Global Fit Statistics for SSIS-PF Measurement Model at Successive Stages of

Invariance Analysis by Race/Ethnicity ............................................................................ 62

Table 13: Estimated Latent Factor Correlations for SSIS-PF 7-Factor Baseline

Measurement Model by Race/Ethnicity Group ................................................................ 66

viii

Table 14: Unstandardized and Standardized Factor Loadings by Race/Ethnicity for

Configural Invariance Model ........................................................................................... 70

Table 15: Global Fit Statistics for SSIS-PF Higher-Order Measurement Model at

Successive Stages of Post-Hoc Model Fitting ................................................................. 72

Table 16: Global Fit Statistics for SSIS-PF Measurement Model at Successive Stages of

Invariance Analysis by Language Format ........................................................................ 84

Table 17: Unstandardized and Standardized Factor Loadings by Language Format for

Configural Invariance Model ........................................................................................... 85

Table 18: Global Fit Statistics for Proposed SSIS-PF Higher-Order Measurement Model

with Language Format Samples ....................................................................................... 87

ix

ACKNOWLEDGEMENTS

This project would not have been possible without the support of many individuals. In

particular, I would like to thank my dissertation advisor and committee chair, Dr. James DiPerna.

Your guidance, advice, and encouragement have been invaluable. I would also like to thank the

other members of my committee, Dr. Robert Hale, Dr. Jonna Kulikowich, and Dr. Keith Wilson,

for your expertise, thoughtful consideration, and constructive feedback. In addition, special

thanks to Ms. Becky Holter and Dr. Shirley Woika for all that you do for the Penn State School

Psychology program and its students.

I would like to express my sincerest gratitude and appreciation to Dr. Frank Gresham and

Dr. Stephen Elliott for supporting my request to use the SSIS standardization data for this project.

Additional thanks to Pearson, Inc. for approving that request and preparing the SSIS Rating

Scales standardization dataset for use in this project. It is my hope that the current research will

serve to promote further investigations and applications of this already widely used assessment

instrument.

Finally, heartfelt thanks to my parents, Greg and Nancy Schneider, for your constant love

and encouragement; to Chris, Steve, and Melissa, for being there, always; and to all of the friends

and family members who have helped and supported me along the way.

1

Chapter 1

Introduction

Outcomes associated with children and adolescents’ development and exhibition of social

skills have been well documented. Deficits in social competence and social skills have been

linked with a number of negative outcomes including elevated rates of problem behavior (Lane,

Carter, & Piers, 2006; Skoulos & Shick Tryon, 2007; Webster-Stratton, Reid, & Hammond,

2001), increased risk for depression (Bell-Dolan, Reaven, & Peterson, 1993; Ward, Sylva, &

Gresham, 2010), difficulty with the development and maintenance of peer relationships (Ladd,

1999), and future need for psychiatric services (Cowen, Pederson, Babigan, Izzo, & Trost, 1978).

In contrast, children and adolescents who demonstrate appropriate social skills tend to have better

outcomes. For example, prosocial behavior has been linked with greater peer acceptance, and

positive peer relationships have been linked with a variety of benefits for individual development

(Gifford-Smith & Brownell, 2003; Ladd, 1999; Ladd, Karchenderfer, & Coleman, 1996; Parker

& Asher, 1993; Weiner, 2004). Research also has documented the positive effects of social skills

training interventions for different at-risk populations (Ang & Hughes, 2002; Barrera & Schulte,

2010; Gresham, Cook, Crews, & Kern, 2004; Gresham, Van, & Cook, 2006; Najaka, Gottfredson,

& Wilson, 2001; Reichow & Volkmar, 2010; Webster-Stratton, Reid, & Hammond, 2001).

Finally, in the school setting, social skills have been shown to bear positive influence on students’

learning-related behavior and academic achievement (Wentzel, 1993; Zins, Bloodworth,

Weissberg, & Walberg, 2007).

The Need for Cross-cultural Social Skills Research

Social skills are generally considered to be an important protective factor for all

individuals. Still, cross-cultural social skills research is relatively limited. As such, the degree to

2

which previously cited findings generalize across different cultural groups, even those within the

United States, remains unclear. What is clear is that racial and ethnic minority group

representation within the U.S. school-age population is increasing at a steady rate (KewalRamani,

Gilbertson, Fox, & Provasnik, 2007). Moreover, students from racial and ethnic minority groups

continue to be overrepresented in various demographic risk categories and continue to show

chronic academic underachievement, elevated rates of school dropout, and poorer long-term

outcomes (KewalRamani et al., 2007).

Exploring possible ways to improve academic and developmental outcomes for minority

students is an important research objective. As such, the benefits associated with the acquisition

and use of social skills need to be more closely examined from a cross-cultural perspective.

Research aimed at better understanding the social skills construct within specific groups is also

needed. And, as a prerequisite for accomplishing these larger research goals, methods of

assessing social skills within specified populations must be shown to produce scores that are

appropriately reliable and valid (American Educational Research Association, American

Psychological Association, and National Council on Measurement in Education, 1999).

Developing new and distinctive assessment models for different groups is one alternative that

may be considered in attempting to comply with established measurement standards when

working with diverse populations. However, a more parsimonious alternative is to first examine

the validity of scores produced by existing social skills assessment models when used with

specific subgroups from the larger population.

Demographic Trends: The U.S. Hispanic/Latino Population

A significant portion of recent U.S. population growth has taken place within the

Hispanic or Latino1 subpopulation (KewalRamani et al., 2007). As a heterogeneous group, the

1 In order to remain consistent, the term Latino will be used throughout the document when referring to individuals who self-identify

as either Hispanic or Latino. Though sometimes used interchangeably, the term Latino has been applied more frequently in recent scholarly literature.

3

nativity of Latino Americans can be traced back to a number of different countries and

geographic regions including Mexico, Central and South America, and the Caribbean.

Collectively, Latinos comprise the largest minority population within the United States,

representing approximately 15.8% of the total U.S. population (U.S. Census Bureau, 2009). A

significant proportion of U.S. Latinos are foreign born, with 11% of the under-18 Latino

population and 40% of the total Latino population having been born somewhere outside the

United States (KewelRamani et al., 2007). Moreover, roughly 77% of Latinos in the U.S. report

speaking a language other than English in the home (KewelRamani et al., 2007). These

demographic data suggest that many Latino Americans maintain, to varying degrees, a number of

the traditional values, beliefs, and practices of their native cultures. Being embedded in the larger

and, at times, incongruous ‘American’ majority culture, unique cultural influences may affect

outcomes for Latino students in U.S. schools.

Additional characteristics of the Latino American population, independent of ethnically

influenced ‘culture’, also must be considered when comparing outcomes across groups. For

example, Latinos are overrepresented in risk categories defined by poverty status (25% of all

Latino families) and low parental education (41% of Latino mothers and fathers did not complete

high school; KewelRamani et al., 2007). It is often difficult to separate out effects that are

influenced by cultural factors from those that are driven by other factors that simply covary with

group membership. Nevertheless, recent contributions to cross-cultural and culture specific

research have prompted theorists to replace the once commonly accepted developmental risk

framework for minority children with more nuanced ecocultural systems frameworks that

recognize the positive, negative, and neutral influences of cultural group membership on

individual outcomes (Fuller & Garciá-Coll, 2010; Rogoff, 2003).

4

Goals and Objectives

In light of the demographic trends highlighted above, it is clear that U.S. school systems

must work to accommodate increasing diversity within the student body they stand to serve. In

reference to the Latino population(s) in particular, schools must be aware of cultural and

linguistic variation among students and recognize that existing models of instruction, assessment,

and intervention may need to be modified in order to best serve these groups. Educational

researchers also have a responsibility to facilitate this process by prioritizing cross-cultural and

group-specific research.

The primary objective of the current study is to evaluate the structural validity of the

Social Skills Improvement System Rating Scales (Gresham & Elliott, 2008). Specifically,

structural invariance of the measurement model is examined as a function of both student

race/ethnicity and the language in which the scales are written (i.e., language format;

Spanish/English). Through this process two broader research objectives also are addressed. The

first is to investigate the degree to which the social skills construct can be operationalized in a

way that is not bound by cultural norms. The second is to evaluate the appropriateness of using

standardized behavior rating scales and norm-referenced scores within culturally diverse

populations. Before outlining the methodology used for the current investigation, a review of

pertinent literatures is presented in the following chapter.

5

Chapter 2

Literature Review

Motivated by findings linking skilled social behavior with a variety of positive outcomes,

the late 1970s and early 1980s saw a notable increase in the amount of research pertaining to

social skill development (Gresham, 1986). As an emerging research domain, initial attempts to

conceptualize social skill yielded two competing theoretical perspectives: a trait model and a

molecular model (McFall, 1982). The trait model was described as emphasizing an individual’s

underlying capacity for performing in socially relevant contexts. In contrast, the molecular model

emphasized linkages between discrete, observable behaviors and identifiable social outcomes.

Recognizing that the two models were not entirely incompatible, Gresham (1986) argued for a

model that incorporated both perspectives.

According to Gresham (1986), a trait perspective is useful for understanding the higher

order construct of social competence, which he cited as an evaluative term to be used broadly for

the purpose of describing an individual’s social behavior as appropriate/inappropriate or

successful/unsuccessful. In defining the components of social competence, however, Gresham

(1986) opted for a more molecular approach. Specifically, he asserted the importance of the

behavioral construct of social skills, defined functionally as behaviors that maximize the

likelihood of reinforcement and decrease the likelihood of punishment within a given situation

(Gresham, 1981; Gresham & Reschly, 1987).

Gresham’s (1986) focus on the behavioral components of social competence was driven

by practical considerations. Behavioral constructs are particularly useful for the applied practices

of assessment and intervention. It follows that Gresham and colleagues’ early work led to the

development of a detailed assessment for intervention paradigm (Gresham, 1981; Gresham, 1986;

6

Gresham & Reschly, 1986, 1987; Elliott & Gresham, 1987), and ultimately to the publication of

the Social Skills Rating System (SSRS; Gresham & Elliot, 1990). Recently, the SSRS was

revised and incorporated into the comprehensive Social Skills Improvement System (SSIS;

Gresham & Elliott, 2008), which includes assessment and intervention components at the

universal, selected, and targeted levels. The current study examines specific measurement

properties of the newly published SSIS Rating Scales across race/ethnicity and language format

groups. However, before describing the study objectives in more detail, a review of the broader

literatures on social competence, social skills, and cultural influences on social skill development

is provided.

Perspectives on Social Competence

Social competence is widely recognized as a critically important developmental

construct. In general, social competence is viewed broadly as referring to one’s ability to

function successfully within social contexts (Gresham, 1986; McFall, 1982; Merrell, 1999;

Odom, McConnell, & Brown, 2008). The most appropriate means of conceptualizing social

competence as a psychological construct has been a long-standing topic of debate (see Gresham,

1986; McFall, 1982; Merrell & Gimpel, 1998). Greenspan (1981) offered a tripartite model of

social competence in an attempt to consolidate three divergent approaches that had been

frequently applied to conceptualize the construct. As described by Greenspan (1981; see also

Gresham 1986), skill-oriented approaches are those that prioritize process variables associated

with social interaction. Individuals who have developed an intuitive understanding of the rules

and scripts for appropriate interpersonal interaction are, from a skill-oriented perspective, deemed

socially competent. Outcomes-oriented approaches view social competence retroactively on the

basis of important social achievements. Positive achievements (i.e., outcomes) are taken as an

indication that an individual is socially competent. Finally, content-oriented approaches focus on

the exhibition of specific behaviors presumed to be predictive of successful social functioning.

7

From this perspective, individuals who demonstrate appropriate behaviors relative to situational

demands are said to demonstrate social competence.

Divergent Perspectives on Social Competence. In the fields of social and cognitive

psychology, child development, special education, and school psychology, divergent strands of

research continue to reflect the three social competence orientations identified by Greenspan

(1981). Researchers adopting a social-cognitive perspective have expounded on the nature of

cognitive processes that preempt social behavior. For example, social information-processing

(SIP) theory (Crick & Dodge, 1994) corresponds with a skill-oriented perspective in

conceptualizing behavior as a product of the interaction between the processing of situational

information and pre-existing cognitive structures that have developed over time through prior

experience. Developmental researchers have tended to align with outcomes-oriented

conceptualizations of social competence, emphasizing the importance of outcomes such as peer

acceptance and peer relationships as indicators of present levels of social competence and

predictors of long term adjustment (Ladd, 1999; Ladd, Herald, & Andrews, 2005). Finally,

researchers in applied fields such as special education and school psychology have tended to

adopt content-oriented approaches, focusing on the acquisition and performance of specific skills

that promote positive interactions and reinforcement in social contexts (Gresham & Elliott, 1990;

Merrell, 2001).

Joining Perspectives: Comprehensive Models of Social Competence. The

aforementioned distinctions are not rigid, and across research fields, comprehensive models of

social functioning that incorporate aspects of all three conceptual perspectives tend to have the

most explanatory power. For example, a recent study in the field of child development

investigated a three-component developmental cascade model, which analyzed the cyclical and

reciprocal relations between SIP, peer rejection, and aggression across the early school years

(Lansford, Malone, Dodge, Pettit, & Bates, 2010). The model provides a very useful illustration

8

of interactions between skill (e.g., SIP), outcome (e.g., peer rejection), and content (e.g.,

aggression) factors, and shows how all three function together to contribute to patterns of social

development and the emergence of social dysfunction. In a recently published book chapter,

Odom, McConnell, and Brown (2008) also advocate for a combination of conceptual approaches,

arguing that social competence involves both the selection of appropriate behavioral strategies

and the subsequent attainment of social goals. Neither appropriate strategy selection, nor social

goal attainment, in isolation would suffice an indicator of social competence. Moreover, as a

means of attaining social goals, individuals draw on complex repertoires of cognitive, emotional,

and behavioral competencies (Greenspan, 1981; Merrell, 1999; Odom, McConnell, & Brown,

2008), which suggests that skill and content factors contribute to individual outcomes. In sum,

the compatibility of all three social competence perspectives is evident, and inter-disciplinary

distinctions appear to be an artifact of the specific goals of researchers in different fields.

The Issue of Context. Though social competence is often framed as a characteristic of

the individual, comprehensive conceptualizations of social competence also recognize the

influence that contextual factors may have on an individual’s behavior. For example, Odom,

McConnell and Brown (2008) have categorized factors contributing to social competence across

two broad dimensions: those originating within the child (e.g., neurology, temperament,

cognition), and those influencing the child from the outside (e.g., family, school, peers, culture).

These latter factors represent features of the ecological context in which individuals are situated.

In reference to specific events, features of the immediate context are also relevant, as

these may dictate an individual’s selection of alternate behavioral strategies. A skilled, effective

strategy in one context may be considered unskilled or ineffective in a different context.

Therefore, socially competent individuals may not enact uniform behavioral solutions in pursuit

of social goals across unique contexts (McFall, 1982). In fact, noting that situational variables

represent a critical dimension that must be considered within any valid assessment of social

9

behavior, McFall (1982) advocated for an assessment approach that emphasized analysis of tasks

as opposed to analysis of individuals. Though task-based/role-play assessment research and

practice has faded over the past few decades (Matson & Wilkins, 2009), contextual

considerations in the conceptualization of social behavior remain important. For example,

Sheridan and Walker (1999) outlined an ecological-contextual model of social behavior, rooted in

social cognitive theory, which contends that social behaviors are expressed as a product of the

interaction between three sets of factors: characteristics of the child, characteristics of the

individual(s) with whom the child interacts, and features of the context in which interactions take

place. In sum, a general consensus suggests that contextual factors are influential in

understanding social competence and, more narrowly, in determining the expression social

behavior. A more thorough examination of contextual considerations in the assessment of social

skills is presented later.

Emphasizing the Social Skills Dimension of Social Competence

As noted, social competence is a complex multi-dimensional construct, the assessment of

which presents a number of obstacles. All facets of social competence cannot be readily

observed. However, the discrete behaviors demonstrated by individuals in social situations

represent one facet of social competence that researchers have been able to operationally define

and reliably measure. The proliferation of social skills research over the past several decades

attests to this fact (Matson & Wilkins, 2009; Merrell & Gimpel, 1998).

Defining Social Skills. Although a number of different conceptual approaches have been

applied in attempts to define the social skills construct (Gresham, 1986), several common

construct features are generally agreed upon by researchers in the field (Merrell & Gimpel, 1998).

For one, social skills are fundamentally interactive and include behaviors associated with

interaction initiation and interaction response (Merrell & Gimpel, 1998). Social skills are also

understood to be situation specific, as dictated by the context in which they are employed

10

(Sheridan & Walker, 1999). Finally, from a behavioral perspective – which is the perspective

that has been most frequently applied in the scholarly literature – social skills are discrete,

observable, learned behaviors that maximize reinforcement in the social setting (Gresham, 1986).

The distinctive features listed above have provided a useful structure for research in the

social skills domain. However, on the basis of these features alone, the social skills construct

remains somewhat broad and unfocused. For this reason, a social validity approach to social

skills research has become increasingly popular. The social validity approach maintains the same

basic definitional premises, but further stipulates that social skills predictive of important social

outcomes should be the primary focus of investigation (Gresham, 1986; Merrell & Gimpel, 1998;

Sheridan & Walker, 1999). Social validity approaches prioritize those specific skills that are

shown to be important in particular contexts. Although traditional behavioral definitions of social

skills have been described as optimal for the purpose of assessment (Caldarella & Merrell, 1997),

social validity definitions in particular have been identified as most useful in the assessment-for-

intervention process (Elliott, Gresham, Frank, & Beddow, 2008).

As it relates to the current study, the SSIS Rating Scales manual defines social skills as

“learned behaviors that promote positive interactions while simultaneously discouraging negative

interactions when applied to appropriate social situations” (Gresham & Elliott, 2008, p. 1). Thus,

from a conceptual standpoint, the SSIS adopts a relatively straightforward behavioral definition

of social skills as identified by their situation-specific function. In application, however, certain

key features of the SSIS Rating Scales are clearly oriented to the social validity perspective. For

example, the categorical structure of the social skills scale is designed to tap into specific domains

of behavior (e.g., communication, cooperation, responsibility) that are important across a variety

of settings. The SSIS Rating Scales also call for ratings of both behavior frequency and behavior

importance. The importance ratings, specifically, allow for the prioritization of intervention

efforts as informed by the ratings of the adult figures (e.g., parents, teachers) who are most aware

11

of the different types of social skills required in specific settings. More detailed information

regarding the design and structure of the SSIS Rating Scales will be provided at the end of the

current review.

Toward a Taxonomy of Social Skills. Theoretical understanding of psychological

constructs is often aided by the identification and clarification of taxonomic structures.

Taxonomies derived through empirical analysis are particularly useful in refining theory

(Achenbach, 1995; Caldarella & Merrell, 1997). For example, in reference to the study of

problem behavior, Achenbach (1995) outlined an empirically based paradigm for assessment and

taxonomy predicated on the quantitative, multivariate analysis of large-sample assessment data.

In application, Achenbach was able to use objective data to demonstrate the existence of various

“syndromes of co-occurring problems” (p. 262), thus contributing to the refinement of theory in

reference to childhood problem behavior. Unfortunately, and despite continued research in the

field, consensus regarding a uniform taxonomy of social skills remains elusive.

In recognition of the need for a taxonomy of positive behaviors, Caldarella and Merrell

(1997) conducted a meta-analytic review to inform the development of a common social skills

taxonomy. Their meta-analytic procedure involved identifying studies in which unique

dimensions of social skills had been derived through quantitative analysis (e.g., factor analysis,

cluster analysis, etc.). Having identified 21 relevant works, reviewers then looked for

commonalities across studies in terms of social skills factors and the specific items that had been

used as indicators of each factor. Results revealed five commonly occurring dimensions of social

skills.

A peer relationship dimension was represented by items referencing pro-social peer

interactions, friendship initiation and maintenance, and sensitivity to social cues. A self-

management dimension included behaviors indicative of self-control, compliance, and tolerance.

An academic skills dimension was found to include behaviors showing compliance and task

12

orientation. A compliance dimension was represented by items referencing rule-following and

good citizenship. Finally, an assertion skills dimension consisted of behaviors showing proper

interaction initiation and situation appropriate assertiveness (Caldarella & Merrell, 1997).

Collectively, the five social skills dimensions reported by Caldarella and Merrell (1997)

along with the behaviors that comprise each dimension provide a useful blueprint for research,

assessment, and intervention purposes. However, these findings are not without limitations.

First, only those dimensions that had been previously identified through published quantitative

research investigations were eligible for consideration. Moreover, only those dimensions that had

appeared in numerous studies made the final listing. Factors existing in less than one-third of the

reviewed studies were automatically excluded. Thus, while the resulting taxonomy accurately

describes trends in previous research, it cannot be said to represent a definitive listing of all

pertinent domains of social behavior.

A bigger limitation of Caldarella and Merrell’s (1997) study stems from the qualitative

nature of the meta-analysis. All reviewed studies were looked at independently, and qualitative

descriptions of the emergent social skills dimensions were compared. This process produced

taxonomic distinctions that were not fully differentiated. That is, upon comparison of the

behavioral characteristics of each social skills dimension put forth in the taxonomy, considerable

overlap across posited dimensions was observed. For example, items listed on the compliance

dimension (e.g., “follows rules”, “follows instructions/directions”) were virtually identical with

items listed on the self-management (e.g., “follows rules, accepts imposed limits”) and academic

dimensions (e.g., “listens to and carries out teacher directions”). Thus, although each of the

individual studies included in the review applied appropriate multivariate analyses in order to

extract clear factors/clusters of social skills behaviors, the more subjective meta-analytic

procedure did not retain clear distinctions between the various factors/clusters.

13

Empirical taxonomies generated through multivariate analysis of assessment data are

clearly helpful in terms of promoting the advancement of scientific knowledge. The process by

which such taxonomies are developed and refined illustrates the reciprocal contributions of theory

development and measurement design. As explained by Achenbach (1995), the process typically

starts with the operationalization of theoretical hypotheses into an initial measurement model.

Once the measurement model is specified, data can be collected and analyzed, allowing for an

examination of the degree to which patterns in the observed data are consistent with original

theory. Subsequent iterations of the process allow for further refinement of both the

measurement model and guiding theory. This methodology has already been applied successfully

within the study of childhood problem behaviors (Achenbach, 1995). Still, it remains to be seen

how well such an approach would serve to inform the study of social skills. At present, there

does not appear to be a commonly posited structure defining the scope of the social skills

construct/domain. As, such, assessment instruments have been created in broad and varied ways

(Matson & Wilkins, 2009). There are general similarities in the structure and content of

commonly measured social skills domains (Gresham et al., 2004; Gresham, 2011). However, the

vast collection of contemporary social skills assessment instruments also show variability in

reference to a number of key features including: target population (e.g., broad versus specific age

groupings), scope and description of behavior domains, and the degree to which behaviors are

defined as context-specific versus context-free (Matson & Wilkins, 2009).

Social Skills in Context. From a behavioral perspective, social skills are identified on

the basis of function. The molecular function of specific social skills is likely to vary, but

collectively social skills have been defined as behaviors that promote positive interactions

(Gresham & Elliott, 2008) and lead to desirable social outcomes (Merrell, 1999). Drawing from

this functional approach, it is clear that social skills should not be identified on the basis of the

topography of behavior, but rather on the probability that the exhibition of a particular behavior

14

will be associated with reinforcement in the social context. Sheridan and Walker (1999)

suggested that any accurate conceptualization of social skills must allow for the potential effects

of contextual factors on the expression of behavior. Contextual variation in the expression of

social skills can occur as a function of ecological factors, the presence/absence of particular social

actors, or at different points in the reciprocal process of social interaction (Sheridan & Walker,

1999).

The effect of contextual factors on the topography of behavior clearly complicates social

skills assessment. Certain methods of assessment can be used to counteract context-specific

effects on behavior. For example, qualifying direct behavioral observations within the context of

the environment in which they are taken promotes the reliable interpretation of data collected in

this manner (Norton, Washington, Peters, and Hayes, 2010). It is also plausible that certain

behaviors and categories of behavior function as social skills across a multitude of contexts. For

example, the aforementioned Caldarella and Merrell (1997) meta-analysis identified five primary

domains2 of social skills that were frequently represented across reviewed studies: peer

relationships, self-management, academic, compliance, and assertion. These five categories were

observed across multiple independent samples. Similarly, qualitative research has reported

“substantial overlap” in the types of behaviors identified as important social skills by teachers,

parents, and student respondents from the second and fifth grades (Warnes, Sheridan, Geske, &

Warnes, 2005). Specifically, behaviors associated with compromise, empathy, assistance, trust,

loyalty, and social engagement were consistently identified as important social behaviors by all

groups of respondents. Still, other reviews have pointed out variation in the social skills construct

as a function of gender, developmental status, and cultural group membership (see Merrell &

Gimpel, 1998).

2 Caldarella and Merrell (1997) use the term ‘dimensions.’ However, given that the observed categories do

not appear to represent a set of unitary, latent constructs, the term domain is applied in the current

discussion.

15

In light of cited findings, the effect of contextual factors on the exhibition of social skills

cannot merely be assumed or dismissed through subjective analysis. Rather, context-behavior

interactions must be empirically examined, and their effects parsed out. At present, research

examining relationships between cultural context (e.g., cultural routines, traditions, beliefs,

expectations) and the development of social skills within specific groups is limited (Matson &

Wilkins, 2009). Cultural factors, by definition, influence norms for social behavior. As such, and

given that socially skilled behavior has been widely viewed as a context-dependent construct,

investigations of the ways in which variation in cultural context affect the composition and form

of social skill behaviors are clearly warranted.

Cultural Influences on Social Skill Development

From an ecological perspective, individual development occurs within contexts framed

by complex and interactive systems (Bronfrenbrenner, 1986). Cultural researchers have applied

ecological frameworks as a means of exploring the many ways in which culture influences

individual development (Rogoff, 2003; Weisner, 2002). Most straightforward, perhaps, are broad

trends observed in reference to group membership. However, the fluid nature of cultural identity

and cultural affiliation within groups represents a more subtle dimension of cultural influence that

also must be considered (Fuller & García Coll, 2010; Halgunseth, Ispa, & Ruddy, 2006). Finally,

influences that emerge as individuals come into contact with various social institutions (e.g.,

schools) that may or may not operate according to the same cultural frameworks governing

interactions in the home or local community settings also warrant consideration (Fuller & García

Coll, 2010; Warzon & Ginsburg-Block, 2008). As noted previously, the body of research on

social skills as they exist within and across culturally diverse populations is relatively limited

(Matson & Wilkins, 2009). As such, the following sections review research in reference to

emerging themes in culturally focused developmental research (see Fuller & Garcia Coll, 2010)

16

in an attempt to establish links between eco-cultural developmental factors and the process of

social skill acquisition and maintenance for Latino children in the U.S.

Traditional Latino Parenting Practices. The developmental experiences of Latino

children and adolescents are directly influenced by the culturally bound practices of their parents

(Fuller & García Coll, 2010; Halgunseth et al., 2006). In a review of research on parenting

practices within Latino families, Halgunseth et al. (2006) identified three traits that Latino parents

often strive to cultivate in their children: familismo, respeto, and educación. Familismo refers to

the prioritization of family interests over those of the individual. Respeto refers to the expectation

that individuals will act in accordance with their own social roles and show respect for the roles

of others with whom they interact. Finally, educación refers to the cultivation of social

responsibility and emotional maturity. Halgunseth et al. (2006) argue that these traits represent

goals that often dictate the parenting strategies selected and employed by Latino parents.

It is worth noting that traits such as familismo, respeto, and educación are not

intrinsically unique to Latino cultures. Instead, it is the degree of emphasis placed on such traits

and the culturally bound parenting practices designed to cultivate their development that render

them uniquely Latino. For example, Okagaki and Frensch (1998) found that Latino parents

prioritized their children’s development of autonomous and conforming behaviors, and monitored

these aspects of child development more closely than did European-American or Asian-American

parents. By contrast, Asian American parents were found to place more emphasis on cognitive

development and educational achievement than were parents from the other two groups (Okagaki

& Frensch, 1998). These results do not necessarily suggest that certain cultures devalue

particular aspects of child development (e.g., setting social development in opposition to

academic development). However, there does appear to be a limit on the degree to which parents

can emphasize all of the various aspects of child development. As such, differential competence

orientations emerge to reflect the specific developmental features prioritized by members of

17

different cultural groups (Rogoff, 2003). For Latino populations, research suggests that

individual competencies, including interpersonal/social competence, are emphasized in

proportion to their perceived importance within a broader interdependent, community-oriented

context (Fuller & García-Coll, 2010; Halgunseth et al., 2006; López, Correa-Chávez, Rogoff, &

Gutiérez, 2010; Okagaki & Frensch, 1998).

In terms of specific socialization practices, research findings showing frequent use of

parenting strategies such physical guidance, verbal direction, and rule setting, have been cited as

evidence that Latino American parents tend to adopt more authoritarian roles than do their Euro-

American counterparts (Halgunseth et al., 2006; Livas-Dlott, Fuller, Stein, Bridges, Mangual

Figueroa, and Mireles, 2010). However, authoritarian parenting in the Latino context does not

carry the same stigma that it has traditionally held within more Eurocentric contexts. Rather,

many of the Latino parenting practices that have been classified under the ‘authoritarian’

archetype include a distinctly positive qualitative feature termed cariño (i.e., caring; Livas-Dlott

et al., 2010). The presence of cariño distinguishes these Latino parenting practices from the

traditional Eurocentric view of authoritarian parenting as cold and lacking compassion

(Baumrind, 1989, as cited in Livas-Dlott et al., 2010). At present, much of the research on Latino

parenting practices has been descriptive and ethnographic, and more research is needed in order

to clarify the effects of Latino parenting practices on various child outcomes. Given the traits that

Latino parents often strive to cultivate within their children (e.g., familismo, respeto, educación),

research examining the discrete social skill behaviors that children employ in demonstration of

such traits is one specific area where additional research is needed.

Within-group Differences: The Effects of Acculturation. Individual differences in

cultural identity and cultural affiliation are influenced by a variety of factors including personal

history, nativity, generational status, language status, geographic locale, community

demographics, and the availability and composition of social support networks. The Latino

18

population of the United States shows considerable within-group variability across a number of

such factors (Fuller & García Coll, 2010). Moreover, cultural identity and cultural affiliation are

not static traits; rather they fluctuate across the lifespan and as a function of developmental

experience. Researchers interested in understanding cultural change as it takes place for

individuals immersed within novel environments (e.g., first generation Latino immigrants in the

United States) have begun to more closely examine the construct of acculturation, defined as “the

process of adopting goals and practices due to exposure to a new culture” (Halgunseth et al.,

2006, p. 1282). Thus, as a caveat to the previous section’s descriptions of Latino parenting

practices, it is important to note that the degree to which Latino children and youth are actually

exposed to such ‘traditional’ cultural practices may vary significantly as a function of different

factors including familial levels of acculturation.

Consider, for example, the results of the ethnographic study conducted by Livas-Dlott et

al. (2010) which were broadly interpreted in support of the hypothesis that Latina mothers tend to

adopt power-assertive strategies when attempting to gain compliance from their children (as

opposed to inductive strategies). Interestingly, when data were disaggregated according to groups

defined by the specific repertoires of parenting strategies employed, trends suggested that Latina

mothers who did incorporate inductive strategies were more likely to be second generation and

more likely to have graduated high school. Sample size limitations precluded testing such trends

for statistical significance, but the implication is that more acculturated Latina mothers may

expand their parenting repertoires to include practices commonly employed in the culturally

diverse communities in which they reside (Livas-Dlott et al., 2010).

In a separate study, statistically significant positive relationships (concurrent and

predictive) were found between immigrant Latino parents’ ratings of positive parenting practices

and family cohesion and their children’s self-rated social problem-solving skills and social self-

efficacy (Leidy, Guerra, & Toro, 2010). As a follow-up to the larger study, 12 immigrant Latina

19

mothers participated in a focus group in order to discuss barriers they identified as impediments

to family cohesion and positive parenting practices. Not surprisingly, barriers included

generational differences in level of acculturation, parents’ inability to actively partake in their

children’s education, limited availability of social support due to immigration history, and

perceptions of vulnerability and discrimination due to residency status. Taken collectively,

results from the pair of studies conducted by Leidy et al. (2010) suggest that traditional Latino

parenting practices do serve to cultivate social competence within Latino children, but that such

practices are inevitably affected by factors related to acculturation and exposure to a novel

culture.

Cultural Interaction and the Effects of Schooling. As children enter school, their

ability to utilize learned skills in order to achieve success in the academic setting varies according

to both the level to which their skills have been developed and the degree to which those skills are

recognized as functional assets in the new setting (Phelan, Davidson, & Cao, 1991). For many

Latino students, school entry represents entry into a novel cultural context. Thus, in addition to

traditional academic and social requirements, Latino students often have the added task of

acclimating to a culturally unique interactive setting. A lack of continuity between home and

school contexts places these students at-risk for poorer outcomes, particularly when the

functionality of acquired skills is compromised.

Language minority status represents one obvious cultural factor that inhibits the

functionality of students’ skills in the school setting. Research findings indicate poorer academic

achievement outcomes for English language learner (ELL) students (see Genesee, Lindholm-

Leary, Saunders, & Christian, 2006; Suárez-Orozco, Gaytán, Bang, Pakes, O’Connor, & Rhodes,

2010). Though research is somewhat limited, language minority status also has been identified as

a risk factor for students’ social-emotional outcomes (e.g., Dawson & Williams, 2008). A study

conducted by Spomer and Cowen (2001) found that Latino students exhibit unique profiles of

20

social competence as a function of language status. Specifically, teacher ratings of a clinic

referred sample showed non-ESL students to have higher overall competence scores than their

ESL peers, with strengths identified in the domains of Assertive and Peer Social Skills.

Interestingly, ESL students evidenced higher levels of teacher-rated Frustration Tolerance,

showing that, although language minority status often represents a barrier to be overcome within

the school setting, it cannot be solely defined as a risk factor. Rather, the ‘risk’ associated with

language minority status develops as a product of the interaction between students’ skills and

their functionality within specific settings (e.g., school).

Another study, conducted by Edl, Jones, and Estell (2008), compared teacher rated

academic and social competence across groups of European American regular education students,

Latino regular education students, and Latino bilingual education students. Though results varied

across time points with respect to differences on specific academic and social competence

outcomes, discriminant function analysis and follow-up statistical contrasts consistently showed

the greatest differences to exist between European American regular education students and

Latino bilingual education students, with the European American students rated as more

competent. Fewer differences were observed between European American and Latino regular

education students (Edl et al., 2008). Collectively, these results offer support for the hypothesis

that the degree of continuity between students’ home and school cultures (e.g., ethnicity,

language use) is an important predictor of teacher rated interpersonal competence in the school

setting.

More research examining the differential effects of cultural consonance and mismatch

across the home and school settings is needed (Galindo & Fuller, 2010). Still, schools that are

better able to adapt to students’ cultural needs remain more likely to promote positive outcomes

for diverse student populations. A recent study of pre-kindergarten students’ adjustment

outcomes found that the frequency and quality of classroom based Spanish language interactions

21

predicted significantly better teacher-rated social skills for Spanish speaking children (Chang et

al., 2007). Similarly, an intervention study conducted with at-risk Latino children and their

parents showed statistically significant social skills gains and concurrent reductions in problem

behavior for students who participated in a mentoring program that included educational

components attended by both students and parents (Barron-McKeagney, Woody, & d’Souza,

2001). Though specific effects were not parsed out in the analyses, the joint participation of

students and parents likely contributed to student gains at post-test, with parental involvement

promoting social skill development in the home setting as well. For both studies (i.e., Barron-

McKeagney et al., 2001; Chang et al., 2007), positive outcomes were associated with processes

that functioned to bridge the gap between home and school cultures.

As evidenced throughout the current review, the dynamic nature of cultural influences on

the social development of Latino students must be considered when evaluating existing research

with this diverse population. Additional consideration must also be made in reference to the

measurement properties of instruments used for the collection of research data. One of the

studies cited above (i.e., Spomer & Cowen, 2001) included an independent analysis of the

validity of scores obtained in reference to their research sample. Other studies tended to utilize

measurements that had not been validated for use with their sample populations. In order to

promote the interpretability of future cross-cultural research, studies examining the reliability and

validity of scores from measurement instruments with specific populations are needed.

Social Skills Assessment

Common methods of assessing social skills include direct behavior observations,

behavior rating scales, clinical interviews, teacher nomination procedures, sociometric

techniques, and self-report measures. From a clinical perspective, multi-method and multi-

informant assessment of students’ social skills is recommended as best practice (see Merrell,

2001; Sheridan & Walker, 1999). Assessment information from multiple informants allows for

22

an examination of an individual’s social skills across contexts, in the presence of various

audiences, and from a variety of perspectives. Similarly, the use of multiple methods of

assessment provides a means of cross-validating information yielded through any single method.

In addition, there is also research to suggest that multi-method assessment promotes a more

complete assessment of the broader construct of social competence (see Odom, McConnell, &

Brown, 2008).

Advocating for a multi-method approach to social skills assessment does not reduce the

need, from a measurement perspective, for gathering evidence to support the validity of data

obtained from individual methods. The current study is specifically concerned with measurement

validity as it relates to the use of standardized behavior rating scales in the assessment of

children’s social skills. Behavior rating scales have been recommended as a cornerstone of social

skills assessment (Elliott, Gresham, Frank, & Beddow, 2008; Elliott, Malecki, & Demaray, 2001;

Merrell, 2001). Social skills rating scales are recognized for convenience of administration and

strong psychometric properties, while at the same time offering a means of collecting data across

settings, from multiple informants, and about a variety of behaviors that are directly applicable

for intervention planning (Elliott, et al., 2008; Merrell, 2001). However, various critiques of

social skills rating scales have also been offered including their lack of sensitivity to small

changes in behavior, the need to qualify ratings based on the perspective of the rater, and the

related complexities involved in aggregating ratings from multiple raters (Elliott et al., 2001;

Elliott et al., 2008; Gresham, 2011). It is also important to note that social skills rating scales’

sensitivity to contextual factors is influenced by the level of specificity with which operational

definitions are applied at the item level (Elliott et al., 2008; Matson & Wilkins, 2009). Given the

widespread use of social skills rating scales, there is a continual need for new investigations

designed to examine evidence supporting (or refuting) the validity of scores produced by such

instruments.

23

Cross-cultural Social Skills Assessment. In a recent chapter discussing diversity

characteristics within the scope of social skills assessment, Norton et al. (2010) continue to

recommend a multi-method/multi-informant assessment approach as a means of identifying the

effects of diversity characteristics on the expression of social behaviors. Specifically, these

authors recommend a combination of clinical interview, direct observation, and self-/other-report

behavior ratings. They also noted that, in the context of cross-cultural assessment, it is important

that clinicians guard against misinterpretation of assessment data by gathering additional

information relative to the cultural norms of the individual and the local norms of the

environment in which behaviors are exhibited. Such information gathering is likely to assist in

guarding against misinterpretation of directly observed behaviors and those reported through

clinical interviews.

In reference to behavior rating scales, clinicians typically make interpretations on the

basis of norm-referenced scores. As such, when using behavior rating scales, clinicians must

consider the design features and technical properties of each instrument in relation to the

individual being evaluated. Specific design features that should be evaluated when considering

an instrument for cross-cultural use include appropriateness of content, language and dialect, and

reading level required of respondents (Norton et al., 2010). Additional technical properties that

must be considered include representativeness of the standardization sample and whether there is

evidence to support measurement equivalence/invariance (Knight & Hill, 1998), which refers to

the degree to which a measurement instrument operates in consistent ways for individuals from

different cultural groups (Vandenberg & Lance, 2000).

Social Skills Improvement System (SSIS)

The Social Skills Improvement System (SSIS; Gresham & Elliott, 2008) Rating Scales

represent a comprehensive revision to the original Social Skills Rating System (SSRS). In an

effort to update, improve, and expand the SSRS, several areas were targeted for revision.

24

Specific revision goals included the addition of new subscales, improved alignment across forms

(i.e., teacher, parent, student), and new procedures for linking assessment results with

intervention procedures. Similar to the SSRS, the SSIS Rating Scales are available in three rater

versions (parent, teacher, and student) and for three age ranges (3-5, 5-12, 12-18)3. After

revision, the SSIS Rating Scales include a Social Skills scale comprised of seven independent

subscales (Communication, Cooperation, Assertion, Responsibility, Empathy, Engagement, and

Self-Control), a Problem Behaviors scale comprised of five semi-overlapping subscales

(Externalizing, Bullying, Hyperactivity/Inattention, Internalizing, and Autism Spectrum), and a

nine-item Academic Competence scale (teacher version only). Parent and teacher forms are

scored on a 4-point scale for frequency (Never, Seldom, Often, Almost Always) and a 3-point scale

for importance (Not Important, Important, Critical; Gresham & Elliott, 2008). More specific

information on the psychometric properties of the SSIS Rating Scales is provided in the Method

section.

The SSIS Rating Scales also feature Spanish language parent and student versions, which

were not available for the SSRS. The Spanish translations were developed in three stages

(Gresham & Elliott, 2008). First, three Spanish speaking psychologists completed independent

item translations. The independent translations were then submitted to a psychological testing

company specializing in test translation, and item retention decisions were made in reference to

content consistency and reading level. Finally, through the standardization process, item-total

correlations and internal consistency reliabilities of Spanish form scores were compared with

English form scores to establish evidence of equivalence across form language (Gresham &

Elliott, 2008). Again, a more thorough description of the psychometric properties of the SSIS

Spanish forms is provided in the Method section.

3 The SSIS Rating Scales, Student Form is only available for 8-12 and 12-18 age ranges.

25

Given its relatively recent publication, independent analysis of the measurement

properties of scores from the SSIS Rating Scales are limited. However, a large body of research

has been conducted with its predecessor, the SSRS. Though differing substantially in terms of

format and content, previous analyses of the SSRS may provide insight into areas that should be

closely examined when evaluating the SSIS. As such, the following section reviews the extant

literature findings referencing the SSRS. A brief review of basic psychometric properties of the

scale is provided first. Results from a group of studies which looked more specifically at aspects

of structural validity in reference to SSRS scores are then presented. Finally, research looking at

the validity of SSRS scores as a function of cultural and language group membership is

examined.

Social Skills Rating System (SSRS). The SSRS (Gresham & Elliott, 1990) was widely

cited as one of the most comprehensive and technically adequate instruments available for the

assessment of children’s social skills (Bracken, Keith, & Walker, 1998; Demaray & Ruffalo,

1995; Merrell & Gimpell, 1998). Through the development and standardization process, the

SSRS authors amassed evidence to support the internal consistency and short-term stability of

scores across the domains of social skills, problem behavior, and academic competence (Gresham

& Elliott, 1990). Validity evidence for SSRS scores was also provided in the form of moderate

correlations with other rating scales designed to measure similar and related constructs (Gresham

& Elliott, 1990). Additional studies conducted by independent researchers have supported the

reliability (e.g., Pedersen, Worrell, & French, 2001) and validity (e.g., Flanagan, Alfonso,

Primavera, Povall, & Higgins, 1996) of SSRS scores with independent samples. The SSRS has

also been used as a criterion measure when seeking to establish concurrent validity evidence for

different instruments (e.g., Crowley & Merrell, 2000; Merydith, 2001). In addition to evidence

supporting appropriate psychometric properties, reviewers have recognized the SSRS for its

integrated multi-rater assessment format (i.e., teacher, parent, student; Merrell, 1999) and

26

straightforward application for intervention planning (Bracken et al., 1998; Merrell & Gimpell,

1998). In sum, the collection of available research suggests that the SSRS is a psychometrically

sound rating scale instrument, which produces reliable scores that are valid for interpretation and

application within an assessment for intervention framework.

Structural Investigations of the SSRS. Although a majority of studies have tended to

evaluate the psychometric properties of the SSRS by looking at global indicators of score

reliability and validity, several recent studies have applied more sophisticated exploratory and

confirmatory analytic methods in order to examine the structural integrity of the SSRS

measurement model. Results of these investigations have offered mixed support for the rating

scales’ proposed measurement structure, with variation occurring as a function of respondent

(teacher v. parent) and developmental grouping (preschool v. school-age). For example, the

factor structure of the teacher and parent versions of the SSRS failed to replicate for a sample of

African American preschoolers attending Head Start (Fantuzzo, Manz, & McDermott, 1998;

Manz, Fantuzzo, & McDermott, 1999). In a separate study, the factor structure of the SSRS

teacher version did replicate for a clinical sample of Dutch children with ADHD, although the

factor structure of the parent version was not supported with data from the same sample (Van der

Oord, Van der Meulen, Prins, Oosterlaan, Buitelaar, & Emmelkamp, 2005). Through

confirmatory factor analysis (CFA) Walthall, Konold, and Pianta (2005) found evidence to

support the SSRS teacher version measurement model with an independent sample of school-age

children. However, CFA results in two separate investigations of the SSRS parent version again

failed to replicate the rating scales’ original factor structure with pre-school (Whiteside,

McCarthy, & Miller, 2007) and school-age samples (Van Horn, Atkins-Burnett, Karlin, Ramey,

& Snyder, 2007), respectively. Though the collection of independent studies of the SSRS cited

here does not speak directly to the validity of the revised and expanded SSIS Rating Scales, it

does underline the need for evidence supporting the new rating scales’ structural adequacy.

27

Cross-cultural Applications of the SSRS. Despite its status as one of the most

frequently recommended and widely used instruments in the domain of social skills assessment,

questions have been raised regarding the validity of the SSRS when used with various

racial/ethnic groups (Fantuzzo et al., 1998; Manz et al., 1999; Van der Oord et al., 2005;

Whiteside et al., 2007). Researchers have suggested that cultural differences between study

participants and the SSRS standardization sample may have contributed to observed structural

differences (Manz et al., 1999). The same researchers went on to suggest that “conventional test

construction methods often do not sufficiently represent economically and ethnically diverse

populations” (p. 305), and therefore may not be appropriate for use with such populations.

Two of the previously cited SSRS studies (Van Horn et al., 2007; Walthall et al., 2005)

applied multi-group confirmatory factor analytic methods to objectively examine measurement

invariance of SSRS scores across groups defined by race/ethnicity. In their analysis of the SSRS

teacher elementary form, Walthall et al. (2005) found that the general form of the SSRS

measurement model exhibited invariance across groups of White and non-White students,

providing tentative support for invariance of the SSRS teacher form measurement model as a

function of students’ racial status. Van Horn et al.’s (2007) evaluation of measurement

invariance for the SSRS parent elementary form was even more comprehensive in that the authors

examined invariance at multiple levels and across more clearly identified racial/ethnic groups:

White, non-Hispanic; African American; and Hispanic. Multi-group CFA results indicated that a

modified version of the original SSRS showed both configural and item-level invariance across

groups. However, the invariant model differed substantially from the original measurement

model proposed by the authors of the SSRS. As such, Van Horn et al. (2007) cautioned against

the use of SSRS normative scores with children from different racial and ethnic groups. Such

results further underscore the need for evidence supporting the structural integrity of the revised

28

SSIS Rating Scales, and suggest that specific attention should be given to the examination of

invariance across groups defined by race/ethnicity.

As it relates to the current study, it is also important to note that the Hispanic sample in

the Van Horn et al. (2007) study was comprised of parent participants who provided data through

English language interviews. Although a substantial proportion of the study’s sample (7%)

elected to complete interviews in Spanish, the authors reported that such data “were eliminated

because later analyses showed measurement differences for those interviews” (p. 172, Van Horn

et al., 2007). Specifics regarding the nature of measurement differences as a function of

interview language (i.e., English vs. Spanish) were not provided. Nevertheless, the fact that

differences were found to exist calls into question the adequacy of SSRS scores obtained from a

Spanish translation of the original rating scales. Given that the SSIS Rating Scales include a

published Spanish version, invariance as a function of language format also requires examination.

Research Questions

The SSIS Rating Scales represent a new and promising instrument for the assessment of

students’ social skills. However, in light of previous findings questioning the viability of the

SSRS measurement model when applied with minority group populations (e.g., Fantuzzo et al.,

1998; Manz et al., 1999; Van Horn et al., 2007), the cross-cultural utility of the SSIS Rating

Scales cannot merely be assumed. Focused, multi-group confirmatory factor analyses are needed

to evidence the structural integrity of the measurement model as a function of race/ethnicity.

With the creation of the new SSIS Rating Scale Spanish language forms (parent and child

versions), invariance as a function of language format must also be examined. In reference to

both grouping factors (i.e., race/ethnicity, language format), structural analysis of the

measurement model is needed in order to examine invariance in the full model and the viability of

individual subscales. The current study aims to address this need with respect to the parent

version of the SSIS Rating Scales. Specific research questions include:

29

1. Does the SSIS Rating Scale, Parent Form (SSIS-PF) measurement model demonstrate

adequate fit with standardization data?

2. Does the best fitting measurement model for the SSIS-PF demonstrate invariance across

race/ethnicity groups?

3. Does the best fitting measurement model for the SSIS-PF demonstrate invariance across

English and Spanish language formats?

4. Are norm-based scores produced by the SSIS-PF appropriate for use with individuals across

race/ethnicity groups?

5. Are norm-based scores produced by the SSIS-PF appropriate for use with individuals across

English and Spanish language formats?

30

Chapter 3

Method

Participants

Data examined in the current study were collected during the SSIS Rating Scale

development and standardization process.4 The full SSIS-PF standardization sample consisted of

N = 4,368 (English Version, n = 3,882; Spanish Version, n = 486). However, in order to control

for possible developmental differences, only data referencing children from the middle age group

(i.e., 5- to 12-year-olds) were considered. Through the SSIS standardization process, a stratified

norm sample was developed in alignment with March 2006 U.S. population estimates for

racial/ethnic group membership across the following groups: African American, Caucasian,

Latino,5 and Other (Gresham & Elliott, 2008). For the current study, data were obtained from the

combination of SSIS-PF English and Spanish language format samples. Demographic data for

the study sample is presented in Table 1.6

Measures

SSIS Rating Scales. The current study examined various measurement properties of

scores from the SSIS-PF (Gresham & Elliott, 2008). SSIS Rating Scales were developed as a

means of evaluating the behavior of children and adolescents across three interrelated domains:

social skills, problem behavior, and academic competence. The current investigation focuses on

the Social Skills domain only, which is comprised of seven subscales: Communication,

Cooperation, Assertion, Responsibility, Empathy, Engagement, and Self-Control. The SSIS–PF

4 Standardization data from the Social Skills Improvement System (SSIS). Copyright © 2007 NCS Pearson,

Inc. Used with permission. All rights reserved. 5 The terms Caucasian and Latino are used here to maintain consistency throughout the document. The

SSIS authors use the terms White and Hispanic, respectively, to refer to these race/ethnicity categories. 6 In reference to the distribution of participants across race/ethnicity categories, all groups are mutually

exclusive. Participants classified as ‘Other’ were removed from the sample prior to analysis.

31

Table 1.

Frequency Distributions for Demographic Variables among Participants from the SSIS Rating

Scales Standardization Sample that were Included in the Current Study.

AA

CA

LA

Total

(English)

Spanish

n (%) n (%) n (%) n (%) n (%)

Sexa

Girls 157 (16) 578 (58) 200 (20) 1000 (50) 164 (51)

Boys 155 (16) 586 (59) 200 (20) 1000 (50) 156 (49)

SESb

1 33 (11) 77 (7) 20 (14) 127 (8) 112 (36)

2 117 (38) 285 (25) 52 (36) 462 (28) 106 (34)

3 135 (43) 396 (34) 52 (36) 548 (34) 45 (15)

4 27 (9) 404 (35) 21 (15) 482 (30) 46 (15)

Region

NE 30 (10) 269 (23) 23 (16) 325 (20) 13 (4)

NC 125 (40) 266 (23) 21 (15) 404 (25) 28 (9)

SO 100 (32) 434 (37) 33 (23) 602 (37) 129 (42)

WS 57 (18) 193 (17) 68 (47) 288 (18) 139 (45)

Note. AA = African American; CA = Caucasian; LA = Latino; SES = socioeconomic status; Region = Geographic

Region; NE = Northeast; NC = North Central; SO = South; WS = West. a Data referencing participant ‘sex’ were not included in the dataset used for analyses. Information reported in the

table was taken directly from the SSIS Rating Scales Manual (Gresham & Elliott, 2008). b Maternal education was used as a proxy for SES during the SSIS Rating Scales standardization process. The

following scaling procedure was used to quantify SES via maternal education: 1 = 11th

grade or less; 2 = 12th

grade

or GED; 3 = 1 to 3 years of college; 4 = 4 or more years of college.

32

items are scored on a 4-point scale for frequency (Never, Seldom, Often, Almost Always) and a 3-

point scale for importance (Not Important, Important, Critical; Gresham & Elliott, 2008).

Analyses for the current study were conducted using frequency data only.

Through the standardization process, the authors of the SSIS gathered evidence to

support the reliability and validity of scores (see Gresham & Elliott, 2008).7 Score reliability was

examined via internal consistency, stability, and inter-rater reliability. Internal consistency

coefficient alpha estimates for the Social Skills subscales ranged from .83 - .92 (Social Skills

Scale, α = .97). Two month test-retest stability coefficients ranged from .68 - .86 across the seven

Social Skills subscales (Social Skills Total Scale, r = .84). Finally, interrater reliability

coefficients ranged from .35 - .70 (Social Skills Scale, r = .62).

Multiple forms of evidence supporting the validity of scores produced by SSIS Rating

Scales also were provided in the technical manual. Construct validity was examined through the

analysis of internal structure. Consistent with guiding theory, moderate to large negative

correlations were observed between scores on the Social Skills and Problem Behaviors scales and

subscales. The Social Skills and Problem Behaviors scales showed a large negative correlation (r

= -.49). Large positive correlations between Social Skills subscales also were observed (r = .42 -

.78). Confirmatory factor analysis (CFA) was conducted during the scale development phase of

the standardization process. CFA results were not thoroughly explained in the technical manual.

Those that were presented indicated “modest overall fit” (Gresham & Elliott, 2008, p. 31) with

the standardization data.

Additional validity evidence in the form of correlations with scores from other rating

scales designed to measure similar constructs also was provided. For Parent Form samples

7 Comprehensive reliability and validity data are offered in the technical manual (Gresham & Elliott, 2008).

Unless otherwise noted, all reliability and validity statistics reported in text refer to data collected from the

5- to 12-year-old Parent Form subsample.

33

specifically, concurrent relationships were examined between scores on the SSIS Social Skills

scale and scores on the SSRS Social Skills scale (r = .73), the Behavior Assessment System for

Children, Second Edition (BASC-2) Adaptive Skills scale (r = .62), the Vineland-II Socialization

scale (r = .44), and the Home and Community Social Behavior Scales (HCSBS) Social

Competence scale (r = .77). Finally, SSIS scores were shown to accurately differentiate between

non-clinical and clinical samples (e.g., Autistic, Attention Deficit/Hyperactivity Disorder,

Emotional Disturbance, and Intellectual Disability groups). Given the focus of the current study,

it should be noted that the validity studies which examined the SSIS Rating Scales’ relationships

with other measures and ability to differentiate between clinical and non-clinical groups were

conducted in reference to predominantly Caucasian student samples. Thus, the generalizability of

such validity evidence across race/ethnicity groups is unknown.

SSIS Rating Scales, Spanish Format. The Spanish language versions of the SSIS

Rating Scales were developed through the application of systematic translation procedures. After

translation was complete, preliminary analyses were conducted to examine the reliability of

scores produced from the Spanish version of the instrument. Item-total correlations and internal

consistency reliability coefficients were calculated and compared with those observed for scores

from the English version. For the SSIS-PF Spanish format, item-total correlations for Social

Skills subscales ranged from .32 - .70, and internal consistency coefficient alpha estimates ranged

from .75 - .84 (Social Skills Scale, α = .95). Though not tested for statistical significance, the

authors concluded that item-total correlations and internal consistency coefficients were similar

across language format. Thus, the reliability of scores produced on the Spanish format SSIS

Ratings Scales was tentatively supported. Specific examinations of Spanish format score validity,

however, were not conducted.

34

Procedures

SSIS Rating Scales standardization data were collected from a national sample of 4,700

children ages 3 through 18 years. Data collection was conducted from September 2006 through

October 2007. Teacher participants were recruited by site coordinators at 115 data collection

sites across 36 states. Participating teachers distributed data collection packets to all students in

their respective classes. Consent forms, demographic information, and Parent Form data were

collected first. Upon obtaining informed consent and Parent Form data, additional Teacher and

Student Form data were collected in alignment with targets for demographic group

representation. Known Spanish-speaking parents were provided with English and Spanish

versions of all data collection forms. Additional Spanish forms were distributed to individuals

identified as Spanish-speaking via returned English language consent forms. For more specific

details regarding SSIS Rating Scale standardization data collection please refer to the technical

manual (Gresham & Elliott, 2008).

Design and Analysis

The current study examined measurement invariance of the SSIS-PF as a function of

student race/ethnicity and rating scale language format (i.e., English/Spanish). Multiple stages of

data analysis were applied to investigate different forms of measurement invariance. The same

procedures were used for investigating invariance across both sets of grouping variables (i.e.,

race/ethnicity and language format). Due to sampling restrictions (i.e., only Latino participants

completed Spanish-language SSIS Rating Scales), the studies of invariance by race/ethnicity and

language format were conducted separately.

Structural Analyses. Primary analyses examined invariance in the SSIS-PF

measurement model across groups defined by race/ethnicity and language format, respectively.

Specifically, iterative CFA procedures were used to examine invariance in the SSIS-PF

35

measurement model across race/ethnicity and language format groups. The CFAs were

conducted using MPlus 6.11 software (Muthén & Muthén, 2011). Item-level data (i.e., 4-point

frequency ratings) were not available for analysis.8 Instead, item parcels were used as indicators

of latent factors. Though not ideal for the analysis of measurement invariance, item-parcels have

the advantage of approximating continuously scaled and normally distributed indicators (Hau &

Marsh, 2004). Modified maximum likelihood (MLM) estimation methods were used for all CFA

analyses.

Parceling Technique. Two unique sets of item parcels were created for use in the

current study. In both cases, items were assigned to parcels according to item-total subscale

correlations, and all parcels were created with as few items as possible (i.e., two items where

possible, three where necessary).9 However, slightly different procedures were used to create

parcels according to the parameters of the two separate invariance studies.

Because three groups were involved in the analysis of invariance as a function of

race/ethnicity, specifying parcels on the basis of systematic differences in item-total correlations

across groups was not feasible. As such, the parcels used in the race/ethnicity invariance analyses

were created by pairing items according to the magnitude of item-total correlations on each SSIS-

PF subscale for the full standardization sample. This strategy applies a rationale similar to that

adopted for Cattell’s radial item parceling technique (1956, as cited in Bandalos & Finney, 2001).

Essentially, items with the strongest relationships to the underlying factor (subscale) were paired

together first, and the process continued such that the final parcel was created with items

demonstrating the weakest presumed factor relationships. Results of the systematic item-

parceling process are presented in Table 2.

8 Citing company policy, Pearson, Inc., publisher of the SSIS Rating Scales, agreed to release

standardization data in parcel format only. 9 For 7-item subscales, one parcel was comprised of three items.

36

Table 2

Summary of Item-Parcel Assignment for Analysis of SSIS-PF Measurement Invariance by Race/Ethnicity and Language Format

Item-Total Correlation Parcel Assignment

Item English Spanish Difference Language Race

Communication

Item #4 .50 .53 -.03 2 2

Item #10 .40 .48 -.08 3 3

Item #14 .50 .55 -.05 3 1

Item #20 .40 .32 .08 1 3

Item #24 .46 .51 -.05 2 2

Item #30 .50 .53 -.03 1 1

Item #40 .43 .48 -.05 3 3

(continued)

37

Table 2 (continued)




Cooperation

Item #2 .64 .64 .00 4 4

Item #7 .66 .66 .00 5 4

Item #12 .56 .61 -.05 6 6

Item #17 .62 .63 -.01 5 5

Item #27 .58 .45 .13 4 5

Item #37 .55 .60 -.05 6 6

(continued)

38

Table 2 (continued)




Assertion

Item #1 .51 .41 .10 7 7

Item #5 .43 .41 .02 8 9

Item #11 .46 .38 .08 7 8

Item #15 .47 .55 -.08 9 8

Item #25 .43 .44 -.01 8 9

Item #35 .43 .46 -.03 9 9

Item #45 .52 .41 .11 7 7

(continued)

39

Table 2 (continued)




Responsibility

Item #6 .65 .63 .02 10 11

Item #16 .55 .54 .01 11 12

Item #22 .63 .67 -.04 12 11

Item #26 .69 .70 -.01 12 10

Item #32 .51 .51 .00 11 12

Item #42 .67 .65 .02 10 10

(continued)

40

Table 2 (continued)




Empathy

Item #3 .61 .49 .12 13 15

Item #8 .70 .65 .05 14 13

Item #13 .54 .60 -.06 15 15

Item #18 .65 .58 .07 13 14

Item #28 .68 .61 .07 14 14

Item #38 .70 .66 .04 15 13

(continued)

41

Table 2 (continued)




Engagement

Item #9 .57 .51 .06 16 17

Item #19 .70 .72 -.02 18 16

Item #23 .60 .59 .01 17 17

Item #29 .50 .45 .05 17 18

Item #33 .64 .56 .08 16 16

Item #39 .57 .60 -.03 18 17

Item #43 .52 .49 .03 17 18

(continued)

42

Table 2 (continued)




Self-Control

Item #21 .64 .58 .06 20 19

Item #31 .62 .53 .09 19 20

Item #34 .62 .53 .09 19 20

Item #36 .55 .50 .05 20 21

Item #41 .57 .54 .03 21 21

Item #44 .49 .48 .01 21 21

Item #46 .63 .56 .07 20 19

Note. Item content has been omitted in compliance with terms of the license agreement with Pearson, Inc. Item numbers reported in table, however, correspond

with the actual numbering of items on the SSIS-PF. Item-Total Correlations refer to those observed for the 5-12 age group of the SSIS-PF standardization

sample, as reported the SSIS Rating Scales Manual (Gresham & Elliott, 2008). Parcel assignment for Language Format (Language) was determined according to

the consistency of item-total correlations across English and Spanish forms. Parcel assignment for Race/Ethnicity (Race) was determined according to the

magnitude of item-total correlations for aggregated English form standardization data.

43

For the invariance analysis across language format groups, item-parcels were

systematically developed in order to minimize the likelihood that measurement noninvariance

would be obscured at the level of individual latent factor indicators. To do this, item-total

correlation difference scores were calculated for each item across the two groups (i.e., English

and Spanish). Once each item’s difference score had been calculated, parcels were created by

grouping together the two (or three) items with the closest difference scores. A statistical

simulation study conducted by Meade and Kroustalis (2006) showed that measurement

noninvariance was least likely to be obscured when differentially functioning items were grouped

within a single parcel. Thus, the process of grouping items into parcels according to the

magnitude of item-total correlation difference scores was done intentionally to isolate potential

sources of noninvariance. In effect, items that were most dissimilar across groups in terms of the

magnitude of their relationship to the presumed latent factor (i.e., subscale) were targeted as

potential sources of noninvariance and grouped together. Finally, in instances when difference

score parceling was inconclusive (e.g., 3+ difference scores of equal magnitude), qualitative

analysis was used to assign items to specific parcels on the basis of item-content (see Table 2).

Assessing Model Fit. Model fit was evaluated through examination of global fit

statistics and modification indices. Fit indices and criteria for determining fit were as follows:

root mean square error of approximation (RMSEA) and 90% confidence interval (RMSEA ≤ .06;

90% CI upper limit ≤ .10); comparative fit index (CFI > .95); Tucker-Lewis Index (TLI > .95);

and standardized root mean squared residual (SRMR < .08; Byrne, 2012; Hu & Bentler, 1999).

In addition to assessing the adequacy of global model fit, the statistical and practical significance

of estimated model parameters also was assessed. T-tests were used to examine statistical

significance of parameter estimates. Practical significance was assessed through a review of

standardized model parameters (i.e., factor loadings and factor correlations). Where necessary,

44

model respecificaiton was guided by an inspection of modification indices and other pertinent

technical output. The need for model respecification was informed by three separate

considerations: (a) the adequacy of initial global fit statistics, (b) the projected significance of

newly estimated parameters, and (c) the substantive meaningfulness of new parameters.

Analysis of Invariance. The multi-step process outlined by Byrne (2012) was applied in

separate sets of analyses in order to systematically test for invariance in the SSIS-PF

measurement model as a function of race/ethnicity (African American, Caucasian, Latino) and

language format (English, Spanish), respectively. Recommendations offered by Vandenberg and

Lance (2000) also were used as a reference during analysis. As a preliminary step in the analysis

of invariance, the viability of the implied measurement model for the SSIS Rating Scales10

was

examined with data from a randomly selected subsample from each dataset. Through this first set

of structural analyses, a best fitting measurement model was specified and retained as a baseline.

The baseline model was then independently fitted to datasets for each group, and the need for

group-specific model respecification was examined. Next, increasingly restrictive constraints

were applied to the baseline model in order to test for configural, metric, and structural

invariance, respectively. The baseline model was first specified for configural invariance (i.e.,

model parameters constrained to match baseline configuration, magnitude of individual loadings

free to vary across groups) across all groups. Borrowing from procedures used by Van Horn et

al. (2007), the magnitude of unconstrained factor loadings for all indicators were compared at this

stage to examine potential loci of measurement invariance. Next, further model constraints were

applied, specifying invariance for the individual indicator-loadings across groups. Finally,

10

The term implied is used here to acknowledge the fact that the SSIS Rating Scales Manual does not

explicitly define/depict a measurement model for the instrument. CFA conducted during the scale

development process used a first-order model with all seven first-order factors specified to covary with

each other. However, the use of a global Social Skills score also implies the presence of a unitary higher-

order social skills factor. As such, both first-order and higher-order models were examined in the current

study.

45

structural components of the model (i.e., factor variances and covariances) were constrained for

equality. At each stage, global fit statistics, individual model parameters, and modification

indices were examined. The statistical significance of differences in model fit between

constrained (nested) and unconstrained models was examined through chi-square difference

testing. Model improvements were made where appropriate. All resulting modifications were

carried through to each successive stage of analysis.

46

Chapter 4

Results

Preliminary Analyses and Data Preparation

Means, standard deviations, and correlations for all variables are presented in Tables 3 -

8. Prior to the initiation of invariance testing, data were screened to check the assumptions of

normality, linearity, heteroscedasticity, and multicollinearity. Bivariate scatterplots supported

linear relations between all variables, and an examination of the bivariate correlation matrix

supported the relative uniqueness of all indicators. Skew and kurtosis did not yield evidence of

extreme non-normality. However, moderate negative skew was observed for all indicator

distributions. The negative skew of indicator distributions also contributed to a moderate level of

heteroscedasticity. As such, the robust MLM estimator was used for all SEM analyses to control

for potential deviations from the assumption of multivariate normality.

Given that item-parcels were used as latent factor indicators for all models, preliminary

comparisons were performed in order to assess the degree to which the use of item-parcels, as

compared to single-item indicators, might influence results. First, internal consistency

reliabilities of the seven social skills subscales were calculated with item-parcel data and then

compared with the corresponding internal consistency reliability statistics reported in the SSIS

Rating Scales Manual. Second, subscale total-scores were calculated with item-parcel data.

Subscale total-score correlations were then compared with manual reported subscale

intercorrelations. As shown in Tables 9 and 10, differences in subscale internal consistency

reliabilities and subscale correlations were generally small, which provides some justification for

the use of item parcels as latent factor indicators in the current analysis. Bivariate correlations

47

Table 3

Means and Standard Deviations for Item-Parcels and Control Variables used in Analysis of Invariance by Race/Ethnicity.

Race/Ethnicity Groups

Full Sample African American Caucasian Latino

M SD M SD M SD M SD

P1 2.21 0.52 2.25 0.57 2.20 0.49 2.22 0.59

P2 2.42 0.55 2.43 0.59 2.41 0.54 2.51 0.60

P3 2.23 0.48 2.13 0.51 2.26 0.46 2.23 0.54

P4 2.15 0.57 2.17 0.62 2.13 0.55 2.26 0.60

P5 2.08 0.55 2.07 0.61 2.07 0.53 2.20 0.6

P6 2.31 0.52 2.29 0.59 2.30 0.50 2.43 0.52

P7 2.20 0.60 2.19 0.69 2.21 0.57 2.21 0.66

P8 2.11 0.58 2.09 0.64 2.11 0.56 2.09 0.62

P9 2.13 0.50 2.15 0.55 2.11 0.48 2.20 0.57

P10 1.95 0.67 1.92 0.74 1.95 0.64 2.08 0.74

P11 2.30 0.60 2.21 0.66 2.51 0.58 2.43 0.62

P12 2.24 0.55 2.13 0.62 2.26 0.52 2.29 0.58

P13 2.36 0.58 2.30 0.63 2.37 0.57 2.42 0.59

P14 2.10 0.64 2.03 0.68 2.11 0.62 2.17 0.67

P15 2.22 0.56 2.27 0.61 2.20 0.55 2.29 0.60

P16 2.09 0.65 2.10 0.72 2.10 0.62 2.10 0.75

P17 2.22 0.55 2.24 0.57 2.21 0.54 2.22 0.60

P18 2.20 0.54 2.20 0.56 2.21 0.52 2.17 0.62

P19 1.73 0.63 1.78 0.68 1.71 0.61 1.83 0.66

P20 1.50 0.65 1.38 0.74 1.51 0.61 1.60 0.74

P21 1.83 0.55 1.71 0.63 1.86 0.51 1.86 0.64

Age

107.28

27.38

107.66

28.09

106.47

27.39

112.94

25.09

SES 2.86 0.94 2.59 0.90 2.97 0.93 2.51 0.91 Note. P1-P21 are latent factor indicator parcels developed for the analysis of invariance by race/ethnicity (see Table 2). Age is child-age in months. SES is

socioeconomic status. Sample sizes are: Full Sample (N = 1619); African American (n = 312); Caucasian (n = 1162); Latino (n = 145).

48

Table 4

Means and Standard Deviations for Item-Parcels and Control Variables used in Analysis of Invariance by Language Format.

Language Format Groups

Full Sample English Spanish

M SD M SD M SD

P1 2.33 0.56 2.34 0.54 2.28 0.66

P2 3.44 0.56 2.42 0.55 2.53 0.58

P3 2.17 0.51 2.15 0.49 2.30 0.58

P4 2.14 0.59 2.10 0.48 2.35 0.63

P5 2.17 0.55 2.13 0.53 2.37 0.60

P6 2.33 0.53 2.31 0.52 2.45 0.57

P7 2.25 0.55 2.20 0.54 2.45 0.53

P8 2.16 0.57 2.15 0.55 2.21 0.66

P9 2.08 0.61 2.04 0.60 2.25 0.65

P10 2.12 0.63 2.09 0.61 2.28 0.68

P11 2.24 0.56 2.24 0.55 2.28 0.60

P12 2.20 0.60 2.16 0.58 2.39 0.66

P13 2.12 0.64 2.07 0.63 2.34 0.63

P14 2.25 0.63 2.24 0.62 2.27 0.68

P15 2.38 0.55 2.37 0.54 2.42 0.60

P16 2.04 0.64 2.01 0.63 2.16 0.66

P17 2.26 0.53 2.26 0.52 2.27 0.57

P18 2.25 0.58 2.22 0.57 2.38 0.61

P19 1.55 0.68 1.50 0.65 1.81 0.74

P20 1.81 0.61 1.75 0.59 2.11 0.64

P21 1.88 0.63 1.86 0.62 1.97 0.70

Age

108.31

26.85

107.28

27.38

113.45

23.46

SES 2.73 1.00 2.86 0.94 2.07 1.05 Note. P1-P21 are latent factor indicator parcels developed for the analysis of invariance by language format (see Table #). Age is child-age in months. SES is

socioeconomic status. Sample sizes are: Full Sample (N = 1941); English Format (n = 1619); Spanish Format (n = 320).

49

Table 5

Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and Control Variables for Full Race/Ethnicity Invariance

Sample and African American Subsample.

Parcel Indicators – Race/Ethnicity Invariance Analysis

P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12

P1 -- .360 .516 .461 .472 .526 .446 .413 .436 .484 .519 .454

P2 .375 -- .370 .390 .332 .476 .318 .345 .346 .425 .363 .407

P3 .560 .361 -- .394 .451 .497 .360 .438 .363 .390 .392 .484

P4 .505 .391 .444 -- .658 .562 .384 .363 .347 .587 .593 .520

P5 .510 .358 .498 .644 -- .591 .326 .398 .355 .566 .490 .572

P6 .554 .422 .534 .562 .595 -- .448 .359 .432 .611 .547 .587

P7 .351 .201 .329 .269 .250 .298 -- .512 .582 .393 .384 .328

P8 .384 .344 .418 .336 .343 .401 .523 -- .550 .467 .387 .421

P9 .441 .327 .378 .325 .338 .394 .555 .522 -- .370 .451 .400

P10 .482 .374 .424 .568 .533 .544 .264 .411 .343 -- .644 .544

P11 .485 .364 .454 .559 .535 .539 .266 .375 .366 .584 -- .564

P12 .445 .350 .508 .544 .594 .561 .235 .366 .317 .523 .531 --

P13 .446 .425 .431 .393 .363 .454 .328 .504 .414 .467 .427 .394

P14 .435 .408 .435 .377 .402 .481 .350 .544 .414 .538 .429 .426

P15 .502 .418 .431 .497 .442 .527 .331 .505 .440 .541 .457 .432

P16 .459 .311 .365 .260 .247 .317 .431 .443 .483 .333 .298 .237

P17 .487 .330 .420 .313 .329 .412 .448 .499 .519 .380 .343 .315

P18 .528 .332 .412 .330 .307 .394 .388 .438 .485 .359 .320 .321

P19 .493 .346 .447 .524 .550 .525 .187 .339 .312 .557 .450 .481

P20 .421 .276 .412 .410 .476 .466 .208 .342 .271 .528 .407 .408

P21 .524 .351 .489 .472 .503 .529 .299 .414 .388 .561 .499 .491

Age .023 -.001 .087 .038 .059 .072 -.077 -.024 -.040 .102 .147 .129

SES .091 .010 .160 .065 .041 .073 .026 .023 -.010 -.003 .040 .098

50

Table 5 continued

Parcel Indicators – Race/Ethnicity Invariance Analysis Control Variables

P13 P14 P15 P16 P17 P18 P19 P20 P21 Age SES

P1 .435 .422 .499 .480 .481 .578 .452 .392 .517 -.004 .043

P2 .479 .413 .414 .299 .331 .314 .344 .260 .335 -.087 .089

P3 .426 .406 .435 .316 .433 .415 .410 .403 .478 .003 .185

P4 .412 .367 .478 .264 .337 .291 .543 .459 .490 .033 .031

P5 .326 .362 .413 .286 .348 .325 .571 .473 .465 .065 .017

P6 .487 .486 .476 .364 .452 .424 .509 .448 .521 .025 .116

P7 .409 .457 .467 .496 .527 .475 .318 .293 .377 -.051 -.010

P8 .504 .508 .593 .412 .533 .473 .378 .377 .445 -.092 .051

P9 .453 .452 .531 .524 .546 .541 .320 .314 .390 -.070 -.009

P10 .524 .576 .542 .363 .425 .395 .562 .556 .628 .046 .110

P11 .409 .474 .511 .354 .434 .332 .447 .569 .496 .068 .047

P12 .447 .407 .453 .237 .356 .341 .532 .426 .541 .127 .125

P13 -- .741 .651 .445 .521 .438 .460 .424 .567 .050 0.87

P14 .728 -- .585 .425 .527 .427 .546 .428 .545 -.001 0.69

P15 .626 .635 -- .432 .526 .464 .454 .405 .547 -.040 .101

P16 .437 .426 .396 -- .675 .639 .321 .309 .384 -.101 .019

P17 .489 .485 .452 .652 -- .595 .363 .379 .423 -.072 -.017

P18 .465 .438 .451 .625 .609 -- .341 .335 .432 -.141 .075

P19 .405 .463 .503 .281 .342 .348 -- .630 .686 .082 .070

P20 .342 .388 .398 .294 .355 .312 .616 -- .584 .144 .049

P21 .475 .527 .511 .363 .403 .411 .665 .585 -- .039 .121

Age .036 .050 .007 -.009 -.068 -.057 .099 .120 .050 -- -.138

SES .027 .032 -.024 .027 -.016 .023 .042 .074 .112 -- Note. Correlations for the full sample (n = 1619) are presented below the diagonal. Correlations for the African American subsample (n = 312) are presented

above the diagonal. P1-P21 are latent factor indicator parcels developed for the analysis of invariance by race/ethnicity. Age is child-age in months. SES is

socioeconomic status.

51

Table 6

Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and Control Variables for Caucasian and Latino Subsamples in

Race/Ethnicity Invariance Analysis.


P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12

P1 -- .392 .464 .492 .484 .515 .350 .217 .410 .458 .490 .389

P2 .378 -- .466 .474 .411 .527 .315 .423 .327 .356 .416 .331

P3 .603 .347 -- .315 .493 .459 .453 .348 .414 .482 .429 .492

P4 .523 .375 .489 -- .672 .552 .284 .219 .391 .563 .591 .501

P5 .530 .355 .521 .633 -- .620 .335 .229 .376 .575 .548 .608

P6 .573 .385 .566 .561 .591 -- .367 .354 .426 .531 .583 .534

P7 .316 .141 .298 .224 .207 .230 -- .462 .556 .367 .324 .343

P8 .402 .333 .423 .346 .344 .426 .537 -- .518 .335 .297 .367

P9 .448 .319 .385 .304 .324 .374 .547 .515 -- .384 .423 .363

P10 .488 .357 .429 .561 .542 .519 .198 .405 .325 -- .644 .484

P11 .479 .357 .476 .544 .547 .528 .214 .384 .328 .552 -- .496

P12 .460 .336 .514 .565 .603 .557 .180 .346 .284 .523 .519 --

P13 .438 .401 .413 .390 .364 .428 .282 .506 .390 .437 .386 .365

P14 .439 .407 .431 .391 .414 .471 .292 .556 .393 .513 .400 .426

P15 .493 .398 .432 .502 .434 .535 .265 .483 .396 .538 .435 .432

P16 .467 .306 .361 .260 .246 .303 .401 .450 .477 .321 .280 .228

P17 .496 .332 .418 .310 .332 .394 .401 .479 .496 .357 .311 .303

P18 .525 .339 .412 .356 .313 .395 .337 .422 .461 .355 .317 .317

P19 .495 .344 .473 .523 .548 .533 .124 .336 .293 .555 .44 .458

P20 .438 .273 .427 .399 .481 .470 .157 .320 .238 .508 .383 .391

P21 .542 .353 .483 .499 .536 .546 .241 .403 .389 .534 .495 .461

Age .023 .014 .109 .015 .053 .079 -.095 .006 -.047 .096 .155 .139

SES .127 -.017 .150 .117 .081 .076 .046 .013 .012 -.008 .041 .089

52

Table 6 continued



P1 .550 .471 .563 .380 .450 .459 .564 .432 .509 .085 .025

P2 .472 .405 .539 .376 .318 .341 .346 .336 .414 .044 .137

P3 .540 .500 .488 .497 .449 .417 .429 .315 .503 .144 .068

P4 .380 .312 .484 .258 .285 .269 .470 .378 .320 .187 -.118

P5 .434 .404 .549 .174 .279 .253 .508 .454 .424 .046 -.102

P6 .563 .538 .592 .311 .461 .356 .500 .485 .484 .096 .068

P7 .444 .470 .446 .459. 584 .509 .302 .304 .448 -.005 -.041

P8 .493 .548 .473 .470 .565 .462 .296 .403 .425 -.084 -.002

P9 .506 .477 .509 .439 .608 .526 .399 .383 .420 .048 0.076

P10 .532 .609 .566 .344 .439 .326 .555 .576 .588 .249 -.166

P11 .547 .502 .515 .299 .396 .339 .517 .374 .515 .253 011

P12 .434 .442 .441 .306 .348 .320 .570 .426 .509 .067 .021

P13 -- .763 .683 .456 .546 .484 .508 .441 .553 .081 .028

P14 .717 -- .614 .544 .563 .505 .497 .490 .643 .044 .032

P15 .615 .660 -- .440 .484 .408 .597 .526 .560 .032 -.060

P16 .434 .410 .378 -- .614 .644 .313 .313 .410 .076 .065

P17 .475 .464 .424 .653 -- .646 .359 .440 .448 -.023 .047

P18 .473 .433 .456 .618 .609 -- .319 .222 .495 -.013 .034

P19 .375 .464 .502 .263 .332 .358 -- .625 .641 .140 -.078

P20 .293 .352 .381 .288 .340 .322 .619 -- .570 .064 -.106

P21 .427 .499 .504 .349 .398 .391 .677 .583 -- .036 .008

Age .025 .065 .013 .009 -.073 -.037 .094 .119 .057 -- -.093

SES .004 .017 -.038 .026 -.018 .002 .069 .102 .107 -.003 -- Note. Correlations for the Caucasian subsample (n = 1162) are presented below the diagonal. Correlations for the Latino subsample (n = 145) are presented

above the diagonal. P1-P21 are latent factor indicator parcels developed for the analysis of invariance by race/ethnicity. Age is child-age in months. SES is

socioeconomic status.

53

Table 7

Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and Control Variables for Full Language Format Invariance

Sample.


P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12

P1 --

P2 .325 --

P3 .479 .406 --

P4 .426 .394 .542 --

P5 .385 .411 .534 .689 --

P6 .466 .433 .581 .606 .598 --

P7 .373 .314 .407 .342 .360 --

P8 .370 .256 .300 .238 .242 .296 .512 --

P9 .401 .361 .406 .383 .349 .426 .529 .443 --

P10 .438 .409 .520 .625 .587 .586 .367 .303 .439 --

P11 .374 .368 .528 .581 .584 .565 .315 .225 .378 .562 --

P12 .452 .423 .523 .663 .589 .589 .350 .298 .429 .788 .553 --

P13 .405 .411 .472 .499 .484 .532 .436 .337 .531 .565 .466 .569

P14 .425 .425 .393 .385 .350 .423 .387 .354 .542 .460 .366 .469

P15 .500 .426 .441 .447 .403 .503 .398 .370 .497 .501 .404 .516

P16 .438 .326 .364 .307 .285 .348 .447 .440 .487 .372 .264 .367

P17 .504 .346 .426 .318 .319 .418 .458 .425 .483 .367 .306 .374

P18 .483 .345 .407 .305 .301 .411 .481 .429 .540 .386 .297 .378

P19 .346 .289 .469 .487 .420 .461 .286 .206 .358 .503 .399 .506

P20 .403 .372 .545 .574 .528 .536 .331 .280 .431 .566 .494 .577

P21 .434 .328 .489 .496 .428 .491 .330 .283 .378 .490 .465 .513

Age .015 -.003 .092 .065 .057 .071 -.106 -.003 .043 .121 .123 .142

SES .151 .000 .053 .017 -.028 .031 -.026 .003 -.038 -.043 .056 -.017

54

Table 7 continued



P1

P2

P3

P4

P5

P6

P7

P8

P9

P10

P11

P12

P13 --

P14 .639 --

P15 .618 .672 --

P16 .438 .468 .439 --

P17 .421 .474 .481 .634 --

P18 .451 .492 .477 .670 .666 --

P19 .443 .336 .385 .343 .345 .339 --

P20 .568 .444 .487 .372 .375 .360 .606 --

P21 .462 .410 .438 .346 .382 .382 .558 .580 --

Age .071 .003 .020 -.037 -.078 -.012 .130 .116 .049 --

SES -.072 .013 .025 -.032 .016 .002 -.002 -.020 .074 -.075 -- Note. Correlations for the full sample (n = 1941). P1-P21 are latent factor indicator parcels developed for the analysis of invariance by language format. Age is

child-age in months. SES is socioeconomic status.

55

Table 8

Bivariate Correlations of SSIS Rating Scale Latent Factor Indicators and Control Variables for English Language Format and

Spanish Language Format Subsamples in Language Format Invariance Analysis.


P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12

P1 -- .329 .464 .432 .347 .410 .395 .365 .418 .398 .300 .501

P2 .330 -- .492 .460 .493 .463 .508 .282 .361 .456 .441 .531

P3 .492 .380 -- .581 .559 .634 .481 .309 .434 .609 .536 .611

P4 .439 .372 .522 -- .684 .626 .394 .209 .359 .670 .621 .678

P5 .412 .385 .518 .680 -- .672 .414 .176 .317 .656 .632 .662

P6 .490 .422 .561 .595 .574 -- .427 .254 .441 .602 .584 .621

P7 .384 .268 .378 .311 .327 .363 -- .413 .530 .399 .403 .401

P8 .374 .249 .294 .242 .256 .304 .537 -- .480 .272 .187 .286

P9 .407 .355 .388 .373 .341 .412 .518 .432 -- .366 .382 .430

P10 .458 .393 .489 .607 .561 .576 .346 .307 .446 -- .571 .759

P11 .396 .350 .527 .574 .476 .561 .296 .233 .375 .560 -- .530

P12 .452 .392 .491 .651 .559 .575 .321 .299 .417 .792 .560 --

P13 .417 .404 .449 .477 .462 .527 .407 .333 .520 .555 .466 .549

P14 .423 .419 .380 370 .330 .420 .375 .344 .542 .463 .361 .462

P15 .501 .419 .438 .443 .391 .505 .381 .371 .487 .395 .416 .507

P16 .451 .307 .332 .285 .261 .322 .438 .431 .484 .362 .242 .351

P17 .516 .339 .427 .330 .337 .409 .465 .430 .491 .372 .324 .373

P18 .509 .341 .404 .309 .300 .410 .483 .435 .542 .384 .316 .367

P19 .349 .276 .449 .487 .404 .466 .256 .202 .349 .501 .408 .500

P20 .400 .352 .518 .559 .510 .535 .281 .255 .406 .561 .488 .564

P21 .448 .325 .471 .491 .413 .486 .322 .284 .378 .499 .457 .520

Age .019 -.001 .088 .050 .048 .072 -.133 -.006 .037 .111 .129 .152

SES .153 .010 .110 .069 .037 .073 .029 -.007 .002 .002 .098 .035

56

Table 8 continued



P1 .419 .442 .501 .420 .464 .419 .393 .502 .397 .013 .116

P2 .417 .460 .464 .393 .384 .340 .307 .428 .332 -.057 .074

P3 .533 .446 .453 .468 .430 .387 .496 .610 .539 .060 .012

P4 .540 .457 .471 .357 .281 .227 .417 .568 .499 .069 .045

P5 .508 .443 .453 .335 .262 .245 .398 .516 .467 .024 -.039

P6 .520 .438 .497 .428 .459 .383 .396 .504 .493 .010 .021

P7 .496 .459 .482 .453 .457 .421 .303 .425 .328 -.061 .002

P8 .347 .391 .359 .470 .402 .400 .203 .366 .267 -.006 .095

P9 .529 .554 .543 .472 .462 .495 .318 .459 .349 .013 -.014

P10 .571 .453 .524 .383 .353 .355 .467 .548 .437 .121 -.067

P11 .474 .381 .352 .351 .230 .206 .366 .543 .491 .076 -.056

P12 .611 .509 .56i4 .396 .390 .376 .469 .577 .473 .033 -.011

P13 -- .617 .613 .480 .440 .441 .484 .564 .494 .064 -.059

P14 .651 -- .652 .519 .478 .470 .392 .514 .403 -.081 .020

P15 .623 .676 -- .474 .481 .482 .408 .518 .370 .016 -.006

P16 .420 .457 .430 -- .579 .607 .391 .494 .412 -.022 .018

P17 .422 .473 .480 .649 -- .651 .408 .431 .311 -.128 .072

P18 .443 .498 .475 .680 .673 -- .321 .395 .316 -.068 .032

P19 .416 .324 .380 .321 .334 .329 -- .539 .551 .104 -.044

P20 .552 .436 .486 .333 .370 .336 .604 -- .579 .153 -.010

P21 .450 .411 .453 .326 .399 .392 .556 .580 -- .063 .054

Age .058 .017 .018 -.049 -.070 -.013 .120 .090 .039 -- -.091

SES -.022 .019 .046 -.012 .006 .035 .074 .060 .109 -.046 -- Note. Correlations for the English Language Format subsample (n = 1612) are presented below the diagonal. Correlations for the Spanish Language Format

subsample (n = 320) are presented above the diagonal. P1-P21 are latent factor indicator parcels developed for the analysis of invariance by language format.

Age is child-age in months. SES is socioeconomic status.

57

Table 9

Internal Consistency Coefficients and Intercorrelations for SSIS-PF Social Skills Subscales

based on Race/Ethnicity Item Parcels and Manual Reported Item-level Data.

COMM

COOP

ASRT

RESP

EMP

ENG

SC

Coefficient

Alpha

Communication -- .69 .55 .66 .63 .58 .63 .74

Cooperation .704 -- .47 .78 .57 .42 .67 .83

Assertion .525 .455 -- .48 .59 .65 .46 .75

Responsibility .658 .770 .470 -- .62 .44 .67 .84

Empathy .602 .548 .577 .591 -- .60 .60 .86

Engagement .586 .422 .626 .435 .587 -- .48 .83

Self-Control .593 .645 .413 .674 .577 .430 -- .84

Chronbach’s

Alpha

.704

.828

.767

.772

.852

.825

.819

Note. COMM = Communication; COOP = Cooperation; ASRT = Assertion; RESP = Responsibility; EMP =

Empathy; ENG = Engagement; SC = Self-Control. SSIS-PF Internal Consistency Coefficients and Intercorrelations

based on Race/Ethnicity item parcels are presented below the diagonal. Statistics reported above the diagonal are

taken directly from SSIS Rating Scales Manual.

58

Table 10

Internal Consistency Coefficients and Intercorrelations for SSIS-PF Social Skills Subscales

based on Language Format Item Parcels and Manual Reported Item-level Data.

COMM

COOP

ASRT

RESP

EMP

ENG

SC

Coefficient

Alpha

Communication -- .69 .55 .66 .63 .58 .63 .74

Cooperation .673 -- .47 .78 .57 .42 .67 .83

Assertion .548 .456 -- .48 .59 .65 .46 .75

Responsibility .640 .803 .469 -- .62 .44 .67 .84

Empathy .630 .607 .580 .649 -- .60 .60 .86

Engagement .576 .436 .656 .450 .577 -- .48 .83

Self-Control .600 .661 .434 .660 .596 .463 -- .84

Chronbach’s

Alpha

.659

.831

.754

.848

.848

.851

.789

Note. COMM = Communicaiton; COOP = Cooperation; ASRT = Assertion; RESP = Responsibility; EMP =

Empathy; ENG = Engagement; SC = Self-Control. SSIS-PF Internal Consistency Coefficients and Intercorrelations

based on Language Format item parcels are presented below the diagonal. Statistics reported above the diagonal are

taken directly from SSIS Rating Scales Manual.

59

between latent factor indicators (i.e., item parcels) and select control variables (i.e., child age and

socio-economic status) also were examined prior to model testing (see Tables 5-8). Given the

large sample sizes, statistically significant correlations were observed between a number of the

latent factor indicators and control variables. The magnitude of observed indicator-control

correlations tended to be small. Nevertheless, expanded models with pathways controlling for the

effects of SES and age on the indicator parcels were examined following the completion of

invariance analyses.

No missing values were observed in the dataset. Data were screened for the presence of

univariate and multivariate outliers. Univariate outliers were identified through the calculation of

z-scores for all indicator variables. Several cases contained one or more outlying data points, all

of which were observed at the negative end of indicator distributions (z-scores < -3.5).

Multivariate outliers were identified through the calculation of Mahalanobis distance statistics

(D), which were examined for significance with a Chi-square test (df = 21, p = .001). Initial

baseline models were then fit to three different datasets. A first dataset was comprised of the

random sample for that particular analysis. To create a second dataset, all cases identified as

multivariate outliers were removed from the random sample. Finally, to create a third dataset, all

cases identified as univariate outliers also were removed from the sample. Results were

compared to determine whether the removal of outliers significantly influenced model fit. A

comparison of global fit statistics for initial models fitted to the three datasets is presented in

Table 11. As can be seen in the table, changes in global fit statistics associated with the different

datasets were relatively minor. As such, the following sections describe only those analyses for

which models were fitted to the complete datasets.

Analysis of SSIS-PF Invariance by Race/Ethnicity

As the first step in the analysis of invariance by race/ethnicity, a random sample of cases

(n = 845) was selected to establish a baseline measurement model for the SSIS-PF. A first-order

60

Table 11

Global Fit Statistics for SSIS-PF Baseline Invariance Models fitted to Samples with and without Multivariate and Univariate Outliers.

MLM 2 df Scaling Correction

Factor for MLM

RMSEA

[90% C.I.]

CFI TLI SRMR

SSIS-PF Invariance by Race/Ethnicity

Model 1.1

Random Sample 494.637 168 1.242 0.048

[0.043 – 0.053]

0.959 0.949 0.039

Multivariate Outliers Removed 579.146 168 1.114 0.055

[0.05 – 0.060]

.954 .942 0.041

All Outliers Removed 569.436 168 1.116 0.055

[0.050 – 0.060]

0.951 0.939 0.043

SSIS-PF Invariance by Language Format

Model 3.1

Random Sample 560.275 168 1.201 0.050

[0.045 – 0.054]

0.959 0.949 0.036

Multivariate Outliers Removed 585.872 168 1.121 0.052

[0.048 – 0.057]

0.959 0.948 0.037

All Outliers Removed 559.915 168 1.120 0.051

[0.046 – 0.055]

0.959 0.949 0.036

Note: RMSEA [90% C.I.] = Root Mean Squared Error of Approximation with 90% Confidence Interval; CFI = Confirmatory Fit Index; TLI = Tucker-Lewis

Index; SRMR = Standardized Root Mean Squared Residual. Sample Sizes for Race/Ethnicity Samples are: Random Sample (N = 845); Multivariate Outliers

Removed (N = 807); All Outliers Removed (N = 800). Sample Sizes for Language Format Samples are: Random Sample (N = 949); Multivariate Outliers

Removed (N = 913); All Outliers Removed (N = 906).

61

measurement model with seven distinct social skills domains, Model 1.1,11

was supported by

global fit statistics (MLM 2[168] = 494.637; RMSEA = 0.048, 90% C.I. 0.043 – 0.053; CFI =

0.959; TLI = 0.949; SRMR = 0.039; Figure 1). All freely estimated model pathways were

statistically significant and demonstrated large effect sizes.12

Modification indices did suggest a

number of plausible modifications to further improve model fit, many of which proposed

secondary factor-loadings for indicator parcels. Such a pattern suggests a potential lack of

unidimensionality for SSIS-PF Social Skills factors. This issue is considered more directly in the

analysis of group specific baseline models. However, given evidence of strong global fit, and in

an effort to maintain parsimony and adherence to the original measurement model, further

modification was not performed at this stage.

After completing the first analysis, a second baseline model was specified to test the

plausibility of the unitary Social Skills factor for the SSIS Rating Scales (see Figure 2). Model

1.2, which includes a second-order factor subsuming all seven first-order social skills domains,

showed evidence of reduced fit to the sample data (MLM 2

[182] = 836.604; RMSEA = 0.065,

90% C.I. 0.061 – 0.070; CFI = .918; TLI = .905; SRMR = 0.062; Figure 2). Direct comparison of

Models 1.1 and 1.2 showed a statistically significant reduction in global fit associated with the

more restricted Model 1.2 (ΔMLM 2[14] = 350.967, p < .01). Again, results suggested several

model modifications with the potential to improve the fit of a second-order factor solution. Given

that criteria for global fit were not met, modifications to Model 1.2 were applied in step-wise

fashion. A detailed account of the modification process is provided in the ‘Analysis of SSIS-PF

11

To aid the reader, all models are labeled with two numbers. The first number indicates the set of

analyses (e.g., 1 = Analysis of Invariance by Race/Ethnicity). The second number indicates chronological

sequence. Tables 12, 15, 16, and 18 provide a summary of global fit statistics for each model in each of the

four analyses, respectively. 12

The default settings for MPlus specify unit loading identification (ULI) constraints on the first indicator

of each latent factor in a measurement model. In order to examine the significance/effect (and invariance)

of pathways that were initially fixed, secondary analyses were run with ULI constraints specified for the

second indicator of each factor.

62

Table 12

Global Fit Statistics for SSIS-PF Measurement Model at Successive Stages of Invariance Analyses by Race/Ethnicity.

Model Description/Modification Sample MLM 2 df Scaling

Correction

for MLM

RMSEA

[90% C.I.]

CFI TLI SRMR

Model 1.1 7-Factor, First Order

Random 494.637 168 1.242 0.048

[0.043 – 0.053]

0.959 0.949 0.039

Model 1.2 7-Factor, Second Order

Random 836.604 182 1.239 0.065

[0.061 – 0.070]

0.918 0.905 0.062


(Final Baseline)

Random 520.472 174 1.243 0.049

[0.044 – 0.053]

0.957 0.948 0.040

Model 1.3 Group Baseline AA 311.38 174 1.238 0.050

[0.041 – 0.059]

0.956 0.947 0.041

Model 1.3 Group Baseline

CA 712.524 174 1.178 0.052

[0.048 – 0.056]

0.953 0.944 0.042


LA 284.402 174 1.117 0.066

[0.052 – 0.080]

0.933 0.919 0.051

Model 1.4 Configural Invariance Full 1309.836 522 1.177 0.053

[0.049 – 0.056]

0.952 0.942 0.043

Model 1.5 Metric Invariance Full 1346.006 552 1.165 0.053

[0.048 – 0.055]

0.952 0.945 0.047

Model 1.6 Structural Invariance Full 1394.409 594 1.165 0.050

[0.047 – 0.053]

0.951 0.945 0.076

Model 1.7 Structural Invariance and

Control Variables

Full 1416.140 594 1.142 0.051

[0.047 – 0.054]

0.953 0.940 0.069

Note. RMSEA [90% C.I.] = Root Mean Squared Error of Approximation with 90% Confidence Interval; CFI = Confirmatory Fit Index; TLI = Tucker-Lewis

Index; SRMR = Standardized Root Mean Squared Residual. Sample sizes are: Random Sample (N = 845); African American (n = 312); Caucasian (n = 1162);

Latino (n = 145); Full Sample (N = 1619).

63

Figure 1. First-order baseline model for the analysis of invariance by race/ethnicity, Model 1.1.

Standardized factor loadings resulting from MLM estimation with random sample are reported. P1 – P21

are parcels used in the race/ethnicity analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.762

.555

.721

.776

.786

.792

.660

.763

.745

.765

.720

.710

.833

.855

.756

.782

.804

.772

.789

.737

.807

.898

.587

.625

.713

.696

.536

.960

.724

.710

.548

.691

.636

.868

.794

.849

.723

.518

.552

.765

.785

.782

64

Figure 2. Higher-order baseline model for the analysis of invariance by race/ethnicity, Model 1.2.

Standardized factor loadings resulting from MLM estimation with random sample are reported. P1 – P21

are parcels used in the race/ethnicity analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.747

.560

.732

.769

.779

.805

.661

.769

.737

.766

.725

.705

.832

.851

.762

.775

.809

.773

.784

.737

.810

Social Skills

.956

.913

.737

.947

.789

.703

.846

65

Higher-Order Factor Structure’ section that follows the current section. A viable second-order

factor solution was ultimately produced. However, the new model reflects a significant revision

to the model implied by the published version of the SSIS-PF. Given the exploratory nature of

post-hoc model fitting, this revised model requires independent validation before firm assertions

may be made regarding its adequacy. Therefore, as it relates to the current study, the decision

was made to retain the first-order measurement model, Model 1.1, as the initial baseline model

for the analysis of SSIS-PF measurement invariance across racial groups.

Next, independent baseline models were tested for each of the three race/ethnicity

subgroups. Model 1.1 showed adequate fit for both the Caucasian and African American

samples. Fit statistics for the Latino sample were not as strong when compared against those for

the other two groups.13

In addition to comparatively weaker global model fit, the estimated

baseline solution for the Latino sample also failed to produce a positive definite factor covariance

matrix. Technical output indicated a problem with the Empathy factor for the Latino group.

Upon further inspection, results for the Latino group did show stronger correlations between

Empathy and several of the other social skills domains when compared with the same correlations

for African American and Caucasian groups. However, there was no clear rationale for

implementing specific modifications to address the Empathy factor directly.

Estimated factor correlations for Model 1.1 are presented in Table 13. Note that even

though the estimated factor covariance matrices for the African American and Caucasian groups

were shown to be positive definite, several potentially excessive factor correlations were

observed for all three racial subgroups. Factor correlations of substantial magnitude suggest a

13

Small sample size (n = 145) likely hindered model estimation for the Latino group.

66

Table 13

Estimated Latent Factor Correlations for SSIS-PF 7-Factor Baseline Measurement Model by

Race/Ethnicity Group.

African American

COMM COOP ASRT RESP EMP ENG SC

Communication --

Cooperation .872 --

Assertion .794 .657 --

Responsibility .860 .946 .701 --

Empathy .793 .657 .786 .787 --

Engagement .796 .557 .854 .596 .787 --

Self-Control .782 .802 .603 .855 .736 .575 --

Caucasian


Communication --

Cooperation .909 --



Empathy .731 .681 .691 .732 --

Engagement .752 .526 .764 .537 .673 --

Self-Control .792 .828 .506 .818 .659 .585 --

Latino


Communication --

Cooperation .879 --



Empathy .904 .697 .805 .829 --

Engagement .776 .462 .911 .597 .748 --

Self-Control .848 .716 .669 .895 .815 .605 -- Note. COMM = Communication; COOP = Cooperation; ASRT = Assertion; RESP = Responsibility; EMP =

Empathy; ENG = Engagement; SC = Self-Control. Sample sizes are: African American (n = 312); Caucasian (n =

1162); Latino (n = 145).

67

lack of discriminant validity between subscales.14

Given that the estimated correlation between

the Cooperation and Responsibility factors was highest across all groups (African American =

.946; Caucasian = .980; Latino = .952), a subsequent modification was applied such that the six

indicators of Cooperation and Responsibility were specified to load onto a single latent factor (see

Figure 3). The resulting 6-factor model, Model 1.3, was tested as a new baseline with the original

random sample (MLM 2[174] = 520.472, RMSEA = 0.049, 90% C.I. 0.044 – 0.053; CFI = 0.957;

TLI = 0.948; SRMR = 0.040; Figure 3), and then with each of the racial group subsamples.

Differences in global fit statistics for racial group baseline models based on Model 1.1 and Model

1.3 were negligible. Moreover, the Model 1.3 solutions yielded positive definite factor

covariance matrices for all three groups.

The Latino model continued to show the weakest fit of the three group-specific baseline

models. Two modification indices suggested that fit could be improved with the specification of

additional parameters for the Latino baseline model. The first involved a secondary loading for

Parcel 10 on the Empathy factor. The second involved a correlation between residuals for Parcel

18 and Parcel 20. Neither proposed modification was justified from a theoretical perspective.15

Also, given the relatively small predicted improvement in model fit associated with proposed

modifications and the goal of retaining fully invariant baseline models for the initiation of

invariance testing, no modifications were applied to the Latino baseline model at this stage in the

analyses.

14

Evidence supporting the discriminant validity of SSIS-PF subscales was included in the SSIS Rating

Scales’ Manual, with subscale intercorrelations ranging from .42 - .78. However, such values reflect

correlations based on subscale scores that are measured with less than perfect reliability. In SEM/CFA,

latent factors are presumed to be measured without error. Thus, observed factor intercorrelations reflect

‘true’ domain overlap. 15

Although SSIS-PF items are not reproduced in text, item content for specific parcels can be determined

by referencing Table 2. Paraphrasing of item content is used, where necessary, in order to provide an

appropriate description of analyses.

68

Figure 3. Revised baseline model for the analysis of invariance by race/ethnicity, Model 1.3. All

indicators for Cooperation and Responsibility (i.e., Parcels 4-6 and 10-12) assigned to a single latent

factor . Standardized factor loadings resulting from MLM estimation with random sample are reported.

P1 – P21 are parcels used in the language format analysis.

Communication

P1

Assertion

Cooperation and Responsibility

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P11

P10

P12

P8

P9

P7

P15

P14

P13

P16

P18

P17

P19

P21

P20

.759

.552

.726

.762

.772

.788

.747

.716

.713

.660

.763

.745

.832

.854

.759

.782

..805

.771

.793

.737

.803

.897

.611

.725

.696

.535

.678

.723

.794

.691

.537

.550

.764

.821

.781

.710

69

Next, Model 1.3 was respecified for configural invariance across groups. The resulting

Model 1.4 includes parameter constraints specifying that the pattern of variable relationships

remain consistent across African American, Caucasian, and Latino subsamples. The magnitudes

of individual model parameters were freely estimated across groups at this stage. Results of the

multi-group analysis suggested adequate fit for Model 1.4 (MLM 2

[522] = 1309.836; RMSEA =

0.053, 90% C.I. 0.049 – 0.056; CFI = 0.952; TLI = 0.942; SRMR = 0.043). Thus, the general

structure of the measurement model was found to be invariant across racial groups. A follow-up

examination of the standardized and unstandardized path coefficients for Model 1.4 indicated

relative consistency across groups even though the paths had not been constrained for

equivalence (see Table 14). There was only one pathway for which the absolute difference in

unstandardized factor loadings across groups was > 0.20.16

Perhaps more instructively, none of

the standardized factor loadings exhibited absolute differences > 0.10.

Given evidence supporting the configural invariance of the measurement model, further

constraints were applied specifying the magnitude of factor loadings as equal across the three

racial groups. The resulting Model 1.5 continued to demonstrate relatively good fit to sample

data (MLM 2

[552] = 1346.006; RMSEA = 0.052, 90% C.I. 0.048 – 0.055 ; CFI = 0.952; TLI =

0.945; SRMR = 0.047). None of the equality-constrained factor loadings were identified as

contributing to misfit of the model (i.e., all corresponding M.I. values < 10.0). Moreover, a direct

comparison of Models 1.4 and 1.5 found a non-significant difference in global fit (ΔMLM 2[30]

= 27.630, p > .05), which supports retention of the more parsimonious model (Model 1.5). Thus,

results support metric invariance for the modified 6-factor SSIS-PF measurement model across

African American, Caucasian, and Latino race/ethnicity groups.

16

Unstandardized factor loadings for Parcel 8 on the Assertion factor were 1.148, 0.931, and 0.922 for

Caucasian, African American, and Latino subsamples, respectively.

70

Table 14

Unstandardized and Standardized Factor Loadings by Race/Ethnicity for Configural Invariance

Model.

African American

Caucasian

Latino

Scale/Parcel

Estimate

Std

Estimate

Std

Estimate

Std

Communication

Parcel 1 -- .741 -- .787 -- .706

Parcel 2 .774 .555 0.719 .522 .866 .600

Parcel 3 .784 .653 0.871 .731 .887 .679

Cooperation

/Responsibility

Parcel 4 -- .755 -- .762 -- .737

Parcel 5 .967 .744 0.993 .784 1.093 .792

Parcel 6 .978 .772 0.921 .766 .909 .768

Parcel 10 1.252 .797 1.127 .730 1.296 .764

Parcel 11 1.053 .745 .993 .707 1.068 .760

Parcel 12 .978 .737 .901 .725 .923 .699

Assertion

Parcel 7 -- .726 -- .666 -- .706

Parcel 8 .931 .727 1.148 .771 .922 .694

Parcel 9 .843 .770 .945 .737 .917 .753

Empathy

Parcel 13 -- .851 -- .809 -- .860

Parcel 14 1.049 .820 1.149 .855 1.102 .835

Parcel 15 .883 .778 .934 .787 .959 .802

Engagement

Parcel 16 -- .795 -- .792 -- .756

Parcel 17 .820 .821 .889 .812 .877 .837

Parcel 18 .768 .777 .814 .771 .859 .788

Self-Control

Parcel 21 -- .823 -- .830 -- .814

Parcel 22 .974 .729 .863 .719 .996 .729

Parcel 22 .950 .835 .829 .824 .951 .802 Note. Results based on technical output for Model 1.4.

71

As a final step in the analysis of invariance, the structural elements of the model (i.e.,

factor variances and covariances) also were constrained to be equal across racial groups. The

resulting model, Model 1.6, continued to demonstrate adequate fit to the sample data (MLM

2

[594] = 1394.409; RMSEA = 0.050, 90% C.I. 0.047 – 0.053 ; CFI = 0.951; TLI = 0.949; SRMR

= 0.076), though an increase in the value of SRMR is noteworthy, given that this fit index is

particularly sensitive to misspecification of factor covariances (Hu and Bentler, 1998). Still, none

of the newly constrained parameters were identified as contributing to model misfit, and a direct

comparison of Models 1.5 and 1.6 indicated a non-significant difference in global fit (ΔMLM

2

[42] = 48.403, p > .05). Therefore, based on the full set of analyses, the 6-factor model shown in

Figure 3 was found to demonstrate measurement invariance across African American, Caucasian,

and Latino subgroups.

At the conclusion of the analysis, pathways controlling for the effects of SES and age

were added to the final invariance model. Each of these pathways were permitted to vary freely

across groups (i.e., paths were not constrained for invariance). A handful of corresponding

standardized path coefficients reached the minimum threshold for a small effect size (r = .10;

Cohen, 1992). Still, there was negligible change in global model fit when control variable

pathways were included in the structural equation (MLM 2[594] = 1416.140; RMSEA = 0.051,

90% C.I. 0.047 – 0.054; CFI = 0.953; TLI = 0.940; SRMR = 0.069).

Analysis of SSIS-PF Higher-Order Factor Structure

After completing the planned analysis of SSIS-PF measurement invariance as a function

of racial group membership, a set of follow-up analyses were conducted. The primary objective

of the follow-up analyses was to investigate the higher-order factor structure of the SSIS-PF.

Using the implied second-order measurement model for the SSIS Rating Scales (Model 1.2/2.1)

as a starting point, post-hoc model fitting was carried out in order to generate a ‘best-fitting’

model for a random sample of cases (see Table 15). Modifications were performed in a step-wise

72

Table 15

Global Fit Statistics for SSIS-PF Higher Order Measurement Model at Successive Stages of Post-Hoc Model Fitting.


Correction

Factor for MLM

RMSEA

[90% C.I.]

CFI TLI SRMR

Model 2.1 7-Factor, Second Order

Random 836.604 182 1.239 0.065

[0.061 – 0.070]

0.918 0.905 .062

Model 2.2 Assertion/Engagement

residual covariance

Random 715.450 181 1.238 0.059

[0.055 – 0.064]

0.933 0.922 0.056

Model 2.3 Assertion/Engagement on

Proactive Social Skills

Random 715.450 181 1.238 0.059

[0.055 – 0.064]

0.933 0.922 0.056

Model 2.4 Empathy alone; Rename

Responsive Social Skills

Random 618.605 180 1.237 0.054

[0.049 – 0.058]

0.945 0.936 0.047

Model 2.5 Crossloading for

Communication

Random 559.564 179 1.238 0.050

[0.045 – 0.055]

0.952 0.944 0.042

Model 2.5 Cross-validation

Holdout 515.011 179 1.194 0.049

[0.044 – 0.054]

0.957 0.949 0.040

Model 2.5 Group Baseline AA 319.966 179 1.233 0.050

[0.041 – 0.059]

0.955 0.947 0.042

Model 2.5 Group Baseline CA 751.172 179 1.178 0.052

[0.049 – 0.056]

0.951 0.942 0.044

Model 2.5 Group Baseline* LA 291.329 179 1.120 0.066

[0.052 – 0.079]

0.931 0.920 0.054


Index; SRMR = Standardized Root Mean Squared Residual. Sample sizes are: Random Sample (n = 845); Holdout (n = 774); African American (n = 312);

Caucasian (n = 1162); Latino (n = 145). Descriptions marked with an asterisk (*) indicate those analyses for which the factor covariance matrix was not positive

definite.

73

fashion. Once a ‘best-fitting’ model was determined, data from a holdout sample (i.e., those

cases not selected for the initial random sample) were used for cross validation. Finally, given

successful cross-validation, the model was independently fitted to sample data from each of the

three racial subgroups as a preliminary step in the assessment of invariance.17

As a first step, all modification indices produced during the initial analysis of Model 2.1

were examined. Output suggested that the fit of Model 2.1 could be substantially improved

through the inclusion of a freely estimated residual covariance between the Assertion and

Engagement factors. Review of item content for both factors provided conceptual justification

for the proposed residual covariance, as items for both Assertion and Engagement reflect social

skills in the form of proactive behaviors. Given empirical and conceptual support, the

modification was applied (see Figure 4). The resulting Model 2.2 showed evidence of

significantly improved model fit relative to Model 2.1 (ΔMLM 2[1] = 106.215, p < .01), although

global fit statistics still did not meet a priori criteria (MLM 2

[181] = 715.450; RMSEA = 0.059,

90% C.I. 0.055 – 0.064; CFI = .933; TLI = .922; SRMR = 0.056; Figure 5). Next, Model 2.2 was

re-specified in an equivalent format to enhance the overall interpretability of the model. In the

resulting Model 2.3 (see Figure 5), the Assertion and Engagement factors were dropped as

indicators of the original second-order Social Skills factor, and subsequently specified as

indicators of a new second-order factor, Proactive Social Skills. Being equivalent, global fit for

Model 2.3 was identical to that of Model 2.2.

In examining Model 2.3, results suggested the inclusion of a covariance between the

residual for the first-order Empathy factor and the second-order Social Skills factor. Notably, the

standardized estimated parameter change for the proposed covariance was large and negative

17

Given that the revised second-order measurement model was developed through post-hoc model fitting

and has not been independently validated, a full assessment of measurement invariance with the new model

was not appropriate.

74

Figure 4. Modified higher-order factor structure of the SSIS-PF, Model 2.2. Baseline model respecified

to include residual covariance between first-order factors of Assertion and Engagement. Standardized

factor loadings resulting from MLM estimation with random sample are reported. P1 – P21 are parcels

used in the race/ethnicity analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.745

.559

.734

.769

.780

.804

.666

.751

.753

.765

.723

.707

.832

.852

.761

.778

.815

.764

.788

.739

.806

Social Skills

.948

.931

.694

.966

.770

.659

.853

.624

75

Figure 5. Modified higher-order factor structure of the SSIS-PF, Model 2.3. Specification of a second

higher-order factor, Proactive Social Skills. Standardized factor loadings resulting from MLM estimation

with random sample are reported. P1 – P21 are parcels used in the race/ethnicity analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.745

.559

.734

.769

.780

.804

.666

.751

.753

.765

.723

.707

.832

.852

.761

.778

.815

.764

.788

.739

.806

Social Skills


.948

.931

.966

.770

.853

.915

.869

.759

76

(- 0.849). A large, negative covariance between the residual for Empathy and its higher-order

factor suggests that the Empathy domain is somewhat unique when compared with the other four

domains that contribute to Social Skills. Based on item content, Communication, Cooperation,

Responsibility, and Self-Control domains all reference behaviors through which an individual

demonstrates compliance with norms for social exchange (e.g., following rules, assuming

personal responsibilities, responding appropriately to the actions of others). Although not

completely unrelated, the Empathy items primarily reflect the act of being perceptive to the

feelings of others (i.e., making efforts to both understand and respond to others’ feelings). Given

this distinction, a new Model 2.4 was specified by dropping Empathy as an indicator of Social

Skills, and subsequently renaming the second-order Social Skills factor as Responsive Social

Skills (see Figure 6). Again, global fit statistics showed evidence of improved fit (MLM 2

[180] =

618.605; RMSEA = 0.054, 90% C.I. 0.049 – 0.058; CFI = .945; TLI = .936; SRMR = 0.047),

which was found to be statistically significant (ΔMLM 2

[1] = 84.99, p < .01).

In examining the need for further model re-specification, output for Model 2.4 suggested

a potential cross-loading for the first-order Communication factor on the second-order Proactive

Social Skills factor. The cross-loading makes conceptual sense when considering that

communication is a primary means of initiating and participating in social interactions.

Moreover, items contributing to the other two Proactive Social Skills domains also contain

elements of communicative behavior. As such, the cross-loading was specified in Model 2.5 (see

Figure 7). Resulting fit statistics indicated strong global fit for the new model (MLM 2

[179] =

559.564; RMSEA = 0.050, 90% C.I. 0.045 – 0.055; CFI = .952; TLI = .944; SRMR = 0.042),

which represented a significant improvement over Model 2.4 (ΔMLM 2

[1] = 68.50, p < .01).

Given these findings and a lack of substantive rationale for further model modification, Model 2.5

77

Figure 6. Modified higher-order factor structure of the SSIS-PF, Model 2.4. Empathy dropped as an

indicator of higher-order Social Skills factor. Standardized factor loadings resulting from MLM

estimation with random sample are reported. P1 – P21 are parcels used in the race/ethnicity analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.745

.556

.738

.770

.782

.801

.659

.765

.743

.762

.722

.711

.835

.852

.758

.776

.818

.762

.792

.742

.800



.948

.937

.980

.852

.913

.870

.797

.734

.716

78

Figure 7. Modified higher-order factor structure of the SSIS-PF, Model 2.5. Communication specified to

have a cross-loading on Proactive Social Skills factor. Standardized factor loadings resulting from MLM

estimation with random sample are reported. P1 – P21 are parcels used in the race/ethnicity analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.749

.560

.730

.773

.784

.797

.665

.756

.749

.762

.720

.712

.837

..850

.758

.778

.812

.767

.795

.745

.794



.958

.655

.993

.851

.893

.899

.781

.708

.660

.382

79

was retained as the ‘best fitting’ second-order model for the SSIS-PF based on the randomly

selected sample of standardization data.

Model 2.5 represents a higher-order structural model for the SSIS-PF that appears to be

both conceptually defensible and empirically justified. However, given the exploratory nature of

the post-hoc model fitting process, further validation of this substantially revised model is

needed. As a first step in the validation process, a cross-validation analysis was carried out using

data from the holdout sample (n = 774) that had not been used during the model modification

process. When fitted to data from the holdout sample, Model 2.5 continued to show evidence of

strong global model fit (MLM 2[179] = 515.011; RMSEA = 0.049, 90% C.I. 0.044 – 0.054; CFI =

.957; TLI = .949; SRMR = 0.040). Thus, results tentatively support Model 2.5 as an appropriate

representation of the higher-order structure for the SSIS-PF.

Finally, given the current investigation’s focus on determining measurement invariance,

an initial assessment of the adequacy of Model 2.5 with each of the three racial subgroups also

was performed. Consistent with results from the analysis of first-order factors, Model 2.5 showed

adequate fit for both the African American and Caucasian subsamples, but weaker fit for the

Latino subsample (see Table 15). Moreover, results of the CFA for Model 2.5 with the Latino

subsample yielded a factor covariance matrix that was not positive definite. Specifically, results

indicated a negative residual variance for the Responsibility factor, further implicating lack of

discriminant validity between SSIS-PF Social Skills domains – and between Cooperation and

Responsibility in particular. Though the factor covariance matrices for the African American and

Caucasian groups were positive definite, results for these groups also indicated a high degree of

overlap between the two domains in question. Thus, even though Model 2.5 demonstrated strong

fit when tested with two independent and randomly selected samples, the need for further

examination and potential revision of the higher-order factor structure of the SSIS-PF is

indicated.

80

Analysis of SSIS-PF Invariance by Language Format

Before presenting the results of the analysis of invariance by language format, it is

important to note three key differences that distinguish the ‘language format’ invariance analysis

from the ‘race/ethnicity’ invariance analysis. First, the language format sample includes both

English format and Spanish format cases. The race/ethnicity sample included English format

cases only. Second, the parcel indicators for latent factors were constructed independently for the

two sets of analyses (see Method). As such, parcels are not necessarily comprised of the same

items across the two invariance studies. Third, given that the race/ethnicity invariance analysis

suggested a need for substantial revision to the implied higher-order factor structure of the SSIS-

PF, the analysis of measurement invariance by language format was restricted to an examination

of first-order models only. Aside from these noted differences, both studies were carried out

according to the same set of procedures.

For the initial step in the analysis of invariance by language format, a baseline

measurement model for the SSIS-PF was tested with a random sample of cases (n = 949). The 7-

factor first order measurement model18

, Model 3.1, was supported by global fit statistics (MLM

2

[168] = 560.275; RMSEA = 0.050, 90% C.I. 0.044 – 0.054 ; CFI = 0.959; TLI = 0.949; SRMR =

0.036), although potential for improvement through the specification of additional freely

estimated parameters was indicated. In particular, modification indices supported the re-

specification of Parcel 11 as an indicator of Cooperation rather than Responsibility. The two

items from Parcel 11 appear to be similar in content to several of the items already included on

the Cooperation scale (i.e., behaviors referencing rule following and compliance/task-

completion). Moreover, items from the two remaining parcel indicators of Responsibility also

appear to comprise a cohesive domain (i.e., behaviors referencing awareness and ownership of

18

Model 3.1 is structurally identical to Model 1.1 from the race/ethnicity invariance analysis. However,

item-parcel indicators of latent factors were created differently for the two sets of analyses, resulting in

variation of observed model fit.

81

responsibility for one’s own actions). As such, the modification was applied (see Figure 8). The

resulting Model 3.2 showed evidence of strong model fit (MLM 2

[168] = 481.104; RMSEA =

0.044, 90% C.I. 0.040 – 0.049 ; CFI = 0.967; TLI = 0.959; SRMR = 0.034). In addition, all

freely estimated model pathways were statistically significant and demonstrated large effect sizes.

Further model re-specification was not considered necessary at this stage in the analysis.

The group-specific baseline models for English language and Spanish language

subsamples were analyzed next. Model 3.2 showed good fit for both the English language

sample (MLM 2[168] = 756.605; RMSEA = 0.047, 90% C.I. 0.043 – 0.050 ; CFI = 0.964; TLI =

0.955; SRMR = 0.035) and the Spanish language sample (MLM 2[168] = 269.892; RMSEA =

0.044, 90% C.I. 0.034 – 0.054; CFI = 0.968; TLI = 0.960; SRMR = 0.044). Despite evidence of

strong global fit, technical output for Model 3.2 reported a factor covariance matrix for the

Spanish language sample that was not positive definite. An error message suggested that the

problem involved the Empathy factor.19

However, upon further inspection, no substantive

rationale was found to support making additional modifications to the baseline model. As such,

and given the strong global fit for both English language and Spanish language samples, the

decision was made to retain Model 3.2 as the primary baseline model for both groups. The non-

positive definite factor covariance matrix for the Spanish language subsample was acknowledged

as a significant limitation moving forward.

Multi-group analysis of the baseline model specified for configural invariance between

English language and Spanish language groups, Model 3.3, continued to show evidence of strong

global model fit (MLM 2[336] = ; 1026.602; RMSEA = 0.046, 90% C.I. 0.043 – 0.049 ; CFI =

0.965; TLI = 0.956; SRMR = 0.036). Thus, configural invariance of the SSIS-PF as a function of

19

The Empathy factor also was identified as the source of error when a non-positive definite factor

covariance matrix was observed for the Latino baseline model in the analysis of invariance by

race/ethnicity.

82

Figure 8. Revised baseline model for the analysis of invariance by language format, Model 3.2. Parcel

11 reassigned to Cooperation factor. Standardized factor loadings resulting from MLM estimation are

reported. P1 – P21 are parcels used in the language format analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P11

P12

P10

P15

P14

P13

P16

P18

P17

P19

P21

P20

.659

.528

.724

.820

.787

.765

.724

.671

.740

.730

.882

.905

.811

.805

.808

.812

.800

.825

.696

.798

.738

.892

.572

.568

.730

.679

.563

.880

.727

.767

.522

.744

.722

.786

.816

.767

..815

.504

.568

.746

.822

.843

83

language format was supported. In the subsequent analysis, specification of cross-group equality

constraints for factor loadings in a metric invariance model, Model 3.4, also indicated strong

global model fit (MLM 2[350] = 1044.487; RMSEA = 0.045, 90% C.I. 0.042 – 0.049; CFI =

0.964; TLI = 0.957; SRMR = 0.037). None of the equality constrained parameters were

identified as contributing to model misfit, and a direct comparison of Models 3.3 and 3.4

indicated a non-significant difference in global fit (ΔMLM 2

[14] = 14.481, p > .05).

For the final step in the assessment of measurement invariance by language format, cross-

group equality constraints were specified for all structural parameters. The resulting Model 3.5

continued to show strong fit to the sample data (MLM 2[379] = 1095.640; RMSEA = 0.044, 90%

C.I. 0.041 – 0.047; CFI = 0.963; TLI = 0.959; SRMR = 0.050). However, a direct comparison of

Models 3.4 and 3.5 indicated significantly weaker global fit for the more restricted model

(ΔMLM 2[28] = 50.8107, p < .01). Stated differently, the fit of Model 3.5 is significantly

improved through the introduction of additional parameters allowing for the unconstrained

estimation of factor variances and covariances across language format groups (i.e., Model 3.4).

Thus, while the modified SSIS-PF baseline Model 3.2 demonstrated evidence of configural

invariance (Model 3.3) and metric invariance (Model 3.4), the structural components of the model

were not shown to be invariant as a function of language format (see Tables 16 and 17).

Finally, having concluded the analysis of invariance by language format, unconstrained

pathways controlling for the effects of SES and age were added to Model 3.4. Again, several of

the newly introduced parameters reached the minimum threshold for a small effect size (r = .10;

Cohen, 1992). Still, there was negligible change in global model fit when control variable

pathways were included in the structural equation (MLM 2[350] = 1034.441; RMSEA = 0.045,

90% C.I. 0.042 – 0.048; CFI = 0.967; TLI = 0.952; SRMR = 0.034).

84

Table 16

Global Fit Statistics for SSIS-PF Measurement Model at Successive Stages of Invariance Analyses by Language Format.


Correction for

MLM

RMSEA

[90% C.I.]

CFI TLI SRMR


Random 560.275 168 1.201 0.050

[0.045 – 0.054]

0.959 0.949 0.036

Model 3.2 Parcel 11 to load on

Cooperation (Baseline)

Random 481.104 168 1.202 0.044

[0.040 – 0.049]

0.967 0.959 0.034


English 756.605 168 1.21 0.047

[0.043 – 0.050]

0.964 0.955 0.035

Model 3.2 Group Baseline* Spanish 269.892 168 1.201 0.044

[0.034 – 0.054]

0.968 0.960 0.044

Model 3.3 Configural Invariance*

Full 1028.289 336 1.205 0.046

[0.043 – 0.049]

0.965 0.956 0.036

Model 3.4 Metric Invariance* Full 1044.487 350 1.197 0.045

[0.042 – 0.049]

0.964 0.957 0.037

Model 3.5 Structural Invariance Full 1095.640

378 1.196 0.044

[0.041 – 0.047]

0.963 0.959 0.050

Model 3.6 Metric Invariance and

Control Variables*

Full 1034.441 350 1.187 0.045

[0.042 – 0.048]

0.967 0.952 0.034


Index; SRMR = Standardized Root Mean Squared Residual. Sample sizes are: Random Sample (n = 949); English Format (n = 1619); Spanish Format (n = 320);

Full Sample (N = 1941). Descriptions marked with an asterisk (*) indicate those analyses for which the factor covariance matrix was not positive definite.

85

Table 17

Unstandardized and Standardized Factor Loadings by Language Format for Configural

Invariance Model.

English Format

Spanish Format

Scale/Parcel

Estimate

Std

Estimate

Std

Communication

Parcel 1 -- .671 -- .599

Parcel 2 .833 .540 .917 .621

Parcel 3 .962 .705 1.128 .759

Cooperation

Parcel 4 -- .814 -- .821

Parcel 5 .866 .771 .965 .823

Parcel 6 .852 .768 .912 .805

Parcel 11 .849 .731 .880 .740

Assertion

Parcel 7 -- .707 -- .725

Parcel 8 .912 .635 1.007 .585

Parcel 9 1.175 .756 1.265 .751

Responsibility

Parcel 11 -- .884 -- .837

Parcel 12 .965 .896 1.010 .879

Empathy

Parcel 13 -- .804 -- .796

Parcel 14 .984 .806 1.089 .792

Parcel 15 .869 .809 .985 .796

Engagement

Parcel 16 -- .796 -- .774

Parcel 17 .839 .810 .868 .788

Parcel 18 .971 .844 .942 .789

Self-Control

Parcel 21 -- .726 -- .681

Parcel 22 1.008 .815 1.052 .829

Parcel 22 .963 .740 .975 .711 Note. Results based on technical output for Model 3.3.

86

Analysis of the Higher-Order Factor Structure for the SSIS-PF English Language Format

and Spanish Language Format

After completing the planned analysis of SSIS-PF measurement invariance as a function

of language format, follow-up analyses were carried out to investigate the higher-order factor

structure of the SSIS-PF for language format groups (see Table 18). The model to be tested,

Model 4.1, was constructed by combining the first-order structure of the language format

invariance baseline model (Model 3.2)20

and the higher-order structure of the ‘best fitting’ model

produced through the first set of follow-up analyses (Model 2.5). Model 4.1 was then

independently fitted to both the English language format and Spanish language format datasets.

Results indicate that Model 4.1 shows strong global fit for both the English language format

(MLM 2[179] = 844.132; RMSEA = 0.048, 90% C.I. 0.045 – 0.051; CFI = 0.959; TLI = 0.952;

SRMR = 0.034; see Figure 9) and Spanish language format (MLM 2

[179] = 309.340; RMSEA =

0.045, 90% C.I. 0.039 – 0.058; CFI = 0.959; TLI = 0.952; SRMR = 0.049; see Figure 10).

However, results for the Spanish language format again produced a factor covariance matrix that

was not positive definite, identifying a specific issue with the Communication factor (i.e., a

negative residual variance). As stated previously, such results suggest a need for further

investigation and refinement of the SSIS-PF higher-order factor structure, particularly as it relates

to the scales’ Spanish language format.

20

The first order structure of Model 3.2 was used given that Model 4.1 was tested using the same parcel

indicators for latent factors that had been used in the language format invariance analysis.

87

Table 18

Global Fit Statistics for Proposed SSIS-PF Higher Order Measurement Model with Language Format Samples.


Correction

Factor for MLM

RMSEA

[90% C.I.]

CFI TLI SRMR

Model 4.1 Modified 7-Factor, Second

Order

Random 528.591 179 1.205 0.045

[0.041 – 0.050]

0.965 0.957 0.037

Model 4.1 Cross-validation*

Holdout 549.564 179 1.236 0.046

[0.041 – 0.050]

0.963 0.956 0.039

Model 4.1 Group Baseline English 844.132 179 1.213 0.048

[0.045 – 0.051]

0.959 0.952 0.037

Model 4.1 Group Baseline* Spanish 309.340 179 1.196 0.049

[0.039 – 0.058]

0.959 0.952 0.049


Index; SRMR = Standardized Root Mean Residual. Sample sizes are: Random Sample (n = 949); Holdout (n = 992); English Format (n = 1619); Spanish Format

(n = 320). Descriptions marked with an asterisk (*) indicate those analyses for which the factor covariance matrix was not positive definite.

88

Figure 9. Revised higher-order factor structure adapted to include first-order structure from the language

format invariance analysis, Model 4.1. Standardized factor loadings for MLM estimation with English

language format subgroup are reported. P1 – P21 are parcels used in the language format analysis.

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P10

P12

P11

P15

P14

P13

P16

P18

P17

P19

P21

P20

.671

.545

.701

.813

.769

.770

.711

.639

.751

.730

.884

.896

.803

.805

.811

.728

.816

.738

.728

.816

.738



.947

.659

.885

.878

.928

.884

.794

.766

.669

.408

89

Figure 10. Revised higher-order factor structure adapted to include first-order structure from the

language format invariance analysis, Model 4.1. Standardized factor loadings for MLM estimation with

Spanish language format subgroup are reported. P1 – P21 are parcels used in the language format

analysis

Communication

P1

Assertion

Responsibility

Cooperation

Empathy

Engagement

Self-Control

P2

P3

P4

P5

P6

P8

P7

P9

P10

P12

P11

P15

P14

P13

P16

P18

P17

P19

P21

P20

.598

.631

.750

.814

.820

.816

.716

.596

.753

.738

.835

.881

.792

.794

.798

.776

.784

.791

.680

.829

.712



.934

.740

.942

.877

.984

.861

.875

.808

.703

.353

90

Chapter 5

Discussion

Overview

The primary objectives for the current study were to examine: (a) the measurement model

of the SSIS-PF, and, if necessary, alternative measurement models for the instrument; (b)

invariance of the SSIS-PF measurement model across race/ethnicity subgroups; and (c)

invariance in the SSIS-PF measurement model across language format subgroups. Results

provided mixed support for the general measurement model of the SSIS-PF. Analysis of the

higher-order structure, in particular, revealed discrepancies between the original model and the

best-fitting model from the current study. However, first-order measurement models were

supported and demonstrated invariance across groups defined by race/ethnicity and language

format.

Primary Findings

Factor Structure of the SSIS-PF. Both first-order and higher-order factor structures

were tested for the SSIS-PF. The full seven-factor first-order structure was upheld through two

iterations of the same analysis (i.e., one for each of the invariance studies). However, results

from both iterations indicated a lack of discriminant validity between several social skills

domains, most significantly the Cooperation and Responsibility domains. Discriminant validity

concerns were mitigated through model modification. In the race/ethnicity analysis, the

Cooperation and Responsibility indicators were collapsed onto a single factor. In the language

format analysis, reassignment of one indicator from the Responsibility factor to the Cooperation

91

factor produced an optimal first-order model.21

Thus, with minor modification, the proposed

first-order measurement structure of the SSIS-PF was supported.

By contrast, the higher-order factor structure of the SSIS-PF showed clear evidence of

reduced fit relative to its first-order counterpart. Thus, the hypothesis of a unitary Social Skills

construct subsuming all seven first-order domains was not supported. Systematic respecification

of the original higher-order model, guided by a combination of empirical and theoretical

considerations, did ultimately yield a revised higher-order structure that explained the data quite

well. The observed adequacy of the alternate model must be qualified with an acknowledgement

of the exploratory nature of post-hoc model fitting (i.e., potentially capitalizing on random

variation in the sample data), although successful cross-validation with an independent holdout

sample counter-balances such concerns.

The alternate higher-order factor structure of the SSIS-PF frames the social skills

construct in a more complex, multi-dimensional conceptualization than that implied by the

published version of the rating scales. In the revised model, two separate higher-order social

skills factors were indicated. A first higher-order factor, Responsive Social Skills, was shown to

encompass behaviors that reflect an awareness of and adherence to accepted norms for social

exchange. Primary domains under the Responsive Social Skills factor included Cooperation,

Responsibility, and Self-Control. A second higher-order factor, Proactive Social Skills, was

reflected in behaviors used to initiate, redirect, or otherwise influence social context. Primary

domains under the Proactive Social Skills factor included Assertion and Engagement. Two first-

order domains, Communication and Empathy, did not align directly with either of the newly

posited higher-order factors, albeit for different reasons. The Communication domain actually

demonstrated substantive loadings on both higher-order factors. Although less than ideal from a

21

Due to differences in the item-composition of indicator parcels, the same indicator reassignment could

not be specified during the race/ethnicity analysis.

92

measurement standpoint, the dual loading appears to be conceptually justified when considering

that communicative behaviors are applied in both responsive and proactive ways. In contrast, the

Empathy domain was not assigned to either of the higher-order factors, with results suggesting

that Empathy instead represents a distinct aspect of social behavior.

SSIS-PF Measurement Invariance: Race/Ethnicity. Given that preliminary findings

suggested a need for revision to the proposed higher-order structure of the SSIS-PF, subsequent

invariance analyses were restricted to an examination of first-order models only. For the analysis

of invariance as a function of race/ethnicity, the six-factor baseline model demonstrated

configural invariance, metric invariance, and structural invariance across groups of African

American, Caucasian, and Latino participants. Collectively, such results indicate that the SSIS-

PF is consistent in the way in which it measures first-order social skills domains across the three

race/ethnicity groups under study.

It is important to note that the use of a six-factor model – as opposed to the original

seven-factor model – was necessary due to the significant redundancy between Cooperation and

Responsibility factors across each of the race/ethnicity samples. Despite this modification, the

remaining six factors consistently reflected the same social skills domains across all three groups.

Moreover, the indicators used to reflect the six social skills domains also demonstrated

relationships of consistent magnitude with the hypothesized social skills domains regardless of

whether the participants were African American, Caucasian, or Latino.

The observation of invariance at the structural level (i.e., invariant factor variances and

covariances) suggests that the amount of intra-group variability observed for each social skills

domain and pattern of relationships among the full set of social skills domains also were

equivalent across race/ethnicity groups. Such structural invariance is not necessarily critical to

the assertion of general measurement invariance or, subsequently, the validity of scores for

members of different groups. Groups could reasonably be expected to differ in the amount of

93

within-group variation for a particular construct, or in the way in which multiple constructs relate

to one another (Vandenberg & Lance, 2000). Nevertheless, the finding of structural invariance in

the current analysis is informative in that it suggests that first-order social skills domains from the

SSIS-PF relate to each other in similar ways across race/ethnicity groups. This finding has

implications for the measurement of higher-order social skills construct(s) as well. From an

SEM/CFA perspective, first-order domains serve as indicators of higher-order constructs.

Therefore, invariance of structural parameters within first-order measurement models (i.e., factor

variances and covariances) indicates potential invariance for higher-order measurement structures

(i.e., second-order factor loadings), provided that the second order model has been appropriately

conceptualized and sufficiently validated.

SSIS-PF Measurement Invariance: Language Format. For the analysis of invariance

by language format, the seven-factor baseline model demonstrated configural invariance and

metric invariance. Thus, the first-order measurement structure and the magnitude of indicator

loadings for the revised baseline model of the SSIS-PF were invariant across English language

and Spanish language groups. Such results suggest that the Spanish translation of the SSIS-PF

retains the same first-order measurement structure as the original English format. Moreover, the

factor indicators also reflect the same domains to the same degree regardless of the language in

which scale items are presented. The study did not support structural invariance of the SSIS-PF

across language formats, which suggests that the pattern of variances and covariances of social

skills domains were not equivalent for English language and Spanish language groups. As noted

previously, such structural invariance is not necessarily critical to the assertion of general

measurement invariance, at least as it pertains to the first-order measurement model under study.

Still, the lack of invariance for SSIS-PF structural parameters across language format groups is

noteworthy given that structural invariance was supported across race/ethnicity groups.

Unfortunately, due to design limitations, it remains unclear as to whether the difference in

94

structural parameters between language format groups is linked directly to the language of the

measurement instrument/respondents, broader cultural differences of the participants included in

the respective language format samples, or simply the use of discrepant baseline models in the

two analyses.

Interpretation of Primary Findings in the Context of Prior Research

SSIS-PF Structure and the Social Skills Construct. As noted previously, the seven

factor first-order measurement structure of the SSIS-PF was tentatively supported, which fits with

general findings suggesting that the broad set of behaviors traditionally viewed under the label

‘social skills’ can be reliably classified into a set of unique domains (Caldarella & Merrell, 1997).

However, prior research also has been inconsistent in its application of social skills

measurement/classification systems (Matson & Wilkinson, 2009). Despite the efforts of a few

researchers (e.g., Caldarella & Merrell, 1997), there remains no uniformly accepted taxonomy of

social skills. As a result, distinctions between social skills domains have not been clearly defined,

which may explain the mixed evidence of discriminant validity among several first-order factors

from the SSIS-PF.

With some modification (e.g., indicator reassignment), concerns regarding discriminant

validity for the SSIS-PF subscales were reduced. And, it should be emphasized that issues of

discriminant validity are not unique to the SSIS-PF, but rather seem to represent a more pervasive

obstacle within the field of social skills assessment. For example, Caldarella and Merrell’s

(1997) preliminary attempt to develop a taxonomy of social skills was similarly limited, as their

taxonomic categories, based on the five most commonly identified social skills ‘dimensions’

observed in prior research, also showed evidence of considerable overlap. Thus, taken in the

context of the larger body of research, current findings underscore the need for additional

research – particularly that which applies focused, systematic, multivariate analytic procedures

(Achenbach, 1995) to refine understanding of first-order social skills domains.

95

Further study of a higher order structure for the social skills domain also is warranted,

given current results suggesting that a unitary social skills factor may not adequately account for

all identified first-order social skills domains on the SSIS-PF. In fact, with its expansion, the

authors of the SSIS Rating Scales may have unintentionally introduced the instrument’s

assessment of a secondary higher-order social skills factor, labeled Proactive Social Skills in the

current study. The observed higher-order Proactive Social Skills factor is particularly noteworthy

when considered in concert with findings reported in a relatively recent CFA study of the SSRS,

which asserted the presence of a previously unidentified first-order ‘Extroversion’ factor (Van

Horn et al., 2007). The emergence of a Proactive Social Skills factor for the SSIS-PF may reflect

the revised instrument’s expanded coverage of an ‘extroverted’ social skills domain (e.g.,

Assertion and Engagement). Supporting this interpretation, items assigned to the Proactive Social

Skills factor of the SSIS-PF are similar in content to those assigned to the Extroversion factor of

the SSRS identified by Van Horn et al. (2007). The construct validity of the newly posited

Proactive Social Skills factor is further bolstered by the presence of comparable factors (e.g.,

Assertiveness, Interest/Participation, Assertiveness-Prosociablity, etc.) on a variety of alternative

social skills measures (see Matson & Wilkins, 2009).

Finally, current results also indicate the need to further examine the role of empathy as it

relates to the broader domain of social skills. Though the Empathy factor from the SSIS-PF did

share significant relationships with both the Responsive and Proactive Social Skills factors, the

best fitting measurement model was observed with Empathy being included as a stand-alone

factor. Therefore, empathy appears to reflect a unique domain of behavior within (or perhaps

related to) the social skills domain. It has been suggested previously that empathy, as a domain

of social skills, tends to emerge at a later stage of development once individuals have acquired

more advanced cognitive and emotional perspective-taking abilities (Merrell & Gimpel, 1998).

Therefore, developmental characteristics of the sample used in the current study, being comprised

96

of children between the ages of 5 and 12, also may have influenced current findings relative to the

Empathy domain.

Measuring Social Skills across Cultural Groups. An overarching goal for the current

investigation was to examine the degree to which the social skills construct, as operationalized by

the SSIS-PF, is generalizable to the Latino population(s) of the United States. Pursuit of this goal

prompted separate analyses of measurement invariance by race/ethnicity and language format,

respectively.

The current analysis of invariance by race/ethnicity is the first to have been conducted

with the SSIS Rating Scales. However, similar studies were conducted with the instrument’s

predecessor, the SSRS. One such study looked at invariance in the teacher version of the SSRS

across groups of White and Non-White participants (Walthold et al., 2005), and a second study

examined measurement invariance in the parent version of the SSRS as a function of several

different grouping variables including race (i.e., African American, Caucasian, and Hispanic; Van

Horn et al., 2007). Both previous studies were consistent in their support for measurement

invariance across racial groups, which is also consistent with findings from the present study of

the SSIS-PF. Thus, collectively, results suggest that social skills can be measured reliably and

consistently across broadly defined racial/ethnic groups.

In reference to language, current results support configural and metric invariance in the

first order SSIS-PF measurement model across English and Spanish language format groups.

Thus, the nature of the seven first-order SSIS-PF domains and the behaviors selected to reflect

these domains remained intact through the translation process. However, a lack of structural

invariance for the SSIS-PF across language format groups suggests potential differences in the

way first-order social skills domains hold together for individuals from English speaking and

Spanish speaking groups.

97

Interpretation of structural variability for the language format analysis is complicated due

to the limited extant research on standardized assessments of social skills with Spanish-speaking

populations, particularly as it relates to the nature of underlying constructs. Ethnographic

research findings do suggest that traditional Latino child rearing practices emphasize the

cultivation of specific, socially relevant qualities that may not have direct correlates in other

cultures (e.g., familismo, respeto, educación; Halgunseth, 2006). However, these terms also have

yet to be adequately operationalized for measurement purposes. Still, it may be that qualities

such as familismo, respeto, and educación actually constitute unique combinations of social

skills domains and/or items already represented on the SSIS-PF. For example, the Empathy

domain was identified as a source of substantial redundancy in the original first-order

measurement model for both the Spanish language format group in the language invariance

analysis and the Latino subsample of the English language format group in the race/ethnicity

study. Thus, it may be that Empathy is a more pervasive element of the social skills construct as

cultivated in Latino/Spanish-speaking cultures when compared with non-Latino/English-speaking

cultures in the United States. At present, such an interpretation is a working hypothesis that

needs to be empirically tested in future studies.

Despite the complexities of interpretation, the collection of current results indicates that

the social skills construct can be measured with consistency across cultural groups defined by

race/ethnicity and language. Such findings are clearly promising in terms of potential uses for the

SSIS-PF as a measurement instrument. However, there also appears to be some inconsistency

with eco-cultural developmental theory (e.g., Rogoff, 2003) in terms of the lack of observed

variation in the nature of the social skills construct across groups. Specifically, the behavioral

construct of social skills likely would be expected to show some variation across groups as a

function of differences in community based practices, beliefs, and traditions. To explain this

apparent inconsistency, it is helpful to consider that several factors may have contributed to

98

findings of measurement invariance in the current study. First, the SSIS-PF utilizes parent

judgments of behavior frequency, as opposed to direct observations of operationally defined

behaviors. Moreover, all SSIS-PF items are not necessarily designed to assess behaviors at the

molecular level (Elliott et al., 2008). Therefore, differences in the topography of specific

behaviors enacted by individuals from different groups may not result in measurement differences

on the SSIS-PF as long as the behaviors achieve the same function and are judged as such by

parent respondents. Second, the variability in cultural identity and affiliation for individuals

within groups, and the similarities among individuals from different groups based on their

exposure/interactions with the same social networks and community-based institutions also may

have reduced the likelihood of observing measurement non-invariance. Thus, while current

findings of measurement invariance for the SSIS-PF are important and have various implications

for research and practice, further study of social skills within and across various cultural groups is

still needed.

Limitations and Future Directions

Limitations of the current study can be broadly classified into two categories. A first set

of limitations arises from the format and composition of the raw data file that was used for

analysis. A second set reflects shortcomings relative to the general design of the study. Both sets

of limitations need to be addressed through future research.

Data Limitations. Perhaps the most significant limitation of the current study is the fact

that item-level data were not available for CFA. Instead, items were combined into parcels,

which in turn, were used as the observed variables in all analyses. Parcel indicators are actually

well suited to CFA when examining rating scales that employ ordinal item-level scores. The

process of grouping items into parcels typically results in observed variable distributions that

approximate normality more closely than their individual item counterparts (Hau & Marsh, 2004).

As was the case in the current study, traditional methods of estimation (e.g., ML, MLM) can

99

often be applied when item parcels are used as indicators. However, grouping items into parcels

also has several drawbacks. Most notably, when parcel indicators are used, the adequacy of

individual items cannot be directly examined (e.g., item reliability, factor loadings, factor

assignment). Moreover, item parceling precludes an examination of invariance at the item level.

In the current study of invariance, item-level analysis would have facilitated a direct comparison

of parameters for the original English-format items from the SSIS-PF and their translated

Spanish-format counterparts. Instead, the use of item-parcel indicators in the analysis of

measurement invariance may have reduced the likelihood of detecting true non-invariance at the

indicator level for both the race/ethnicity and language format studies (Meade & Kroustalis,

2006).

Steps were taken to address limitations related to the use of item parcels. First, all parcels

were developed according to specific procedures designed to either optimize parcel

unidimensionality or minimize the likelihood that non-invariance would be obscured at the

indicator level. In addition, as a means of assessing the effects of item parceling on subscale

measurement properties, item-generated and parcel-generated subscale internal consistency

reliability coefficients and intercorrelations were compared. Minimal differences were observed

upon comparison, providing some added justification for the use of item parcels. Nevertheless,

future investigations of invariance with item-level data from the SSIS Rating Scales are

warranted. Through such efforts, researchers would be able to examine the instrument’s general

measurement model more closely in terms of the adequacy of individual items and their

assignment to first-order domains. Similarly, item-level analysis would permit a more precise

examination of metric invariance, with the potential to inform understanding of similarities and

differences in specific social skills behaviors and their respective contributions to first-order

domains across groups.

100

A second limitation pertaining to the dataset used for the current study involves the

observed sample sizes of certain subsamples. The full sample was appropriately large in

reference to recommendations for CFA, and all groups were proportionately represented in the

sample in accordance with March 2006 U.S. population estimates (Gresham & Elliot, 2008).

However, while the Caucasian subsample remained sufficiently large when group data were

disaggregated, the African American and Latino/Spanish-format subsamples were only

moderately sized, and the Latino/English-format subsample was small. Thus, the stability of

parameter estimates and accuracy of global fit statistics for these latter groups cannot be asserted

with as much confidence as can those for the Caucasian group. All findings concerning the

Latino/English-format subsample specifically should be qualified as tentative in light of the small

sample size for this group.

An additional limitation relative to the dataset used for analysis involves the lack of select

demographic data. First, the dataset provided by the SSIS publisher did not include the sex of the

student participants. It is important to note that the standardization sample for the SSIS Rating

Scales was stratified according to sex, though, with equal representation of girls and boys in the

sample. Thus, it is likely that the distribution was similar in the current sample. Perhaps more

directly relevant to the current study, information was not available regarding the language status

and acculturation of participants. In order to examine the social skills construct across cultures

more precisely, data accounting for these more continuous elements of ‘culture’ should be

considered in future analyses.

Design Limitations. As initially conceptualized, the identification of a best-fitting

measurement model for the SSIS-PF was sought to serve as a baseline for the analysis of

invariance by race/ethnicity. Similarly, the final model from the race/ethnicity invariance

analysis was intended to be used as the baseline for a within group assessment of invariance by

language format. However, due to the collective impact of a number of factors – several of which

101

have been mentioned in the preceding section – adjustments to the original design and sequence

of analyses were required. As a result, analyses did not build from one to the next as efficiently

as possible.

First, a lack of item-level data hindered the study’s pursuit of a single ‘best- fitting’

measurement model for the SSIS-PF. The subsequent need to employ two different sets of item

parcels for the race/ethnicity and language format invariance studies, respectively, meant that

findings from one study were not necessarily directly comparable to the next. For example, the

baseline models tested for invariance differed across the two studies with respect to the number of

first order factors that were included. In addition, sample size restrictions precluded the use of

specific analyses intended to parse out the effects of language format and racial/ethnic group

membership on structural parameters for the SSIS-PF measurement model. As such, another

direction for future research is a study through which measurement invariance can be examined

across race/ethnicity and language format via a single set of iterative analyses (e.g., Non-

Latino/English-format; Latino/English-format; Latino/Spanish-format).

A second caveat to the interpretation of results for the current study concerns the nature

of measurement invariance as a psychometric property that is inferred on the basis of a collection

of evidence. Indices of global fit for equality-constrained models are typically used as primary

sources of evidence. However, comparison of unconstrained model parameters can also be

informative. Relatedly, it is important to note that testing for invariance in model-specified

patterns of parameters across groups does not guarantee detection of differences in specific

parameters, particularly when more than two groups are included in the analysis. Although

omnibus results generally supported SSIS-PF measurement invariance as a function of

race/ethnicity at the configural, metric, and structural levels, conflicting evidence should not be

dismissed. Specifically, the observation of potential group differences for model-implied factor

correlations at the baseline stage of the race/ethnicity invariance analysis, and the noted decrease

102

in absolute model fit when equality constraints were placed on structural parameters at a later

stage in the same study,22

are findings that require further investigation.

In general, findings supporting the invariance of the first-order measurement model of the

SSIS-PF across race/ethnicity and language format are promising and should encourage a variety

of research extensions. However, an acknowledgement of the possibility that relationships

between specific social skills factors may actually differ across race/ethnicity groups also is

justified. The lack of structural invariance for the SSIS-PF as a function of language format also

raises the question of potential race/ethnicity differences in structural parameters, given the

respective demographic composition of the English language format and Spanish language format

samples.

A final caveat regarding study design concerns the difference between full measurement

invariance and partial measurement invariance. The current study was designed to examine full

measurement invariance. As such, although group-specific modifications to the baseline SSIS-PF

model were considered, the goal was to retain equivalent models for each group across all stages

of the analysis where possible. In adhering to this goal, optimal group-specific measurement

models were not explicitly sought. Rather, analyses reflect the degree to which a common

measurement model demonstrated equivalence across groups. Furthermore, at various stages in

the analysis, group-specific models demonstrated less than adequate model fit and/or yielded

solutions that were otherwise problematic (e.g., observed factor covariance matrix that was not

positive definite). Such instances were addressed whenever possible. However, in order to

complete the set of analyses as planned, it was necessary at times to move forward with analyses

despite less than optimal findings at preliminary stages. As such, group-specific research should

22

Although chi-square difference testing revealed no significant difference in model fit when structural

constraints were added to the model, a notable increase in the magnitude of the SRMR absolute fit index

was observed.

103

also be conducted to further inform understanding of the social skills construct as it exists within

specific cultural groups.

The preceding list of design limitations also represent caveats to the interpretation of

results. In addition to those already mentioned, there are several other caveats to the current

study that should be addressed in future research. First, the present study focused solely on the

parent form of the SSIS Rating Scales. Therefore, independent studies of the teacher and self-

report forms of the instrument are needed. Similarly, studies that examine invariance along other

dimensions (i.e., gender, age, time) also would be informative. Finally, given the relative lack of

independent work with the SSIS Rating Scales, studies that examine other aspects of

measurement validity (e.g., predictive validity, sensitivity to change, etc.) should be completed as

well.

Implications for the Use of the SSIS-PF in Research and Practice

As it relates to the general factor structure of the SSIS-PF in its published format,

evidence tentatively supports the validity of a seven-factor structure for the instrument. Two

iterations of the full first-order model showed adequate fit to sample data. Moreover, all

indicators demonstrated strong positive loadings on their respective factors. Thus, the seven first-

order SSIS-PF factors (i.e., Communication, Cooperation, Assertion, Responsibility, Empathy,

Engagement, Self-Control) appear to reflect meaningful domains under the ‘social skills’

construct, and the indicators used to tap into these domains appear to be have been appropriately

selected. A lack of discriminant validity among several of the first-order domains does temper

support for the seven-factor structure despite evidence of strong global model fit. Thus, further

examination and potential revision of item content for highly correlated domains (e.g.,

Cooperation and Responsibility) is warranted.

The higher-order factor structure implied by the published version of the SSIS-PF is also

in need of further examination and potential revision. The existence of a unitary ‘Social Skills’

104

factor subsuming all seven first-order domains was not supported in the current study. Instead,

based on exploratory analysis, a more complex higher-order structuring of the social skills

construct with two distinct second-order factors was shown to be more appropriate. In terms of

practical implications, the validity of using a single Social Skills score based on the aggregate

total of ratings across all seven domains of the SSIS-PF is called into question. If the revised

higher-order structure identified through the current study were to be replicated in future studies,

revision to the recommended procedures for SSIS-PF scoring and interpretation would likely be

required.

Implications for Cross-Cultural Social Skills Assessment

Current results are promising in terms of supporting first-order measurement invariance

for the SSIS-PF across groups defined by race/ethnicity and language format. It is worth noting,

however, that measurement invariance is only meaningful to the extent that an instrument’s

general measurement structure has been well validated. As such, initial focus on continuing to

generate evidence regarding the validity of the general measurement model of the SSIS-PF

should be the highest priority in future studies.

When moving beyond this specific instrument, current results suggest that the social

skills construct, at least in terms of behavioral indicators and first order domains, can be

meaningfully operationalized in ways that remain consistent across cultures defined by broad

race/ethnicity categories (i.e., African American, Caucasian, Latino) and language groups (i.e.,

English, Spanish). At this point, further study of the structural relationships among first-order

and higher-order social skills factors across cultural groups is needed before firm assertion can be

made in reference to similarities and differences. Similarly, studies designed with more

specificity in reference to cultural groupings also are needed (e.g., accounting for SES,

acculturation, language status, etc.).

105

Conclusions

Current evidence supports the first-order measurement structure of the SSIS-PF. All

seven first-order social skills domains (i.e., Communication, Cooperation, Assertion,

Responsibility, Empathy, Engagement, Self-Control) were supported, and all indicators

(comprised of two- and three-item sets) demonstrated appropriately strong relationships with their

respective domains. Within the context of the SSIS integrated assessment-for-intervention

paradigm, valid assessment at the item and domain levels is critical in terms of appropriately

guiding the intervention process. Thus, current evidence generally supports the instrument’s

utility for such purposes. It is important to note, however, that results also indicated limited

discriminant validity among several first-order social skills domains. Redundancy in first-order

measurement is not necessarily overly detrimental to the SSIS assessment-for- intervention

framework. Such results, however, do underscore the absence of a consistent and well-validated

classification structure for the social skills construct/domain (Gresham, 1986; Caldarella &

Merrell, 1997; Matson & Wilkins, 2009).

In contrast to results supporting the first-order measurement structure of the SSIS-PF, current

findings did not support the presence of a single, unitary social skills factor subsuming all seven

of the instrument’s first-order domains. In terms of practical implications, such results suggest

that the calculation of a single norm-referenced score may not be the most appropriate means of

quantifying an individual’s social skills as rated by parents on the SSIS-PF. Instead, exploratory

analysis suggests that the SSIS-PF may actually reflect two distinct higher-order domains:

Responsive Social Skills and Proactive Social Skills. More research is needed before conclusions

can be drawn regarding the higher-order measurement structure of the SSIS-PF. In terms of

methodological implications, the potential for iterative (exploratory and confirmatory)

multivariate analysis to move the field closer to consensus in reference to the most appropriate

means of conceptualizing the social skills construct should be noted.

106

Finally, as it relates to measurement invariance of the SSIS-PF and, more broadly, to the

study of social skills across cultures, current results should be interpreted with some caution. The

study of invariance is predicated on the validity of the general measurement model under study.

And, while the first order model of the SSIS-PF was supported in the current study, results also

indicate the need for further examination of the instrument’s higher-order structure. Limitations

notwithstanding, the first-order measurement structure of the SSIS-PF was shown to be invariant

across race/ethnicity and language format groups, indicating that the instrument can be used with

confidence to assess first-order social skills domains within and across each of the groups under

study. More broadly, such results indicate that narrow domains of social behavior (e.g.,

cooperation, engagement, self-control, etc.) can be objectively measured in consistent ways

across African American, Caucasian, and Latino individuals, as well as English-speaking and

Spanish-speaking groups. Based on results of the current study, though, conclusions and

implications concerning the existence of a broader, unitary social skills construct across groups

must be reserved until further research has been conducted.

107

References

Achenbach, T. M. (1995). Empirically based assessment and taxonomy: Applications to clinical

research. Psychological Assessment, 7, 261-274. doi: 10.1037/1040-3590.7.3.261

American Educational Research Association, American Psychological Association, and National

Council on Measurement in Education (1999). Standards for educational and

psychological testing. Washington, DC: American Educational Research Association.

Ang, R. P., & Hughes, J. N. (2001). Differential benefits of skills training with antisocial youth

based on group composition: A meta-analytic investigation. School Psychology Review, 31,

164-185.

Bandalos, D. L., & Finney, S. J. (2001). Item parceling issues in structural equation modeling. In

G.A. Marcoulides (ed.). New developments and techniques in structural equation modeling

(pp. 269-296). Mahwah, NJ: Lawrence Erlbaum.

Barrera, M., & Schulte, F. (2010). A group social skills intervention program for survivors of

childhood brain tumors. Journal of Pediatric Psychology, 34, 1108-1118. doi:

10.1093/jpepsy/jsp018

Barron-McKeagney, T., Woody, J. D., & D’Souza, H. J. (2001). Mentoring at-risk Latino

children and their parents: Impact on social skills and problem behaviors. Child and

Adolescent Social Work, 18, 119-136. doi: 10.1023/A:1007698728775

Bell-Dolan, D. J., Reaven, N. M., & Peterson, L. (1993). Depression and social functioning: A

multidimensional study of the linkages. Journal of Clinical Child Psychology, 22, 306-315.

doi: 10.1207/s15374424jccp2203_1

Bracken, B. A., Keith, L. K., & Walker, K. C. (1998). Assessment of preschool behavior and

social-emotional functioning: A review of thirteen third-party instruments. Journal of

Psychoeducational Assessment, 16, 153-169. doi: 10.1177/073428299801600204

108

Bronfenbrenner, U. (1986). Ecology of the family as a context for human development: Research

perspectives. Developmental Psychology, 22, 723-742. doi: 10.1037/0012-1649.22.6.723

Byrne, B. A. (2012). Structural Equation Modeling with Mplus: Basic Concepts, Applications,

and Programming. New York: Taylor & Francis Group, LLC.

Caldarella, P., & Merrell, K. W. (1997). Common dimensions of social skills of children and

adolescents: A taxonomy of positive behaviors. School Psychology Review, 26, 264-278.

Chang, F., Crawford, G., Early, D., Bryant, D., Howes, C., Burchinal, M., Barbarin, O.,

Clifford, R., & Pianta, R. (2007). Spanish-speaking children’s social and language

development in pre-kindergarten classrooms. Early Education and Development, 18, 243-

269. doi: 10.1080/10409280701282959

Cowen, E. L., Pederson, A., Babigan, H., Izzo, L. D., & Trost, M. A. (1978). Long-term follow-

up of early detected vulnerable children. Journal of Consulting and Clinical Psychology,

41, 438-446. doi: 10.1037/h0035373

Crick, N. R., & Dodge, K. A. (1994). A review and reformulation of social information-

processing mechanisms in children’s social adjustment. Psychological Bulletin, 115, 74 –

101. doi: 10.1037/0033-2909.115.1.74

Crowley, S. L., & Merrell, K. W. (2000). Convergent and discriminant validity of the

Internalizing Symptoms Scale for Children. Journal of Psychoeducational Assessment, 18,

4-16. doi: 10.1177/073428290001800101

Dawson, B. A., & Williams, S. A. (2008). The impact of language status as an acculturative

stressor on internalizing and externalizing behaviors among Latino/a children: A

longitudinal analysis from school entry through third grade. Journal of Youth and

Adolescence, 37, 399-411. doi: 10.1007/s10964-007-9233-z

Demaray, M. K., & Ruffalo, S. L. (1995). Social skills assessment: A comparative evaluation of

six published rating scales. School Psychology Review, 24, 648-671.

109

Edl, H. M., Jones, M. H., & Estell, D. B. (2008). Ethnicity and English proficiency: Teacher

perceptions of academic and interpersonal competence in European American and Latino

students. School Psychology Review, 37, 38-45.

Elliott, S. N., & Gresham, F. M. (1987). Children’s social skills: Assessment and classificaiton

practices. Journal of Counseling and Development, 66, 96-99.

Elliott, S. N., Gresham, F. M., Frank, J. L., & Beddow, P. A., III. (2008). Intervention validity of

social behavior rating scales: Features of assessments that link results to treatment plans.

Assessment for Effective Intervention, 34, 15-24. doi: 10.1177/1534508408314111

Elliott, S. N., Malecki, C. K., & Demaray, M. K. (2001). New directions in social skills

assessment and intervention for elementary and middle school students. Exceptionality, 9,

19-32.

Fantuzzo, J., Manz, P. H., & McDermott, P. (1998). Preschool version of the Social Skills Rating

System: An empirical analysis of its use with low-income children. Journal of School

Psychology, 36, 199-214. doi: 10.1016/S0022-4405(98)00005-3

Flanagan, D. P., Alfonso, V. C., Primavera, L. H., Povall, L., & Higgins, D. (1996). Convergent

validity of the BASC and SSRS: Implications for social skills assessment. Psychology in

the Schools, 33, 13-23. doi: 10.1002/(SICI)1520-6807(199601)33:1<13::AID-

PITS2>3.0.CO;2-X

Fuller, B., & Garciá-Coll, C. (2010). Learning from Latinos: Contexts, families, and child

development in motion. Developmental Psychology, 46, 559-565. doi: 10.1037/a0019412

Galindo, C., & Fuller, B. (2010). The social competence of Latino kindergartners and growth in

mathematical understanding. Developmental Psychology, 46, 579-592. doi:

10.1037/a0017821

110

Genesee, F., Lindholm-Leary, K., Saunders, W. M., & Christian, D. (Eds.). (2006). Educating

English language learners: A synthesis of research evidence. New York: Cambridge

University Press.

Gifford-Smith, M. E., & Brownell, C. A. (2003). Childhood peer relationships: Social acceptance,

friendships, and peer networks. Journal of School Psychology, 41, 235-284. doi:

10.1016/S0022-4405(03)00048-7

Greenspan, S. (1981). Defining childhood social competence: A proposed working model.

Advances in Special Educaiton, 3, 1-39.

Gresham, F. M. (1981). Assessment of children’s social skills. Journal of School Psychology, 19,

120-133. doi: 10.1016/0022-4405(81)90054-6

Gresham, F. M. (1986). Conceptual Issues in the Assessment of Social Competence in Children.

In P. S. Strain, M. M. Guralnick, & H. M. Walker (Eds.), Children’s social behavior:

development, assessment, and modification. (pp. 143-179). Orlando, FL: Academic Press.

Gresham, F. M. (2011). Social behavioral assessment and intervention: Observations and

impressions. School Psychology Review, 40, 275-283.Gresham, F. M., Cook, C. R., Crews,

S. D., & Kern, L. (2004). Social skills training for children and youth with emotional and

behavioral disorders: Intervention validity considerations and future directions. Behavioral

Disorders, 30, 32-46.

Gresham, F. M., & Elliott, S. N. (1990). Social skills rating system. Minneapolis, MN: NCS

Pearson, Inc.

Gresham, F. M., & Elliott, S. N. (2008). Social skills improvement system: Rating scales manual.

Minneapolis, MN: NCS Pearson, Inc.

Gresham, F. M., & Reschly, D. J. (1986). Social skill deficits and low peer acceptance of

mainstreamed learning disabled children. Learning Disability Quarterly, 9, 23-32. doi:

10.2307/1510398

111

Gresham, F. M., & Reschly, D. J. (1987). Dimensions of social competence: Method factors in

the assessment of adaptive behavior, social skills, and peer acceptance. Journal of School

Psychology, 25, 367-381. doi: 10.1016/0022-4405(87)90038-0

Gresham, F. M., Van, M. B., & Cook, C. R. (2006). Social skills training for teaching

replacement behaviors: Remediating acquisition deficits in at-risk students. Behavioral

Disorders, 31, 363-378.

Halgunseth, L. C., Ispa, J. M., & Ruddy, D. (2006). Parental control in Latino families: An

integrated review of the literature. Child Development, 77, 1282-1297. doi: 10.1111/j.1467-

8624.2006.00934.x

Hau, K. & Marsh, H. W. (2004). The use of item parcels in structural equation modeling: Non-

normal data and small sample sizes. British Journal of Mathematical Statistical

Psychology, 57, 327-351. doi: 10.1111/j.2044-8317.2004.tb00142.x

Hu, L. & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis:

Conventional criteria versus new alternatives. Structural Equation Modeling, 6, 1-55. doi:

10.1080/10705519909540118

Hu, L. & Bentler, P. M. (1998). Fit indices in covariance structure modeling: Sensitivity to

underparameterized model misspecification. Psychological Methods, 1998, 3, 424-453. doi:

10.1037/1082-989X.3.4.424

KewalRamani, A., Gilbertson, L., Fox, M., & Provasnik, S. (2007). Status and trends in the

education of racial and ethnic minorities (NCES 2007-039). National Center for Education

Statistics, Institute of Education Sciences, U.S. Department of Education. Washington,

D.C.

Knight, G. P., & Hill, N. E. (1998). Measurement equivalence in research involving minority

adolescents. In V. C. McLoyd and L. Steinberg (Eds.), Studying Minority Adolescents:

112

Conceptual, Methodological, and Theoretical Issues (pp. 183-210). Mahwah, NJ: Lawrence

Erlbaum Associates.

Ladd, G. W. (1999). Peer relationships and social competence during early and middle childhood.

Annual Revue of Psychology, 50, 333-359. doi: 10.1146/annurev.psych.50.1.333

Ladd, G. W., Herald, S. L., & Andrews, R. K. (2005). Young children’s peer relations and social

competence. In B. Spodek, & O.N. Saracho (Eds.), Handbook of research on the education

of young children, (2nd ed., pp. 23-54). Mahwah, NJ: Lawrence Erlbaum Associates

Publishers.

Ladd, G. W., Kochenderfer, B. J., & Coleman, C. C. (1996). Friendship quality as a predictor of

young children’s early school adjustment. Child Development, 67, 1103-1118. doi:

10.2307/1131882

Lane, K. L., Carter, E. W., Pierson, M. R., & Glaeser, B. C. (2008). Academic, social, and

behavioral characteristics of high school students with emotional disturbances or learning

disabilities. Journal of Emotional and Behavioral Disorders, 14, 108-117. doi.

10.1177/10634266060140020101

Lansford, J. E., Malone, P. S., Dodge, K. A., Pettit, G. S., & Bates, J. E. (2010). Developmental

cascades of peer rejection, social information processing biases, and aggression during

middle childhood. Development and Psychopathology, 22, 593-602. doi:

10.1017/S095479410000301

Leidy, M. S., Guerra, N. G., & Toro, R. I. (2010). Positive parenting, family cohesion, and child

social competence among immigrant Latino families. Journal of Family Psychology, 24,

252-260. doi: 10.1037/a0019407

Livas-Dlott, A., Fuller, B., Stein, G. L., Bridges, M., Figueroa, A. M., & Mireles, L. (2010).

Commands, competence, and cariño: Maternal socialization practices in Mexican American

families. Developmental Psychology, 46, 566-578. doi: 10.1037/a0018016

113

López, A., Correa-Chávez, M., Rogoff, B., & Gutiérez, K. (2010). Attention to instruction

directed to another by U.S. Mexican-heritage children of varying cultural backgrounds.

Developmental Psychology, 46, 593-601.

Manz, P. H., Fantuzzo, J. W., & McDermott, P. A. (1999). The parent version of the preschool

Social Skills Rating Scale: An analysis of its use with low-income, Ethnic Minority

Children. School Psychology Review, 28(3), 493-504.

Matson, J. L. & Wilkins, J. (2009). Psychometric testing methods for children’s social skills.

Research in Developmental Disabilities, 30, 249-274. doi: 10.1016/j.rid.2008.04.002

McFall, R. M. (1982). A review and reformulation of the concept of social skills. Behavioral

Assessment, 4, 1-33. doi: 10.1007/BF01321377

Meade, A. W., & Kroustalis, C. M. (2006). Problems with item parceling for confirmatory factor

analytic tests of measurement invariance. Organizational Research Methods, 9, p. 369-403.

doi: 10.1177/1094428105283384

Merrell, K. (1999). Behavioral, Social, and Emotional Assessment of Children and Adolescents.

Mahwah, NJ: Lawrence Erlbaum Associates.

Merrell, K. W. (2001). Assessment of children’s social skills: Recent developments, best

practices, and new directions. Exceptionality, 9, 3-18. doi: 10.1207/S15327035EX091&2_2

Merrell, K. W., & Gimpel, G. A. (1998). Social skills of children and adolescents:

Conceptualization, assessment, treatment. Mahwah, NJ: Lawrence Erlbaum Associates

Publishers.

Merydith, S. P. (2001). Temporal stability and convergent validity of the Behavior Assessment

System for Children. Journal of School Psychology, 39, 253-265. doi: 10.1016/S0022-

4405(01)00064-4

Muthén, L.K. and Muthén, B.O. (2011). Mplus (Version 6.11). Los Angeles, CA: Muthén &

Muthén.

114

Najaka, S. S., Gottfredson, D. C., & Wilson, D. B. (2001). A meta-analytic inquiry into the

relationship between selected risk factors and problem behavior. Prevention Science, 2,

257-271. doi: 10.1023/A:1013610115351

Norton, P. J., Washington, C. S., Peters, J. E., & Hayes, S. A. (2010). Diversity considerations in

assessing social skills. In D.W. Nangle, D.J. Hansen, C.A. Erdley, & P.J. Norton (Eds.),

Practitioner’s guide to empirically based measures of social skills (pp. 135-153). New

York: Springer Publishing Co.

Okagaki, L., & Frensch, P. A. (1998). Parenting and children’s school achievement: A

multiethnic perspective. American Educational Research Journal, 35, 123-144. doi:

10.2307/1163454

Odom, S. L., McConnell, S. R., & Brown, W. H. (2008). Social competence of young children. In

W. H. Brown, S. L. Odom, & S. R. McConnell (Eds.), Social competence of young

children: Risk, disability, & intervention (pp. 3-29). Baltimore, MD: Paul H. Brookes

Publishing Co.

Parker, J. G., & Asher, S. R. (1993). Friendship and friendship quality in middle childhood: Links

with peer group acceptance and feelings of loneliness and social dissatisfaction.

Developmental Psychology, 29, 611-621. doi: 10.1037/0012-1649.29.4.611

Pedersen, J. A., Worrell, F. C., & French, J. L. (2001). Reliability of the Social Skills Rating

System with rural Appalachian children from families with low incomes. Journal of

Psychoeducational Assessment, 19, 45-53. doi: 10.1177/073428290101900103

Phelan, P. Davidson, A. L., and Cao, H. T. (1991). Students’ multiple worlds: Negotiating the

boundaries of family, peer, and school cultures. Anthrpology & Education Quarterly, 22,

224-250. doi: 10.1525/aeq.1991.22.3.05x1051k

Reichow, B., & Volkmar, F. R. (2010). Social skills interventions for individuals with autism:

Evaluation for evidence-based practices within a best evidence synthesis framework.

115

Journal of Autism and Developmental Disorders, 40, 149-166. doi: 10.1007/s10803-009-

0842-0

Rogoff, B. (2003). The Cultural Nature of Human Development. New York: Oxford University

Press.

Sheridan, S. M., & Walker, D. (1999). Social skills in context: Considerations for assessment,

intervention, and generalization. In C.R. Reynolds & T.B. Gutkin (Eds.), The handbook of

school psychology (3rd

ed., pp. 686-708). New York: Wiley.

Skoulos, V., & Shick Tryon, G. (2007). Social skills of adolescents in special education who

display symptoms of oppositional defiant disorder. American Secondary Education, 35,

103-115.

Spomer, M. L., & Cowen, E. L. (2001). A comparison of the school mental health referral

profiles of young ESL and English-speaking children. Journal of Community Psychology,

29, 69-82. doi: 10.1002/1520-6629(200101)29:1<69::AID-JCOP5>3.3.CO;2-M

Suárez-Orozco, C. S., Gaytán, F. X., Bang, H. J., Pakes, J., O’Connor, E., & Rhodes, J. (2010).

Academic trajectories of newcomer immigrant youth. Developmental Psychology, 46, 602-

618. doi: 10.1037/a0018201

U.S. Census Bureau. (2009) State & County QuickFacts: USA QuickFacts. Retrieved from

http://quickfacts.census.gov/qfd/states/00000.html

Van der Oord, S., Van der Meulen, E. M., Prins, P. J. M., Oosterlaan, J., Buitelaar, J. K., &

Emmelkamp, P. M. G. (2005). A psychometric evaluaton of the social skills rating system

in children with attention deficit hyperactivity disorder. Behaviour Research and Therapy,

43, 733-746. doi: 10.1016/j.brat.2004.06.004

Van Horn, M. L., Atkins-Burnett, S., Karlin, E., Ramey, S. L., & Snyder, S. (2007). Parent

ratings of children’s social skills: Longitudinal psychometric analyses of the social skills

116

rating system. School Psychology Quarterly, 22, 162-199. doi: 10.1037/1045-

3830.22.2.162

Vandenberg, R. J. & Lance, C. E. (2000). A review and synthesis of the measurement invariance

literature: Suggestions, practices, and recommendations for organizational research.

Organizational Research Methods, 3, 4-70. doi: 10.1177/109442810031002

Walthall, J. C., Konold, T. R., & Pianta, R. C. (2005). Factor structure of the social skills rating

system across child gender and ethnicity. Journal of Psycholoeducational Assessment, 23,

201-215. doi: 10.1177/073428290502300301

Ward, S., Sylva, J., & Gresham, F. M. (2010). School-based predictors of early adolescent

depression. School Mental Health, 2, 125-131. doi: 10.1007/s12310-010-9028-3

Warnes, E. D., Sheridan, S. M., Geske, J., & Warnes W. A. (2005). A contextual approach to the

assessment of social skills: Identifying meaningful behaviors for social competence.

Psychology in the Schools, 42, 173-187. doi: 10.1002/pits.20052

Warzon, K. B., & Ginsburg-Block, M. (2008). Cultural continuity between home and school as a

predictor of student motivation: What we know, what we need to learn, and implications

for practice. In C. Hudley & A.E. Gottfried (Eds.) Academic motivation and the culture of

school in childhood and adolescence (pp. 121-145). New York: Oxford University Press.

Whiteside, S. P., McCarthy, D. M., & Miller, J. D. (2007). An examination of the factor structure

of the Social Skills Rating System parent elementary form. Assessment, 14, 246-254. doi:

10.1177/1073191107302062.

Webster-Stratton, C., Reid, J., & Hammond, M. (2001). Social skills and problem-solving

training for children with early-onset conduct problems: Who benefits? Journal of Child

Psychology and Psychiatry, 42, 943-952. doi: 10.1111/1469-7610.00790

Weisner, T. S. (2002). Ecocultural understanding of children’s developmental pathways. Human

Development, 45, 275-281. doi: 10.1159/000064989

117

Weiner, J. (2004). Do peer relationships foster behavioral adjustment in children with learning

disabilities? Learning Disability Quarterly, 27, 21-30. doi: 10.2307/1593629

Wentzel, K. R. (1993). Does being good make the grade? Social behavior and academic

competence in middle school. Journal of Educaitonal Psychology, 85, 357-364. doi:

10.1037/0022-0663.85.2.357

Zins, J. E., Bloodworth, M. R., Weissberg, R. P., & Walberg, H. J. (2007). The scientific base

linking social and emotional learning to school success. Journal of Educaitonal and

Psychological Consultation, 17, 191-210. doi: 10.1080/10474410701413145

VITA

Brian Schneider

Education

Ph. D., School Psychology, Pennsylvania State University, University Park, PA, 2012

M. Ed., School Psychology, Pennsylvania State University, University Park, PA, 2009

B.A., Psychology, Muhlenberg College, Allentown, PA 2006

Fellowship

Specialization in Culture and Language Education (SCALE) Fellowship, 2008 – 2012

Professional Positions

School Psychologist, Owen J. Roberts School District, Pottstown, PA, 2012 – present

School Psychologist, Chester County Intermediate Unit, Downingtown, PA 2011 – 2012

Pre-Doctoral Intern, CORA Services, Inc., Philadelphia, PA 2010 – 2011

Professional Certification

School Psychologist (Pennsylvania)

Research Interests

Cross-Cultural Social Skills Assessment

Language and literacy development of English Language Learners

Multicultural issues in education

Early literacy development

Professional Presentations Schneider, B. P., & DiPerna, J.C. (2012, February). A Structural Analysis of the Social Skills

Improvement System Rating Scales, Parent Form: Measurement Invariance by Race/Ethnicity.

Poster presented at the National Association of School Psychologists Annual Convention,

Philadelphia, PA.

Schneider, B. P., & DiPerna, J. C. (2009, February). The home literacy environment’s effect on

emergent literacy outcomes. Poster presented at the National Association of School Psychologists

Annual Convention, Boston, MA.

Professional Memberships

National Association of School Psychologists, 2011 – present

National Association of School Psychologists, Student Member 2007 – 2011

American Psychological Associate, Student Affiliate, 2009 – 2010

Association of School Psychologists of Pennsylvania, 2006 – 2009

Documents

A STRUCTURAL ANALYSIS OF THE SOCIAL SKILLS …