Upload
nevada-reno
View
0
Download
0
Embed Size (px)
Citation preview
University of IowaIowa Research Online
Theses and Dissertations
2014
Processing long-distance dependencies: Clitic LeftDislocation in L2 SpanishTania Lorena LealUniversity of Iowa
Copyright 2014 Tania L. Leal
This dissertation is available at Iowa Research Online: http://ir.uiowa.edu/etd/1353
Follow this and additional works at: http://ir.uiowa.edu/etd
Part of the First and Second Language Acquisition Commons
Recommended CitationLeal, Tania Lorena. "Processing long-distance dependencies: Clitic Left Dislocation in L2 Spanish." PhD (Doctor of Philosophy)thesis, University of Iowa, 2014.http://ir.uiowa.edu/etd/1353.
1
PROCESSING LONG-DISTANCE DEPENDENCIES: CLITIC LEFT DISLOCATION
IN L2 SPANISH
by
Tania Lorena Leal
A thesis submitted in partial fulfillment of the requirements for the Doctor of
Philosophy degree in Second Language Acquisition in the Graduate College of
The University of Iowa
August 2014
Thesis Supervisors: Professor Roumyana Slabakova Associate Professor Paula Kempchinsky
Graduate College The University of Iowa
Iowa City, Iowa
CERTIFICATE OF APPROVAL
_______________________
PH.D. THESIS
_______________
This is to certify that the Ph.D. thesis of
Tania Lorena Leal
has been approved by the Examining Committee for the thesis requirement for the Doctor of Philosophy degree in Second Language Acquisition at the August 2014 graduation.
Thesis Committee: ___________________________________ Roumyana Slabakova, Thesis Supervisor
___________________________________ Paula Kempchinsky, Thesis Supervisor
___________________________________ Thomas Farmer
___________________________________ Luis López
___________________________________ Christine Shea
iii
3
Puedo desconectar, pero sólo en ciertos estados de ánimo irresponsable o bien mediante un gran esfuerzo, y por eso a veces me alegro de que los murmullos sean de veras
indistinguibles y los susurros imperceptibles, y de que existan tantas lenguas que me son extrañas y no son deducibles, porque así descanso.
Javier Marías Corazón Tan Blanco
iv
4
ACKNOWLEDGMENTS
This dissertation is the product of the help and support of many people, a few of
whom I will invariably forget to mention—my apologies in advance. Back in 2008, when
I first came to the University of Iowa, I got to meet Roumyana Slabakova in Syntax I. I
probably read more during that semester than I did during the entire final year of my
M.A. studies. I had a lot of catching up to do, but even beyond that I couldn’t help
myself—it is to her credit that I became fascinated with the syntactic side of language,
and it is no exaggeration to say that meeting her changed my life. Roumyana’s mentoring
has never stopped at the classroom door: She has always answered every email and
request, however small, and has managed to keep up with pretty much every aspect of my
academic training. I am forever thankful to her. My other co-director, Paula
Kempchinsky, a.k.a. the most native Spanish near-native I know, is also one of the most
selfless scholars I’ve ever met. Her knowledge of the syntactic literature has been
invaluable in the course of my training and her pithy summative statements (on pretty
much every subject imaginable) have provided me with many moments of spontaneous
laughter. My research (dissertation and beyond) has been greatly improved because of
her insights and her encouragement. Mis más sinceras gracias por todo, incluyendo todas
las tortillas españolas tuyas que me comí durante estos años.
I am also especially thankful to my committee members for their help. I owe the
cheeriest parts of my take on academia to Richard Hurtig. He must be one of the most
well-liked professors I know, and for good reason. His contributions to this dissertation
went well beyond the design but his comments always encouraged me to think about the
fine-grained consequences of any choice in methodology. The second chapter in this
dissertation would have never existed had I not been lucky enough to take his
Psycholinguistics course. Christine Shea was also instrumental in helping me to choose
between the types of methodologies I considered early on in the planning stages of the
v
5
dissertation; she also helped me see the bright side of things when pilot data was, let’s
say, surprising. I am also very thankful to Luis López for having written his book on
Information Structure and even more thankful that he agreed to be on this committee. His
model of syntax-IS interaction has inspired many of the research projects I have
completed.
When I think of my department, I always think of three wonderful women: Judy,
Sue, and Kathy. Judith Liskin-Gasparro, without whom my data collection in the U.S.
would have never happened, changes everything she touches for the better, and I am
beyond lucky to have met her. She is the little voice in my head that tells me to think
beyond my syntax box and get my hands dirty—both when dealing with data and
(especially) when eating grapefruit. When I want to think of something lovely, I think of
giving her a hug under the apple tree in her backyard. I will never think of Iowa City
without thinking of her smile. Sue Otto is my idea of a perfectly balanced human being:
one of the most graceful and yet strong women I know. However hard a day I have had, I
always feel instantly better upon crossing the threshold of her door. She handles every
request, large and small, tech-related or not, with kindness and efficiency. I don’t think I
ever left her office without laughing at least once. I have often thought to myself, in times
of difficulty, “What would Sue say?” and never failed to feel better. I have never stopped
missing Kathy Heilenman. She had the uncanny ability to say the right thing even before
the right time appeared on the horizon. During our first conversation, she gave me the
one piece of advice I kept holding on to after my comprehensive exams. Things will get
rough, she said, but you just keep trying. She taught me how to walk in the snow (I fell
only once!) and provided me with the most impressive example of professionalism by
being absolutely present in class through her painful treatment. Son un ejemplo a seguir,
las tres. Gracias.
I was incredibly lucky to have landed in Thomas Farmer’s seminar right before I
decided on the methodology for the dissertation. Without his insight and dedication, I
vi
6
would never have attempted to place processing at the center of the project. He has
always been willing to help, and he has inspired me to work hard by example—I had all
but forgotten what all-nighters were until I met him. I am especially thankful for his
willingness to engage with second language research (and linguistic terminology) and to
do so with gusto. Grazie. I was also extremely lucky to have met Jason Rothman during
his time at Iowa. I owe much of my linguistic foundation to the many seminars of his that
I attended. Beyond Iowa, Jason was incredibly helpful, a pesar de la distancia y el
charco, during my last semester. He never failed to be supportive throughout, both as a
mentor and as a co-author. My deepest thanks to him. I am also thankful for the
professors of the FLARE, Spanish and Portuguese, Linguistics, and Psychology
departments at the University of Iowa for their help and support.
If my dissertation was only possible because of my mentors, graduate school was
only possible because of my colleagues. Elena Shimanskaya single-handedly made my
data collection at Iowa viable, offering encouragement and bringing (and buying) me
lunch and coffee whenever I forgot to make room between data collection sessions
(which happened often). She has never failed to make me laugh through the most
frustrating parts of the experience, and she even got me to be a regular at the gym at one
point (!). I can’t imagine grad school without her. Jeff Renaud patiently listened to all of
my patoaventuras and always had either the right words or gestures to make me feel
better. He never failed to be helpful in every respect, and my dogs, Chap and Bean, for
whom he cared many times, will miss him dearly. Mike Iverson, who is great at finding
links on Internets, was extremely helpful and patient through all of my questions, stats-
related or not. His Facebook statuses are the reason why I no longer check Facebook
while drinking coffee. Tiffany Judy, Gonzalo Campos, and Marta Tecedor have always
been helpful, encouraging, kind, and just all-around fun. Thank you guys. Brianna
Janssen Sánchez has the best advice about most anything and she has been generous in
every way, both with her help and time. Frau Kley was always willing to lend an ear over
vii
7
a Blue Moon and discuss data with me—Danke. I am also indebted the rest of the
FLAREs gang: Akiko, Anna, Brian, Céline, Chie, Chiemi, Denise, Ella, Elizabeth, Jacee,
Jen (Cabrelli), Jen (Vojtko), Jia, Karen, Lyudmila, Raychel, Olesya, Olga, Rajiv, Takako,
and Yi-Tzu. Gracias chic@s.
I am also extremely thankful to all the faculty and staff (at UI and Middlebury)
that made testing possible: Amber Brian, Denise Filios, Brian Gollnick, Christina
Isabelli, Luis Martín Estudillo, Ana Merino, Mercedes Nino Murcia, Ana Rodriguez,
Fanny Roncal Ramírez, Marda Rose, Jacobo Sefamí, Becky Bohde, Jennifer Crawford,
Greg Johnson, and Audrey LaRock. Thank you for letting me recruit in your classes and
for helping with the details of testing.
I am eternally thankful for my family and friends in Mexico who have always
been willing to help with my recruitment efforts. Erika Méndez Quero is the most
cheerful recruiter and I will be forever thankful for her and her family for their help. Mis
tíos (en orden) Raúl y Antonieta, Jorge y Juanita, y Migue y Analí: Gracias por su ayuda
y su apoyo, tanto incondicional como constante. Dr. Kimberly Davis, thank you for
always treating me like family. You are what I strive to be as a mentor. George and Cathy
Stack are the most supportive in-laws I have ever even heard of. They have traveled
thousands of miles to help at home whenever I have gone to gather data or to present
research. My most sincere thanks. My brother Oscar Leal is the brightest and most hard-
working person I know…also one of the funniest, which is really not fair. He has served
as an informant more times than he wishes to remember. No te escogí como hermano
pero si pudiera, lo haría dos veces. My parents Guillermo and Socorro were incredibly
helpful with the logistics of testing and recruiting. I would have no native data if it
weren’t for their efforts. En verdad que todas las páginas de esta tesis no serían
suficientes para listar todo lo han hecho por mí. Gracias. My husband, Micah Stack, has
sat through every draft of every paper I’ve written (including this dissertation, of course)
and picked up the slack at home whenever I have to leave town and gather (even more)
viii
8
data. He has been selfless and kind in every respect—time and time again he has placed
my research efforts above his own objectives and done so in admirable good humor. My
son Sebastian has provided me with the most joyous reprieves from my work and has
pouted minimally when I’ve had to leave him to do work.
This research was funded by the following grants: Executive Council of Graduate
and Professional Students Research Grant (University of Iowa), Ann Cleary Dissertation
Grant, and a Dissertation Improvement Grant from the National Science Foundation,
Award No. BCS-1323229.
ix
9
ABSTRACT
It has long been theorized that, after the so-called critical period has passed,
acquiring language becomes a more difficult enterprise. While general differences
between adult second language (L2) learners and normally developed child (L1)
acquirers have been more or less empirically established, a strand of recent L2 accounts
have focused on the specific locus of these differences. The main goal of this dissertation
project is to test the predictions of one such account: Clahsen and Felser’s Shallow
Structure Hypothesis (SSH; Clahsen & Felser 2006a, 2006b). The SSH places emphasis
on the empirical testing of native/non-native language processing asymmetries, which are
argued to be due to less detailed L2 grammatical representations. This dissertation tests
the predictions of the SSH using a long-distance dependency: Clitic Left Dislocation
(CLLD) in L2 Spanish. The study includes on-line and an off-line tasks, which were
completed by a control group of native speakers of Spanish and an experimental group
constituted by L2 learners of Spanish whose first language was English.
In view of the well-known fact that L2 learning outcomes vary widely across
individuals, a secondary goal of this dissertation project is to determine whether
variability in individual learning abilities, such as inhibitory control and statistical
learning predicts variability in L2 learning. Part of L2 learning involves detecting the
probabilistic patterns of a language (Saffran, Aslin, & Newport, 1996), such that
individuals who are better pattern learners may be better able to learn the structural
regularities of the L2 input.
Results were analyzed in order to determine whether the predictions of the SSH
could account for the patterns present in the data. These results suggest that although the
acquisition of long-distance dependencies is a protracted process, both intermediate and
advanced L2 learners of Spanish could anticipate (predict) a syntactic element based in
previously occurring cues. Thus, these results fail to support the predictions of the SSH.
x
10
In terms of individual differences, overall, neither statistical learning nor inhibitory
control appear to modulate the on-line processing of this particular long-distance
dependency in Spanish.
xi
11
TABLE OF CONTENTS
CHAPTER 1 INTRODUCTION .........................................................................................1!1.1 The Research Problem ................................................................................4!1.2 Research Questions .....................................................................................7!1.3 Organization of the Dissertation .................................................................7!
CHAPTER 2 SECOND LANGUAGE ACQUISITION RESEARCH ON PROCESSING ..................................................................................................9!2.1 General Introduction to the Chapter ...........................................................9!2.2 On the plurality of SLA frameworks ..........................................................9!2.3 Generative Approaches to SLA ................................................................11!
2.3.1 Main tenets and goals .....................................................................11!2.3.2 Current Research in Generative L2A: Moving beyond the initial state ...............................................................................................14!2.3.3. Generative L2A and Language Processing Research ....................16!2.3.4 The Shallow Structure Hypothesis (SSH) ......................................17!
2.4 Usage-Based Approaches to SLA ............................................................20!2.4.1. Main tenets ....................................................................................20!
2.5 Statistical Learning and the Richness of the Input ...................................22!2.6 Research on L2 processing .......................................................................32!
2.6.1. Two Approaches to L2 Processing ................................................32!2.7 The Construct of Proficiency in L2 Research and the Relationship between Proficiency and Exposure .................................................................36!
2.7.1 The Relationship between Proficiency and L2 Processing ............39!2.8. The Generation of Predictions and L2 Processing ..................................46!2.9. Summary ..................................................................................................48!
CHAPTER 3 THE SYNTAX OF CLITIC LEFT DISLOCATION IN SPANISH ..........50!3.1 General Introduction to the Chapter .........................................................50!3.2 Properties of CLLD ..................................................................................51!
3.2.1 General structure and availability of CLLD in Romance Languages ................................................................................................51!3.2.2 Structural Properties of Spanish CLLD ..........................................55!
3.2.2.1 Types of phrases that can be (clitic) left dislocated .............55!3.2.2.2 The semantic relationship of the dislocated element and the clitic .....................................................................................58!3.2.2.3 CLLD and a-marking ...........................................................60!3.2.2.4 Iterative CLLD .....................................................................62!3.2.2.5 CLLD in combination with other elements in the left periphery ...........................................................................................64!3.2.2.6 CLLD in embedded clauses .................................................65!
3.2.3 Discourse Properties of Spanish CLLD .........................................67!3.2.3.1 CLLD and the notion of “Topic” .........................................67!
3.2.4 Other Dislocations in Spanish ........................................................68!3.2.4.1 CLLD and Hanging Topic Left Dislocation (HTLD) ..........69!3.2.4.2 CLLD and (Fronted) Focus ..................................................72!
3.3 CLLD “analogues”: English-style topicalization .....................................75!3.4 Syntactic analyses of CLLD .....................................................................77!
3.4.1 Movement vs. Base-Generation approaches ..................................78!3.4.2 The location (or landing site) of the dislocate ................................80!
3.4.2.1 Rizzi’s (1997, 2004) cartographic approach ........................81!3.4.2.2 López’s (2009) approach to Information Structure ..............85!
xii
12
3.5 Summary ...................................................................................................91!CHAPTER 4 METHODS ..................................................................................................92!
4.1 Introduction to the Chapter .......................................................................92!4.2 Participants ...............................................................................................93!
4.2.1. Second language learners ..............................................................93!4.2.1. Native speaker control group .........................................................94!
4.3 Tasks and Materials ..................................................................................95!4.3.1 Off-line Tasks .................................................................................96!
4.3.1.1 Proficiency Test: Materials and Procedures .........................97!4.3.1.2 Clitic Knowledge Test ..........................................................98!4.3.1.3 (English) Vocabulary Knowledge (L1 Verbal Skills) ........100!4.3.1.4 Audio-Visual Rating Task of CLLD ..................................101!4.3.1.5 Statistical Learning Task ....................................................104!4.3.1.6 Language Background Questionnaire ................................105!
4.3.2 On-line Tasks ................................................................................105!4.3.2.1 Self-paced reading task ......................................................106!4.3.2.2 Non-verbal Stroop task .......................................................108!
4.4 Summary .................................................................................................111!CHAPTER 5 RESULTS ..................................................................................................112!
5.1 Introduction to the Chapter .....................................................................112!5.2 Off-line Tasks .........................................................................................113!
5.2.1 Proficiency Test ............................................................................113!5.2.2 Clitic Knowledge Test ..................................................................115!5.2.3 (English) Vocabulary Knowledge (L1 Verbal Skills) ..................116!5.2.4 Audio-Visual Rating Task of CLLD ............................................116!
5.2.4.1 Rating Task: Main Clause CLLD (Group results) .............117!5.2.4.2 Rating Task: Embedded Clause CLLD (Group results) .....119!
5.2.5 Statistical Learning Task ..............................................................126!5.3 Interim Summary: Offline tasks .............................................................128!5.4 On-line Tasks ..........................................................................................129!
5.4.1 Self-paced reading task .................................................................129!5.4.1.1 Accuracy scores on comprehension questions ...................130!5.4.1.2 Reading Times ....................................................................131!
5.4.2 Non-verbal Stroop task .................................................................142!5.5 Inter-correlations between tasks .............................................................145!5.6 Summary .................................................................................................151!
CHAPTER 6 DISCUSSION AND CONCLUSIONS .....................................................153!6.1 Introduction to the Chapter .....................................................................153!6.2 Summary of Main Findings ....................................................................154!
6.2.1 Research Question 1 .....................................................................154!6.2.2 Research Question 2 .....................................................................158!6.2.3 Research Question 3 .....................................................................165!
6.3 Limitations of the Study and Directions for Future Research ................173!6.4. Conclusions ............................................................................................175!
REFERENCES ................................................................................................................177
xiii
13
LIST OF TABLES
Table 2-1 L2 processing studies using indirect measures of proficiency (no independent proficiency measure). ...........................................................................40!
Table 2-2 L2 processing studies using direct measures of proficiency (independent proficiency measure were used). ..............................................................................44!
Table 3-1 The Spanish clitic paradigm for dative and accusative clitics ...........................56!Table 3-2 Feature combinations and syntax-discourse structures, López (2009) ..............87!Table 4-1 Demographics of the L2 Learner group ............................................................94!Table 4-2 Demographics of the Native Speaker group ......................................................95!Table 4-3 Conditions and sample tokens from the Offline Rating Task .........................10�!Table 4-4 Grammar and items in Gómez’s Language 1 ..................................................105!Table 5-1 Means, standard deviations, and range of scores on the proficiency
measure per group. ..................................................................................................114!Table 5-2 Means, standard deviations, and ranges of the scores of the test of clitic
knowledge per group ..............................................................................................115!Table 5-3 Means, standard deviations, and ranges of the scores on the SILS
Vocabulary test for the L2 groups. .........................................................................116!Table 5-4 Pairwise comparisons (grammaticality*condition*group): Embedded
CLLD (native speakers). Conditions: C—Subjunctive Complements, D—Temporal Adverbials, E—Sentential Subjects. ......................................................124!
Table 5-5 Pairwise comparisons (grammaticality*condition*group): Embedded CLLD (L2 advanced learners) ................................................................................125!
Table 5-6 Pairwise comparisons (grammaticality*condition*group): Embedded CLLD (L2 intermediate learners) ...........................................................................126!
Table 5-7 Mean accuracy rates and standard deviations for the statistical learning task (all groups) ......................................................................................................127!
Table 5-8 Mean overall accuracy rates (fillers included) and standard deviations for the self-paced reading comprehension questions (all groups) ..........................130!
Table 5-9 Mean accuracy rates and standard deviations on the experimental items only (n=24 per list) for the self-paced reading comprehension questions (all groups) ....................................................................................................................131!
Table 5-10 Self-paced reading pairwise comparisons (segment * group): all segments except segment 8 for all participant groups. Superscripts indicate the group with the higher RT mean. .......................................................................137!
xiv
14
Table 5-11 Self-paced reading pairwise comparisons (condition*segment*group): all segments except segment 8 for all participant groups. Superscripts indicate the condition with the higher mean. ........................................................................139!
Table 5-12 Inter-correlation between task measures (significant correlations in shaded cells). ...........................................................................................................147!
xv
15
LIST OF FIGURES
Figure 3-1 López’s model: interaction of syntactic objects with pragmatics ....................86!Figure 4-1 Tasks with approximate durations (L2 group) .................................................96!Figure 4-2 Screenshot from the audio-visual rating task, condition e (temporal
adverbial clauses) ....................................................................................................101!Figure 4-3 Sample stimuli from the non-verbal Stroop task. Panel (a) depicts
congruent stimuli (left and right) and panel (b) depicts incongruent stimuli (left and right). ........................................................................................................110!
Figure 5-1 Mean acceptability rates for Condition A (CLLD + Fronted Focus) and B (CLLD + wh-operator) ........................................................................................118!
Figure 5-2 Mean acceptability rates for Embedded CLLD condition C (Subjunctive Complements) for all groups. .................................................................................120!
Figure 5-3 Mean acceptability rates for Embedded CLLD condition D (Temporal Adverbials) for all groups. ......................................................................................121!
Figure 5-4 Mean acceptability rates for Embedded CLLD condition E (Sentential Subjects) for all groups. ..........................................................................................122!
Figure 5-5 Reading times per region for the early and late completion conditions (error bars +/- 1SE) for the native speaker group (see (5-�)): A1 aquellas2 estudiantes3, la4 linda5 secretaria6 felizmente7 les8 contó9 que10 probablemente11 las12 admitirán13 en14 el_programa15 ............................................133!
Figure 5-6 Reading times per region for the early and late completion conditions (error bars +/- 1SE) for the advanced L2 learner group (see (5-�)): A1�aquellas2 estudiantes3, la4 linda5 secretaria6 felizmente7 les8 contó9 que10 probablemente11 las12 admitirán13 en14 el_programa15 ............................................134!
Figure 5-7 Reading times per region for the early and late completion conditions (error bars +/- 1SE) for the intermediate L2 learner group (see (5-�)): A1�aquellas2 estudiantes3, la4 linda5 secretaria6 felizmente7 les8 contó9 que10 probablemente11 las12 admitirán13 en14 el_programa15 ............................................135!
Figure 5-8 Scatter plot of proficiency test scores and reaction times at segment 9 in the late (expectation violation) and early (expectation met) conditions for the L2 learner group. .....................................................................................................141!
Figure 5-9 Mean accuracy scores and standard errors for the congruent and incongruent trials for the non-verbal Stroop task (all groups). ...............................143!
Figure 5-10 Mean reaction times and standard deviations for the congruent and incongruent trials for the non-verbal Stroop task (all groups). ...............................144!
1
CHAPTER 1
INTRODUCTION
The present dissertation examines the acquisition and processing (comprehension)
of a long-distance dependency by learners of Spanish as a second language (L2) in
adulthood, using insights from both Generative and Usage-based approaches to language
acquisition. Although these two approaches have been, at least nominally, incompatible
in many well-documented regards (e.g. views on nativism, innate representational
systems), there are other issues on which the two have developed many common points
of contact, both in terms of assumptions, methodologies, and even the types of inferences
can be drawn from experimental data. These developments are partly due to internal
changes and advances within the Generative Paradigm—changes that have not
necessarily altered the view of generative linguists from an outsider’s perspective.
Marantz (2005) notes that the general (and, he argues, largely mistaken) perception of a
disconnection between linguistic theory and (experimental) data analysis has resulted in
many cognitive researchers basically rejecting the idea that generative linguistics can be a
foundational starting point for the cognitive scientific study of language. In fact, Marantz
notes about this perceived “gap” between generative linguistics and experimental
cognitive researchers, that “generative grammarians perhaps suffer from a public
relations problem” (p. 430). Although several generative linguists (e.g. Jackendoff, 2003;
Marantz, 2005) have attempted to carve a more substantial and broad role for linguistics
in fields such as psycholinguistics and computational neuroscience, this (re) assimilation
is far from being universally accepted or attained.
The developments that have taken place within the Generative paradigm, which
have been admittedly gradual and somewhat localized, have been spurred by several
factors, including conceptual advancements in linguistic theory (e.g. Chomsky, 2001,
2005; Jackendoff, 2003), changes in the kinds of constructs that have been argued to
influence the acquisition process (e.g. in L2 acquisition research, Sorace, 2011 for
2
2
cognitive control; Juffs & Harrington, 2011 for memory; Hawkins, 2011 for statistical
learning abilities), as well as from collective attempts to incorporate findings from other
fields, especially from Usage-based empirical studies within psycholinguistics. It should
be noted that this perceived gap is not exclusively one-sided. Many researchers have
noted that the majority of generative linguists have not wholeheartedly embraced the
adoption of methodologies and constructs from experimental psychology and
psycholinguistics (see Marantz, 2005 for discussion).
The current study focuses on the L2 acquisition/processing of a long-distance
dependency in Spanish (i.e. Clitic Left Dislocation; CLLD) by a group of English-native
speakers learning Spanish as a second language in adulthood. In order to weigh in on
current debates regarding second language acquisition and processing, the study includes
both off-line tasks, which do not necessitate the collection of data that is time-sensitive
and do not place a time constraint on completion, and on-line tasks, where the data
collected is time-sensitive and results from the manipulation of an independent variable
that controls specific linguistic aspects. Given that the goal of these tasks is to measure
the effect of a given linguistic manipulation, usually in terms of differences in reaction or
processing times (see Garrod, 2006 for detailed descriptions of these types of
methodologies), this dissertation includes time-sensitive measures (henceforth on-line) in
order to better understand the process and mechanisms involved in the learning and
comprehension of long-distance dependencies in a second language.
The point of departure of this dissertation, as is the case with most research done
in second language acquisition, is the universally acknowledged fact that the ultimate
outcomes of first (L1) versus L2 language acquisition are remarkably different. These
differences must be accounted for by any theory of second language acquisition,
regardless of the theoretical approach or the methodologies employed. In this respect,
making use of off-line methodologies (e.g. grammaticality judgment tasks, truth-value
judgment tasks) L2 researchers have documented finding differences in the linguistic
3
3
knowledge possessed by L1 vs. L2 speakers, providing evidence for a “lack of
convergence” across multiple linguistic domains including phonology (e.g. Fathman &
Precup, 1983; Munro, Flege & MacKay, 1996) and morpho-syntax (e.g. Hyltestam &
Abrahamsson, 2003; Lardiere, 2009).
However, there is also evidence to the contrary: A significant number of studies
have shown evidence, especially in study of linguistic structures involving the acquisition
of morphosyntax and syntax-semantics, that these across-group differences are not
always insurmountable (e.g. Hopp, 2006; Leal Méndez & Slabakova, 2012; Iverson,
Rothman & Kempchinsky, 2008; Montrul & Slabakova, 2003). The evidence for
“convergence” usually comes in the form of a lack of qualitative differences between the
performances of native speakers and L2 learners.
In the last decade and a half, L2 acquisition researchers have sought alternative
explanations for native vs. L2 ultimate-attainment differences by focusing especially on
L1-L2 processing-based differences. Many studies in this vein have produced evidence
that the moment-by-moment processing of an unfolding linguistic signal is different
between these two groups (e.g. Jiang, 2004; Papadopoulou & Clahsen, 2003), although
other studies show areas of convergence (e.g. Hopp 2006, 2013). Accounts that center on
processing vary from one to another across a wide variety of dimensions. It has been
noted (e.g. in Leal Méndez, Farmer, & Slabakova, in press) that they can be roughly
categorized into two groups, based upon the degree to which they propose that the
processes driving on-line L2 comprehension are the same as, rather than different from,
those responsible for L1 comprehension. In any case, this shift to processing-centric
explanations for L2 acquisition has broadened the research questions that have been
addressed within the generative L2 acquisition paradigm.
4
4
1.1 The Research Problem
The acquisition and comprehension of a language beyond one’s native language is
a difficult endeavor with infamously variable and often imperfect outcomes. A
notoriously protracted process, L2 acquisition is a gradual undertaking in which a learner
must learn an extensive number of arbitrary sound-meaning parings as well as a series of
constraints (i.e. rules) involving a multitude of linguistic levels (e.g. syntax, morphology,
phonology). These levels must be subsequently integrated with social contexts and world
knowledge in order to construct meaning (Kutas, DeLong & Smith, 2011). Thus, L2
comprehension requires that a speaker/comprehender incrementally incorporate these
various linguistic levels in a matter of hundreds of milliseconds in order to form mental
representations (i.e. meaning). These representations have been argued to result from
processes such as the online mapping of lexical items into grammatical categories, the
computation of semantic interpretations, and the interpretation of pragmatic and
communicative intentions (Renaud & Dekydspotter, in press).
Research in L1 sentence processing has presented a wealth of evidence showing
that speakers can anticipate/predict upcoming information at multiple levels, including
morphosyntax, lexicon, or even word-form typicality (e.g., Altmann & Mirkovic, 2009;
Arai & Keller, 2013; Farmer, Christiansen & Monaghan, 2006; Farmer, Brown &
Tanenhaus, 2013; Jaeger & Snider, 2013; Kutas, DeLong & Smith, 2011; Levy, 2008;
MacDonald, 2013; Pickering & Garrod, 2011, 2013; Van Petten & Luka, 2012). The
question of whether L2 learners can predict in their L2, however, has been explored only
recently (e.g. Hopp, 2013). The investigation of L2 predictive processing has yielded
somewhat mixed results, with some studies finding that L2 learners do not predict in a
native-like way while others have found fewer differences (see Kaan, in press for
discussion). Nevertheless, the investigation of prediction should occupy an important
place in L2 acquisition research because it is especially significant in terms of weighing
in on the different accounts that have been proposed for L2 processing.
5
5
One important account of L2 processing is the Shallow Structure Hypothesis
(Clahsen & Felser, 2006 a,b; SSH), which proposes that L2 learners do not process
language in the same way as native speakers of the language due to differences in their
grammatical representations. Moreover, the SSH posits that L1-L2 processing differences
are localized. The processing of long-distance dependencies (e.g. wh-movement) is
argued to lie at the most problematic end of the processing difficulty continuum, while
adjacent dependencies (e.g. morphosyntactic agreement) are argued to posit less
difficulty. In this regard, although the SSH does not explicitly address the generation of
predictions or predictive processing (but see Kaan, in press for discussion), the expected
outcomes are clear: Under the SSH, L2 learners are not expected to predict/anticipate
syntactic information that depends on long-distance syntactic relationships. Instead, L2
learners are expected to succeed only if (and when) they can use heuristics based on
semantic and pragmatic contextual information.
Testing the SSH empirically by investigating the processing of long-distance
dependencies is important from a theoretical point of view because this investigation
advances our knowledge of the underlying mental architecture responsible for L2
learning. By testing the SSH, we can address a long-standing and crucial question in
second language acquisition research: whether differences between native speakers and
L2 learners (including those at the end of the proficiency continuum often referred to as
“near-natives”) are qualitative (e.g. L2 learners have less detailed mental representations
which may lead to inefficient or unavailable predictive processing, as predicted by the
SSH) or quantitative (e.g. the differences can be traced to explanations such as
differences in L2 proficiency or exposure to the L2).
In addition to accounting theoretically for the acquisition of long-distance
dependencies, it is also important to account for how individual differences may or may
not influence L2 acquisition. It is well known that L2 learning outcomes vary widely
across individuals, a fact that is distinct from typical L1 acquisition where the ultimate
6
6
outcomes are much more uniform. Thus, a secondary goal of this dissertation is to
investigate whether variability in individual measures of ability, including non-linguistic
measures of inhibitory control and statistical learning abilities, can predict variability in
L2 learning. Measures of ability on these two different cognitive constructs were used for
this purpose. In addition to these non-linguistic measures, this dissertation also included
linguistic measures such as L1 Vocabulary measures and measures of syntactic
knowledge in L2.
In order to measure inhibitory control, this dissertation includes a classic
inhibitory control measure (Stroop, 1935). The version employed in this study is a (non-
linguistic) version of the Stroop task that has been previously, and successfully, used in
research with bilinguals (Blumenfeld & Marian, 2011). This task was used to investigate
the possible contribution of inhibitory control in the acquisition and processing of long-
distance dependences. Although some L2 acquisition theories (e.g. the Interface
Hypothesis, Sorace, 2011; Sorace & Serratrice, 2009) predict that inhibitory control can
modulate L2 acquisition and processing, the SSH does not predict such a relationship. In
fact, the SSH does not predict that processing is affected by individual cognitive
differences (e.g. differences in reading-span measures).
In order to investigate the potential contributions of statistical learning on L2
acquisition and processing, this study used a statistical learning task (Gómez, 2002),
which measured participants’ ability to detect probabilistic regularities in an inherently
noisy input. Part of L2 learning involves learning about the probabilistic patterns of a
language (Saffran, Aslin & Newport, 1996), such that individuals who are better pattern
learners may be better able to learn about structural regularities inherent to the L2 input.
This study aimed to test this hypothesis. Although the study of statistical learning within
L2 research is still burgeoning, there is indication that statistical learning abilities could
be critical for a more detailed understanding of the L2 acquisition process. For instance,
Kaufman, DeYoung, Gray, Jiménez, Brown, & Mackintosh (2010) found that subject-
7
7
based variability in scores on statistical learning tasks significantly predicted variability
on reading and other language-related measures. Based on this research, it would be
logical to predict a positive relationship between statistical learning scores and
knowledge of long-distance dependencies, such as CLLD.
1.2 Research Questions
The goal of the present study is to investigate (adult) second language
development by examining the L2 acquisition and processing of CLLD, a long-distance
dependency in Spanish. In doing so, this study seeks to test the predictions of the Shallow
Structure Hypothesis, which proposes that the processing of long-distance dependencies
in a second language is impeded by L2 grammatical representations that lack detail. A
secondary goal is to investigate the contribution of individual differences in measures of
inhibitory control and statistical learning abilities to the processing of CLLD in L2
Spanish.
In view of the previous discussion, the research questions that guide this study are
the following:
1. Do L2 learners of Spanish, native speakers of English, show off-line knowledge
of the syntactic and pragmatic constraints of CLLD in Spanish?
2. Are long-distance dependencies such as Spanish CLLD especially difficult for L2
learners to process in real time, as predicted by the Shallow Structure Hypothesis?
3. Can individual differences in measures of linguistic- (e.g. L1 vocabulary) and
non-linguistic abilities (e.g. inhibitory control and statistical learning ability)
predict variability in the acquisition and processing of CLLD in L2 Spanish?
1.3 Organization of the Dissertation
The present dissertation is organized into six chapters. Chapter 2 presents the
theoretical frameworks that inform the study, and it explores points of contact and shared
interest between the Generative and Usage-based approaches to L2 acquisition. This
8
8
chapter also includes a selective review of processing studies in L2 from various
approaches. Chapter 3 details the syntactic and discourse properties of CLLD in
Romance languages, with a special emphasis on the properties of Spanish CLLD.
Additionally, it includes two syntactic analyses that intend to account for Romance
CLLD. Chapter 4 presents the methodology used in the empirical study, including the
research populations, tasks, and procedures. Chapter 5 presents the results and statistical
analyses from all experimental tasks. Finally, Chapter 6 presents a discussion of the
findings of the experimental tasks and offers the essential conclusions and interpretations
of the results. This chapter concludes with a brief presentation of the shortcomings of the
experimental study and offers some ideas and potential avenues for future research in L2
processing.
9
CHAPTER 2
SECOND LANGUAGE ACQUISITION RESEARCH
ON PROCESSING
2.1 General Introduction to the Chapter
In this chapter, I will present the theoretical backdrop that informs the empirical
study described in Chapter 4. For this purpose, I will focus on two different frameworks
in Second Language Acquisition (Generative approaches to SLA and Usage-based
approaches to SLA) that have historically differed in their departing assumptions, the
types of data that are commonly used as evidence within each, as well as the
interpretations that are drawn from that data. Although these two frameworks have
orthogonal views with regard to a wide range of issues, I will strive to highlight several
commonalities between them with regard to the constructs that are used in this
dissertation (i.e. inhibitory control, statistical learning abilities) as well as in other many
other recent studies. In part, these commonalities constitute the byproduct of both
theoretical and methodological advances (e.g. the increased use of time-sensitive
methodologies) as well as of important empirical findings in language research (e.g.
Saffran, Aslin & Newport, 1996).
Additionally, in this chapter I will present a review of L2 research studies that,
using time-sensitive methodologies, have investigated the moment-to-moment processing
of a linguistic signal, with a special focus on a subset of studies that have taken L2
proficiency as a variable. Finally, this chapter will present the notion of “prediction”
(anticipatory processing) and will spell out how examining predictive processes can be a
good measure of L2 processing development.
2.2 On the plurality of SLA frameworks
SLA is widely considered to be a relatively young field, one that goes back, as an
independent discipline, to the 1960s at the earliest (Gass, 2006; Gass, Fleck, Leder, &
10
10
Svetics, 1998; but see Thomas, 1998, for a different argument). However, talking about
SLA as a unitary discipline would be somewhat misleading and perhaps tendentious.
From its inception, SLA research has been informed by other disciplines (most notably
linguistics, psychology, and pedagogy) and, to this day, there are a multitude of SLA
frameworks and approaches, many of which exist largely without influencing one
another. This should be to some extent unsurprising given the multiple questions and
issues that SLA as a field has sought to address over the years. Thus, the plurality of
frameworks may be a matter of sheer necessity.
R. Ellis (1997), for example, proposes that no single framework, theory, or model
can adequately address the wide array of hypotheses that have been proposed in order to
account for the L2 acquisition process. Although there have been a great number of
fruitful collaborations between SLA frameworks and between SLA and other fields (e.g.
linguistics, psychology, sociology), the plurality of frameworks used in SLA research has
not co-existed peacefully. In fact, the field has seen several debates about the nature of
SLA inquiry and the types of frameworks that should (or should not) be part of the field
(e.g. Firth & Wagner, 1997, 1998; Gregg, 1993; Gregg, Long, Jordan & Berreta, 1997;
Lantolf, 1996; Long, 1997).
With the integration of different approaches and disciplines have come different
research methodologies. As noted by Seliger and Shohamy (1989), choosing an
experimental method over another constitutes an indication of the types of questions that
are essential to the researcher, as well as of her/his assumptions about the subject of
inquiry. Studies in L2 processing (see section 2.6 for details) have their basis in the
psycholinguistic tradition, where the main focus has been on the mechanisms involved in
processing, rather than on the constraints on grammar formation (Mackey & Gass, 2005).
Despite their differences, the two approaches that will be discussed below are generally
considered to be approaches to SLA that focus on quantitative data analysis.
11
11
2.3 Generative Approaches to SLA
2.3.1 Main tenets and goals
Since its inception, the study of generative second language acquisition has had as
a fundamental goal the characterization of the linguistic systems developed by second
language learners—the so-called interlanguage (Selinker, 1972)—as it pertains to the
universal principles that are argued to constrain natural languages in general (e.g.
Chomsky, 1985; White, 2003). In addition to the characterization of second language
learners’ linguistic knowledge, the study of generative second language acquisition has
also striven to account for what is (and, perhaps more crucially, what is not) acquirable in
the L2 (Gass & Selinker, 2008), as well as the ways in which this knowledge is attained
(Gregg, 1993). Although it would be unfair to say that Generative approaches to SLA
have not focused on L2 development, the historical focus of the Generative study of L2A
has usually centered on either the initial state of L2 knowledge (e.g. issues of what type
of knowledge can be transferred from the L1) or the ultimate state of L2 knowledge (also
referred to as “ultimate attainment”).1 These foci have led to a very productive research
program, but it has also neglected some areas of L2 study. I will elaborate on this point
below.
Regarding attainment, the Generative paradigm has historically framed this
question in terms of accessibility to Universal Grammar (UG), where various, often
dichotomous, proposals have been put forward in order to account for the (im)possibility
of native-like convergence. Indeed, the question of accessibility to UG in the
development of interlanguage grammars had been, until very recently, perhaps the central
research topic within the generative study of second language acquisition. Much of this
1 The initial state of L2 acquisition represents the set of initial linguistic circumstances (e.g. L1 background and possible transfer) that constitute the departing point of L2ers when exposed to L2 language data (White, 2003).
12
12
research has addressed the (im)possibility of parameter resetting, especially of L2
acquisition beyond critical (or sensitive) periods (e.g. adult L2 learners).
As is to be expected, some of these original proposals represented antithetical
accounts that posit, on one end of the spectrum, the unequivocal (albeit not inevitable)
possibility of resetting parameters (e.g. Schwartz & Sprouse, 1996) or the categorical
lack of such a possibility (Flynn & Martohardjono, 1994, Epstein, et al., 1996) on the
other end. However, there also exist partial-access accounts, which postulate either local
impairments or qualified possibilities of resetting (e.g. Hawkins & Hattori, 2006; Tsimpli
& Dimitrakopoulou, 2007; Vainikka & Young-Scholten, 1994).
Among these proposals, two have been widely regarded as the main contenders.
On the one hand, proponents of the so-called Representational Deficit Accounts
(Hawkins & Chan, 1997; Hawkins & Hattori, 2006; Tsimpli & Dimitrakopoulou, 2007;
Tsimpli & Mastropavlou, 2008, inter alia) argue for impaired L2 representations due to
partial availability of UG.2 This partial access is argued to affect a subset of features that
are not instantiated in the L1 (i.e. they cannot be transferred). The Interpretability
Hypothesis (Tsimpli & Dimitrakopoulou, 2006), for example, proposes that this subset of
features is constituted of uninterpretable features. Uninterpretable features are, of course,
the opposite of interpretable features, which make an essential contribution to meaning
(e.g. animate, human, tense). Uninterpretable features (e.g. case or agreement), on the
other hand, are necessary only for grammatical reasons and are only relevant to the
morphosyntax.3
2 It is important to note that although these accounts all propose representational deficits of one sort or another, there are important differences among theme. These differences, however, are not crucial for the purposes of the current discussion.
3 Within the Minimalist program (e.g. Chomsky, 1995, 2001), only uninterpretable features trigger the movement of constituents (e.g. wh-movement). Therefore, uninterpretable features are crucial in all transformations within narrow syntax.
13
13
The other hypothesis is the Full Transfer/Full Access Hypothesis (Schwartz &
Sprouse, 1996), which proposes that L2 learners have full accessibility to UG.4 Under
Full Transfer/Full Access, learners are not limited to the representations they transfer
from their native language but are able to restructure their grammar based on the
available L2 input. Crucially, although Full Transfer/Full Access argues that L2 learners
can attain a native-like grammar, convergence with the target grammar is far from being
effortless or guaranteed. Therefore, if a given learner does not find evidence to re-set
her/his grammar, even though she/he may have access to UG, the learner’s grammar
might not converge with the grammar of the target language.
As a matter of illustration, White (2003, p. 64) notes that an English native
speaker learning Mandarin might have problems acquiring long-distance reflexives (e.g.
ziji “self”) because Mandarin makes use of both local (i.e. taziji) and long-distance
reflexives. White notes that this fact could obscure the phenomenon and thus not
motivate grammar restructuring. Both local and long distance antecedents can bind
Mandarin long-distance reflexives, which can be unrestrictedly far from their
antecedents. Thus, White surmises that, given that an English speaker learning Mandarin
can exclusively (and successfully) use local reflexives (as in his/her native language) and
that, moreover, it is unlikely that a problem in interpretation arises in the use of a long-
distance reflexive, this learner might never find instances in the L2 input that would
motivate restructuring.
Hawkins (2011) argues that Full Transfer/Full Access theories have a problem in
terms of accounting for L2 variability (optionality). As is commonly known, L2 learners
have been widely documented to show variability in their use of inflectional
morphology—even if they demonstrate knowledge of inflectional morphology using one
4 Full Transfer refers to the notion that the initial state includes all the abstract representations of a learner’s L1. Full Access refers to the notion of unrestricted accessibility to Universal Grammar.
14
14
measure, they might fail to use it correctly in another context. In order to account for this
variability, Prévost and White (2001) proposed the Missing Surface Inflection
Hypothesis, which proposes that faulty performance is not necessarily indicative of a lack
of functional categories or of features associated with them. Consequently, Prévost and
White argue that the source of the problems such as target-deviant use of morphology is
due to a mapping problem: A disconnect between knowledge of abstract syntactic
features and categories and the (overt) inflectional morphology (features are not always
accessible, although present).5,6
2.3.2 Current Research in Generative L2A: Moving beyond
the initial state
More recently, the generative study of L2A has focused on a much broader set of
issues (see Slabakova, Leal Méndez & Liskin-Gasparro, in review). By way of example,
we can examine the evolution of the “light switch” simile. The original notion of
“instantaneous” parametric acquisition, held currency at one time but it was challenged
early on by empirical L2 data (e.g. White, 1985). Thus, the “light switch” as a simile for
L2 acquisition was mostly abandoned by the early 1990s. Presently, the focus has shifted
from parametric acquisition to the acquisition and re-assembly of features (Lardiere,
2009).7
5 Prévost and White appeal to Halle and Marantz’s Distributed Morphology (1993), which proposes that lexical items can be inserted into a hosting node even if the features do not match exactly. The reader is referred to the original sources for details.
6 Another Generative hypothesis that attempts to account for L2 variability of outcomes is the Prosodic Transfer Hypothesis (Goad & White, 2006). This hypothesis, like Full Transfer/Full Access, proposes that all syntactic properties can be acquirable, at least in principle. Goad and White propose that non-nativelike use of inflectional morphology might be explained by (negative) L1 phonological transfer effects.
7 See Yang (2002, 2010) for an updated acquisition model that incorporates parameters and notions such as frequency of use.
15
15
Additionally, advances in linguistic theory, the use of empirical methodologies,
and the increasing reliance on resources such as L1 and L2 corpora have also resulted in
the postulation of new acquisition models and hypotheses (e.g. Lardiere, 2009;
Slabakova, 2008; Yang, 2002, 2010). Although these new models maintain a focus on the
genesis of L2 interlanguage, many of these proposals have capitalized on notions like
linguistic variation and frequency of occurrence—a far cry from the notions informing
early generative L2A studies.
In the last decade, the study of generative L2A has seen a proliferation of studies
focusing on the linguistic interfaces (e.g. syntax–discourse, syntax–phonology, syntax–
semantics).8 Within the study of interfaces, one of the most influential hypotheses has
been the Interface Hypothesis (Sorace & Filiaci, 2006; Sorace & Serratrice, 2009; Sorace,
2011), which proposes a principled distinction between external and internal interfaces.
Internal interfaces (e.g. syntax-semantics, syntax-morphology, phonology-morphology
and syntax-morphology) involve only interactions between language modules, while
external interfaces involve both linguistic and non-linguistic cognitive systems (e.g.
syntax-discourse). According to Sorace and colleagues, external interfaces are argued to
pose a formidable challenge for L2 ultimate native-like attainment. The Interface
Hypothesis has generated an extensive amount of research that, crucially, has engaged a
diversity of learner populations (e.g. child L2 acquisition, adult L2 acquisition, L1
attrition) and has enriched the field because it has encouraged fruitful cross-discipline
collaborations (Rothman & Slabakova, 2011).
8 In a modular view of language (e.g. Fodor, 1983; Jackendoff, 2002; Reinhart, 2006), interfaces embody contact points where modules of language interact with each other or with other cognitive domains.
16
16
2.3.3. Generative L2A and Language Processing Research
A recent development within the generative L2A paradigm is that explanations
for native vs. learner ultimate-attainment differences have focused more prominently on
processing-based differences. This new research focus has produced a spate of
demonstrations that the moment-to-moment processing of an unfolding linguistic signal
can be different for native speakers than it is for L2 learners. Furthermore, it has been
argued that different processing signatures can provide insight into the underlying
systems of native speakers and L2 learners. For instance, the current version of the
Interface Hypothesis (Sorace, 2011) places much of the burden of explanation for
native/L2 learner asymmetries on issues related to language processing.
Drawing on the concept of inhibitory control (Green, 1986), and in light of
research showing that all systems in a bilingual mind are simultaneously activated even
when one is contextually unnecessary (see Bialystok, 2009 for review), Sorace suggests
that there is an inherent cognitive cost to bilingualism because the inhibition of a (first)
language is assumed to be burdensome on the processor. In a series of psycholinguistic
experiments involving diverse methodologies, including eye-tracking and brain-imaging
methodologies, Marian and colleagues (Spivey & Marian, 1999; Marian, Spivey &
Hirsch, 2003) present evidence that dual language activation in bilinguals causes
languages to compete. Even when linguistic input is presented in one language (be it the
L1 or L2), both L1 and L2 show evidence of activation. Increased bilingual experience
(i.e. experience managing the interference caused by the presence of an additional
language), however, has been argued to improve bilinguals’ ability to inhibit one
language (but see Schwartz & Kroll, 2006 for discussion).
Aside from the Interface Hypothesis, another hypothesis that has placed
processing in a central position is the Shallow Structure Hypothesis (SSH, Clahsen &
Felser 2006a, b), which will be presented in the following section.
17
17
2.3.4 The Shallow Structure Hypothesis (SSH)
The SSH proposes a dual-route processing model with two distinct parsing paths.
The first is a full syntactic path (Full Parsing), which is based on a wholly specified
syntactic description. The second is a “shallow” path, which is based on lexical, thematic,
pragmatic, and contextual information (to the exclusion of syntactic representations).
Shallow representations are argued to display both deficient hierarchical structure and a
lack of “configurationally-determined elements,” (e.g. movement traces), such that this
shallow parsing route is necessarily confined to local domains (e.g. morpho-syntactic
agreement of adjacent dependencies, word segmentation).
The main claim of the SSH is that L2 learners process language differently from
native speakers because L2 learners are restricted to the shallow route. The SSH
explicitly proposes that the fundamental differences between native and non-native
processing are due not to more general processing deficits, per se (contra Sorace, 2011),
but rather to differences in the underlying syntactic representations available to L2
learners. These grammatical representations, unlike those of native speakers, are
hypothesized to be anomalous (i.e. incomplete or deficient). Based on results from
experiments that fail to detect relationships between more general processing abilities,
such as working memory, and patterns of on-line processing for L2ers (Juffs, 2004; Juffs
& Harrigton, 2011), the SSH proposes that differences in L2 processing can be isolated to
L2 reliance on shallow grammatical structures and heuristics.
In support of the SSH, Clahsen and Felser (2006a) cite evidence from several
studies on relative clause attachment (Felser, et al., 2003; Papadopoulou & Clahsen,
2003). These studies involve the resolution of relative clause attachment ambiguities such
as the one found the example below.
(2-1) Someone saw the servant of the actress who was on the balcony.
The resolution of these syntactic ambiguities is subject to cross-linguistic
variation (i.e. language vary in terms of preferences for either high- or low-attachment).
18
18
English speakers, for example, have a preference for attaching to the lower NP (NP2
disambiguation; i.e. the servant was on the balcony), whereas Spanish and German
speakers prefer high attachment (NP1 disambiguation; the actress was on the balcony). In
these studies, native speakers were found to attach the relative clauses as predicted in the
linguistic literature (e.g. Carreiras & Clifton, 1999; Cuetos & Mitchell, 1988; Fernández,
2003), but L2 learners, even those of high proficiency, failed to attach these relative
clauses in the same manner as the natives (Dussias, 2003; Felser et al., 2003;
Papadopoulou & Clahsen, 2003). Given that previous research studies have shown that
L2 learners can attach in a native-like manner when they are provided with semantic cues
(e.g. Juffs, 1998), Clahsen and Felser (2006a) argue that the deficits in L2 processing are
localized to deficient phrase-structure-based strategies caused by incomplete L2
grammatical knowledge.
The SSH has inspired a large number of studies, a number of which have also
found counterevidence to the predicted outcomes (e.g. Hopp 2006; Lim & Christianson,
2013; Pliatsikas & Marinis, 2013). An interesting study to consider in this context is
Marinis, Roberts, Felser and Clahsen (2005), because it has been the center of some
controversy. Clahsen and Felser (2006) single out this study as providing evidence
supporting the hypothesis that L2ers process shallowly, while the natives speaker controls
do not. On the other hand, Dekydstpotter, Schwartz, and Sprouse (2006) have criticized
Clahsen and Felser’s interpretations, highlighting what they call a crucial methodological
point (p. 33):
“The mere fact that there is an observed non-isomorphy between natives and L2ers does not entail that the natives and the L2ers deploy fundamentally different mechanisms.”
Marinis et al. (2005) used a self-paced reading task to examine the processing of
sentences involving long-distance wh-dependencies in English L2ers from different L1
backgrounds. Experimental sentences had either an intermediate gap, as in (2-2a), or did
19
19
not (2-2b). The slash (/) indicates a new (self-paced) computer screen, while the bold
numbers in parentheses indicate the segment number.
(2-2) a. (1) The nurse who / (2) the doctor argued / (3) __ that the rude patient / (4) had angered /(5) ____ is refusing to work late. (Intermediate gap)
b. (1) The nurse who / (2) the doctor’s argument / (3) about the rude patient / (4) had angered / (5) ____is refusing to work late. (No intermediate gap)
Participants were asked to read each segment quickly, pressing a button to bring
up the next screen (segment). At the end of the last segment, participants were presented
with a question (e.g. Who angered the nurse?), whereupon they chose one of two options
on the screen.
The results indicated that the learners had no trouble with comprehension, but
their reaction times (RTs) did not pattern exactly with those of the natives speakers.
Specifically, while the natives had longer RTs on segment 3 when processing sentences
like (2-2a) than when processing sentences like (2-2b), the L2 learners displayed no
difference between the two sentences in terms of RTs. Crucially, both groups showed
longer RTs for (2-2a, intermediate gap) than for (2-2b, no intermediate gap) on segment
5. Whereas Marinis et al. concluded that L2 learners did not rely on intermediate traces
and thus processed shallowly, Dekydstpotter et al. (2006) argued that this (segment-to-
segment) comparison was an invalid one. In their reanalysis of Marinis and colleagues’
data, Dekydspotter et al. found that two sub-groups of L2 learners (Japanese-English and
German-English) indeed had longer RTs in the right direction—albeit a segment
afterwards, (i.e. segment 4). Dekydspotter and colleagues thus argued that these results
suggest a similar pattern, albeit a delayed effect.
Dekydspotter and colleagues raise very interesting issues regarding the kinds of
inferences we can draw from data based on time-sensitive measurements. Namely, if, as
they argue, non-isomorphy between native speakers and L2 learners cannot constitute
20
20
grounds for appeal to different mechanisms (e.g. “shallow parsing” in L2), a logical
consequence is that, in turn, isomorphy cannot not necessarily mean that the same
mechanisms used in L2 are those deployed in L1 processing (see Leal Méndez, Farmer,
& Slabakova, in press). Additionally, one important variable that is missing from this
discussion concerns the proficiency of the L2 learners. These issues will be addressed in
more detail in section 2.7.
2.4 Usage-Based Approaches to SLA
2.4.1. Main tenets
The term “Usage-based theories” of language covers a wide spectrum of theories
and approaches that share a number of basic principles (e.g. Bates & MacWhinney, 1987;
Beckner, Ellis, Blythe, Holland, Bybee, Ke, Christiansen, Larsen-Freeman, &
Schoenemann, 2010; Ellis, 2002, 2008, Larsen-Freeman, 2009; MacWhinney, 2005). In
this regard, Bybee (2008) notes that what usage-based approaches all have in common is
that these theories acknowledge the effects of usage on the cognitive representations of
language, connecting frequency of use to structural phenomena. One of the most
important findings from studies made under this paradigm is that language users
remember fine-grained details about their experience with language. This fine-grained
detail has shown to be multi-leveled and to include information about occurrences and
co-occurrences (e.g. transitional probabilities). Thus, every individual language
experience, which is imbued with detail at multiple levels, is gradually gathered and
organized (e.g. categorized) by speakers. When faced with additional evidence (e.g.
differences in use or in frequency of use), these experiences can change over time. This
change can take place both in individuals (development) and in groups of individuals
(language change).
In accordance with this worldview, Usage-based researchers such as Bybee
(2008) consider grammar to be “the cognitive organization of one’s experience with
21
21
language” (p. 216), where abstract representations result from language users’ cumulative
experience with language. Thus, while Generative approaches to L2A view grammar as a
finite set of abstract universal rules (Universal Grammar) that generate those (and only
those) sentences that are acceptable in a given language, Usage-based approaches view
grammar as a network constructed from individual instances of language use/experience
which are categorized into basic units. These basic units are form-meaning pairings that
range from individual lexical items (e.g. words, idioms) all the way up to discourse-level
units. Crucially, Usage-based theories place a significant importance on the
communicative nature of language and highlight the notion that these units are learned
through (interpersonal) communication (Barlow & Kemmer, 2000).
Another important tenet that Usage-based theories share is the rejection of
Chomsky’s (1957) “competence” vs. “performance” dichotomy.9 Broadly, “competence”
is viewed as a given speaker’s abstract grammatical knowledge, while “performance”
refers to the “real-time application of linguistic knowledge/forms” (Rothman, 2007). In
Usage-based theories, both competence and performance are two facets that emerge from
the same (dynamic) system, where competence is argued to be the “integrated sum of
prior usage,” while performance constitutes its “dynamic contextualized activation”
(Ellis, 2008). Within Usage-based approaches, variable performance cannot be
discounted as “noise” that is generated by random performance factors (Ellis & Larsen-
Freeman, 2006). Therefore, while UG-approaches (at least initially) have traditionally
relied on data gathered from tasks that were presented devoid of social context, such as
Grammaticality or Acceptability Judgment Tasks, Usage-based researchers have used a
9 Rothman (2007) argues that this distinction is a necessary one in L2 research in order to explain “how real-time production can undermine the L2 system, which is independently demonstrated via complementary empirical testing (p. 610).”
22
22
wider variety of methodologies and data, including more naturalistic data from variable
spontaneous production in different social contexts.10
Moreover, Usage-based approaches view the notions of language use and
language acquisition as intimately linked. Language acquisition (both L1 and L2
acquisition) is argued to involve the sampling and categorization of a given speaker’s
specific language experiences (e.g. language use in varying social contexts; Beckner et
al., 2010). Categorization and use are simultaneous so that no language user’s experience
can ever be considered to be at an “end-state” or what is commonly referred to as the
stage of “ultimate attainment.” This is because, within this approach, any language user
has the potential to change at any point, with additional experience (e.g. Bybee 2008;
Ellis, 2002; Larsen-Freeman, 2010). In fact, many recent Usage-based (connectivist)
psycholinguistic models suggest that processing and learning are deeply intertwined
(Chang et al., 2006; Christiansen & Chater, 2001; Elman, 1997: Elman et al., 1996;
Seidenberg & McClelland, 1989; McMurray, Horst & Samuelson, 2012), and that
mechanisms such as prediction (anticipatory processing) or inhibition are fundamental
not only for real-time processing, but to the shaping of the representations that are
simultaneously learned through use.
2.5 Statistical Learning and the Richness of the Input
One of the most crucial discoveries in psycholinguistic research in the last decade
or so represents the widespread recognition that humans are unconsciously attuned to
statistical (distributional) probabilities (Saffran, Aslin & Newport, 1996; Saffran,
Johnson, Aslin & Newport, 1999; Werker & Gerken, 2002). Probabilistic information is
10 Note, however, that Generative researchers have used methodologies that extend beyond gathering judgments. In this respect, White (2003) notes that a common misconception regarding the UG-framework (e.g. Carroll & Meisel, 1990; Ellis, 1990) is that grammaticality judgments have a “privileged” status “such that they provide a direct reflection of linguistic competence (p. 17).” White argues that grammaticality judgments constitute “performance” data, just like any data elicited by other behavioral methods.
23
23
present at multiple levels (e.g. linguistic domains), including syntactic dependencies,
word-co-occurrence information, and even the phonotactic probabilities inherent to a
word (Dikker, Rabagliati, Farmer, & Pylkkänen, 2010). Consequently, it appears to be
the case that the input signal, which is rich in probabilistic and distributional information,
can be used to guide the learner through the acquisition process.
The implicit ability to be attuned to and learn about distributional regularities in
the input is usually referred to as statistical learning. Statistical learning has been found to
be associated with multiple aspects of the acquisition process, at least in L1 (e.g. speech
segmentation and auditory processes, Saffran, Aslin & Newport, 1996; Saffran, Johnson,
Aslin & Newport, 1999, phonetic category formation, Werker & Gerken, 2002, higher-
level language comprehension, Misyak & Christiansen, 2012, and L2 literacy (Frost et
al., 2013).
In light of the results from these research studies, it should be easy to conceive
how the inclusion of the findings concerning statistical learning abilities can be
seamlessly woven in a Usage-based language approach. Usage-based approaches view
language learning as a statistical learning problem, where the language user’s task is to
learn the probability distribution of a given form-meaning mapping (Chater & Manning,
2006). Within SLA research, many researchers have used probabilistic and frequency-
based theories that focus on the effects of frequency and repetition in both
comprehension and production (e.g. Bybee & Hopper, 2001, Ellis, 2002; Hoey, 2005;
Jurafsky, 2002). UG-based approaches, on the other hand, have not typically found a
place for statistical learning abilities within their theoretical frameworks (see Hawkins,
2011 for discussion). By and large, UG-based approaches have historically been less
focused on the probabilistic and distributional properties of the linguistic signal.
Although the notion of input in general has always been important for generative
linguists, this interest has not necessarily extended to notions like frequency of
occurrence.
24
24
Under most Generative approaches, the general assumption is that a language
learner has a certain amount of prior knowledge—often argued to be innate knowledge—
that helps to constrain the hypothesis space of said language acquirer. In simple terms,
this means that a child acquiring a language does not consider every single linguistic
possibility available. Under this view, there are a number of language universals
(linguistic principles that are shared by all human languages), which are “given,” so to
speak. In addition to principles, there are also language-specific parameters. Unlike
principles, language-specific parameters require exposure to the input in order to be set
(i.e. learned) according to the specifications of a given language. Phillips summarizes this
state of affairs thus (Phillips, 2012, italics added):
The UG hypothesis is a provocative name for the idea that there are constraints on the hypotheses that learners entertain. It is not a claim that learners do not need to learn. Experience is obviously necessary. However, we still know very little about how much experience is needed for successful learning, and how much variation in experience is needed to yield different learning outcomes.
Some of the issues that Phillips raises are of vital importance—especially the need
for researchers to address the link between amount of exposure and acquisition (i.e. how
much experience is necessary for the successful acquisition/learning of a given
structure?). However, within the UG framework, input-related questions such as these
were, at least initially, somewhat sidestepped by the early-held notion in generative
linguistics that the primary function of input was comparatively more limited: to provide
key data (or even a single datum according to Eubank, 1991) that would “trigger” the
activation of a given pre-existing linguistic category (Hawkins, 2006). Hyams (1991), for
example, explicitly rejects the notion that frequency in the input is of any great
importance:
“[T]he quantity of input required for acquisition says absolutely nothing about the function of the input. The notion of a trigger, in language acquisition as in biology, refers to the relationship
25
25
between the input experience and the behavioral output, not to the quantity of the input in absolute terms. There are, moreover, good empirical reasons for doubting the claim that [L1] acquisition actually requires a vast amount of input” (p.77).
Although the notion of “trigger” has been defined by several researchers, most
notably Fodor (1998), Gibon & Wexler (1994) and Lightfoot (1991), certain aspects of
these definitions have been disputed (e.g. whether triggers could be ambiguous or not). In
general a “trigger” is considered to be the subset of linguistic data that activates a given
property of UG. However, there is no overall consensus on whether a trigger should be
encountered at a certain point or development, or how many times a trigger should appear
before prompting activation. That is not to say that there hadn’t been calls, even early on,
to change this state of affairs. In the early 1990s, Meisel (1991) recognized the
importance of defining the notion of “trigger” and “triggering data”:
[M]uch more information is needed concerning the triggering data for [L2 grammar-] restructuring purposes. As it stands, it is not clear what these data might look like and why they are only available at a specific point of the acquisitional process, although they undoubtedly had been present in the input before.” (p. 238)
Hilles (1991) and Hyams (1991) view the notion of “trigger” as parallel to how it
is viewed in biological systems. Hilles (1991) compares it to the triggering data needed in
vision: for development to proceed properly, the trigger must be encountered in a timely
fashion. Thus, the trigger, which is external to UG, is subject to timing constraints: if a
trigger does not appear within a certain time frame (e.g. developmental stage), linguistic
development will suffer.
Although, as noted in the previous discussion, the importance of input frequency
has not figured centrally in Generative approaches, it is not the case that these approaches
(especially in the last decade) ignore or disregard the importance of frequency altogether
(see Slabakova, 2003, and Slabakova et al., in press, for discussion). Slabakova et al. (in
review) note that generative acquisitions heed O’Grady, Lee and Kwak’s (2009: 72)
recommendation that:
26
26
“[i]n considering the role of input frequency in language acquisition (first or second), it is vital to bear in mind a key point: what counts is not how many times learners hear a particular form—it is how many times they encounter mappings between a form and its meaning.”
One example of a UG-based study focusing on the effect of input on child L1
acquisition is Miller and Schmitt (2010). Miller and Schmitt examine the acquisition of
plural morphology in two different dialects of Spanish: Chilean and Mexican Spanish.
These dialects have different phonological realizations of plural: While Mexican Spanish
consistently realizes plural as [s], the realization of plural in Chilean Spanish is highly
variable and can range from overt realization, to aspiration, or even complete lenition in
some cases. Under Miller and Schmitt’s count, Chilean morphology is present in about
50% of all instances of plural. This asymmetry indicates that Chilean children are
exposed to unreliable input, where the form-meaning mapping (i.e. plural = [s]) is present
only half the time. On the other hand, Mexican children are exposed to extremely reliable
plural morphology markings. Results show that 5-year-old, working-class Chilean
children were significantly less accurate in their comprehension of plural morphology
when compared to their Mexican counterparts, matched for socio-economic status (32%
vs. 65% accurate). Importantly, older Chilean children eventually converge with the adult
grammar. In view of these results, the authors fittingly conclude that higher ambiguity
(unreliability) in the input is crucial in terms of the time course of the acquisition process.
A study by Meisel, Elsig, and Bonnesen (2011) is another example of a study that
explores how differences in frequency of exposure to a certain construction impact its
correct comprehension and usage. Meisel and et al. (2011) study the acquisition of
different French interrogatives that vary in frequency of occurrence depending on
different registers and social contexts. Specifically, interrogatives involving subject–verb
inversion are extremely rare in pre-scholar child-directed speech (based on corpora by
Suppes, Smith & Leveillé (1973) and the DuFDE; German and French — Simultaneous
27
27
Acquisition of two First Languages). Example (2-2) illustrates subject-verb inversion
interrogatives. (2-2) Où travalle ta mère?
where she-works your mother “Where does your mother work?”
This state of affairs changes quickly once these children attend school, where they
get significantly more exposure to standard French and to interrogatives such as (2-2).
Based on the fact that children are exposed to this particular construction later in life,
Meisel et al. (2011) predict that these L1 acquirers may show evidence of the optionality
effects that are so famously pervasive in L2 grammars.11 The results of a grammaticality
judgment task testing knowledge of interrogatives in twenty adult native French speakers
show that, as predicted by the authors, subject-verb inversion interrogatives (which are a
feature of standard but, crucially, not colloquial French) show evidence of optionality. On
the contrary, interrogatives that are present in both colloquial and standard French appear
to be significantly more homogenous and stable. In light of the evidence showing that
frequency of occurrence can shape not only the time course but also the ultimate outcome
of speakers, Meisel and colleagues conclude that even native grammars can contain
“islands of non-native knowledge” (Meisel et al., 2011: 369).
Recently, there have been a number of proposals to accommodate for frequency
effects and statistical learning mechanisms within the Generative framework itself.
Chomsky (2005:6, bold not in the original), for example, proposes that the three
determining factors of language growth (i.e. acquisition) are:
1. A uniform Genetic endowment (UG), which imposes constraints on how the
environment is interpreted as linguistic experience,
11 Sorace (2003) defines optionality as “the existence of two or more variants of a given construction that are identical in meaning and have a clear correspondence in form” (p. 135).
28
28
2. Exposure (i.e. linguistic experience), which is unique to each individual and leads
to “fairly narrow” linguistic variation, and
3. Principles that are outside of the language faculty, including principles of data
analysis that are not exclusive to language and principles of efficient
computation.
The third principle could reasonably be taken to be something akin to pattern-
learning or statistical learning. Although presently Chomsky does not relinquish the idea
of UG playing a role in the acquisition processes, he paves the way for the possibility that
the theory will evolve to the point that UG may be, eventually, subsumed under general
cognitive principles (i.e. not exclusive to language). In Chomsky’s view, these advances
in syntactic theory have helped to overcome:
“a difficult conceptual barrier to shifting the burden of explanation from the first factor, the genetic endowment, to the third factor, language-independent principles of data processing, structural architecture, and computational efficiency. (p. 10)”
Chomsky notes that early on in the UG framework, linguistics principles were
expressed in terms of “grammatical constructions” which could only be viewed as
linguistic in nature (e.g. islands, filters, constraints on operations). In other words, these
notions were considered to be “linguistic” because they could only be found in language,
and thus had no other biological analogues. However, in Principles and Parameters, the
“basic computational ingredients” became increasingly abstract (i.e. recursion, locality,
economy, minimal search). These computational ingredients, as envisioned under
Principles and Parameters, are much more amenable to be viewed as operations that are
available both within and beyond language itself. Although Chomsky stops short of
subsuming the first principle (i.e. the genetic endowment) into the third (i.e. the
29
29
principles of data analysis), the shift from Chomsky’s original position (e.g. Chomsky,
1957) is quite striking.12
Another attempt to account for frequency effects within the UG framework is
Yang’s Variational Learning Model (Yang, 2002, 2004, 2010). Yang’s model (based on
insights by Roeper, 1999) builds on the concept of parameter setting while, crucially, also
appealing to general learning mechanisms and input frequency effects. In this model,
general learning mechanisms keep track of the frequency of parametric evidence
(“unambiguous” evidence, which is a factor in estimating a parameter’s “signature”).
Yang predicts that a parameter that is substantiated by ample evidence (i.e. it is more
frequent) will be acquired earlier than a parameter that is substantiated by sparse
evidence (i.e. it is less frequent). Parameter “signatures” are calculated as the proportion
of the unambiguous evidence out of the overall evidence in the input that is relevant for a
given parameter.
Hawkins (2011) makes specific reference to statistical learning abilities in order
to account for the famous optionality of morphological variants that commonly
characterizes L2 learners. As mentioned earlier (see section 2.2.1.1.), Hawkins subscribes
to a view in which adult L2 learners are subject to representational deficits.
Representational-deficit accounts argue that adult L2 learners, unlike L1 acquirers, do not
have access to UG mechanisms, especially as it pertains to a subset of features, namely
uninterpretable features, such as gender agreement.13 Thus, within Representational-
deficit accounts, the burden of explanation is not on accounting for lack of convergence
(since this could be explained by lack of access to UG) but rather on accounting for: a)
positive development (towards more target-like grammar) and b) native-like
12 For example, Boeckx (2008) argues that Chomsky’s (2005) viewpoint may, eventually, lead to disappearance of parameters. Other researchers (e.g. de Villiers & Roeper, 2011: 3) see that extension as a “highly problematic” logical possibility.
13 As seen in section 2.2.1.1, agreement is argued to involve uninterpretable features.
30
30
performance. Hawkins suggests that the development observed in L2 grammars (i.e.
positive proficiency effects) can be explained through statistical learning abilities.
Hawkins proposes that L2 learners “resort to using statistical information […] to
compensate for the lack of UG-derived categories (p. 72).” Hawkins submits evidence
from a study by Bloom, Polišenská, and Weerman (2008) examining the acquisition of
gender agreement by child and adult L2 learners of Dutch.14
The Dutch gender agreement paradigm is especially apt for testing Hawkins’s
proposal because, in terms of noun-adjective gender concord, it instantiates a relatively
rare situation where a morphologically underspecified form (usually masculine, or
“common” in the case of Dutch) is actually less frequent than the fully specified form
(usually feminine or “neuter,” the latter being the case in Dutch). Hawkin’s reasoning is
as follows: given that L2 learners are not guided by UG principles, they will use the
statistical properties of the input in order to guide their acquisition process.
Although it is very often the case that the statistical properties of the input will
yield an accurate organization of the input, Dutch noun-adjective agreement is an
exception in this regard. If L2 learners are indeed guided by the distributional properties
of noun-adjective agreement, they should be at a disadvantage and, thus, L2 learners
would overgeneralize the specified form (neuter in the case of Dutch) rather than the
common form. Bloom et al.’s (2008) study shows exactly that pattern. While child
acquirers (both L1 and L2 acquirers) overgeneralized the more specified form (which is
also the least frequent), adult L2 learners display the opposite pattern, overgeneralizing
the least specified form (which is also more frequent than its counterpart). Results also
14 In Bloom et al.’s study, the participants included a control group that was comprised by a child L1 learner group and two experimental L2 groups (child and adult). The child L2 group was comprised of speakers of two different L1s: Moroccan Arabic and Berber. Each of these groups was further divided into two subgroups based on proficiency levels. The adult L2 group also included Moroccan Arabic and Berber speakers and was also divided into two proficiency levels.
31
31
show that these overgeneralizations are not insurmountable and can be alleviated with
increased language experience (i.e. proficiency).
Hawkins (2011) is not the only researcher within Generative linguistics who has
tried to weave statistical learning into the existing theory. Within the sentence-processing
literature, there have been several “hybrid” proposals that have attempted to include
aspects of both Generative linguistics and statistical models. One example of a proposal
that attempts to take into consideration the findings from statistical models as well as
(generative) syntactic principles is Townsend and Bever’s (2001) Late Assignment of
Syntax Theory (LAST).15 LAST proposes that the analysis (i.e. processing) of a
sentence, which proceeds in steps, involves both statistical strategies and a grammar. In
LAST, the initial analysis of a sentence is semantic. This semantic analysis is argued to
be sensitive to statistical information and strategies. The initial analysis gives rise to a
preliminary hypothesis that pairs form and meaning and may include procedures such as
theta-role assignment.16 The initial semantic analysis, which is probabilistic, limits the
possibilities entertained by the syntactic parse.
The second step, termed “pseudosyntax,” uses the initial analysis and matches it
onto an array of syntactic structures to form a more detailed “candidate real syntax,”
which will be subsequently checked against the input in the next step. During the second
step, the conceptual structure adds likely information on functional relations (e.g. in
English, agents are usually nominative, patients of verbs and prepositions are usually
accusative, verbs agree with their subjects, etc.). The last step is syntax generation and
matching. In this step, the parser produces: i) a detailed syntactic description, and ii) a
derivational analysis with a complete surface phrase structure (e.g. agreement markers,
15 Townsend and Bever (2001) explicitly propose that their model is “entirely consistent with the minimalist syntactic model” (p. 178).
16 In Generative syntax, theta roles are formal terms that describe the semantic relationship that an argument plays in regard to a given predicate. (Carnie, 2006).
32
32
word order, etc.). If the generated result matches the input, the computation is finalized.
If the result is a mismatch then the computation can restart at particular points (strategies
for recovery). Although LAST has only been proposed for L1 processing, it bears a
striking resemblance to Clahsen and Felser’s proposal for L2 processing albeit one that
Clahsen and Felser mostly reject.
In summary, the Generative framework has evolved considerably in the last
twenty years. A number of these changes have been driven by attempts to incorporate
findings from other language disciplines. Other developments have resulted from
advances in linguistic theory. A third type of development, such as the Feature
Reassembly hypothesis, was theory-internal and in response to masses of findings, right?
We can’t say that generative L2 linguists have only been kicked into change from
outside. Nonetheless, although this progressive evolution has the potential to bring the
field out of its relative isolation, one of the major hurdles to surmount at the moment is
the lack of a coherent narrative that spells out the “new” positions, as it were. Therefore,
the efforts conducted by generative researchers such as Hawkins (2011) and Phillips
(2006) have been instrumental in this regard. As mentioned earlier, one of the shifts
within the generative framework has been the use of time-sensitive methodologies that
can shed light on the moment-by-moment processing of language. In the following
section, I will review a series of studies that focus on L2 processing—both from
Generative and Usage-based approaches.
2.6 Research on L2 processing
2.6.1. Two Approaches to L2 Processing
As mentioned in Chapter 1, it is generally agreed upon that the outcomes of first
versus second language acquisition are remarkably different. On the whole, in early SLA
research, the most common data collecting instruments consisted of off-line tasks (i.e. not
time-sensitive) such as grammaticality/acceptability judgments, magnitude estimation,
33
33
elicited imitation, and a variety of interpretation tasks (e.g. truth-value judgment tasks,
picture-matching tasks). Using these off-line methodologies, there has been a wealth of
empirical evidence supporting the notion of a “lack of convergence” between the
grammar of L1 and L2 speakers, differences that span multiple linguistic domains
including phonology (e.g. Fathman & Precup, 1983; Munro, Flege & MacKay, 1996),
morpho-syntax (e.g. Hyltenstam & Abrahamsson, 2003; Lardiere, 1998a,b), and syntax-
pragmatics (Belleti, Bennati, & Sorace, 2007; Sorace, 2003; Sorace & Filiaci, 2006).
Accounts such as the Fundamental Difference Hypothesis (Bley-Vroman, 1990) propose
that this lack of convergence is due to core differences in the grammatical representations
of L1 and L2 speakers.
More recently, explanations for native vs. learner ultimate-attainment differences
have focused more prominently on processing-based differences, producing a series of
demonstrations that the real-time processing of an unfolding linguistic signal differs
between groups of L1 and L2 speakers (e.g. Clahsen & Felser, 2006; Dussias, 2001;
Frenk-Mestre, 2002). Although these processing-centric accounts vary from one to
another across a wide variety of dimensions, Leal Méndez, Farmer & Slabakova (2013)
posit that they can be roughly categorized into two groups, which differ on the degree to
which they propose that the processes driving on-line L2 comprehension are the same as,
rather than different from, those responsible for L1 comprehension.
“Different-system” accounts of L2 processing propose that there is a qualitative
difference between L1 and L2 processing because the latter does not rely on the same
processing systems that are responsible for L1 comprehension. An example of one such
account is the Shallow Structure Hypothesis (SSH; Clahsen & Felser, 2006a, b), which
was discussed in section 2.3.4. As noted earlier, the SSH proposes that L2 learners
process language “shallowly” because of incomplete L2 grammatical representations.
Another instantiation of a different-system account of L2 processing is Ullman’s (2001)
Declarative-Procedural Model.
34
34
The Declarative-Procedural Model (Ullman, 2001) builds on an independent
distinction between procedural and declarative memory, both of which are considered to
be mediated by different neural mechanisms. Declarative memory stores lexical
information about words, sounds, meanings, and other memorized knowledge and
associations. This information is argued to be accessible to various systems rather than
being encapsulated (pace Fodor, 1983), and has been argued to be involved in the
learning/use of world facts and knowledge. Procedural memory, on the other hand, is
related to implicit (unconscious) learning and use of grammar (syntax, semantics,
morphology, phonology). Furthermore, procedural structures have been argued to be
involved in the acquisition articulation of motor and general cognitive skills, especially
those involving sequences (Graybiel, 1995; Willingham, 1998).
The Declarative Procedural Model proposes that in L2 grammatical processing,
there is, at least initially, little involvement of procedural memory, and thus L2 learners
are expected to be dependent upon declarative memory when processing their L2. This
shift in dependence is argued to cause L2 learners to compute the L2 grammar using
memorized lexical items stored in declarative memory. Increased productivity in L2
learners is argued to be due to processes such as associative memorization, generalization
of patterns, and further pattern abstraction. Furthermore, the model posits that at least
some rules can be learned in declarative memory—with the exception of “grammatical”
rules, which supposedly depend on procedural memory and computations. More recently,
however, Ullman (2004) has suggested that L2 learners could, with enough practice,
improve at learning grammatical rules in procedural memory.
On the other hand, Leal Méndez et al. (2013) use the term “same-system”
approaches to describe those approaches that maintain that the systems responsible for
processing an L2 are the same as those responsible for L1 processing (e.g. Dekydtspotter,
Schwartz & Sprouse, 2006). As a whole, same-system approaches typically posit that L1
vs. L2 processing differences do not obtain because of differences in the systems
35
35
themselves but may come about because of factors such as divergences in exposure or
types of exposure (e.g. Leal Méndez et al, 2013; Pliatsikas & Marinis, 2012), or lexical
access routines and differences in the “timing” of computational moments (Dekydtspotter
et al., 2006; Hopp 2013).
Lim and Christianson (2013) also propose that L1 and L2 processing are
qualitatively equivalent, arguing that both L1 and L2 speakers show evidence of “good
enough” processing (GEP). Several studies supporting the GEP have shown evidence that
even native speakers misinterpret linguistic input when the syntactic structure (passive,
for example) is in conflict with world knowledge/plausibility of events (e.g. The man bit
the dog.). To account for these misinterpretations, GEP approaches propose that native
speakers can, as L2 learners have often been found to do, over-rely on lexical-semantic
information under some circumstances (e.g. task demands; Swets, Desmet, Clifton &
Ferreira, 2008). Using self-paced-reading and translation tasks, Lim and Christianson
(2013) found that the patterns of processing of native- and L2 speakers were both
modulated by 1) the goals of the task (reading for comprehension vs. for translation) and
2) proficiency in the target language. Therefore, Lim and Christianson argue that the
differences between the native and learner groups are quantitative (e.g. speed of
processing) rather than qualitative.
As can be observed in the research described above, it is typically the case that the
evidence that is submitted as supporting a “same-system” approach consists of the
observation of similar processing patterns between natives and L2 learners (e.g. Hopp,
2006; Lim & Christianson, 2013). However, as mentioned earlier (see also Leal Méndez
et al., 2013 for discussion), it is unclear whether homogeneous processing patterns—even
with data collected with more sensitive methodologies such as Event-Related Potentials;
ERP—necessarily entail identical processing mechanisms.
Perhaps unsurprisingly, some of these studies have found evidence that the
processing patterns of learners as a group are more variable than those displayed by
36
36
native speaker groups (e.g. Trenkic, 2007). To explain this variability in L2 processing,
some researchers have argued that L2 learners may process language differently because
of differences in their individual learning histories (e.g. age of exposure, L1, individual
abilities) and language experiences (e.g. McDonald, 2006). Indeed, it appears to be the
case that L2 language experience and proficiency are good predictors of increasingly
more native-like processing.
2.7 The Construct of Proficiency in L2 Research and the
Relationship between Proficiency and Exposure
An important number of recent L2 processing studies have found a positive
relationship between L2 proficiency and increasing similarity to native-like processing
patterns (e.g. Hopp, 2006; Hoover & Dwivedi, 1998; Jackson, 2008; Lim & Christianson,
2013). However, this relationship cannot be straightforwardly interpreted in part because
“proficiency” is not a unitary concept (e.g. Cummins, 1980; Hughes, 2007; Hulstijn,
2011). Furthermore, as I will discuss momentarily, there are substantial differences
among studies in the way that proficiency is gauged (i.e. standardized tests, in-house
assessment, c-tests, self-rating, measure of language exposure, language metrics, teacher
judgments, etc.). An additional problem is that even comparable standardized proficiency
tests have not been designed to provide comparable measurements among different
languages.17 In other words, what it means to be “intermediate” in German using a given
measure might not necessarily correspond to “intermediate” in Arabic using a different
measure.18
17 For an attempt at providing such a diagnostic measure for European Languages, see Alderson (2008). In the context of foreign languages in the U.S., see the American Council for the Teaching of Foreign Languages (ACTFL) guidelines.
18 Skehan’s Trade-off Hypothesis (2009) emphasizes another important complication when measuring learner performance. Based on the notion that human attentional capacity and working memory are limited (Baddeley, 2007), Skehan argues that when learners direct their attention to one dimension of performance (be it complexity, accuracy, or fluency), they must, necessarily, reduce their attention for other areas. Thus, Skehan suggests that in addition to measuring these
37
37
Previous attempts at gauging language development have used different measures
(i.e. L2 developmental indices) such as standardized proficiency tests, comparative
longitudinal data (Hakuta, 1976), or measurements such as T-Units (Wolfe-Quintero,
Inagaki, & Jim, 1998).19,20 Nevertheless, to date, there is no consensus in the literature
on what constitutes the relevant dimensions of proficiency or how these can be measured
(i.e. how to measure L2 development) (e.g. Benati, 2009; Verhoeven & De Jong, 1992).
Thomas (1994, 2006) conducted two synthetic analyses focusing on the types of
assessments used to measure proficiency in empirical L2 studies. Comparing two
different time frames (1988-1992 and 2000-2004), Thomas found that L2 studies had
progressed from using blanket assessments to measuring proficiency in “finer detail,”
using statistical measures to estimate the validity of their groupings. In addition, more
recent studies have included several measures gauging different dimensions of
proficiency. However, she also noted a trend in the second cohort of studies to downplay
the assessment of proficiency. The authors of some of these studies claim that proficiency
is a dynamic, multi-dimensional, and (crucially), context-dependent phenomenon.
three dimensions of performance, these should be accompanied (i.e. supplemented) by measures of lexical knowledge.
19 A T-unit is a minimal terminal unit that consists of an independent clause, accompanied by any associated dependent (e.g. subordinated) clauses. T-units have been used to measure different aspects of proficiency. In the L2 literature focusing on fluency, T-units are measured for length, accuracy (e.g. number of error-free T-units), and clause length. In terms of accuracy, the most commonly used measures are the number of error-free T-units, proportion of overall error-free T-units, and errors per T-unit; for syntactic complexity, the most commonly used measures are clauses per T-unit, number of dependent clauses per total clauses, or number of dependent clauses per T-unit (Larsen-Freeman, 2009).
20 The types of proficiency tests that are usually employed in SLA testing are “general” proficiency tests, such as the TOEFL (Test of English as a Foreign Language, Educational Testing Service) or the Cambridge Certificate in English Examination. In general, these tests are designed to gauge learner’s “ability in a language regardless of any training they may have had in the language. The content of a proficiency test, therefore, is not based on the content or the objectives of language courses […] [but rather] on a specification of what candidates have to be able to do in the language in order to be considered proficient.” (Hughes, 1989).
38
38
Therefore, some authors argue that the (static) measurement of proficiency “seems so
reductionist as to lack all usefulness” (Thomas, 2006:293).
A less contentious notion is that L2 proficiency is likely to emerge gradually—it
is improbable that L2 learners will fully acquire any given grammatical construction at
some discrete point during the learning process. On the contrary, there is evidence that
shows evidence of the progressive accumulation of knowledge over the course of
exposure to the input (DeKeyser, 2005; Ellis, 2002; MacWhinney, 2004). The
relationship between L2 proficiency and exposure, however, has not been a
straightforward one in L2 research. Although prima facie it seems uncontroversial to
assume that proficiency and exposure are positively related, previous research has cast
doubt on this premise. In the realm of L2 speaking, for example, Moyer (1999) reviewed
a series of studies investigating the relationship between exposure (measured by length of
residence) and foreign accents and concluded that the effects of language exposure on L2
learning were “inconsequential” (p. 83).
Flege and Liu (2001), on the other hand, argue that the lack of exposure effects in
these studies is the direct consequence of methodological confounds (e.g. sampling
errors, factors that confounded length of residence with age of acquisition). In their
empirical investigation, Flege and Liu (2001) found that when they statistically controlled
these factors, adult L2 learners’ performance improved over time only when they were
exposed to “the kind of input-rich environment that is typical for children who immigrate
to North America” (p. 547). These results mirror the results obtained from a similar task
with Korean-English bilinguals investigating the development of English morpho-syntax
(Flege, Yeni-Komishian, & Liu, 1999). What these results underscore is that the type of
exposure may play a crucial role in explaining the relationship (or lack thereof) between
L2 proficiency and exposure.
39
39
2.7.1 The Relationship between Proficiency and L2
Processing
As noted in the previous section, “proficiency” in L2 studies is often
operationalized as the numerical outcome of a standardized proficiency test examining
knowledge of specific domains such as grammar and vocabulary. Under this
operationalization, overall, there seems to be a positive relationship between L2
proficiency and length of exposure to the L2—a relationship that seems be a good
predictor of more native-like L2 processing. As noted by Flege and colleagues (Flege &
Liu, 2001; Flege, Yeni-Komishian & Liu, 1999), this relationship has been typically
framed around the age-of-exposure and critical-period debates, where the general finding
is that younger learners (who are also those learners that have been exposed to the
language the longest) display more native-like language outcomes. Although the
investigation of L2 proficiency controlling for age-of-exposure effects has been less
examined, several studies have found that longer exposure translates to higher L2
proficiency/learning outcomes (e.g. Flege & Liu, 2001; Flege, Yeni-Komishian & Liu,
1999; García Mayo & García Lecumberri, 2003).
40
Study Linguistic domain: structure
Method Languages, N, proficiency
Findings Proficiency test
Dussias & Piñar (2010)
Syntax: Long distance wh dependencies (intermediate gaps), manipulating plausibility
Self-paced Reading
L2 English (L1 Chinese; 36) Min TOEFL 550/213 L1 English (34, monolingual)
Proficiency was not a factor but there were effects of reading span NSs: subject extraction sentences posed more difficulties (three word critical region) than object extractions (plausibility information used) High-span L2ers: same pattern Low-span L2ers: effect is driven by implausibility (reverse pattern than NSs)
No independent test—self ratings, years of study, length of stay
Fernández (1998)
Syntax: relative clause attachment manipulating length
Self-paced Reading
? Bilinguals were sensitive to native-like preference according to proficiency. (Self-reported) proficiency was the best prediction for native-like attachment--even more so than age of acquisition.
No independent test—self ratings in multiple skills aimed at determining “dominance”
Hahne (2001) Semantic violations (selectional restrictions of the verb), phrase structure violations (e.g. participle followed by preposition)
ERP L2 German (L1 Russian; 16); L1 German. (16)
Proficiency not a variable, but proficiency effects are suggested (compared to other studies: p. 264) Semantically incorrect L1: N400, pronounced, peaked 200ms earlier than the L2 group L2: N400, less pronounced, later Syntactically incorrect L1: ELAN followed by P600 L2: No ELAN, P600 peaking about 150ms later
No independent test—Self-reported in four skills
Table 2-1 L2 processing studies using indirect measures of proficiency (no independent proficiency measure).
41
Study Linguistic domain: structure
Method Languages, N, proficiency
Findings Proficiency test
Jiang (2004) Morphosyntax: Number subject-verb agreement
Self-paced reading
L2 English (L1 Chinese; 30) L1 English (30) All L2 learners scored 550 or higher in the TOEFL (M=608)
No proficiency effects were found NSs were sensitive to morpho-syntactic “idiosyncracies” (verb subcategorization, grammatical number, etc.) L2ers were not sensitive to number violations as reflected in RTs, although were able to do so offline
No independent test—entrance TOEFL scores were used and all were described as “advanced”
Rossi et al. (2006)
Morphosyntax: word category violations, agreement violations and category + agreement violations
ERP; GJT accuracy
L2 German (L1 It high; 16 and low prof; 19) L2 Italian (high; 16 and low prof; 18) (no control groups)
Proficiency effects found High proficiency learners of German and Italian showed the same ERP components as in the NSs literature: LAN followed by P600 Low proficiency learners failed to show LAN (tendency but did not reach significance)
No independent prof. measure: language-learning history, self-rating, and performance on translation tests
Tanner et al. (2013)
Inflection: Verbal agreement
ERP L2 German (L1 English; 33; 20 novice, 13 3rd year) L1 German (13)
Proficiency effects found NSs: grammatical violations elicited P600 L2 (3rd year): same L1 (1st year): group: bi-phasic response (N4---P600 but correlations showed either N400 or P600)
No independent proficiency test—groups were based on exposure (year)
Tokowicz & MacWhinney (2005)
Morphosyntax (inflection): auxiliary marking (L1|L2 match), number agreement (mismatch), gender agreement (no match)
ERP; GJT accuracy
L2 Spanish (L1 English; 20) No control group, experimental group tested in L1 and L2 (different constructions)
No proficiency effects but scores were proxies (self-report and length of study (exposure). Length of exposure appeared to be was fairly homogeneous Learners: Marginally significant effects for grammaticality, but there was an interaction of language similarity and grammaticality: marginal sensitivity for match, no sensitivity to mismatch, and significant sensitivity for no match
No independent test—self ratings and years of study were used as proxies
Table 2-2 (continued) L2 processing studies using indirect measures of proficiency (no independent proficiency measure).
42
42
In L2 processing research, there are a significant number of studies that either
explicitly address the existence (or non-existence) of proficiency effects or else draw
indirect conclusions about such effects even in the absence of an independent (e.g.
standardized) proficiency test. Table 2-1 presents a list of representative studies in this
vein. This list, although hardly exhaustive, focuses on the acquisition of either syntactic
or morpho-syntactic phenomena. Thus, evidence from single word activation or studies
focused on the L2 lexicon were not included (e.g. Chee, Tanm & Thiel, 1999; Klein,
Milner, Zatorre, Meyer & Evans, 1995). Additionally, for ease of comparison, I focused
on studies using specific time-sensitive methodologies: Self-paced reading, eye tracking,
ERP, and fMRI (e.g. priming studies were not included).21
In light of the findings that greater exposure to the L2 is positively related to
higher L2 learning/proficiency, it would be unreasonable to bypass the findings of those
studies that didn’t measure proficiency using an independent measure. However, it does
make the task of comparing these effects across studies more challenging. This is
especially so because these comparisons usually involve different linguistic structures
and language combinations.
The use of self-assessment data as a proxy for L2 proficiency, however, might
appear potentially more problematic. In this regard, research findings focusing on the
reliability of self-reported proficiency have produced mixed results. Blanche and Merino
(1989), for example, carried out a review of L2 studies using self-assessment. Overall,
they note (p. 315):
21 Other important studies (e.g. Kim, Relkin, Lee, & Hirsch, 1997) were not included because these used such open-ended tasks (“internally expressive linguistic tasks”) that it is difficult to determine which linguistic domain was being tested (most likely, all). In Kim et al. (1997) using fMRI, for example, subjects were asked to imagine describing events in a given language (either L1 or L2), describing those events that happened during the morning, afternoon and night.
43
43
The emerging pattern is one of consistent overall agreement between self-assessments and ratings based on a variety of external criteria. The accuracy of most students’ self-estimates often varies depending on the linguistic skills and materials involved in the evaluations […] but these estimates are generally good or very good.
Nevertheless, there is evidence that there is substantial individual variation
in the accuracy of self-ratings related to factors such as personality, motivation,
and anxiety levels. A study by MacIntyre, Noels and Cléments (1997) found that
a regression analysis between self-assessment and proficiency level
(independently assessed through a proficiency test) showed evidence that student
anxiety played a role in self-assessment. Namely, anxious students showed a
tendency to underestimate their proficiency, while less anxious students tended to
overestimate it. AlFallay (2004) found that students with low self-esteem were the
most accurate in assessing their performance. Additionally, learners with
instrumental (extrinsic) motivation were the least accurate at self-rating their
abilities.
However, even with use of these proxy measures, several studies (e.g. Rossi et al.,
2006; Tanner et al., 2013) have found evidence of proficiency effects. Another study that
has been reported to provide evidence for proficiency effects is Hahne (2001). However,
Hahne’s conclusion stems from comparative evidence—by comparing the results of that
study with other studies using the same testing materials (e.g. Hahne & Friederici, 2001).
Two other studies (Jiang, 2004; Tokowicz & MacWhinney, 2005) failed to find these
effects but, importantly, it seems like their groups were too heterogeneous (too similar in
terms of length of exposure) to make such effects detectable.
44
Authors Linguistic domain /
structure Method Languages, N,
proficiency Findings Proficiency
test
Dallas et al. (2013)
Syntax: Long-distance wh dependencies manipulating plausibility
ERP L2 Eng (L1 Chi; 20) L1 Eng (19) L2ers took either the TOEFL or IELTS
Proficiency effects found L2 speakers as a group not sensitive to plausibility (N400 effect at the embedded verb), correlations indicate that more proficient L2 speakers, like the L1 speakers, are sensitive to plausibility variations while processing filler-gap sentences. Working memory did not play a role.
Composite score: Two vocab tests: SILS and vocab subtest of WAIS, and a lexical decision task
Dussias et al. (2014)
Inflection: encoding gender information in articles (anticipatory processing)
VWP L2 Sp (L1 Eng; 18, high and low prof. and L1 It; 15) L1 Sp (16)
Proficiency effects found NSs exhibited gender-anticipatory effects, as expected. L2 Sp-L1 It showed gender-anticipatory effects but only for feminine. Highly proficiency L2 Sp-L1 Engl showed evidence of using gender information during processing, but low proficiency L2ers did not.
Standardized test (DELE, probably modified, 50 pts; adv: over 40) Self-reported proficiency scores
Hopp (2006) Syntax: German scrambling (SO to OS orders)
Self-paced reading, speeded GJT
(L2 groups: adv & near n) L2 Ger (L1 Eng; 20); -L2 Ger (L1 Dut; 20); L1 Ger (20)
Proficiency effects found Near-native L2ers converge in processing patterns with native speakers—strength of the effect is modulated by the type of syntactic features (like in NSs)
Web-based C-test (timed)
Hopp (2013) Inflection: lexical and syntactic gender (anticipatory) processing
Elicited production, VWP
L2 Ger (L1 Eng; 20-adv. & near n); L1 Ger (20)
Proficiency effects found Target-like performance in a subgroup of learners: those who had consistent target-like overall gender assignment in the elicited production. These learners also had significantly higher proficiency levels.
Goethe Institute Test (30 items)
Table 2-2 L2 processing studies using direct measures of proficiency (independent proficiency measure were used).
45
Authors Linguistic domain / structure
Method Languages,
N, proficiency
Findings Proficiency test
Jackson (2008) Morphosyntax: subject-object temporal ambiguity in wh questions
Self-paced reading
L2 Ger (L1 Eng; 40 adv and int)
Proficiency effects were found (although the advanced did not exactly pattern after NSs). Segment 3 (immediately following manipulation) NSs: Lower comprehension and higher RTs on object-first sentences but no effect of tense Adv L2: longer RTs immediately when case markings disambiguated to a dispreferred object-first (like NSs) but greater difference in past tense vs. present perfect Int L2: did not show sensitivity to case-marking information but there was a difference (in the right direction) on segment 5 (last one)
Goethe Institute Test (30 items)
Lim & Christianson (2013)
Syntax: Subject and object relatives (manipulating plausibility)
Self-paced reading, translation task
L2 Eng (L1 Kor; 36 adv, int) L1 Eng (32)
Proficiency effects found NSs: SRCs were easier to process than ORC; speakers were influence by structural and plausibility information. Learners slowed down but at a later time: they are slower in retrieving syntactic information: reading strategies varied with task and proficiency (accuracy was modulated by proficiency)
Cloze test (Guili Dussias, Penn State, 40 points)
Pliatsikas & Marinis (2013)
Syntax: Long distance wh dependencies (intermediate gaps)
Self-paced reading
L2 Eng (L1 Greek; 26 naturalistic; 30 classroom) L1 Eng (30)
No proficiency effects per se but effects of type of exposure (classroom vs. naturalistic). **Proficiency not treated continuously (all learners where taken to be advanced) NSs: facilitation of the final gap (like NSs) when the intermediate gap was present, higher RTs at the gap in the extraction condition. L2 CE: No facilitation effects or higher RTs at the int gap but did show a spill over effect L2 NE: facilitation of the final gap (like NSs) when the intermediate gap was present but no higher RTs in extraction condition (but extraction x phrase type interaction)
Quick Placement English Test (UCLES, 2001)-20 min testing comprehension and language skills.
Table 2-2 (continued) L2 processing studies using direct measures of proficiency (independent proficiency measure were used).
46
46
Table 2.2 shows another set of representative studies that focused on L2
processing. These studies included proficiency as a variable and tested it so
independently. The majority of these used either standardized proficiency tests or
modified versions of such tests. Others included other types of measures, such as c-tests.
One study (i.e. Dallas et al., 2013) used vocabulary measures that have been shown to
correlate highly with language outcomes. As Table 2.2 shows, the presence of
proficiency effects in L2 processing is pervasive. Overall, these results fit well within
experiential models of language acquisition, although generative models of L2
acquisition and processing can account for these effects as well (Dekydspotter et al.,
2006; Hopp, 2006).
In sum, overall, a considerable amount of literature focusing on L2 processing has
found that L2 Proficiency constitutes a key predictor in more native-like processing. As
was pointed out in the introduction to this paper, one of the goals of this dissertation is to
test the prediction of the SSH. Examining the effects of L2 processing is vital in this
regard because the SSH does not predict L2 effects—in fact, the only development that is
predicted is brought about through the use of semantic and pragmatic cues, but not
syntactic knowledge. Thus, any study intending to test the SSH should include a measure
of proficiency.
2.8. The Generation of Predictions and L2 Processing
In the psycholinguistic L1 processing literature, there have been a significant
number of studies providing evidence that predictive/anticipatory processes can aid
speakers to quickly and accurately interpret an incoming linguistic signal (e.g. Altmann
& Kamide, 1999; DeLong, Urbach, & Kutas, 2005; Dikker, Rabagliati, Farmer, &
Pylkkänen, 2010; Farmer, Brown, & Tanenhaus, 2013; Fine, Jaeger, Farmer, & Qian,
2013). These anticipatory processes (i.e. predictions) seem to be multi-leveled and fine-
47
47
grained, and include many different aspects of the incoming linguistic input ranging from
grammatical properties of the signal (e.g. verb sub-categorization preferences, Arai &
Keller, 2013) to semantic properties of a word (e.g. DeLong, Urbach, & Kutas, 2005;
Kamide, Altman, & Haywood, 2003), or even the form-based perceptual properties of the
physical input (Dikker, Rabagliati, Farmer, & Pylkkänen, 2010; Farmer, Christiansen, &
Monaghan, 2006; Gow & McMurray, 2007).
Predictions aid the quick and accurate interpretation of an unfolding linguistic
signal. It has been argued that predictions might play a role in development: When
predictions are violated by a given property of the input, an error-signal is created. This
error signal has been proposed to be an element that can drive learning (Clark, 2013;
Elman, 1991; Elman et al., 1996). Language is imbued with probabilistic regularities (e.g.
long-distance dependencies), and the progressive learning of those regularities over
exposure is likely to drive a progressive ability to predict downstream linguistic units
from previously encountered information in the signal. The role of predictive processing
in L2, however, is still unclear.
To date, there exists a relatively small body of research work concerning whether
or not L2 learners actively predict upcoming linguistic material (but see Hopp, 2013;
Kaan, in press). In the present experimental study, the goal is to examine the
learning/processing of a long-distance dependency in L2 Spanish with a special focus on
whether learners develop an emerging sensitivity to the violation of expectations
(predictions) during on-line processing. In order to find evidence for predictive processes,
the data would need to demonstrate that when a prediction is violated (not completed
where expected), there would be an increase in RTs (assuming that increased RTs are
indicative of processing difficulties). In order to determine any effects of proficiency and
experience, the experimental study samples learners from broad swaths of the proficiency
spectrum and collects their language learning histories.
48
48
Investigating predictive processes in L2 can be particularly useful because it may
provide a suitable index of development in L2 processing. In other words, if there is
indication that learners of higher proficiency develop a higher sensitivity to the violation
of expectations, we could, at least in principle, use the estimated magnitude of the
expectation violation as an index of a given learner’s L2 processing development. As
mentioned earlier, there is evidence that language users can learn about regularities
inherent to the input using statistical probabilities (e.g. Gómez, 2002; Newport & Aslin,
2004). Furthermore, there is also evidence (e.g. Ellis, 2002) that L2 learners are
unconsciously attuned to statistical regularities of the input (e.g. frequency). Thus, it
would stand to reason that these abilities can be useful in an L2 learner’s development—
in other words, it can help her/him to better predict in an L2.
In fact, there is some very recent evidence suggesting that individual differences
in the ability to learn about probabilistic statistical regularities in structured input (i.e.
statistical learning ability) predict some variability in L2 learning outcomes (Frost,
Siegelman, Narkiss, & Afek, 2013; Kaufman, DeYoung, Gray, Jiménez, Brown, &
Mackintosh, 2010). In the present empirical study, I include a measure of statistical
learning abilities (Gómez, 2002) in order to determine whether differences in statistical
learning ability have a positive relationship with the development of L2 predictive
processes.
2.9. Summary
This chapter introduced the theoretical backdrop that informs this work, focusing
on both Generative and Usage-based approaches to SLA. Additionally, I have outlined
how these approaches have incorporated (albeit to different degrees) the evidence
concerning statistical learning abilities. I have also presented a selective review of L2
research studies focusing on processing with a special focus on those studies that have
actively attempted to investigate the existence of proficiency effects. As a corollary, I
49
49
also introduced the notion of “prediction,” arguing that the investigation of predictive
abilities can provide a suitable developmental index for L2 processing.
50
CHAPTER 3
THE SYNTAX OF CLITIC LEFT DISLOCATION IN SPANISH
3.1 General Introduction to the Chapter
The purpose of this chapter is to present a brief overview of the basic properties
and general structure of the long-distance dependency under study: Clitic Left
Dislocation (CLLD) in Spanish. Although some cross-linguistic comparisons with other
Romance languages will be made where necessary, the focus will be mostly on Spanish
CLLD. CLLD has been extensively discussed, both in the formal literature (e.g. Arregui,
2003; Bianchi & Frascarelli, 2010; Cinque, 1990; Casielles-Suárez, 2003, 2004;
Kempchinsky, 2013; López, 2009; Suñer, 2006; Zubizarreta, 1998, to name a few) and in
empirical generative studies (e.g. Donaldson, 2011; Ivanov, 2012; Leal Méndez,
Rothman & Slabakova, 2014, in press; Pablos, 2006, 2008; Valenzuela, 2006). For this
reason, only those properties and issues that are most relevant to the empirical study will
be described in any detail.
Additionally, this chapter will also present the discourse properties of CLLD,
introducing key Information Structure terms such as “topic” and “focus.”22These notions
form the information structure of a sentence (Birner & Ward, 1998; Gundel, 1988;
Halliday, 1967; Lambrecht, 1994; Ward, 1988). Topic is intimately related to CLLD,
which is generally considered to be an instantiation of topicalization in Spanish (and in
Romance in general). For the purposes of comparison, this chapter will also review the
properties of two other left dislocations in Spanish, Hanging Topic Left Dislocation
22 A commonly used definition of information structure, albeit not necessarily a universally accepted one (see, for example, López, 2009 for an alternative), is the following: “[Information structure is the] component of sentence grammar in which propositions as conceptual representations of states of affairs are paired with lexicogrammatical structures in accordance with the mental states of interlocutors who use and interpret these structures as units of information in given discourse contents” (Lambrecht, 1994:5).
51
51
(HTLD) and Fronted Focus, in view of the fact that these structures share many surface
characteristics with CLLD.
Furthermore, in order to delineate the learning tasks for the group of L2 learners
of Spanish, whose L1 is English (see Chapter 4 for details on the participant groups), this
chapter will briefly outline the general syntactic and discourse properties of English-style
topicalization. Although a detailed syntactic analysis for this structure is outside of the
scope of the present work, I will draw some comparisons between the English and
Spanish topicalization in order to better to establish learnability issues for the L2 learners.
The last part of the chapter presents two competing syntactic analyses that aim to account
for the syntactic and pragmatic properties of CLLD as well as for other Romance
dislocations (López, 2009; Rizzi, 1997, 2001).
3.2 Properties of CLLD
3.2.1 General structure and availability of CLLD in
Romance Languages
In Romance languages, CLLD is a frequently employed marking of topicalization,
where a phrase that has been previously mentioned, or is otherwise sufficiently salient in
the discourse (i.e. a topic), is re-introduced into the discourse by virtue of being set apart
(i.e. left-dislocated). 23,24 CLLD is an example of a natural long-distance syntactic
23A note on translation: When translating CLLD into English, the closest analogue (in syntactic and informational structure terms) would be English-style Topicalization (e.g. Al presidente de la asociación realmente lo admirarías, “The president of the association, you would really admire”). However, CLLD is also often translated into English using the passive voice (Al ladrón lo agarraron con las manos en la masa, “The thief was caught red handed”). These two translations are quite valid and highlight different aspects of its relationship with Spanish CLLD. In fact, these different translations might be somewhat systematically linked to discourse notions thus might be illuminating in regards to different types of CLLD (see Bianchi & Frascarelli, 2010 for a detailed classification). Kempchinsky (p.c.) notes that it seems to be the case that when CLLD is translated into English, Identity-CLLD represents topic continuation—these would be the cases where CLLD is translated into English as passive. Other cases are representative of what Arregi (2003) calls ‘contrastive topicalization’, and would translate into English as topicalization. I agree that this observation accurately captures these preferences. However, throughout this chapter, however, I forgo these two possibilities because, at least in the case of
52
52
dependency, where a topicalized phrase is connected to the comment via a clitic.
Syntactically, the general structure of CLLD is as follows (from Kempchinsky, 2008):
(3.1) General (underlying) structure: [LP … XPi … [TP … Cli-V … ]]
where:
LP = ‘left periphery’
XP = the CLLD dislocate
English-style topicalization, some of these translations seem to be somewhat marked (see Slabakova, 2013 for a comparison of the acceptability of Spanish CLLD by Spanish native speakers and English-style topicalization by English native speakers). Although I am aware that the translation I use throughout the chapter may not necessarily reflect the same information structure status as in the original, I believe that it captures the unmarked status of CLLD in both spoken and written Spanish. It is my hope that the information status will be abundantly clear from both the transliteration and the preceding discussion. Thus, when I use data from other authors I use their transliteration but use the unmarked English order in the translation for most examples.
24Although the term CLLD was first coined in the early eighties (van Haaften, Smits, & Vat, 1983) Cinque’s work on A-bar dependencies in the early nineties (Cinque, 1990) constituted the most comprehensive early attempt to provide a syntactic analysis of these constructions.
53
53
As is evident from the co-indexation in (3.1), the clitic and the left-dislocated
phrase must agree. In Spanish, the dislocated phrase and the clitic agree in person,
number, and occasionally gender, as is evident in (3.2).
(3.2) CONTEXT: Are you not going to the party with Elena?
[A Elena]i sí *(lai)invitaron, pero a mí no. to Elena (fem.sg) indeed cl.ACC.3f.sg invited.PAST.3rd.PL but to me NEG “Well, Elena was indeed invited, but I wasn’t.”
Once the topicalized phrase (a Elena: feminine, singular) has been introduced, an
agreeing clitic (la: feminine, singular) must subsequently appear in order for the sentence
to be grammatical, as indicated by the asterisk in (3.2).25 Romance clitics appear before
finite verbs such that with these verbs, speakers would expect the clitic to appear pre-
verbally.
CLLD has been extensively documented and is instantiated in virtually all of the
Romance languages. Before presenting cross-linguistic CLLD examples, I would like to
address an issue regarding notation. Namely, there are some inconsistencies in terms of
how CLLD is notated by different authors. These inconsistencies reflect differences in
opinion regarding whether a comma (marking a boundary realization) should follow the
dislocated element or not. While some authors note that CLLD is not necessarily (but
rather optionally) followed by a pause (e.g. Kempchinsky, 2013), others find the pause to
be necessary.
In one of the very few empirical investigations on the prosodic aspects of CLLD,
Feldhausen (2014) examined the production of a homogeneous group of four native
speakers of Peninsular Spanish reading scripted (embedded and non-embedded) CLLD
25There is a construction in Spanish (Fronted Focus) that involves the dislocation of a phrase that is not doubled by an agreeing clitic. This construction, which must be accompanied by a specific intonation and has different interpretative properties, will be described in detail in section 3.2.4.2.
54
54
sentences. Feldhausen found that, in this very homogeneous group, the data showed an
obligatory right boundary. Nevertheless, given the lack of consensus in this matter, I use
the notation as presented by the original sources throughout this chapter. In my own
examples, I forgo the use of a comma, based on my own intuitions. Examples from
Spanish, Italian, Catalan, European Portuguese, French, and Romanian can be found in
(3.3a-f) below.
(3.3) a. [A Juan]i loi vi en el parque hoy.
to Juan CL.ACC.M.SG I-saw in the park today “I saw Juan in the park today.”
Spanish
b. Giannii, io loi odio. Gianni I Cl.ACC.M.SG I-hate “I hate Gianni.”
Italian (Cecchetto, 1999)
c. [Les taules]i lesi vaig portar al matí. the tables Cl.ACC PAST.SG bring in-the morning “I brought the tables in the morning.”
Catalan (López, 2003)
d. [Este livro]i, só oi encontrei na FNAC this book only Cl.ACC.3.SG found in-the FNAC “This book, I only found it at FNAC.”
European Port. (Kato & Raposo, 2007)
e. [Les malotrus]i, on ne lesi invite pas. the louts one NEG them.ACC invites not “We don’t invite louts.”
French (De Cat, 2007)
f. (Pe) Ionili-am întîlnit anul trecut. (on) John him-I met year last ‘I met John last year.’
Romanian (Dobrovie-Sorin, 1990)
An interesting exception appears to be Brazilian Portuguese. The clitic paradigm
of spoken Brazilian Portuguese (when compared to European Portuguese) is deficient in
that it lacks 3rd person clitics. In Brazilian Portuguese, nominative forms are used for both
direct and indirect objects. This characteristic stands in contrast to European Portuguese,
which instantiates object clitics for 1st, 2nd, and 3rd persons (Kato, 1993; Kato & Raposo,
55
55
2007). The differences between European and Brazilian Portuguese are evident upon
comparing (3.3d) with (3.4) below (from Kato & Raposo, 2007:206). In these examples,
the nominative form is in bold letters.
(3.4) [Este livro]i, só encontreiele na FNAC
this book only I-found it in-the FNAC “This book, I only found it at FNAC.”
(Kato & Raposo, 2007)
Based on these differences, Kato and Raposo (2007) argue that CLLD is a
phenomenon that is only instantiated in European, not Brazilian, Portuguese.
3.2.2 Structural Properties of Spanish CLLD
3.2.2.1 Types of phrases that can be (clitic) left dislocated
As illustrated in the examples (3.3a-f), Romance CLLD allows for an accusative
noun phrase to be left dislocated.26 In Spanish, as in many other Romance languages, a
dative NPs can also be left dislocated, as exemplified in (3.527).
(3.5) CONTEXT: Did you tell Elena what happened at the party?
26I use “noun phrase” here for ease of exposition, and it is therefore not an indication of any theoretical underpinning. The most accurate term would be Determiner Phrase (DP), but nothing hinges on this distinction for the purposes of the present discussion.
27 An additional requirement for Spanish CLLD is related to specificity. When left-dislocating an element, it must be doubled with a clitic only if it is specific. If the element is non-specific, it cannot be doubled by a clitic (Contreras 1976; Arregi 2003) The examples below are from Valenzuela, 2005).
(a) Este libro, *(lo) he leído muchas veces. this book, *(CL) I-have read many times “I have read this book many times.’
(b) Revistas, (*las) leo a menudo. Magazines, (*CL) I-read often ‘I often read magazines.’
56
56
No. [A Elena]i no lei dije nada. no to Elena (fem.SG)NEG cl.DAT.3.SG tell.PAST.1st.sg nothing “I didn’t tell Elena anything.”
The Spanish clitic paradigm has distinct forms for 3rd person accusative and
dative clitics, while 1st and 2nd person accusative and dative forms are virtually
homophonous (see ft. 28) . Table 3-1 illustrates the Spanish clitic paradigm of accusative
and dative clitics. Note that the accusative clitic in (3.2) agrees in gender (feminine),
person (3rd person) and number (singular), while the dative clitic in (3.5) agrees in person
(3rd) and number (singular) only.
Accusative clitics Dative Clitics
Person 1st 2nd 3rd 1st 2nd 3rd fem. masc.
Singular me te la lo me te le
Plural nos os/los28 las los nos os/les les
Gender agreement no yes no no
Table 3-1 The Spanish clitic paradigm for dative and accusative clitics
In addition to dislocating noun phrases, Spanish CLLD allows for the dislocation
of a variety of phrases. These include prepositional phrases (PPs), adjective phrases,
quantifier phrases, bare nominals, or even whole clauses (Arregi, 2003; Casiellez-Suárez,
28 Morphologically, the accusative second person plural clitic is “os,” which is mostly used in Peninsular Spanish varieties. However, in most Latin American countries, the third person accusative plural is used instead (“los”). This dialectal difference does not affect the central CLLD tests (rating task and self-paced reading task). However, in view of the fact that the native Spanish speaker participants were recruited in Mexico, the clitic test included “los” instead of “os.”
57
57
2003; Pablos, 2006).29 The examples provided below (3.6a-d) illustrate CLLD with
predicative adjectives, quantifier phrases, whole clauses, propositional phrases, and bare
nouns.
(3.6) a.
Listo no lo parece ser. clever NEG Cl.neut he-seems be “He doesn’t seem to be clever.”
Predicative adjective (Casielles-Suárez, 2003)
b. Algunos libros, Juan los leyó ayer. some books Juan Cl.ACC.3m.pl read yesterday “Juan read some books yesterday.”
QP (Arregi, 2003)
c. Que fumas lo sabemos todos. that you-smoke CL.neut we-know all “We all know that you smoke.”
Clause Casielles-Suárez, 2003)
d. Sobre la mesa no puse nada. on the table NEG I-put anything “I didn’t put anything on the table.”
Prepositional Phrase
e. Manzanas, dicen que no ha cogido. apples they-say that NEG has gotten “They say that they didn’t get apples.”
Bare noun (Pablos, 2006)
Spanish, unlike other Romance languages such as Catalan, French, and Italian,
lacks a full paradigm of clitics (e.g. partitive and locative resumptive clitics do not exist
in Modern Spanish, which only instantiates accusative and dative structural case).
29Arregi (2003) notes that there are some restrictions on QPs in Spanish CLLD. Contrast the grammaticality of (3.6b) with the ungrammaticality of the example below.
*Algo, Juan lo leyó ayer. something Juan Cl.ACC.3M.SG read yesterday “Juan read something yesterday.' The example above shows that while definite descriptions and indefinites can be dislocated and clitic-doubled, some indefinites cannot. Arregi argues that this data is evidence that CLLD is interpreted as contrastive and, moreover, evidence that the clitic is interpreted as a variable that ranges over individuals.
58
58
Consequently, in (3.6d), where a prepositional phrase has been left dislocated, we can
observe that there is no (overt) locative clitic that can agree with the dislocated phrase
(sobre la mesa, “on the table”). Similarly, in (3.6e), there is no partitive clitic that can
agree with the dislocated bare noun (manzanas, “apples”). In fact, many researchers have
proposed that these structures are instantiations of CLLD that differ only in that they are
doubled by a phonetically null clitic (Casielles-Suárez, 2003; López, 2009; Zagona,
2002). Therefore, although these dislocations are generally argued to constitute examples
of CLLD and not of other types of dislocation, the present experimental study (see
Chapter 4) only included unambiguous cases where a clitic (either accusative or dative)
was overt.
3.2.2.2 The semantic relationship of the dislocated element
and the clitic
CLLD can display a variety of semantic relationships between the dislocated
constituent and its antecedent. This relationship can be one of semantic identity, set-
subset, set-set member, or part/whole. Examples (slightly modified from López, 2009) of
these relationships are found below.
(3.7) a. CONTEXT: What did you do with the furniture?
Las mesas las traje en la mañana… the tables CL.ACC.3F.PL I-brought in the morning “The tables, I brought in the morning…”
Set-Subset
b. CONTEXT: What did you do with the tables? Las mesas las traje en la mañana…
Identity
c. CONTEXT: What did you do with the tables? La mesa blanca la puse adentro… the table white CL.ACC.3F.SG put.PAST.1SG inside “I put the white one inside…”
Set-membership
59
59
d. CONTEXT: What shall we do with the table? It is too big! Mira, las patas, las doblas así.... Look the legs CL.ACC.3F.PL fold.PRES.2sg this-way. “Look, you can fold the legs like this…”
Part/whole
As the examples above demonstrate, all of these four types of semantic
relationships between the dislocated element and its discourse antecedent are felicitous.
Although all of these options are grammatical, there exist no corpus studies that can give
us an indication of whether these types of semantic relationships are in a preferential
hierarchy (e.g. one type of semantic relationship is preferred to another) or if one is more
frequent than another in real discourse. A previous investigation (Leal Méndez, Rothman
& Slabakova, 2014) suggests that there might be some (arguably subtle) distinctions
among these different semantic relationships.
Leal Méndez and colleagues (2014) tested CLLD judgments on four groups of
Spanish speakers: Spanish monolinguals (tested in their country of residence, Mexico),
Spanish-English bilinguals with more than seven years residence in an English-speaking
country, and two groups of Heritage Spanish speakers (divided into two groups of
intermediate and advanced proficiency). CLLD stimuli included only two types of
semantic relations: Identity (3.7b) and set-subset (3.7a). Although the overall findings
suggested that all groups were able to accurately judge the felicity of the CLLD stimuli,
there was evidence of a small performance difference among the groups. The
unacceptable sentences in both the Identity and the Set-Subset conditions were uniformly
rejected, with no differences between the participant groups. However, with regard to the
acceptable sentences, the rate of acceptance between the two types of CLLD was
different in the monolingual group. The monolingual native speakers rated Identity
CLLD higher than Set-Subset CLLD. None of the other groups differentiated between the
two acceptable types of sentence. The authors argued that this difference could reflect
differences in the frequency of occurrence between the two types. In light of this possible
60
60
evidence, the stimuli in the present experimental study only included the type of
relationship between the dislocated element and the discourse antecedent that was rated
the highest: Identity.
3.2.2.3 CLLD and a-marking
Note that the NPs in (3.2) and (3.5) are preceded by a preposition, a phenomenon
that is also referred to as “a-marking.” In Spanish, dative objects are case marked with
the preposition a (“to”). Dative objects are often doubled by a clitic, as shown in (3.8),
which is the non-dislocated version of (3.5). Unlike its CLLD analogue, (3-8) does not
require a previous context, and can be uttered out of the blue.
(3.8) Yo no lei dije nada a Elena.
no NEG cl.DAT.3.SG tell.PAST.1st.SG nothing to Elena “I didn’t tell Elena anything.”
Additionally, Spanish a-marks a subset of accusative objects in a phenomenon
that is often referred to as “differential object marking” (DOM). To date, there is
considerable debate surrounding what defines DOM and DOM objects (e.g. Aissen,
2003; Leonetti, 2004; Torrego, 1998), with notions like specificity and animacy being
frequently invoked. Notoriously complex to define, DOM employs dative case markings
on some (but not other) direct objects, where these a-marked objects are argued to be
semantically and pragmatically more salient. According to Torrego (1998), DOM
involves both structural and inherent case.
In Spanish, animate, specific accusative objects are typically a-marked, while
inanimate, specific objects are unmarked. This contrast is exemplified in the minimal pair
(3.9) and (3.10).
61
61
(3.9) Luisa trajo *(a)la niña a la escuela. Luisa brought DOM the girl to the school “Luisa brought the girl to school.”
(3.10) Luisa trajo (*a) la mochila a la escuela. Luisa brought DOM the schoolbag to the school “Luisa brought the schoolbag to school.”
Nonetheless, animacy is not the only crucial notion in Spanish DOM. Importantly,
the lack of a-marking with animate objects does not necessarily result in
ungrammaticality. In fact, for certain animate objects that lack DOM, a special meaning
may arise (i.e. non-specificity). This effect is illustrated in the minimal pairs below.
(3.11) Necesitamos a una enfermera. [+ animate, + specific] we-need DOM a nurse “We need a nurse” (a particular one).
(3.12) Necesitamos una enfermera [+ animate, - specific] “We need a nurse” (any nurse).
Apart from the notions of specificity and animacy, DOM is often associated with
certain polarity items. Negative polarity words that encode animacy (e.g. nadie “no one”)
are [-specific], and yet these are obligatorily a-marked. Note the examples below.
(3.13) No llamé a nadie. NEG called-1st.sg DOM no one “I didn’t call anyone.”
(3.14) *No llamé nadie.
In addition to the aforementioned considerations, DOM has been argued to be
sensitive to personifications, topicality, and the degree of eventiveness of the verb
(Montrul & Bowles, 2009; Torrego, 1998).
62
62
With regard to information structure, it has been noted (Laca, 1987; Leonetti,
2004; Melis, 1995) that DOM is habitually present in topicalization structures (i.e.
CLLD). This is especially interesting because CLLD requires that some dislocates be a-
marked even though the non-dislocated form may not require such marking. Pablos
(2008) has noted that this is related to CLLD’s required grammatical and selectional
connectivity between the dislocated phrase and the agreeing clitic30. The examples below
are slightly modified from Leonetti (2004:86).
(3.15) Ya conocía (a) muchos estudiantes. already I-met to many students “I had already met many students.”
(3.16) *(A) muchos estudiantes, ya los conocía. to many students already Cl.ACC.3sg I-met “I had already met many students.”
Leonetti (2004) argues that the ungrammaticality in (3.16) arises as a consequence
of CLLD imposing certain interpretations on the dislocated DPs—interpretations that are
typically associated with a-marking. Given that DOM is strongly linked to CLLD in
Spanish, the present experimental study only included topicalizations that were a-marked
and were both [+animate] and [+specific].
3.2.2.4 Iterative CLLD
Spanish allows for multiple CLLD elements to be left dislocated. These
dislocations can combine different types of phrases (as in 3.17), or appear in any order
(compare 3.18 and 3.19) without affecting grammaticality. This is the case with full NPs
30 The term connectivity was coined by Cinque (1990), who viewed it as a property of chains (irrespective of movement). In general, it could represent any grammatically encoded relationships between two elements, including agreement and binding.
63
63
or with bare nouns as well (compare 3.20, and 3.21). Furthermore, as illustrated in (3.22),
there appears to be no limit to the number of CLLD elements that can be preposed
(examples from Casielles-Suárez, 2003:328; 335).
(3.17) A mí listo no me lo parece. to me smart NEG CL.1sg CL.ACC.3m.sg seems “He doesn’t seem smart to me.”
(3.18) El regalo a mi madre no se lo he the gift to my mother NEG CL.DAT.3rd CL.ACC.3m.sg have dado todavía given yet “I haven’t given the gift to my mother yet.”
(3.19) A mi madre el regalo no se lo he dado todavía. “I haven’t given the gift to my mother yet.”
(3.20) A mí dinero Juan nunca me deja. to me money Juan never CL.ACC.1.SG leaves “Juan never leaves me any money.”
(3.21) Dinero a mi Juan nunca me deja. / Juan dinero a mi nunca me deja. “Juan never leaves me any money.
(3.22) Un libro a mí Juan en Navidad nunca me lo a book to me Juan in Christmas never CL.1SG CL.ACC.3.M.SG ha regalado has gifted “Juan has never given me a book for Christmas.”
As mentioned earlier, the order in which these CLLD elements appear does not
affect grammaticality. It is less clear, however, whether there exist interpretive effects
that are associated with these different orders. In terms of other restrictions for iteration,
Frascarelli and Hinterhötlz (2004) have proposed, using data from Italian CLLD, a
typology of topics that discriminates between aboutness topics, contrastive topics, and
64
64
familiarity topics. With regard to iterative CLLD in Italian, Frascarelli and Hinterhöltz
propose that only one type, namely familiarity topics, can be iterative (see also Bianchi &
Frascarelli, 2010).
3.2.2.5 CLLD in combination with other elements in the
left periphery
In Spanish, and Romance in general, CLLD can also appear in combination with
other elements in the left periphery such as Fronted Focus and wh-elements.31 When
these different left-periphery elements combine, unlike with iterative CLLD, there are
important ordering restrictions affecting grammaticality. In terms of wh-elements, it has
been observed that, in Romance languages, question operators must follow topics. This is
evident when comparing the grammaticality of (3.23; CLLD > wh operator) with the
ungrammaticality of (3.24; * wh operator > CLLD). The examples below (based on
original data in Italian) are modified from Barbosa (2001:25).
(3.23) El premio Nobel, a quién se lo dieron?32 the prize Nobel to who CL.DAT.3rd CL.ACC.3M.SG they-give.PAST “To whom did they give the Nobel prize?”
(3.24) *A quién el premio Nobel se lo dieron?
31Fronted Focus will be discussed in detail in section 3.2.4.2.
32As is customary, questions in Spanish are presented without the initial question mark (¿), in order to avoid any potential confusion with regard to the grammaticality status of the sentence or segment.
65
65
The combination of Fronted Focus and CLLD is also sensitive to ordering
restrictions that determine grammaticality. In the case of Fronted Focus, we can verify
this generalization when comparing the grammaticality of (3.25a-b; CLLD > Fronted
Focus) with the ungrammaticality of (3.25c-d; * Fronted Focus > CLLD). Examples
below are modified from Casielles-Suárez (2003:334).
(3.25) a. A sus padres MENTIRAS les cuenta siempre. to his parents lies CL.DAT.3pl tell.3sg always “S/he always tells lies to her/his parents.”
b. Mentiras A SUS PADRES les cuenta siempre.c. *A SUS PADRES mentiras les cuenta siempre.
d. *ESE DINERO a Juan no le voy a dar nunca.
Thus, the ordering of these types of elements has been noted as the generalization
that topic precedes focus as well as question operators (Campos & Zampini, 1990). When
these left periphery operations are combined, the only possible order is CLLD > {FF,
wh} or, more generally, Topic > Focus.
3.2.2.6 CLLD in embedded clauses
CLLD has been argued to be acceptable in both root and subordinate clauses (e.g.
Anagnostopoulou, 1997; Cinque, 1990; Haegeman, 2006; Valenzuela, 2006; but see
Escobar, 1997 for an argument that CLLD is a root phenomenon). The example in (3.26)
shows that multiple CLLD elements can occur in embedded environments (example from
Valenzuela, 2006:3).
(3.26) Me pregunto que, a María, el libro, quién se me I-ask that to María the book who CL.DAT.3rd lo dio CL.ACC.3M.SG gave “I wonder who gave the book to Maria.”
66
66
Haegeman’s (e.g. 2006, 2012) work has examined the distribution of CLLD in
root and subordinate environments. Haegeman has noted that Romance CLLD is not
confined to root or root-like contexts and can thus appear in subordinate argument
clauses, sentential subjects, and adverbial clauses, among others. The examples below,
slightly modified from Haegeman (2006), illustrate examples of CLLD in central
adverbials (3.27), temporal adverbial clauses (3.28), and sentential subjects (3.29).
(3.27) Si este examen no lo apruebas con un cinco, perderás if this exam NEG CL.ACC.3m.sg pass-2sgwith a five lose-FUT-2SG el curso entero. the course whole “If you don’t pass this exam with a five, you’ll fail the whole course.”
(3.28) Mientras la cena la preparaba Pedro, Miguel barría el patio while the dinner CL.ACC.3f.sg prepared Pedro Miguel swept the patio “While Pedro prepared dinner, Miguel was sweeping the patio.”
(3.29) Que este problema no lo hayan podido resolver me that this problem NEG CL.ACC.3m.sg have-3pl can solve me parece inacceptable. seems unacceptable “That they were not able to solve the problem seems unacceptable to me.”
In fact, Haegeman notes that (3.29) is less acceptable in Spanish than in other
Romance languages. However, as the results of the empirical investigation will show,
sentential subjects were as acceptable as other embedded constructions for a group of
native speakers of Spanish.
67
67
3.2.3 Discourse Properties of Spanish CLLD
3.2.3.1 CLLD and the notion of “Topic”
As mentioned earlier, CLLD is an expression of topicalization in Spanish.33
Although this much is generally uncontroversial, the definition of “Topic” is not
universally agreed upon. In fact, definitions of topic abound. Notions such as
“aboutness,” “old knowledge,” “discourse referent,” “discourse oldness,” “context-
boundness,” “shared knowledge,” or “discourse salience” are often invoked in these
definitions. This is not to say, however, that because there is no all-encompassing
definition that accounts for all topics, everything is in dispute. Casielles-Suárez (2004)
considers the following generalization as prompting minimal disagreement in the
literature:
“A topic tends to be what the sentence is about; what the speaker takes as the point of departure for the sentence; that which is not part of the focus; placed towards the beginning of the sentence; often a subject; often active or salient in the discourse; and often expressed by pronominal or unaccented lexical phrases.” (p. 22)
The expression of topic varies widely among languages, which can mark topic
lexically, phonologically, morphologically, and, as is the case in Spanish CLLD,
syntactically. This variation has been argued to be at the core of the difficulty of defining
the concept. In fact, Casielles-Suárez (2004) suggests that a unitary (i.e. cross-linguistic)
definition of topics does not exist, as the characterization of topic in one language might
be inadequate to define the same notion in another language.
33It should be noted that although CLLD is a very frequent topicalization strategy in Spanish, it is not the only one. Other structures, such as p-movement (Zubizarreta, 1998) and Clitic Right Dislocation also involve topicalization.
68
68
3.2.4 Other Dislocations in Spanish
Since the earliest work on Romance dislocations (Cinque, 1977, 1990), many
researchers have devoted considerable attention to the classification of different types of
dislocations through diagnostic tests.34 In terms of similarity and possible overlap, the
two constructions that are most frequently discussed are Hanging Topic Left Dislocations
(HTLD) and Fronted Focus (also called “Contrastive Left Dislocation” or even
“Contrastive Focus”). Although the goal of this section is not to exhaustively describe
these three structures, I will present some relevant differences between them, as they are
relevant to the empirical study. First, we will compare CLLD to HTLD, and then CLLD
to Focus Fronting.
34There is another type of dislocation that exhibits a great deal of overlap with Spanish CLLD: Verum Focus Fronting (VFF; Leonetti & Escandell-Vidal, 2009). An example of VFF can be found below:
Nada tengo que añadir. Nothing I-have to add “I have nothing to add.”
According to Leonetti and colleagues (Leonetti & Escandell-Vidal, 2009) dislocated elements that are VF-fronted are usually limited to a relatively small set: Indefinite quantifiers (e.g. nada, see X above), fronted lexical definites (Al presidente conocí, “I met the president”), quantified NPs (A pocos colegas ha invitado, “S/he has invited few colleagues.”), adverbials (Pronto te has cansado, “You have tired quickly”), adjective phrases (Sevillano soy, sí señor, “I am indeed from Seville”), and PPs (Con la iglesia hemos topado, “We have stopped at the church.”). However, given that VFF is not a part of the present empirical study, I will not discuss VFF in any detail. Nonetheless, it is interesting to note that CLLD and VFF can indeed combine (Leonetti, 2009) and, furthermore, there appear to be similar ordering restrictions as in the case of CLLD and Fronted Focus and wh-elements (Namely, CLLD must precede these three dislocations, when combined). The examples below are from Kempchinsky, 2013.
A Pedro nada le tengo que decir sobre el asunto. to Pedro nothing CL.DAT.3.sg have to say on the matter.
“I have nothing to say to Pedro in regards to this matter.”
b. *Nada a Pedro le tengo que decir sobre el asunto.
69
69
3.2.4.1 CLLD and Hanging Topic Left Dislocation (HTLD)
A minimal pair comparing CLLD and HTLD can be found in (3.33a) and (3.33b).
Examples are from López (2009: 3-4).
(3.33) a.
A María hace tiempo que no la veo. to Maria does time that NEG CL.ACC.3f.sg I-see. “I haven’t seen Maria in a long time.”
CLLD
b. María, hace tiempo que no la veo. HTLD
A cursory comparison of the pair above will reveal one important difference. In
(3.33a), the CLLD example, the dislocated NP, which is a CLLD element that is also
[+animate] and [+specific], is a-marked, while the HTLD example in (3.33b) is not. As
noted earlier, in addition to being argued to be a topic marker (Leonetti, 2004), the a-
marking has been taken to realize the grammatical and selectional connectivity between
the dislocated element and the resumptive clitic (Pablos, 2008). As is evident in (3.33b),
HTLD does not require such connectivity. Further evidence of this lack of connectivity is
illustrated in (3.34) and (3.35a-b).
(3.34) Nosotros,, nadie nos ha visto.35 We.NOM nobody CL.ACC.1.pl has seen
HTLD (Zagona, 2002)
(3.35) a.
A mi, me has dicho la verdad. to me.ACC you-have said the truth “You have told me the truth.”
CLLD
b. *Yo, me has dicho la verdad CLLD (Valenzuela, 2006)
35 The double comma notation (,,) indicates a longer pause (what Anagnostopoulou (1997:301) calls a “sharp intonational break” between the dislocated element and the rest of the sentence), which is a characteristic of HTLD but not of CLLD.
70
70
In (3.34), we can verify that HTLD does not require that case be uniform between
the dislocated element and the resumptive element (i.e. the dislocated element is
nominative whereas the clitic is accusative). However, the ungrammaticality of (3.35b)
shows that connectivity is required in CLLD (i.e. fact that the dislocated element is
nominative and clitic is accusative causes ungrammaticality).
Cinque (1990) points out an additional difference between HTLD and CLLD,
which involves the resumptive element. Although the examples of HTLD we have
encountered so far have been clitics, HTLD can also be resumed by other elements (see
Villalba, 2000, for review), including epithets (see example 3.35a-b) or strong pronouns
(see example 3.36a-b).
(3.35)a. *A María hace tiempo que no la veo to Maria does time that NEG CL.ACC.3f.sg I-see a esa sinvergüenza. to that scoundrel
CLLD López (2009)
b. María,, hace tiempo que no la veo a esa sinvergüenza.“I haven’t seen that scoundrel Maria in a long time.”
HTLD
(3.36)a. A Julia no la pude encontrar (*a ella). to Julia NEG CL.ACC.3f.sg I-could find to her
CLLD Casielles-S (2003)
b. Julia,, no la pude encontrar a ella“Julia, I couldn’t find her.”
HTLD
These examples show that doubling the dislocated phrase with a resumptive
element that is either an epithet (esa sinvergüenza “that scoundrel”) or a strong pronoun
(ella “she”) causes ungrammaticality in CLLD but not in HTLD. In the case of CLLD,
the dislocated phrase can only be doubled by a resumptive clitic.
71
71
In addition to the aforementioned differences, HTLD and CLLD have been
argued to differ in their sensitivity to island constraints (HTLD is not sensitive, while
CLLD is sensitive to strong islands; 3.37a-b),36 their availability in embedded clauses
(HTLD is not; 3.38a-b), and the type of constituent that can be dislocated (HTLD only
allows for specific indefinite or referential definite DPs, while CLLD allows for a variety
of phrases to be dislocated; 3.39a-b). Additionally, while HTLD allows a topicalizing
expression to precede it, CLLD does not (3.40a-b). Examples (3.37)-(3.40) illustrate
these differences.
(3.37)a. *A Juan me enojé porque lo desaprobaron. to Juan me angered because CL.ACC.3m.sg failed “I got angry because they failed Juan.”
CLLD: Adjunct island Saab (2010)
b. Juan,, me enojé porque lo desaprobaron. HTLD: Adjunct island
(3.38)a. Creo que a Elena la ayudará su tía. I-believe that to Elena CL.ACC.3f.sg help.FUT.3sg her aunt “I believe that Elena’s aunt will help her.”
Embedded CLLD
b. *Creo que, Elena,,la ayudará su tía. Embedded HTLD
(3.39)a. De Lucas, he oído hablar mucho. of Lucas I-have heard talk much “I’ve heard a lot about Lucas.”
CLLD: PP (Valenzuela, 2006)
b. *De Lucas,, he oído hablar mucho de él. of Lucas I-have heard talk much of he.NOM
HTLD: PP
36 López (2009) argues that CLLD is also subject to weak island effect. This issue will be discussed further in section 3.4.
72
72
(3.40)a. ?Hablando de Pepe, lo vi ayer. talking of Pepe CL.ACC.3m.sg I-saw yesterday “Speaking of Pepe, I saw him yesterday.”
CLLD
b. Hablando de Pepe,, vi a ese mentiroso ayer. talking of Pepe I-saw to that liar yesterday “Speaking of Pepe, I saw that liar yesterday.”
HTLD (Valenzuela, 2006)
In terms of their syntactic analysis, HTLD and CLLD have been argued to differ
in the way in which the structures are derived. In this regard, however, there is no overall
consensus. HTLD has been argued to be base generated in the left periphery although
other researchers have also argued that HTLD involves movement (e.g. Boeckx, 2003).
As far as CLLD is concerned, while some researchers have argued that it is generated by
movement to the left periphery (López, 2009; Pablos, 2008; Rizzi, 2004), other
researchers have proposed that CLLD is not generated by such movement
(Anagnostopoulou, 1997; Cinque, 1990; Suñer, 2006). This issue will be further
discussed in section 3.4.
3.2.4.2 CLLD and (Fronted) Focus
Another structure that bears a high degree of resemblance to CLLD is Fronted
Focus. In terms of information structure, the main difference between these two
structures is that CLLD is an expression of topicalization in Spanish, whereas Fronted
Focus, as evident from its name, is a focalization strategy (Campos & Zampini, 1990).
Vallduví & Engdahl (1996) describe focus as the part of the sentence that is “the
informative, newsy, dominant, or contrary-to-expectation part” (p. 462). López (2009)
claims that the least controversial definition of focus is one proposed by Jackendoff
(1972): Focus is that which resolves a variable left open in the discourse. In the example
below, that information which resolves the open variable opened by the wh-element is
focus—informational Focus in this case.
73
73
(3.41) What did Lena buy? [x | Lena bought x]
Lena bought beer.37 [x= the beer; “the beer” is focus]
Much like in the case of Topic, although perhaps slightly less contentiously, it has
been noted that the concept of Focus is not unitary. The literature on information
structure (e.g. Gundel, 1998; Kiss, 1998) discriminates between two types of Foci:
presentational (i.e. informational focus, the new information predicated about the topic)
and contrastive focus (information that is placed in a position of prominence for the
purposes of contrast or correction).38 Example () above is an example of Informational
focus. In this case, the focus (“beer”) resolves a variable that is opened by the wh-
question (“what”).
Fronted Focus is an example of the second type of focus (i.e. contrastive).
Contrastive foci, like presentational foci, also resolve a variable. The crucial distinction
between the two is that while presentational foci resolve a variable that was opened by
the previous discourse context (e.g. by a wh-operator), contrastive foci both open and
close a variable simultaneously (López, 2009). Thus, what Informational and Contrastive
foci have in common is that they both involve a presupposition. In Example (3.41), for
instance, what is presupposed is that Lena bought something. ([Lena bought] [x], = { x |
Lena bought x}, x = beer}). As we will see, in example 3.42 below, this presupposition is
also operant in Fronted Focus, so that what is presupposed is that They invited someone
37In English, as shown in (3.41), foci are prosodically prominent (underlining denotes prosodic prominence). Languages like Spanish tend to mark focus in different ways (e.g. using word order rather than prosody; Zubizarreta, 1998). Crucially, within a single language, there can be multiple structures that can express these notions. However, the common test for identifying foci (question/answer pairs: “Who came to the party last night?” “JOHN did”) is valid for both English and Spanish.
38It should be noted that within the functional descriptive literature (e.g. Dik, 1997), these two functions are not syncretic; corrective focus and contrastive focus are themselves distinct types (or alternatives) of focus. The terminology in the information structure literature is notorious for the lack of consistency with which these terms are used.
74
74
([They invited] [x], = { x | They invited x}, x = Juan}). The difference is that, in the later
case, this instance of Focus is corrective in nature.
Fronted Focus and CLLD display many superficial resemblances. The pairs in
(3.42a) and (3.42b) illustrate CLLD and Fronted Focus in Spanish.
(3.42) a.
CONTEXT: Do you know what happened to Juan? A Juan lo invitaron a la cena. to Juan CL.ACC.3f.sg they-invited to the dinner “Juan was invited to the dinner.”
CLLD
b. CONTEXT: Pedro was invited to the dinner.A JUAN invitaron a la cena (no a Pedro).to Juan they-invited to the dinner NEG to Pedro“It was Juan who was invited to dinner (not Pedro).”
Fronted Focus
Fronted Focus is similar to CLLD in that it also involves the left dislocation of an
element that is not necessarily restricted to a noun phrase. Crucially, however, it is not
doubled by a resumptive clitic, and it has a distinctive intonation—a “prominent pitch
accent” (Gundel, 1999: 298), as indicated by the caps in (3.42b). The example also
illustrates the fact that CLLD and Fronted Focus occur in different discourse contexts.
While in (3.42a) CLLD requires that the dislocated element be previously mentioned or
else be retrievable from the discourse context (i.e. it requires a discourse anaphor),
Fronted Focus does not (i.e. Juan was not previously mentioned). It is important to note
that Fronted Focus does not require for the dislocated element to be discourse new. In
fact, the element can be either discourse-old or discourse salient. The difference, thus is
that Fronted Focus allows for the dislocated element to be either discourse-old or
discourse-new whereas the dislocated element in CLLD can only be discourse-old (or
else discourse-salient).
75
75
3.3 CLLD “analogues”: English-style topicalization
English, like most Romance languages, also has a linguistic mechanism to
(syntactically) mark a topic: (English-style) Topicalization, which is CLLD’s
“functional” but not morpho-phonological equivalent (Slabakova, 2013). This structure
also involves a topicalized phrase that is moved to the left periphery. Example (3.43),
which appeared in the New York Times, illustrates such a property. The article covers the
trial of an embattled former editor of a series of British tabloids as the prosecutor is
interrogating her. In the immediately preceding context, the prosecutor asked the editor,
Rebecca Brooks, if she had given her employees the specific instruction ‘Do not hack
phones,’ to which she responded:
(3.43) No, that phrase I did not use. (New York Times, 3/5/2014)
As in the case of CLLD, a topicalized element (“that phrase”) moved out of the
unmarked SVO order. Unlike CLLD, however, there is no clitic to resume the dislocated
phrase. In fact, English does not allow for any other resumptive element to double the
dislocated phrase. This is illustrated in (3.44).
(3.44) *No, that phrase I did not use it.
When comparing (3.43) and (3.44), we can see that English topicalization does
not allow for the doubling of the dislocated phrase with a strong pronoun (it), which
causes ungrammaticality. Another important difference between CLLD and English-
style Topicalization is that while CLLD allows for multiple topics to be dislocated (see
example 3.45), only one topicalization is allowed in English (example 3.46).
Furthermore, English-style topicalization cannot combine with other constructions in the
left periphery (see section 3.4 for details).
76
76
(3.45) [A mí][dinero][Juan] nunca me deja. to me money Juan never CL.ACC.1.sg leaves “Juan never leaves me any money.”
Multiple CLLD (Casielles-Suárez, 2003)
(3.46) *[The whole weekend] [that stupid campaign] Mary spent on.
E-Topicalization (Emonds, 2004)
There is a rich tradition of the work investigating English-style topicalization that
has been done outside Generative linguistics. Within this tradition, Ward (1988) has
argued that movement to the left periphery in English could be an instantiation of two
different strategies. One is what Ward terms “focus preposing” while the second one is
topicalization. These two strategies are not argued to be distinct based on syntactic
differences, but rather on the basis of the discourse environment in which they appear as
well as by the intonational contours of the sentence. Upon a perusal of the illustrative
examples provided by Ward (1988), Slabakova, Rothman, and Kempchinsky (2012)
uncovered a similarity between those examples that were illustrative of “focus preposing”
and those examples that Arregi (2003) classifies as “contrastive topicalization.”
Arregi (2003) proposes that contrastive topicalization does not involve the
correction of an utterance but involves instead choosing an alternative from a set that is
contextually salient. Although in Spanish there are diagnostic tests (see section 3.2.4.2)
that help distinguish Fronted Focus from CLLD, Slabakova and colleagues note that there
are no analogous syntactic diagnostics in English to distinguish “topicalization” from
“focus preposing.” Thus, minimally, a native speaker of English learning Spanish as an
L2 could (positively) transfer the topicalization strategy. However, the transfer would not
be straightforward, as this hypothetical learner would have to realize that the English
preposing strategy could potentially map to two different structures: CLLD and Fronted
77
77
Focus. In view of the fact that English lacks a clitic system, however, a prerequisite for
the acquisition and processing of CLLD is the acquisition of the syntactic properties of
clitics. This issue will be further discussed briefly in section 3.4.1.
3.4 Syntactic analyses of CLLD
As mentioned at the outset of the chapter, a considerable amount of literature has
been published on the syntactic analysis of CLLD (e.g. Arregui, 2003; Bianchi &
Frascarelli, 2010; Cinque, 1990; Contreras, 1976; Casielles-Suárez, 2003, 2004;
Kempchinsky, 2013; López, 2009; Suñer, 2006; Zubizarreta, 1998). As far as the
syntactic analysis of CLLD is concerned, there have been several questions that have
dominated this area of inquiry. In the sections that follow, I will focus on two of the most
important questions that have been addressed.
The first question, which I will address only briefly, regards the manner in which
CLLD is generated. Overall, the two main positions are that CLLD is either: a) base-
generated (i.e. initially merged) to later establish a relationship with the clitic, or b)
generated by movement to the left periphery.39, 40 I will argue that the analyses that
assume movement can most effectively account for the data. The second question regards
the position of the dislocated element, and whether this position is exclusive to CLLD. In
this regard, I will argue that López’s analysis, which proposes a simplified left periphery
with only two heads, is preferable.
39 Researchers such as Aoun and Benmanoun (1998) opt for a different explanation, in which CLLD can be generated by movement or by base-generation but that movement is not available in some environments (i.e. island contexts).
40 Within the movement approaches, several researchers have proposed that the type of movement may involve A-bar movement or a combination of A-movement and A-bar movement. These distinctions are related to a researcher’s syntactic assumptions on the place where the clitic is initially merged.
78
78
3.4.1 Movement vs. Base-Generation approaches
Overall, the approaches that propose that CLLD is generated (or not) through
movement have used evidence from diagnostic tests for A-bar movement, which is
argued to proceed in steps. As point of reference, the prototypical illustration of A-bar
long distance movement is wh-movement. These (short) movement steps have been
argued to take place in order to satisfy locality requirements (Boeckx, 2001; Bošković,
2001). In the literature on A-bar movement, it has been well demonstrated that this type
of movement (i.e. long-distance (cyclic) movement) is subject to strong islands (i.e. wh-
elements cannot be extracted from strong islands). Example (3.47) below illustrates this
fact for Spanish (complex NP island).
(3.47) *[A quién] es sorprendente [el hecho de [que Juan ame [e]]? to whom is surprising the fact that Juan loves “It is surprising that Juan loves who?”
Cinque (1977, 1990) noted that CLLD is also sensitive to (strong) islands. An
illustration of Spanish CLLD (complex NP island) can be found in example (3.48), which
is analogous to example (3.47).
(3.48) *[A María] es sorprendente [el hecho de [que Juan la ame [e]]. to María is surprising the fact that Juan her loves “The fact that Juan loves Maria is surprising.”
Although it has been well established that CLLD is subject to strong island
effects, CLLD has been argued not to be sensitive to weak island effects (e.g. Cinque,
1990). This situation is illustrated in example (3.49). Among those researchers that have
proposed non-movement approaches, the lack of weak island effects is routinely taken as
evidence for base generation.
(3.49) A ese hombre me averguenzo de haberlo amado. to that man me shame.1.SG of have.INF-CL.3rd.ACC.SG loved “I am ashamed to have loved that man.”
79
79
López (2009) notes that although these examples have been routinely accepted
ever since Cinque’s (1990) analysis of CLLD, there is reason to examine them more
closely. Specifically, López proposes that the lack of weak-island effects is a byproduct
of using only definite object DPs as dislocates. López argues that DP objects can be
easily extracted, both from CLLD from a wh-phrase. Indeed, example (3.50) shows this
lack of effect for wh-movement. (3.50) A qué hombre te averguenzas de haber amado? to what man CL.2.SG shame.2.SG of have.INF.ACC.SG loved “What man are you ashamed to have loved?”
As mentioned earlier, CLLD in Spanish can only be doubled by accusative and
dative clitics because of the incomplete clitic paradigm of Spanish (e.g. no partitive
clitics). Catalan, however, does not have this restriction. As part of his argument that
CLLD does not show weak island effects, López provides a Catalan example of a
dislocated phrase that is not a definite DP (in this case, it is an indefinite DP). (5.51) ?? D’històries, m’ avergonyeixo d’haver-ne explicat diumenge.
of stories CL.DAT shame.1.SG of’have.INF-CL.PART told Sunday “I’m ashamed to have told stories on Sunday.”
I believe that these arguments are indeed indicative that CLLD generates though
movement. An additional argument made against the movement accounts involves
parasitic-gap licensing (Cinque, 1990). The examples below illustrate that clitics are not
able to license parasitic gaps in Spanish. (5.52) A María la despidieron sin avisarle [e]. to Maria CL.ACC.3rd.F.SG they-fired without notice-CL.DAT.3rd.SG “They fired Maria without notifying her.” (5.53) *A María la despidieron sin avisar [e].
80
80
However, Pablos (2006) notes that some clitics appear to license parasitic gaps—
specifically those that double inanimate CLLD. The examples (5.54) and (5.55) below
(examples from Pablos, 2006: 53) illustrate this fact. In view of this evidence, Pablos
argues that the parasitic-gap test is not strong evidence against movement.
(5.54) ?El libro, lo ha vendido sin leer [e]. The book, CL.ACC.3rd.M.SG has sold without reading “He has sold the book without reading it.”
(5.55) El libro, lo ha vendido sin leerlo.
Although there are additional arguments in favor and against movement-
generation in CLLD, I believe that the data put forth by López (2009) and Pablos (2006)
are strong evidence for movement in terms of the generation of CLLD.41 If this approach
is on the right path, then this would entail that those L2 learners whose first language is
English could transfer, in principle, the movement strategy.
3.4.2 The location (or landing site) of the dislocate
In structural terms, another aspect that has been frequently discussed in the CLLD
literature is the location of the dislocated element—or the landing site, if one espouses a
movement approach. In this respect, I will present two current proposals that detail the
location of the dislocated element. Although the second proposal (López, 2009)
necessarily requires that CLLD must be generated through movement, the first (Rizzi,
1997, 2004) is, in principle, compatible with both movement and base-generation
approaches.
41 For example, another argument in favor of movement involves reconstruction effects (see Zubizarreta, 1998). The evidence that Zubizarreta has advanced highlights the fact that reconstruction effects in Spanish CLLD can be localized in a position that is higher than the subject. The reader is referred to López (2009) and Zubizarreta (1998) for further discussion.
81
81
3.4.2.1 Rizzi’s (1997, 2004) cartographic approach
3.4.2.1.1 Main Tenets
Rizzi (1997) argues for the articulation of multiple functional projections within
the complementizer phrase (CP) layer. In Rizzi’s view, this proposal is a necessary
parallel to what has been proposed in terms of the articulation of verb phrase (VP) and
tense phrase (TP), the other two main layers in the structural representation of the clause.
Rizzi proposes that the expanded left periphery constitutes an intermediary (i.e. interface)
layer between the propositional content expressed by TP and the superordinate structure.
This superordinate structure, which is expressed in Rizzi’s proposal to expand the CP
field, is argued to consist of a “higher clause or, possibly, the articulation of discourse”
(p. 283).
Therefore, Rizzi proposes that the C system must accommodate for the expression
of two different types of information. The first type of information faces inward (towards
the embedded TP under it), while the second one faces outward (toward discourse or the
matrix clause, in the case of embedded clauses). The first type of information
(propositional layer) includes agreement rules between C and T and the expression of
Finiteness, while the second type (superordinate layer) includes, among other things, the
specification of (illocutionary) Force (Chomsky, 1995). The articulated structure
proposed by Rizzi is found in example (3.56), where ForceP specifies clause type, Topic
Phrase (TopP) hosts topics, Focus Phrase (FocP) hosts foci (such as Fronted Focus and
wh-phrases), and ForceP marks finiteness. Thus, under Rizzi’s proposal, CLLD is
equivalent to English-style topicalization, because they both structures occupy the
specifier position in TopP. Importantly, although Rizzi’s articulated left periphery is in
principle compatible with movement generation, his proposal remains agnostic on the
issue of whether movement is necessary for the extended left periphery.
82
82
(3.56)
In order to account for a known asymmetry between TopP and FocP, namely, the
fact that numerous languages allow for multiple topics whereas only one focus per
sentence is sanctioned, Rizzi proposes that TopP is recursive (marked with an asterisk in
(3.56)), while FocP is not. The example below (Rizzi, 1997:295) illustrates this
asymmetry with data from Italian.
(3.57) Credo que a Gianni, QUESTO, domani, gli doveremo dire. I-believe that to Gianni this tomorrow to-him we-should say “I believe that tomorrow we should tell this to Gianni.”
Rizzi proposes that the Topic-Focus system is available on an “as-needed” basis
(i.e. the periphery only expands when a certain element needs to be accommodated),
sanctioned by “activation.” This operation entails the presence of a feature (e.g. topic or
focus) under a Spec-Head relationship, which is essentially present to satisfy a structural
principle. Rizzi argues that these features ([+Topic], [+Focus]) can be phonetically null,
as in Italian or Spanish, or phonetically realized, as in Gunghe (Aboh, 1995), cited in
Rizzi, 1997). In the case of the former languages, Rizzi’s analysis assumes that a
83
83
topicalized element will be in Spec, TopP, while the null anaphoric operator will be in the
(immediately lower) specifier.
3.4.2.1.2. Issues with Cartographic Approaches
Although this seminal analysis has been immensely influential, Rizzi’s
cartographic approach has been subject to much criticism (Casielles-Suárez, 2004;
Edmonds, 2004; Kempchinsky, 2013; López, 2009), especially in the view of the tenets
of Minimalism. One such criticism is that this analysis lends itself to a proliferation of
layers far beyond those noted in (3.56) above. Kempchinsky (2013) notes that Bernicà
and Poletto (2004), for example, carve a space within Rizzi’s cartography for categories
such as scene-setting adverbials (e.g. duman in the V2 Rhaeto-Romance variety spoken
in southern Italy, “tomorrow”). Not surprisingly, there has been considerable pushback
from some researchers, like Edmonds (2004), who notes that a negative consequence of
this proliferation of layers. One of these negative consequences involves what he believes
are the ad-hoc propositions that criterial heads are the heads of phrases (e.g. TopP, FocP),
as well as the proposed orderings.
Kempchinsky (2013) notes that the main problem with this approach is a
conceptual one. By having these (discourse) features checked against a head, the
approach “puts into the syntactic component notions that are in fact discourse notions (p.
312).” Additionally, she notes that English, unlike Spanish, allows for only one head to
be active in the Left Periphery. This is true in the case of two topics (see 3.56 above) but
also when two different structures combine. Examples (3.58a-b) and (3.59) depicts the
combination of English Topicalization and Focal Preposing in English and Spanish,
respectively.42
42English Focal Preposing (Ward’s 1998 term) and Spanish Focus Fronting are essentially equivalent in discourse and syntactic terms. An example of Focal Preposing is found below (Example from Prince, 1981:259).
FIDO, they named their dog.
84
84
(3.58) a. *The dog FIDO they named (not Rex). (Kempchinsky, 2013)
b. *FIDO the dog they named (not Rex).
(3.59) Al niño PEPE lo llamaron (no Pablo). to-the boy Pepe CL.ACC.3m.sg they-called “PEPE they named the kid (not Pablo).”
CLLD + Fronted Focus
However, given that languages like Spanish allow for those positions to be active
simultaneously (c.f. 3.59 above), Kempchinsky notes that there is no principled
explanation for this asymmetry within this approach. Examples (3.60a and 3.60b) below
are the schematic representations of English Topicalization and Focus Preposing,
respectively, while example (3.60c) is the representation of CLLD + Fronted Focus in
Spanish (see 3.59 above).
Emonds (2004) also notes that these two structures are limited to root clauses and to “root-like indirect discourse embedding” contexts (Emonds, 2004), which, as we saw earlier, is quite distinct from the distribution of CLLD.
85
85
In view of this lack of specification, Kempchinsky further argues that the
cartographic approach “overdetermines” the discourse functions of constituents without
really providing an explanation that accounts for the discourse properties of dislocated
elements. Therefore, she argues that a more conceptually appealing solution would be to
“put some of the burden of [interpreting]” a constituent as either topic or focus beyond
the syntax, in the discourse component. The approach described below is an example of
an analysis that attempts to account for the data by distributing the burden between
syntax and pragmatics.
3.4.2.2 López’s (2009) approach to Information Structure
Firmly grounded in Minimalist principles, López argues for a modular approach
in which the pragmatic module is independent from the Computational System (c.f.,
Erteschik-Shir, 1997; Lambrecht, 1994; Vallduví, 1990).43 In López’s view, discourse is
a computational module that assembles sentences into Discourse Representation
43For an example of a different, non-modular approach, see Büring (2007), who argues that information structure is actually an aspect of syntactic representations.
(3.60)
86
86
Structures, while pragmatics (a distinct module) assigns (pragmatic) features to the
syntactic object. Hence, the information structure of a given syntactic object ∑ is ∑[p],
which is the original syntactic object augmented with the relevant pragmatic features.
This object ∑[p] then maps onto discourse. A critical point for López’s approach is the
place in the derivation where the pragmatic module intervenes, assigning its features to
the syntactic object. Following phase theory (Chomsky, 2001), López proposes that the
points of contact (between syntax and pragmatics) are precisely the edges of phases (vP
and CP). His interaction model is reproduced in Figure 3.1 below.
Figure 3-1 López’s model: interaction of syntactic objects with pragmatics
Rejecting the traditional notions of topic and focus (as in Reinhart, 2006; Rizzi,
1997, inter alia) as inadequate to “identify a coherent class of constructions” (p. 24),
López proposes that the discourse notions of “anaphor” and “contrast” are the crucial
notions (i.e. features) of information structure. Discourse Anaphors have two
characteristics: (i) they obligatorily take a discourse antecedent that is necessarily local
87
87
(in the sense of Villalba, 2000),44 and (ii) they are in a discourse-structural asymmetry
with respect to their dependent.
The second binary feature, contrast, opens up a variable in the quantification
domain (after Vallduví & Vilkuna, 1998). The syntax-discourse structures that López
uses as evidence for his syntactic analysis (data from Catalan and, to a lesser extent,
Spanish) are then posited as combinations of these binary features.
[+ contrast] [- contrast]
[+anaphor] - Clitic Left Dislocation (CLLD)
- Clitic Right Dislocation (CLRD) - P-movement
[-anaphor] - Fronted Focus - Rheme (Informational focus)
Table 3-2 Feature combinations and syntax-discourse structures, López (2009)
As Table 3.2 illustrates, CLLD and Fronted Focus differ in one feature: CLLD is
[+anaphor], whereas Fronted Focus is not [-anaphor]. These two structures (CLLD and
Fronted Focus) are similar in that both of them open a variable in the quantificational
domain, while simultaneously resolving it.45 As mentioned earlier, this is in contrast with
simple focus (Rheme), which resolves a variable that was left open in the discourse. One
important difference between Fronted Focus and CLLD is that CLLD requires that the
dislocated element necessarily be a discourse anaphor (i.e. it needs to take a local
antecedent and being in a discourse-structural asymmetry), while Fronted Focus does not
have such a requirement (i.e. the dislocated element can be either discourse-old or
discourse-new).
44Roughly, anaphors should be explicitly mentioned in the discourse or else be abundantly prominent in the context but, crucially, cannot be the answer to wh-questions.
45 Which is, incidentally, precisely what the feature [+contrast] is intended to capture in López’s model.
88
88
López’s approach, which is crucially dependent on movement, outlines in detail
how the step-by-step derivation of these structures proceeds. In the case of CLLD, the
dislocated element moves from its base position to the edge of the phase (the specifier of
vP; Spec,vP). In (3.61) below, the solid black line indicates the movement of the
dislocated constituent (a Elena), while the copy left behind is indicated by brackets (<>).
Once the dislocated phrase moves to Spec,vP, the interpretive pragmatics module can
then mark the clitic (and not the dislocated element directly) as anaphoric [+a]. López
takes the pragmatics module to be invasive, meaning that it can “inspect a syntactic
structure and attach a feature to it” (p. 98). In accordance with phase theory (Chomsky,
2001), however, the pragmatics can only attach a feature after a phase is completed.
(3.61) Step 1: López’s derivation of A Elena la vieron ayer. (“Elena, they saw
yesterday.”)
Crucially, the clitic (adjoined to the head of vP) enters in an Agree relationship
with the dislocated (moved) constituent (phi features are checked). Because of this Agree
relationship, the dislocated phrase (a Elena) comes to be marked [+anaphoric] as well.
89
89
(This provision will be of crucial importance in terms of how the derivation of Fronted
Focus will proceed.)
The following step involves movement to the next phase edge, the specifier of
Finite Phrase (FinP). After the dislocated phrase is marked [+a], by virtue of agreeing
with the clitic, which is also [+a], the dislocated phrase moves to the left periphery.
Specifically, López argues that the (target) landing site of this movement is FinP. Once
the dislocated phrase is at the edge of the next phase (FinP, this time), the pragmatics
module will mark it as contrast [+c]. These steps are illustrated in the tree in example
(3.62) below.
(3.62) Step 2: López’s derivation of A Elena la vieron ayer. (“Elena, they saw
yesterday”)
As can be seen in (3.61 and 3.62), the pragmatics module marks the dislocated
phrase. One of this is performed directly and the other via Agree so that the dislocated
element ends up with the feature composition [+contrast, +anaphor].
In López’s analysis, the difference between how CLLD and Fronted Focus are
derived involves the manner in which a dislocated element is assigned [+anaphor] (vs.
[+contrast]). As illustrated in the tree in (3.62), the pragmatics module assigns [+contrast]
90
90
to the actual dislocated phrase. However, the feature [+anaphor] is only assigned through
agreement with the clitic. As noted earlier, Fronted Focus does not have an agreeing clitic
that doubles the dislocated phrase. Thus, in López’s proposal, Fronted Focus is NOT
marked [+anaphor] precisely because there is no clitic to assign it to—in spite of the fact
that it lands in Spec,vP (the edge of the phase). The rest of the derivation proceeds in the
same manner as for CLLD, and thus Fronted Focus, which does not have a doubling clitic
and thus cannot be marked [+a], ends up with the feature composition [+contrast, -
anaphor].
Thus, Lopez’s approach to the characterization of CLLD and Fronted Focus
dislocates (defining these in terms of the pragmatic features [+a] or [-a] and [+c] or [-c] is
not just compatible with a movement analysis but, instead, it is absolutely contingent on
movement of these dislocates. On the other hand, Rizzi’s cartographic approach can be
compatible with movement but it is not contingent on it. In fact, Rizzi remains mostly
agnostic on this point. In terms of the acquisition of CLLD by English speakers, there
might be subtle but meaningful differences in terms of the predictions that can be gleaned
from both approaches. If CLLD involves movement, L1 English speakers should, in
principle, easily transfer the movement strategy from English topicalization. This fact
could be compatible with both approaches.
However, in terms of the landing sites, recall that Rizzi’s articulated periphery
proposes different landing sites for each head, while López proposes that the landing site
(the phase) is the same for both CLLD and Fronted Focus (i.e. FinP). Thus, while Rizzi
explains the order CLLD > Fronted Focus by positing different landing sites (and
syntactic requirements of feature-matching), López resorts to a discourse (as opposed to a
syntactic) restriction, whereby CLLD precedes Fronted Focus because the CLLD (being
[+a]) is required to be closer to its antecedent than Fronted Focus. Fronted Focus, by
virtue of being [-a], does not have such requirements. As will be described in Chapter 4,
one of the conditions in the Acceptability Judgment Task will ask participants to judge
91
91
these structures in combination (e.g. CLLD + Fronted Focus, CLLD + wh-elements). In
Chapter 5, I will speculate how these experimental tokens support the view that López’s
account might better account for the results.
3.5 Summary
I this chapter I have focused on the presentation and those syntactic and discourse
properties of CLLD that are relevant for the empirical study. For the purposes of
comparison, I have presented cross-linguistic data from other Romance languages.
Additionally, in order to establish what the transfer effects (learning tasks) are for the L2
group, I have also outlined the general syntactic and discourse properties of English-style
topicalization. The last part of the chapter presented two (competing) syntactic analyses
that aim to account for the syntactic and pragmatic properties of CLLD (López, 2009;
Rizzi, 1997, 2001).
92
CHAPTER 4
METHODS
4.1 Introduction to the Chapter
The purpose of the present chapter is to introduce the different methodologies
used in this study, detailing the tasks and procedures for each. It also details the
demographics of the participant groups included therein. The tasks that were used in this
study included on-line (time-sensitive) and off-line tasks that measured both linguistic
and non-linguistic cognitive capacities. In order to test the viability of the claims of the
(dis)continuity debate in L2 processing as well as the generation of predictions in L2
Spanish, this study uses Clitic Left Dislocation in Spanish (CLLD). CLLD is a long-
distance dependency that is felicitous only in particular discourse contexts, as reviewed in
the previous chapter. CLLD is an apt construction to test anticipatory language
processing in L2 because it requires a specific syntactic reflex (an agreeing dative or
accusative clitic) to appear downstream once a topicalized phrase has been encountered
(i.e. the clitic is necessary for the sentence to be completed felicitously).
According to the Shallow Structure Hypothesis (SSH; Clahsen & Felser 2006a,
b), post-puberty second language learners can only process language “shallowly”
because, unlike native speakers, L2 learners are argued to not have detailed grammatical
representations of their L2. Given that the SSH predicts that when processing their L2 the
learners in this study will only have access to pragmatic and semantic information, rather
than detailed syntactic representations, the learners in this study are not expected to show
evidence of native-like language processing. One of the experimental tasks of this study
focuses on investigating the generation of syntactic predictions for a downstream
linguistic element after encountering a linguistic cue. Thus, native-like prediction of
long-distance dependencies (such as CLLD) in the L2 learner group would not be
predicted under the SSH.
93
93
Prior to the presentation of the methods used in the study, I will reiterate the
research questions put forth in the introductory chapter:
4. Do L2 learners of Spanish, native speakers of English, show off-line knowledge
of the syntactic and pragmatic constraints of CLLD in Spanish?
5. Are long-distance dependencies such as Spanish CLLD especially difficult for L2
learners to process in real time, as predicted by the Shallow Structure Hypothesis?
6. Can individual differences in measures of linguistic (e.g. L1 vocabulary) and non-
linguistic (e.g. inhibitory control and statistical learning ability) predict variability
in the acquisition and processing of CLLD in L2 Spanish?
Thus, the present study has direct relevance to the understanding of how a second
language is learned over the course of development and how (non)linguistic skills can
transfer across languages. Additionally, it can further our understanding of how
individual differences in linguistic and non-linguistic cognitive abilities can modulate this
development. In order to address the contributions of individual differences in the
development of linguistic prediction in L2, this study uses a variety of tests. Both on-line
and off-line tests were included in order to gain a more complete understanding of the L2
linguistic system.
4.2 Participants
4.2.1. Second language learners
This study includes data from a total of 120 native speakers of English, post-
puberty learners of Spanish (85 women; mean age = 26.4 years, SD = 9.48 years). These
participants were recruited at two universities in the U.S. At the time of testing, these
participants were either enrolled in Spanish courses or were teaching said courses at the
college or high-school level. Participants in the L2 groups reported having normal
hearing and normal or corrected-to-normal vision. No cognitive disabilities were
94
94
reported. Table 4-1 presents the demographics of the L2 participants, grouped by
proficiency level.
Advanced Intermediate
N 56 37
Women 37 24
Age (mean) 32.5 21.3
Age (SD) 10.5 3.6
Years of exposure (mean) 14.9 7.5
Years of exposure (SD) 8.8 2.8
Table 4-1 Demographics of the L2 Learner group
L2 participants were sequential bilinguals who were exposed to Spanish during
puberty (mean age of exposure = 14.4), after they had already learned their first language.
None reported fluency in a language other than English or Spanish. L2 participants
received compensation for their participation.
4.2.1. Native speaker control group
The participants in the Spanish NSs group (n=36) were mostly monolingual; they
were tested in their native country (Puebla, Mexico) and reported having normal hearing
and normal or corrected-to-normal vision. These participants were included in order to
have a baseline of linguistic and processing behavior. Spanish NSs reported having had
only minimal exposure to English. However, most of them had taken at least a year of
foreign language (usually English) because secondary foreign language education is
compulsory in Mexico for at least a year. Additionally, many of them have been exposed
to English from movies, music, and other American cultural products that are ubiquitous
in Mexico. Nevertheless, none of the native Spanish participants reported fluency in any
95
95
language other than their native one. Table 4-2 presents the demographics of the native
speaker group.
Native Speaker participants
N 36
Women 20
Age (mean) 40.7
Age (SD) 18.7
Table 4-2 Demographics of the Native Speaker group
Given that previous research has shown evidence that level of education and other
variables such as socioeconomic background are closely related to language outcomes
(Mulder & Hulstijn, 2011; Pakulak & Neville, 2010; Street & Dąbrowska, 2010), NS
participants were recruited from a middle-class local arts music conservatory and from
the state hospital (Puebla, Mexico). In order to seek a close match in educational
backgrounds, all NS participants had to have completed at least two years of college in
order to be included in the study.
4.3 Tasks and Materials
Participants completed a battery of off-line and on-line tests over the course of
two separate data collection sessions. L2 participants took the tests in a computer lab,
while NS participants took the tests in a private office. Figure X presents the order of
tasks and the duration of each. Participants completed the tasks in the same order.
96
96
Figure 4-1 Tasks with approximate durations (L2 group)
For the L2 learner group, data collection sessions were at least two days apart, but
no more than five days apart. For the native speaker group, data collections took place a
month apart. Given that data collection occurred over the course of two different trips, it
was difficult to locate some of the speakers and thus only twenty-seven of the thirty-six
completed the full battery of tests.
4.3.1 Off-line Tasks
The off-line tests in this study included a) an independent proficiency test, b) a
test of syntactic knowledge of clitics, c) an English vocabulary task (only for the L2
group), d) an audio-visual rating task, e) a statistical learning task, and f) a language
background questionnaire. These tasks are detailed below.
97
97
4.3.1.1 Proficiency Test: Materials and Procedures
In order to discriminate between proficiency levels, this battery of tests included
an independent proficiency test that tested grammar and vocabulary. This test is a
shortened version of a standardized test used for official Spanish language accreditation
in Spain called DELE (Diploma de Español como Lengua Extranjera) issued by the
Ministry of Education, Culture, and Sport of Spain. It also includes a section from the
reading and vocabulary sections of the Modern Language Association Cooperative
Foreign Language Test (Educational Testing Service, Princeton, NJ). This proficiency
test has been used successfully as a discriminator of proficiency in prior L2 Spanish
acquisition research (e.g., White, Valenzuela, Kozlowska-MacGregor & Leung, 2004;
Montrul, 2004; Rothman, 2009). The test was uploaded to an independent survey
software and was available online, although all the participants took the test under
supervision. Instructions and test items were in Spanish. (4-1) presents an example from
the proficiency task (correct answer is in bold).
(4-1) Sus amigos pudieron salvarlo pero lo dejaron ____. His friends could have saved him, but they let him ____. a. ganar (win)b. parecer (resemble)c. perecer (perish)d. acabar (finish)
The test consisted of fifty multiple-choice items focusing on vocabulary and
grammar (maximum possible score=50). The first 30 items were randomized per
participant. The last 20 were part of a coherent paragraph, and consequently they were
presented in the same order to all participants. Most participants completed this task
within 35 minutes, although participants had up to 40 minutes to complete the proficiency
task. Partly based on previous literature (e.g. Slabakova, Rothman & Kempchinsky,
2011), the accuracy range cutoffs were set at 40-50 for the advanced group, 25-39 for the
98
98
intermediate group, and 10-24 for the beginner group. Because the self-paced reading
task was rather difficult in both syntactic and lexical counts, participants needed to have
knowledge of basic lexical items. Thus, this study included only the data of participants
who scored at least 50% on the proficiency task (n=93).
4.3.1.2 Clitic Knowledge Test
In order to determine whether the L2 learners had knowledge of the syntactic
properties of clitics, participants completed a multiple-choice test that included ten
multiple-choice items. This test has been successfully used in previous research on Clitic
doubling and Fronting operations in Spanish with L2 and Heritage Speaker populations
(e.g. Leal Méndez, Rothman & Slabakova, 2014, in press; Slabakova et al., 2011;
Slabakova, Rothman & Kempchinsky, 2012). Romance clitics appear either before finite
verbs or, in cases where there is an auxiliary (in many cases, a restructuring verb in the
terminology of Rizzi, 1982) and a semantic verb, they can either precede the first verb or
it can attach to the second, which can be either a gerund or an infinitive. Examples (4-2)
and (4-3) reproduce the instructions as well as two items from this test, one with a single
finite verb (only one option is grammatical) and one with two verbs (two options are
grammatical).
INSTRUCCIONES: Los segmentos de diálogo que están abajo están incompletos. Escoge cualquier respuesta que considere natural y apropiada. Puede haber (o no) más de una respuesta correcta.
INSTRUCTIONS: Each dialogue below is incomplete. Choose any answer that you consider to sound natural and appropriate. There may or may not be more than one correct answer.
(4-2) Pablo: Me encanta esta camisa pero es muy cara Juana: No te preocupes, yo ___________. Pablo: I love this shirt, but it is too expensive. Juana: Don’t worry, I’ll (buy it for you).
99
99
� Te compro la ��Compro ! Te la compro � La te compro ��Cómprotela
(4-3) José: Le voy a enviar las instrucciones a Pablo. Alfonso: Te pido un favor, ¿_________ a mí también? José: I will send Pablo the instructions. Alfonso: Can I ask you a favor? Can you (send them to me) as well?
� podrías me las enviar ! me las podrías enviar ��me podrías enviar ! podrías enviármelas � podrías enviar
As shown in (4-2) and (4-3), each item included a small dialogue that ended on a
question, the answer to which required the use of accusative clitics. Every item included
two clitics, one accusative clitic and one dative clitic. These clitics have ordering
restrictions in Spanish (Zagona, 2002), with dative clitics preceding accusative clitics.
Dialogues were followed by five answer options, some of which were ungrammatical due
to the position of the clitic in relation to the verb, others because of the position of a clitic
in relation to the other, and still others because a clitic was missing. Participants were
instructed to click on every right answer. They were also instructed that although at least
one answer was correct, they could pick more than one. Items and answer choices were
randomized per participant. The maximum score for this task was 50 points. This task
was administered via an independent online survey service (under supervision) after all
other tasks, in order to avoid priming effects. Most participants completed this task
within 15 minutes although there was no set time limit.
100
100
4.3.1.3 (English) Vocabulary Knowledge (L1 Verbal
Skills)
In order to obtain a broad measure of L1 verbal skills of the L2 participants, those
in the learner group completed the vocabulary subtest of a standardized vocabulary
measure, the Shipley Institute of Living Scale (SILS; Zachary, 1994). Native Spanish
participants did not complete this task or any other task of native verbal skill ability. This
pen-and-pencil test includes forty multiple-choice vocabulary items, which vary
substantially in terms of difficulty. Items were presented in the standard order, which
roughly presents vocabulary in order of difficulty. Participants were asked to choose
among four options what they considered to be the closest synonym for each vocabulary
item. Example (4-4) presents a sample item along with the instructions (italics and bold
in the original). The correct answer is marked with a circle.
(4-4) INSTRUCTIONS: In the test below, the first word in each line is printed in capital letters. Opposite it are four other words. Circle the one word that means the same thing, or most nearly the same thing as the first word. If you don’t know, guess. Be sure to circle the one word in each line that means the same thing as the first word.
QUERULOUS Maniacal Curious Devout Complaining
Although this task is rather brief (participants completed it within 10 minutes), the
SILS vocabulary subtest correlates highly with other verbal IQ measures, such as
Kaufman and Kaufman’s (1990) Brief Intelligence Test (r = .77 in a sample of college
students; Bowers & Pantle, 1985). Additionally, Zachary, Crumpton and Spiegel (1985)
reported correlations between the SILS and the Wechsler Adult Intelligence Scale
(Wechsler, 1955) ranging from .73 to .90 (median r = .795) across eight research studies.
As such, the SILS vocabulary subtest has been estimated to be a “brief but accurate
101
101
(verbal) IQ estimate” (Zachary et al., 1985). Furthermore, these effects have been
replicated successfully (Weiss & Schell, 1991).
4.3.1.4 Audio-Visual Rating Task of CLLD
In order to gauge off-line knowledge of CLLD in Spanish, participants completed
an untimed audio-visual acceptability judgment (rating) task. This task aimed to
determine whether L2 participants could exhibit knowledge of either word order
restrictions or of the required clitics after a topicalized phrase was introduced when there
were no time constraints. Each test item was presented entirely in Spanish (text and audio
recording) and included a context followed by a short question and an answer. Answers
either conformed to the discourse-conferred expectations or not, meaning that they did or
did not include an accusative clitic before the finite verb (Embedded CLLD items) or,
alternatively, that they were presented in the right word order (Main Clause CLLD
items). Figure 4-2 presents a screenshot from the rating task.
Figure 4-2 Screenshot from the audio-visual rating task, condition e (temporal adverbial clauses)
102
102
Five native speakers of Spanish recorded the test items in order to avoid any
intonation coercions. The inclusion of the audio recordings was crucial because
dislocated structures in Spanish have special intonation patterns. While CLLD lacks
stress, Focus Fronting has emphatic stress on the fronted constituent. In this respect, the
difference between these constructions might include both clitic presence/absence and the
intonation. If the presentation were only visual, participants could accept some
infelicitous structures for the wrong reasons (e.g. an infelicitous context-test sentence
combination for CLLD could be accepted as felicitous if participants mentally assigned
emphatic (e.g. contrastive) stress).
Previous research using virtually identical methodologies has shown that L2
learners and simultaneous bilinguals (Heritage Speakers) were successful in
demonstrating knowledge of CLLD by the intermediate proficiency level (Leal Méndez,
in press; Slabakova et al., 2011, 2012). In addition, the participants in these studies were
sensitive to Set-Subset and Identity semantic distinctions. However, these studies used
clitic left dislocations involving no embeddings and dislocating a single NP only. In
order to determine whether L2 learners could demonstrate knowledge of the syntactic
and pragmatic properties of CLLD in other syntactic environments (such as embedded
CLLD), this study includes six different conditions of CLLD. Table (4-3) presents the
conditions included in this task along with the number of tokens per list.
Main Clause CLLD
Condition Tokens per list A) Topic + Fronted Focus Grammaticality depends on word order.
3 grammatical 3 ungrammatical
i. Ese dinero A JUAN no se lo daré. ii. *A JUAN ese dinero no se lo daré. ‘It is to Juan that I will never give that money.’
Table 4-3 Conditions and sample tokens from the Offline Rating Task
103
103
B) Topic + wh-constructionsGrammaticality depends on word order.
3 grammatical 3 ungrammatical
i. Ese dinero ¿a quién se lo darás?to whom that money CL.DAT CL.ACC 2nd.give.FUT
ii. *A quién ese dinero se lo darás? ‘To whom will you give that money?’
Embedded CLLD Condition Tokens per list C) Subjunctive complements
Acceptability depends on clitic presence.3 grammatical 3 ungrammatical
i. El presidente quiere que al secretario lo asciendan. the president wants that to.the secretary CL.ACC.3.M.SG promote.SUBJ.3PL
ii. #El presidente quiere que al secretario asciendan. ‘The president wants the secretary to be promoted.’ D) Temporal adverbial clauses Acceptability depends on clitic presence.
3 grammatical 3 ungrammatical
i. Mientras a Don Armando lo ayudaban a preparar el asado, while to Don Armando CL.ACC.3.M.SG they-helped to prepare the grill a Jaimito se le ocurrió prenderle fuego a una cortina. to Jaimito CL CL occurred light fire to a curtain
ii. #Mientras a Don Armando ayudaban a preparer el asado, a Jaimito se le ocurrióprenderle fuego a una Cortina. .
“While they helped Don Armando to prepare the barbeque, Jaimito thought to set the curtain on fire.”. E) Subjunctive complements
Acceptability depends on clitic presence.3 grammatical 3 ungrammatical
i. Que a ese hombre lo hayan despedido no me sorprende. That to that man CL.ACC have.SUB.3rd.PL fired neg me surprise
ii. Que a ese hombre hayan despedido no me sorprende. ‘That they fired that man doesn’t surprise me.’
Table 4-3 �FRQGLQXHG��Conditions and sample tokens from the Offline Rating Task
For each item, participants evaluated the acceptability of each answer on a 7-point
Likert scale, ranging from “Normal (normal)—7” to “Very strange (muy rara)—1.”
104
104
Participants were also provided with an “I don’t know” option, although they were
instructed that this option was only to represent uncertainty, rather than the middle of the
scale. Items were presented in a Latin square such that no participant saw both
(grammatical and ungrammatical) versions of each item. Each list contained 33 test
sentences and 10 fillers. The small number of fillers is due to the fact that the study
comprised a great number of tasks and this one in particular was quite lengthy, with an
approximate duration of 45 minutes.
4.3.1.5 Statistical Learning Task
In order to gauge individual statistical learning abilities, participants completed a
long-distance dependency artificial grammar-learning task (Gómez, 2002) in which
subjects were expected to learn probabilistic regularities of the input implicitly, through
unguided exposure. The materials that were used in the study have been successfully used
in statistical learning tasks. Long-distance dependencies have been argued to have strong
natural language structure correlates (Gómez, 2002; Misyak & Christiansen, 2012) such
as center embeddings and natural long-distance dependencies. Friederici, Bahlmann,
Heim, Schubotz and Anwander (2006) have argued that only long-distance dependencies
are relevant for language, although the evidence for this claim is not beyond dispute
(Misyak & Christiansen, 2012).
In this task, subjects were first aurally exposed to an artificial grammar comprised
of a non-adjacent dependency (aXb). Three elements, which are non-words that adhere to
the phonological properties of English (e.g. ‘pel’, ‘jic’) (a, b, c), are uniformly linked to
three others (d, e, f), with an intervening X element, which was randomly chosen from a
pool of 24 items. Participants heard a total of 436 strings drawn from this AXB grammar
(approximately 25 minutes of training). Table X presents the artificial grammar words
from each category (A, B, X).
105
105
Artificial grammar (Language 1 from Gómez, 2002)
S"{ aXd bXe
cXf }
A B X
a = pel b = vot c = dak
d = rud e = jic f = tood
wadim, kicey, puser, fengle, coomo, loga, gople, taspu, hiftam, deecha, vamey, skiger, benez, gensim, feenam, laeljeen, chila, roosa, plizet, balip, malsig, suleb, nilbo, wiffle
Table 4-4 Grammar and items in Gómez’s Language 1
After training, subjects were presented with six strings that were consistent with
the grammar and six strings that were inconsistent with the grammar. In a forced choice
task with two alternatives, participants were asked to judge whether each string was
consistent with the patterns of the input or not. Test items contained one new X element,
such that accurate performance on this test necessarily involved generalization. Accuracy
rates were used as an index of an individual’s ability to pick up on non-adjacent
relationships in an artificial grammar.
4.3.1.6 Language Background Questionnaire
All participants also completed an online language background questionnaire that
collected demographic (non-identifying) information, as well as language background
(profile on language use) and language learning history.
4.3.2 On-line Tasks
In order to have an index of language processing as well as obtaining a measure
of cognitive control, this study also included two time-sensitive measures: a self-paced
reading task and a non-verbal Stroop task. Both tasks are detailed below.
106
106
4.3.2.1 Self-paced reading task
This study included a self-paced reading task in order to investigate whether
participants were able to predict downstream linguistic material based on previously
appearing syntactic cues. Stimuli were presented on a computer using a word-by-word
moving window display (Just, Carpenter & Wooley, 1982). Each target sentence was
preceded by a one- or two-sentence context that was necessary to make the topicalization
natural (i.e. the topic was introduced). Each target sentence was followed by a yes/no
comprehension question. Half of the comprehension questions were based on the context
and half on the target sentence. Stimuli included 24 dislocated sentences in two versions
(early and late conditions): one where a sentential-conferred prediction was violated
(dependency was completed late), relative to another where the prediction was met
(dependency was completed early). In total, the self-paced reading experiment had forty-
eight target sentences.
These materials were previously designed (Pablos, 2006) to test long-distance
dependencies in a group of Peninsular Spanish NSs. In order to independently verify
NSs’ preferences, Pablos used an off-line grammaticality-rating task to verify that NSs
preferred clitic over clitic-less sentence continuations after a dislocated phrase was
introduced. The sentences used in this study were only slightly simplified from Pablos
(2006) by removing a level of embedding; some lexical items were also replaced with
more standard counterparts in order to avoid any differences due to dialectal varieties.
Example (5-1) is a sample token.
(5-1) CONTEXT: Varias estudiantes visitaron el departamento al que habían mandado su inscripción. (Several students visited the department where they had sent their registration.)
Target: A aquellas estudiantes la linda secretaria felizmente (les) contóTo those students the lovely secretary happily (clitic) told.3rd.sg que probablemente las admitirán en el programa. that probably (clitic) admit.pl.3rd.FUT in the program
107
‘The lovely secretary happily told the students that they would probably be admitted to the program.’
In order to determine whether the sentences used in the experimental items did
require a clitic in the grammar of native speakers, an additional group of participants
completed a preliminary normative sentence-completion task. This task was presented in
two forms: the first included the target verb while the second one did not (i.e. the
participants could complete the sentence using a verb of their choosing). Monolingual
speakers of Mexican Spanish (none of whom participated in any other tasks) completed
these tasks while in their native country. The target items were constituted by 24 sentence
fragments based of the stimuli used in the self-paced reading tasks. Each sentence
fragment was preceded by a context and was identical right before the main verb. The
rest of the sentence (including the main verb) was replaced by a line. Participants were
asked to complete the sentence quickly, using the first completion that they could think
of. These 24 items along with 24 fillers were randomized per subject. A total of 24
participants completed the first list (verb included) while 25 participants completed the
second list (participants could use a verb of their choosing). Sentences were scored by
hand and grouped into two categories: (a) sentences were completed using an accusative
or dative doubling clitic or (b) sentences were not completed using a doubling clitic.
Results showed that 87.1% of the sentences in the first list (verb included) were
completed using a clitic while 89.5% of sentences in the second list (verb open) were
completed with a clitic. Thus, it is clear that native Spanish speakers have a clear
preference for completing such sentences using a doubling clitic.
In this manipulation, the sentential-conferred expectation after encountering a
left-dislocated phrase (e.g. ‘A aquellas estudiantes’, in 1) is that a clitic will appear later
on. Because Spanish clitics appear before finite verbs (Rizzi, 1982), the clitic would be
anticipated before the matrix verb (contó in (X)). If speakers had predicted an upcoming
108
108
clitic after encountering a topicalized phrase, their reaction times (RTs) should be slower
at the main verb if the clitic was not present. In the original design, Pablos (2006) chose
matrix verbs that were optionally ditransitive (e.g. contar ‘to tell’). These verbs can take a
dative argument or not, which increases the chances of an equal bias for the
presence/absence of the clitic.
The researcher constructed two lists with the 48 experimental sentences so that
each participant would only see one version of each sentence (every participant was
exposed to 24 experimental sentences only, 12 in the early condition and 12 in the late
condition). An additional forty-eight sentences, which did not include any dislocated
items but were also preceded by a context, were included as fillers. Items were
randomized per participant.
4.3.2.2 Non-verbal Stroop task
In order to measure an individual’s ability to suppress information that is
irrelevant to a given task (i.e. ‘inhibitory control’), the participants in this study
completed a non-verbal Stroop task (Stroop, 1935, specifically the version adapted by
Blumenfeld & Marian, 2011 from Liu, Banich, Jacobson, & Tanabe, 2004) as part of the
battery of tests. It has been argued that when bilinguals process language, there is
simultaneous activation of the L1 and the L2 (called “parallel activation”), regardless of
linguistic context (Blumenfeld & Marian, 2007; Marian & Spivey, 2003; Marian, Spivey
& Hirsch, 2003). Thus, some researchers propose that bilingual speakers face higher
levels of linguistic competition on a regular basis (e.g. Bialystok, 2009; Linck, Kroll, &
Sunderman, 2009) and, furthermore, that this increased competition results in a cognitive
advantages which might include advantages in inhibitory control mechanisms (Bialystok
& Codd, 1997; Bialystok & Martin, 2004).
109
109
Blumenfeld and Marian’s Stroop task creates cognitive conflict by manipulating
different aspects of the stimulus so that these are either congruent or incongruent. The
stimulus consisted of an arrow, and the aspects that were manipulated were arrow
location (arrow was located at either the right or left side of the screen) and arrow
direction (arrow could either point right or left). In this study, participants were asked to
push “3,” which was located on the right side in a numerical pad when they saw an arrow
pointing rightward and “1,” which was located left on a numerical pad when they saw an
arrow pointing leftward. In other words, participants were asked to mind arrow direction
while ignoring (i.e. inhibiting) arrow location. Figure 4-3, panel (a) presents the initial
fixation point along with two examples of congruity (arrow direction and arrow location
match). Panel (b) depicts the fixation point and two examples of incongruity (arrow
direction and arrow location do not match).
110
110
Figure 4-3 Sample stimuli from the non-verbal Stroop task. Panel (a) depicts congruent stimuli (left and right) and panel (b) depicts incongruent stimuli (left and right).
Each trial started with a centrally-located fixating cross (Figure X, panels a and b,
left side) that was presented for 500 ms. This fixation cross was presented in order to
have participants focus on the middle of the screen. After the fixation point, participants
were exposed to a congruent or an incongruent stimulus for 700ms. In total, participants
were exposed to 160 trials (3:1 ratio congruent/incongruent): 120 congruent trials (60
congruent right and 60 congruent left) and 40 incongruent trials (20 incongruent right
and 20 incongruent left). At the end of each trial, participants were exposed to a blank
screen for 800 ms. Trials were presented in a pseudo-randomized order so that all
participants saw the same order. Before the actual experiment, participants had a small
practice run which included both incongruent and congruent items. Although the software
111
111
recorded reaction times during the practice runs, participants could not move on until
they answered every item in the practice right.
4.4 Summary
This chapter presented the tasks used in this study in order to weigh in on the
(dis)continuity debate in L2 processing. This study examines the processing of L2 long-
distance dependencies and the development of linguistic predictions (anticipatory
language processing) in an L2. Discontinuous accounts of L2 processing posit that the
mechanisms used in L1 and L2 processing are different from each other (i.e. L1 and L2
processing are qualitatively different). Specifically, the SSH (Clahsen & Felser, 2006a,
2006b) predicts that L2 learners will display qualitative differences from native speakers
when processing long-distance dependencies in a second language. This is argued to be
due to lack of detail in the syntactic representation in the L2. On the other hand,
continuous accounts of L2 processing do not posit either impaired representations or
different processing mechanisms for L2 processing. Instead, these accounts predict a
gradual emergence of sensitivity for violation of expectations.
112
CHAPTER 5
RESULTS
5.1 Introduction to the Chapter
The current chapter presents the results of the experimental tasks described in
Chapter 4. The tasks used in this study, which measured both linguistic and non-linguistic
cognitive capacities, were completed by a group of native speaker Spanish controls
(tested in Mexico) and a group of L2 learners of Spanish, native speakers of English
(tested in two universities in the United States). The L2 (experimental) group was divided
into two subgroups (advanced and intermediate), based on the independent proficiency
test detailed in Chapter Four. The purpose of including a control group was to ascertain
that the tasks are reliable as well as to ensure test validity (that the tasks measure what
they purpose to measure). Following Bley-Vroman (1983) and Dekydtspotter et al.
(2006), I will also focus on examining whether L2 learners can show evidence of
sensitivity to grammaticality or the to the (on-line) violation of expectations within their
own groups (thus avoiding Bley-Vroman’s Comparative Fallacy). As presented in the
previous chapter, there were two types of tasks: off-line tasks, which where completed in
an untimed fashion, and on-line tasks, which were time-sensitive tasks where reaction
times were recorded.
The tasks included in this study have a three-fold purpose: a) to examine whether
learners show off-line knowledge of the syntactic properties CLLD in Spanish, b) to test
the claims of the Shallow Structure Hypothesis (Clahsen & Felser 2006a, b), which
argues that adult L2 learners process their L2 “shallowly” due to incomplete grammatical
representations by way of examining the processing of a long-distance dependency in L2
Spanish and c) c) to examine the contributions of individual differences in the acquisition
and processing of long-distance dependencies L2 Spanish. The linguistic property
examined in this study is Clitic Left Dislocation in Spanish (CLLD), a long-distance
113
113
dependency in Spanish. As argued earlier, CLLD is an apt construction to test prediction
(anticipatory language processing) in L2 because it requires for a specific syntactic reflex
(an agreeing dative or accusative clitic) to appear downstream once a topicalized phrase
has been encountered (i.e. the clitic is necessary for the sentence to be completed
felicitously).
In addition to descriptive statistics, the data will be analyzed using repeated-
measures ANOVA with Bonferroni post-hoc tests in both intergroup and intragroup
comparisons. Additionally, to describe the potential relationships between the several
measures of individual abilities, I will use a Pearson Product-Moment correlation and
regression analyses.
5.2 Off-line Tasks
As noted in the previous chapter, the off-line tests in this study included: a) an
independent proficiency test, b) a test of syntactic knowledge of clitics, c) an English
vocabulary task (only for the L2 group), d) an audio-visual rating task focusing on
CLLD, e) a statistical learning task, and a f) language background questionnaire. These
results of each task will be detailed below.
5.2.1 Proficiency Test
The proficiency test tested grammar and vocabulary and was used to divide the
experimental group into two subgroups. The test consisted of fifty multiple-choice items,
so that the maximum possible score was 50 points. This test has been used extensively in
L2 acquisition research, both with on-line and off-line studies (e.g. Dussias, 2004;
Montrul, 2004; Rothman, 2009; White et al., 2004;) and only a subset of the native
speaker participants completed the task (n=27). Multiple-choice proficiency tests are one
of the more practical ways to assess language level, although they are admittedly limited
in terms of assessing overall language skill (only reading and comprehension).
114
114
Group (n) Mean SD Range
Native speakers (n=27) 48.5 1.5 5 (45-50)
Advanced L2 (n=56) 45 3.0 10 (40-50)
Intermediate L2 (n=37) 31 4.6 14 (25-39)
Table 5-1 Means, standard deviations, and range of scores on the proficiency measure per group.
Table 5-1 displays the means and standard deviations of the proficiency test for
each group. Using (admittedly) arbitrary cutoff points, which have been used in the
literature (e.g. Slabakova et al., 2012), I used the scores from the proficiency test to
assign L2 learners to two sub-groups.46 The accuracy range cutoffs were set at 40-50 for
the advanced group and 25-39 for the intermediate group. The native speakers ranged
from 45-50 points. Because the battery of tests included linguistic tasks that were rather
difficult on both syntactic and lexical counts, participants needed to have knowledge of
basic lexical items and syntactic structures in order to complete the tasks. Thus, this study
included only the data of participants who scored at least 50% on the proficiency task
(n=93). The data from the participants scoring less than 50% was not analyzed.
A one-way ANOVA revealed that there were significant differences in the mean
proficiency scores across groups, F(2, 117) = 265.282, p < .0001. Post-hoc Bonferroni
comparisons also indicated that the mean scores of all three groups were significantly
different from each other. The proficiency scores of the natives were higher than those of
the L2 advanced and, in turn, the L2 advanced group scored higher than the L2
intermediate group.
46 The only difference is that in the present study, the intermediate group includes a larger range of scores. Slabakova et al. (2012) used 30-40 for the intermediate group. This approach was chosen because in this study we did not include the data of novice or beginner learners.
115
115
5.2.2 Clitic Knowledge Test
Participants also completed a multiple-choice test that included ten multiple-
choice items (maximum 50 points). This test has been successfully used in previous
research on L2 and Heritage Speaker populations (e.g. Leal Méndez, Rothman, &
Slabakova, in press; Slabakova et al., 2011; Slabakova, Rothman & Kempchinsky, 2012).
This test focused on the syntactic knowledge of clitics, specifically, on clitic location
(rather than agreement). Again, only a subset of native Spanish speakers (n=27)
completed this test.
Group (n) Mean SD Range
Native speakers (n=27) 49.37 1.14 37 (25-62)
Advanced L2 (n=56) 48.51 2.19 45 (21-66)
Intermediate L2 (n=37) 39.19 6.9 17 (18-35)
Table 5-2 Means, standard deviations, and ranges of the scores of the test of clitic knowledge per group
Table 5-2 displays the means, standard deviations, and ranges of scores on the
clitic knowledge test for each participant group. A one-way ANOVA revealed that there
were significant differences in the mean scores across groups (F(2, 117) = 68.98,
p < .0001). Post-hoc comparisons using Bonferroni corrections indicated that the mean
scores for the intermediate L2 group (ML2.I = 39.19, SDL2.I = 6.9) were significantly
different from both the advanced L2 group and the native speaker groups (ML2.A = 48.51,
SDL2.A = 2.19, p < .0001; MNS = 39.19, SDNS = 6.9, p < .0001, respectively). The means
of the advanced group, however, did not significantly differ from the means of the native
speaker group (p = 1.000).
116
116
5.2.3 (English) Vocabulary Knowledge (L1 Verbal Skills)
The participants in the learner group completed the multiple-choice vocabulary
subtest of a standardized vocabulary measure, the Shipley Institute of Living Scale
(SILS; Zachary, 1994) with the purpose of obtaining a broad measure of their verbal
skills in their native language (i.e. English). The maximum score for this test was forty
points (no half points were possible). Table 5-3 presents the means, standard deviations
and ranges of the scores on the Vocabulary measure per group.
Group (n) Mean SD Range
Advanced L2 (n=55) 35.82 2.60 13 (27-40)
Intermediate L2 (n=37) 31.73 3.81 19 (21-38)
Table 5-3 Means, standard deviations, and ranges of the scores on the SILS Vocabulary test for the L2 groups.
An independent-samples t-test was conducted to compare the mean scores in the
SILS measure for both the intermediate and advanced L2 groups. The test revealed a
significant difference at the p = 0.05 level in the scores for intermediate and advanced
groups; t(91) = , p < .0001. Thus, the scores on the Vocabulary test were reliably higher
for the participants of the L2 advanced group than for the L2 intermediate group.
5.2.4 Audio-Visual Rating Task of CLLD
This task consisted of an untimed, audio-visual acceptability judgment task
focusing on CLLD. I will first present the results of the Main Clause CLLD items
(conditions b, c), given that the grammaticality of these sentences involved word order
alternations (rather than the absence (ungrammatical) or presence (grammatical) of an
accusative clitic.
117
117
5.2.4.1 Rating Task: Main Clause CLLD (Group results)
Mean group responses for all three participant groups included in the study (Main
clause CLLD structures) for conditions A (CLLD Topic + Fronted Focus) and B (CLLD
Topic + wh-constructions) are displayed Figure 5-1. Examples (5-1 a,b) below illustrate
condition A while example (5-2 a,b) illustrate condition B.
(5-1) Condition A: Topic + Fronted Focus a. Ese dinero A JUAN no se lo daré.
that money to Juan neg cl.DAT CL.ACC.3m.sg 1st.give.FUTb. *A JUAN ese dinero no se lo daré.
“It is to Juan that I will never give that money.”
(5-2) Condition B: Topic + wh-constructions a. Ese dinero ¿a quién se lo darás?
that money to whom cl.DAT cl.ACC 2nd.give.FUT b. *A quién ese dinero se lo darás?
“To whom will you give that money?”
As noted in Chapter Four, the experimental design required participants to read
and simultaneously listen to a short context. After the context, participants listened to and
read a question and answer pair after which participants evaluated the answers with the
order Topic > {FF, wh} (which were expected to be grammatical based on the literature),
and the answers with the order {FF, wh} > Topic (which were expected to be
ungrammatical) independently of each other. Items were distributed in two lists, so no
participant saw both versions of each item (grammatical and ungrammatical). The scale
provided was 1 – 7, with the upper end of the scale indicating grammaticality and the
lower end of the scale ungrammaticality. Participants were also presented with the option
“I don’t know.” These answers were not analyzed and only constituted 1.62% of all
responses.
118
118
Figure 5-1 Mean acceptability rates for Condition A (CLLD + Fronted Focus) and B (CLLD + wh-operator)
A two-way repeated-measures ANOVA was performed with condition (i.e.
CLLD + Fronted Focus, CLLD + wh-operator) and grammaticality (✓ word order
Topic > {FF, wh} and * word order {FF, wh} > Focus) as within-participant variable and
group as a between-participant variable (native speakers, L2 advanced, L2 intermediate).
The significance level was established at p < 0.05. There was a significant effect of
grammaticality (F(1, 117) = 269.303, p < 0.0001), but no effect of condition (F(1, 117) =
.440, p > 0.05). There was an effect of group (F(2, 117) = 4.457, p > 0.01), although the
interaction between condition and group was not significant (F(2,117) = .559, p > 0.05).
Tests also revealed a significant interaction between grammaticality and condition
(F(1, 117) = 50.226, p < 0.0001), indicating that the grammaticality effect was greater in
the CLLD + wh-operator condition (MG = 5.533, SDG = 1.22; MUG = 3.05, SDUG = 1.34)
119
119
than in the CLLD + Fronted Focus condition (MG = 4.757, SDG = 1.5; MUG = 3.733,
SDUG = 1.4). This result, which may appear at first sight to be somewhat unexpected, will
be discussed in Chapter 6. The only other significant interaction was between
grammaticality and group (F(2, 117) = 13.71, p < 0.0001), which was examined further.
Pairwise comparisons with Bonferroni corrections revealed that all three groups made a
significant contrast between the grammatical and ungrammatical sentences (p = 0.0001
for all three participant groups).
5.2.4.2 Rating Task: Embedded Clause CLLD (Group
results)
In this task, participants were asked to read and listen to the sentences and
evaluate the answers with clitics (which were expected to be grammatical), and the
answers without clitics (which were expected to be ungrammatical) independently of
each other, as presented in a Latin square. Figure 5-2, displays the mean group ratings for
all three participant groups included in the study. Examples (5-3 a,b) are illustrative of
the grammatical and ungrammatical options in condition C .
(5-3) Condition C Subjunctive complements a. El presidente quiere que al secretario lo asciendan.
the president wants that to.the secretary cl.ACC.3m.sg promote.SUBJ.3PL b. *El presidente quiere que al secretario asciendan. “The president wants the secretary to be promoted.”
120
120
Figure 5-2 Mean acceptability rates for Embedded CLLD condition C (Subjunctive Complements) for all groups.
Figure 5-4 illustrates the ratings for condition D (Temporal Adverbials). Example
5-4 illustrates the ungrammatical and grammatical options in each condition. As
mentioned earlier, these items were distributed in two lists so that participants were only
exposed to one version of the items.
(5-4) Condition D Temporal adverbial clauses a. Mientras a ella la adulan, a él lo ignoran. while to she cl.ACC.3f.sg idolize.3rd.PL to he cl.ACC ignore.3rd.PL b. Mientras a ella adulan, a él lo ignoran. “While they idolize her, they ignore him.”
121
121
Figure 5-3 Mean acceptability rates for Embedded CLLD condition D (Temporal Adverbials) for all groups.
Finally, Figure 5-4 below presents the ratings of condition E (Sentential Subjects)
for all groups. Example 5-5 is illustrative of the grammatical and ungrammatical options
that were rated in this condition.
(5-5) Condition E Sentential Subjects a. Que a ese hombre lo hayan despedido no me sorprende. that to that man cl.ACC have.SUB.3rd.PL fired neg me surprise b. Que a ese hombre hayan despedido no me sorprende. “It doesn’t surprise me that they fired that man.”
122
122
Figure 5-4 Mean acceptability rates for Embedded CLLD condition E (Sentential Subjects) for all groups.
A two-way repeated-measures ANOVA was performed with condition (C—
Subjunctive Complements, D—Temporal Adverbials, and E—Sentential Subjects), and
grammaticality (clitic ✓ and without clitic *) as within-participant variables and group as
a between-participants variable. A significance level of p < 0.5 was established. There
was a significant effect of condition (F(2, 234) = 4.439, p = 0.013), an effect of
grammaticality (F(1, 117) = 111.535, p < 0.0001), and an effect of participant group
(native speakers, L2 advanced, L2 intermediate; F(2, 117) = 3.360, p < 0.038). The
interaction between condition and group was not significant (F(4, 234) = 1.908, p > 0.05),
however.
123
123
The interactions between grammaticality and group (F(2, 117) = 14.67,
p < 0.0001) and between condition and grammaticality (F(2, 117) = 9.278, p < 0.0001)
we both statistically significant. Finally, there was a significant three-way interaction
among condition, grammaticality, and group (F(4, 234) = 2.745, p < 0.029). These
interactions show that the intermediate L2 participants did not make a robust distinction
between the ungrammatical and grammatical sentences while the native speaker and the
L2 advanced participants did make a distinction between the two. This three-way
interaction was further examined.
Table 5-4 displays the contrasts between the types of test sentences in native
speaker judgments (Bonferroni adjustment). Blank cells represent irrelevant comparisons.
As expected, native speaker participants made a reliable distinction between the
grammatical and the ungrammatical sentences in all three conditions (Subjunctive
Complements, Temporal Adverbials, and Sentential Subjects test sentences). The cells
where these contrasts were expected have a darker background. Additionally, these
pairwise comparisons show that the native speakers did not display a significant
difference among all grammatical sentence conditions (sentences with the clitics,
regardless of condition).
124
124
Grammaticality
Condition C (Subjunctive compl.)
Condition D Temporal Adverbials
Condition E Sentential Subj.
✓ * ✓ * ✓
Condition C * p < 0.0001
Condition D ✓ n.s.
* n.s. p < 0.0001
Condition E ✓ n.s. n.s.
* p < 0.001 p < 0.004 p < 0.0001
Table 5-4 Pairwise comparisons (grammaticality*condition*group): Embedded CLLD (native speakers). Conditions: C—Subjunctive Complements, D—Temporal Adverbials, E—Sentential Subjects.
However, the native speaker judgments exhibited an interesting contrast between
the three types of ungrammatical sentences (conditions C, D, E). Namely, native speakers
judged the ungrammatical sentences in condition E (Sentential subjects; MUG = 4.83,
SDUG = 1.10) significantly higher (more acceptable) than the ungrammatical sentences in
conditions D (Subjunctive complements; MUG = 3.29, SDUG = 1.36) and e (Temporal
adverbials; MUG = 3.86, SDUG = 1.02). Note that in section X, Haegeman (2006) noted
that her informants rated Sentential Subjects somewhat lower than the other embedded
CLLD types. However, the native speakers in this task did not accept CLLD Sentential
Subjects at a lower rate—instead, they rejected the ungrammatical sentences significantly
less robustly than the other two conditions.
Table 5-5 displays the aforementioned contrasts presented in Table 5-4 but in the
judgments of L2 advanced speakers. Again, blank cells represent irrelevant comparisons.
As in the case of the native speaker group, L2 advanced participants made a reliable
distinction between the grammatical and the ungrammatical sentences in all three
125
125
conditions (C—Subjunctive Complements,�D—Temporal Adverbials, and E—
Sentential Subjects test sentences). Also in keeping in the contrasts shown by native
speakers, the L2 advanced speakers did not display a significant difference among all
grammatical sentence conditions (clitic sentences, regardless of condition).
Grammaticality
Condition C (Subjunctive compl.)
Condition D Temporal Adverbials
Condition E Sentential Subj.
✓ * ✓ * ✓
Condition C * p < 0.0001
Condition D ✓ n.s.
* n.s. p < 0.0001
Condition E ✓ n.s. n.s.
* p = 0.06 n.s. p < 0.0001
Table 5-5 Pairwise comparisons (grammaticality*condition*group): Embedded CLLD (L2 advanced learners)
Although the L2 advanced speakers exhibited the same expected contrasts as
native speakers, the L2 advanced group did not display the contrast between the three
types of ungrammatical sentences (conditions C, D, E) present in the judgments of native
speakers. L2 learners made a marginal distinction (p = 0.06) between the ungrammatical
sentences in condition E (Sentential subjects; MUG = 4.68, SDUG = 1.38) and the
ungrammatical sentences in conditions C (Subjunctive complements; MUG = 4.14, SDUG
= 1.58). However, the distinction of the ungrammatical sentences in condition C and
condition D (Temporal adverbials; MUG = 4.45, SDUG = 1.62) was not statistically
significant.
126
126
Finally, Table 5-6 displays the same contrasts in the judgments of L2 intermediate
speakers. Unlike the native speaker group and the L2 advanced group, however, the L2
intermediate participants only exhibited a statistically significant distinction between the
ungrammatical and grammatical sentences in condition E (Sentential subjects). On the
other hand, as in the case of other two groups, L2 intermediate participants did not
display a significant difference among any of the grammatical sentences, regardless of
condition.
Grammaticality
Condition C (Subjunctive compl.)
Condition D Temporal Adverbials
Condition E Sentential Subj.
✓ * ✓ * ✓
Condition C * p = .101
Condition D ✓ n.s.
* n.s. p = .235
Condition E ✓ n.s. n.s.
* n.s. n.s. p < 0.040
Table 5-6 Pairwise comparisons (grammaticality*condition*group): Embedded CLLD (L2 intermediate learners)
Additionally, L2 intermediate speakers did not display any contrasts among the
three types of ungrammatical sentences (conditions C, D, E).
5.2.5 Statistical Learning Task
Participants completed an artificial grammar-learning task (Gómez, 2002) in order
to determine individual statistic learning abilities. In this task, participants were exposed
to the probabilistic regularities of the artificial grammar in the hopes that they would
127
127
(implicitly) learn a long-distance dependency in the language (See section X for details).
Importantly, the words in the artificial grammar were non-words that nonetheless adhered
to English phonotactic rules. After the twenty-five minute exposure phase, participants
were asked to complete a twelve-item forced-choice task where they had to judge
whether the options were consistent or not with the input to which they were exposed.
Table 5-7 illustrates the mean accuracy rates and standard deviations for each participant
group.
Participant groups Mean Accuracy (percentage) SD Ranges
Native Speakers (n=27) 6.41 (53%) 1.62
L2 Advanced (n=56) 8.87 (74%) 2.75 L2 Intermediate (n=37) 8.54 (71%) 2.71
Table 5-7 Mean accuracy rates and standard deviations for the statistical learning task (all groups)
As mentioned in Chapter Four, accuracy rates were used as an index of individual
ability to pick up on long-distance dependencies in the artificial grammar. A one-way
ANOVA revealed that there were significant differences in the mean scores across groups
(F(2, 117) = 9.092, p < 0.0001). Post-hoc comparisons using Bonferroni tests indicated
that the mean scores for the native speaker group were significantly different from both
the advanced L2 group (p < 0.0001) and the intermediate L2 group (p = .003). The means
of the L2 advanced group, however, did not significantly differ from the means of the L2
intermediate group. Although there is no reason to believe that statistical learning ability
varies according to native language, recall that our task (Gómez, 2002) followed the rules
of English phonotactics. Thus, the lower performance of the native Spanish speakers
(essentially monolingual-) group appears to indicate a perceptual bottleneck that regulates
128
128
performance in this task. In other words, the Spanish native speakers were probably
unable to pick up on the regularities of the language because they had trouble perceiving
the non-words of the artificial language.
5.3 Interim Summary: Offline tasks
The two main tasks were the tasks focusing on CLLD: an offline rating task and
an online self-paced reading task. A portion of the off-line rating task tested main clause
CLLD focusing on the combination of CLLD with other left-periphery structures
(Fronted Focus, wh-operators). The other portion tested the occurrence of embedded
CLLD in Subjunctive Complements, Temporal adverbials and Sentential Subjects.
The offline rating task tested main clause CLLD where the grammatical and
ungrammaticality options differed in terms of word order (grammatical: Topic > {FF,
wh}; ungrammatical: {FF, wh} > Topic. In these items, all the participant groups were
able to discriminate between the grammatical and ungrammatical sentences and, as a
whole, they displayed a larger difference between the ungrammatical-grammatical pairs
involving wh-operators than the pairs involving Fronted Focus.
In terms of embedded CLLD, both the native speaker and advanced L2 learner
group were able to make a clear distinction between the grammatical-ungrammatical
pairs in the three conditions (Subjunctive Complements, Temporal Adverbials, and
Sentential Subjects), while the intermediate L2 group was only able to display this
distinction in the condition of Sentential Subjects. There is another difference that merits
discussion. Namely, within the ungrammatical sentences, the native speakers had lower
ratings for the ungrammatical conditions in Subjunctive Complements and Temporal
Adverbials. Thus, they rated the ungrammatical sentences in the Sentential Subjects
condition reliably higher (“less” ungrammatical, so to speak). The advanced group
displayed only one difference in this respect: While they rated the ungrammatical
sentences in the Subjunctive Complements lower, there was no significant contrast in
129
129
terms of the rate of their rejection of ungrammatical sentences between the sentences in
the Temporal Adverbials condition and the Sentential Subject condition.
5.4 On-line Tasks
In order to obtain indices of language processing as well as of cognitive control,
this study also included two time-sensitive measures: a self-paced reading task (Pablos,
2006) and a non-verbal Stroop task (Blumenfeld & Marian, 2010). The results of these
tasks are presented in turn.
5.4.1 Self-paced reading task
The self-paced reading task was included to determine whether participants were
able to predict downstream linguistic material based on previously appearing cues.
Stimuli (slightly modified from Pablos, 2006) included twenty-four CLLD sentences in
two conditions (forty-eight total sentences in two randomized lists so that no single
participant saw both early and late versions of each sentence). In the early condition, a
sentential-conferred prediction was violated (dependency completed late), as compared
to another where the prediction was met (dependency completed early). The example
below (5-1) illustrates a sample token.
(5-1) CONTEXT: (“Estudiantes” are mentioned) Target: A aquellas estudiantes la linda secretaria felizmente (les) contó
To those students the lovely secretary happily (clitic) told.3rd.sg que probablemente las admitirán en el programa. that probably (clitic) admit.pl.3rd.FUT in the program “The lovely secretary happily told the students that they would probably be admitted to the program.”
Thus, for Spanish speakers, the expectation after encountering an a-marked left-
dislocated phrase should be that a clitic will appear later on (before a finite verb). Thus,
RTs should be higher at the main verb when the clitic was not present (dependency
completed late) compared to when it was (dependency completed early). Each context
130
130
and test sentence was followed by a yes/no comprehension question. Per list there were
seventy-two comprehension questions in total, twenty-four comprehension questions
associated with the CLLD items.
5.4.1.1 Accuracy scores on comprehension questions
Accuracy scores will be presented in two different ways: overall accuracy and
accuracy on experimental items only. Table 5-8 presents the mean accuracy scores and
standard deviation per participant group of all the items in each list (overall accuracy),
including fillers (total of 72 items).
Participant groups Mean Accuracy (percentage) SD
Native Speakers (n=36) 63.4 (88%) 4.9
L2 Advanced (n=56) 64.8 (90%) 3.6 L2 Intermediate (n=37) 59.7 (83%) 6.7
Table 5-8 Mean overall accuracy rates (fillers included) and standard deviations for the self-paced reading comprehension questions (all groups)
A one-way ANOVA revealed that there were significant differences in the mean
scores across the three participant groups (F(2, 126) = 11.763, p < 0.0001). Post-hoc
comparisons using Bonferroni tests indicated that the mean scores for the native speaker
group were not significantly different from the advanced L2 group (p > 0.05). However,
the mean accuracy scores of the intermediate group were significantly different to both
the mean scores of the native speakers (p = .003) and the mean scores of the advanced L2
group (p < 0.0001).
Table 5-9 presents the mean accuracy scores and standard deviation per
participant group only for the experimental CLLD items in each list (accuracy in
131
131
experimental stimuli), thus not including fillers (each participant was exposed to a total of
twenty-four items per list: twelve in the late condition, twelve in the early condition).
Participant groups Mean Accuracy (percentage) SD
Native Speakers (n=27) 20.3 (85%) 9.94
L2 Advanced (n=56) 21.5 (90%) 7.73 L2 Intermediate (n=37) 18.8 (78%) 13.1
Table 5-9 Mean accuracy rates and standard deviations on the experimental items only (n=24 per list) for the self-paced reading comprehension questions (all groups)
A one-way ANOVA revealed that there were significant differences in the mean
scores across the three participant groups (F(2, 126) = 14.687, p < 0.0001). Post-hoc
comparisons using Bonferroni tests indicated that the mean scores for all three participant
groups were different from each other. The native speaker group scores were different
from both learner groups: L2 advanced (p = .042) and the L2 intermediate (p = .030).
Additionally, the scores of the L2 advanced group were also significantly different from
the intermediate L2 group (p < 0.0001). Although the difference in the performance of
the L2 advanced and native speaker groups merits discussion (the learners answered on
average 5% higher than the natives did) it is important to note that overall, all of these
comprehension score rates (including the scores of the native speakers) are in keeping
with comparable L1 and L2 processing experiments.
5.4.1.2 Reading Times
As mentioned in Chapter Four, the software package used in the experiment
recorded the time spent on every segment by each participant in milliseconds. Figure 5-5
graphs the magnitude of the RTs (in milliseconds) of both the early condition
(expectation met) and late condition (expectation not met) across all fifteen regions for
132
132
native Spanish speaker participants. As mentioned in Chapter 4, stimuli included 24
dislocated sentences in two versions (early and late conditions): one where a sentential-
conferred prediction was violated (dependency was completed late—clitic did not appear
until the embedded verb), relative to another where the prediction was met (dependency
was completed early—clitic appeared at the first possible opportunity, the main verb).
Figures 5-6 and 5-7 plot the RT magnitudes in both conditions for the L2 advanced and
L2 intermediate groups, respectively. Reading times are presented in their original (raw)
form and were trimmed at the 200 ms and 3000 ms lower and upper boundaries,
respectively. Screening for outliers proceeded as follows.
Figure 5-5 Reading times per region for the early and late completion conditions (error bars +/- 1SE) for the native speaker group (see (5-1)): A1 aquellas2 estudiantes3, la4 linda5 secretaria6 felizmente7 les8 contó9 que10 probablemente11 las12 admitirán13 en14 el_programa15
1��
134
Figure 5-6 Reading times per region for the early and late completion conditions (error bars +/- 1SE) for the advanced L2 learner group (see (5-1)): A1 aquellas2 estudiantes3, la4 linda5 secretaria6 felizmente7 les8 contó9 que10 probablemente11 las12 admitirán13 en14 el_programa15
135
Figure 5-7 Reading times per region for the early and late completion conditions (error bars +/- 1SE) for the intermediate L2 learner group (see (5-�)): A1 aquellas2 estudiantes3, la4 linda5 secretaria6 felizmente7 les8 contó9 que10 probablemente11 las12 admitirán13 en14 el_programa15
136
136
As is customary, we excluded from the analyses those participants whose
performances were clearly distant from the rest of the participant groups. Some outliers
scored less than 3 standard deviations in the yes/no (experimental) comprehension
questions. This resulted in the exclusion of four participants: two from the advanced
group and one from each the native and intermediate L2 groups. In view of the fact that
offline knowledge of clitic presence in CLLD (as demonstrated in the offline rating task)
constitutes a syntactic pre-requirement to the processing of CLLD, we excluded a total of
one additional participant. This individual (native speaker) was the only participant from
the native speaker group who did not made a distinction between any of the offline
CLLD conditions, or did so in the wrong direction (0 out of five). Thus, the data
discussed below included the responses from 34 native speakers, 54 advanced L2
participants and 36 intermediate L2 participants.
As illustrated in Figures 5-5, 5-6, and 5-7, there are very few differences in mean
RT values between the early and late conditions for all groups. This is unsurprising, given
that these sentences were identical expect for the presence (early condition) or absence
(late condition) of the clitic (segment 8). A two-way repeated-measures ANOVA was
performed with condition (early—expectation met and late—expectation not met) and
segment as within-participant variables and group as a between-participants variable
(native speakers, L2 advanced, L2 intermediate). The total number of segments compared
was fourteen. Segment eight was excluded from this analysis because the first clitic was
absent in the late condition; therefore, no comparisons could be made. The significance
level was established at p < 0.05. A Mauchly's Test of Sphericity indicated that the
assumption of sphericity had been violated for segment, (χ2(90) = 1658.569, p < 0.0001)
and for condition*segment (χ2(90) = 682.406, p < 0.0001). Thus, the df values reported
include a Greenhouse-Geisser correction throughout
Analyses of variance revealed that there was a significant effect of segment
(F(2.855, 339.350) = 342.612, p < 0.0001) on reading times, but no effect of condition
137
137
(F(1, 120) = .200, p > 0.05). This is unsurprising given that both conditions were nearly
identical. Furthermore, there was no effect of participant group (F(1, 120) = 2.114,
p > 0.05). Additionally, the interaction between group and condition did not have a
significant effect on reading times (F(2, 120) = 1.493, p > 0.05). Tests revealed a
significant interaction between segment and group (F(5.710, 342.612) = 4.252,
p < 0.0001). Table 5-10 displays the pairwise comparisons (Bonferroni correction) with p
values per segment. Only significant values are shown. All other segments fell over the
established significance level. The superscripts (“n” for native speakers, “a” for L2
advanced and “i” for L2 intermediate) indicate the group with the higher mean (i.e.
slower reading times overall).
L2 Intermediate (n=36)
L2 Advanced (n=54)
Native Speakers (n=34)
Segment 4 (p = 0.003)n
Segment 12 (p = 0.001)n
Segment 14 (p = <0.0001)n
Segment 15 (p = 0.005)n
Segment 4 (p = 0.052)n
Segment 5 (p = 0.29)n
Segment 12 (p = 0.014)n
Segment 14 (p = 0.16)n
L2 Advanced (n=54) Segment 7 (p = 0.006)i
Table 5-10 Self-paced reading pairwise comparisons (segment * group): all segments except segment 8 for all participant groups. Superscripts indicate the group with the higher RT mean.
Note that in every case where there was a significant contrast involving native
speakers (both when compared to the L2 advanced and the L2 intermediate), the native
speaker participants had the highest RT mean (i.e. natives read slower). In the only case
where the advanced and the intermediate where compared, the intermediate had a higher
mean (i.e. they read segment 7 slower than the L2 advanced). This difference in reading
speed might reflect differences in the background of the participants, especially as it
138
138
relates to age, reading habits, ease with technology, and familiarity with linguistic
experiments.
The analyses of variance also revealed that there was a significant interaction
between condition and segment (F(5.122, 10.245) = 4.759, p < 0.0001). Overall, the early
and late conditions were read at significantly different speeds, with the late condition
been read slower on two different segments: segment 9 (ME.9 = 552.104, SDE.9 = 13.794,
ML.9 = 624.04, SDL.9 = 12.673, p < 0.0001) and segment 10 (ME.10 = 450.428,
SDE.10 = 8.88, ML.10 = 477.214, SDL.10 = 9.304, p < 0.0001). Additionally the last segment
(segment 15) was, overall, read faster in the early condition than in the late condition
(ME.15 = 1246.181, SDE.15 = 51.01, ML.15 = 1185.773, SDL.15 = 44.021, p = 0.044). Finally,
there was a also significant three-way interaction among condition, segment, and group
(F(10.322, 624.480) = 1.904, p < 0.040). This three-way interaction was further
examined.
Pairwise comparisons with Bonferroni corrections revealed that the native speaker
group did not display a significant difference in terms of reading times between the late
and early conditions on segments 1 through 7, and 11, though 13. Table 5-11 presents
these comparisons for all three groups with their respective p values. On the other hand,
the native group did display significant differences in reading times between the late and
early conditions in segments 9 (ME.9 = 545.19, SDE.9 = 192.17, ML.9 = 645.68, SDL.9 =
162.34, p < 0.0001), 10 (ME.10 = 467.24, SDE.10 = 147.06, ML.10 = 501.08, SDL.10 =
132.68, p = 0.004), 14 (ME.14 = 587.33, SDE.14 = 200.55, ML.14 = 558.90, SDL.14 = 216.33,
p = 0.032) and 15 (ME.15 = 1430.95, SDE.15 = 800.82, ML.15 = 1329.38, SDL.15 = 687.89,
p = 0.004). The advanced L2 group displayed the same contrasts as native speakers in
segments 9 (ME.9 = 519.55, SDE.9 = 138.95, ML.9 = 614.90, SDL.9 = 142.56, p < 0.0001),
and 10 (ME.10 = 436.23, SDE.10 =84.75, ML.10 = 4.68.00, SDL.10 = 90.55, p = 0.001), and
throughout except for the last three segments. Out of these last three segments, only
segment 13 (ME.13 = 779.17, SDE.13 = 248.45, ML.13 = 740.41, SDL.13 = 242.77, p = 0.05),
139
139
exhibited a significant contrast. The intermediate learners showed a different pattern.
These learners only made a significant contrast in terms of reading times between the
early and late conditions on segment 9 (ME.9 = 569.55, SDE.9 = 122.76, ML.9 = 610.34,
SDL.9 = 105.29, p = 0.037). Table 5-12 indicates the pairwise comparisons (per segment)
of the condition * segment * group interaction. The superscripts (“e” for early condition,
“l” for late condition) indicate the condition with the higher mean (i.e. which condition
was read slower per segment).
Segment / word (see 5-�)
Native Speakers (n=34)
L2 Advanced (n=54)
L2 Intermediate (n=36)
1 | A n.s. n.s. n.s.
2 | aquellas n.s. n.s. n.s.
3 | estudiantes n.s. n.s. n.s.
4 | la n.s. n.s. n.s.
5 | linda n.s. n.s. n.s.
6 | secretaria n.s. n.s. n.s.
7 | felizmente n.s. n.s. n.s.
9 | contó (p < 0.0001)l
(p < 0.0001)l
(p = 0.036)l
10 | que (p = 0.004)l
(p = 0.001)l n.s.
11 | probablemente n.s. n.s. n.s.
12 | las n.s. n.s. n.s.
13 | admitirán n.s. (p = 0.05)e n.s.
14 | en (p = 0.05)e n.s. n.s.
15 | el_programa (p = 0.004)e n.s. n.s.
Table 5-11 Self-paced reading pairwise comparisons (condition*segment*group): all segments except segment 8 for all participant groups. Superscripts indicate the condition with the higher mean.
All of the research outcomes on the prior discussion have been presented by
binning the L2 participant results by proficiency level. As mentioned earlier, this artificial
grouping based on admittedly arbitrary (albeit conventionally used in the literature) cut
140
140
off points can be problematic in several levels. Even if we consider using the same
proficiency test and same language pairings, one potential issue, for example, is that a
slight variation in cut-off points can jeopardize how replicable the results are. There are
other problems associated with artificial groupings. For example, researchers (e.g. Cohen,
1983 cited in West, Aiken & Kull, 1996) have shown how the artificial dichotomization
of a continuous variable can result in reduced statistical power, which has an effect on the
individual differences variables as well as the interactions between these and the
manipulated variables.
Although a review of the problems of this statistical analysis approach is outside
the scope of this chapter, it is important to note that the decision of grouping participants
has important repercussions. Therefore, in order to examine the relationship between L2
proficiency and the magnitude of the effect elicited by the violation of expectations (i.e.
whether participants were able to predict an upcoming element based on previous
information) in a more continuous fashion, this relationship was also examined through
the calculation of a Pearson correlation coefficient.
Figure 5-8 displays a scatterplot with proficiency, as measured with the
proficiency test, scores ranging from 25-50, and reaction times in the early (expectation
violation) and late (expectation met) conditions at segment 9. In view of the fact that the
purpose of this comparison is to determine whether with increasing proficiency
participants show heightened sensitivity to the violation of expectations, only the L2
learners are plotted in this graph. Note that, for ease of exposition, although I use the
difference score (late RTs minus early RTs) in the calculations, Figure 5-8 illustrates the
RTs in each condition separately.
141
141
Figure 5-8 Scatter plot of proficiency test scores and reaction times at segment 9 in the late (expectation violation) and early (expectation met) conditions for the L2 learner group.
Overall, using the difference score between the early and late conditions (the
subtraction of the RTs of the late condition from the RTs in the early condition where a
positive difference is interpreted as heightened sensitivity to the violation of
expectations), L2 learners demonstrated a significant positive correlation between
proficiency scores and matrix-verb RTs when the clitic was absent relative to when it was
present (r = .335, n = 91, p = .001). Thus, with increasing proficiency, the learners show
an increased sensitivity to the violation of expectations. It is important to note that
142
142
although it appears like the error signal is increasing in magnitude, the increase in the
magnitude in the difference scores appear to be driven in large part by as speed up on the
processing of the main verb when a clitic was present. In other words, although it is true
that with increasing proficiency, non-native RTs appeared progressively similar to NSs in
terms of the magnitude of RTs elicited by the expectation violation, this is due to two
factors. First, the learners took longer to read the late condition (expectation violated) as
their proficiency increased but, crucially, they also were faster when reading the main
verb in condition where their expectations were met.
5.4.2 Non-verbal Stroop task
This task was included to measure the ability to suppress information that is
irrelevant to a given task (i.e. ‘inhibitory control’). This task, adapted by Blumenfeld and
Marian, 2011 (from Liu, Banich, Jacobson & Tanabe, 2004) has been previously used in
research with bilinguals and has been found to relate to auditory comprehension. The
non-verbal Stroop task creates conflict by manipulating different aspects of the stimulus
(an arrow, in this case) so that these are either congruent or incongruent. The arrow
varied in two dimensions: the arrow’s location (arrow was located at either the right or
left side of the screen) and the arrow’s direction (arrow could either point right or left).
The task included 160 trials (3:1 ratio congruent/incongruent): 120 congruent trials and
40 incongruent trials. Trials were scored for accuracy (correct, incorrect). For a trial to be
coded as correct, it had to be answered correctly the first time within the time limit.
Figure 5-9 presents the mean accuracy rates and standard errors on the congruent and
incongruent trials.
143
143
Figure 5-9 Mean accuracy scores and standard errors for the congruent and incongruent trials for the non-verbal Stroop task (all groups).
In terms of reaction times, only the RTs on accurate trials were considered. Figure
5-10 presents the means and standard errors of reaction times per group on the congruent
and incongruent trials.
144
144
Figure 5-10 Mean reaction times and standard deviations for the congruent and incongruent trials for the non-verbal Stroop task (all groups).
Following the analysis carried out in Blumenfeld and Marian’s study, in order to
compare the groups in terms of two different measures (accuracy and response times) the
data was analyzed using 2 x 2 mixed ANOVAs with trial type (congruent, incongruent)
as within-subjects factor and group (native speakers, L2 advanced, L2 intermediate) as
between subjects factor. As in the case of Blumenfeld and Marian’s study, the analyses
yielded a significant effect of trial both in terms of accuracy and RTs. Namely,
participants were more accurate (F(1, 112) = 170.187, p < .0001) and faster (F(1, 112) =
853.457, p < .0001) in the Congruent trials than in the Incongruent trials. There was also
an effect of group both in terms of accuracy (F(2, 112) = 5.939, p = .004) and RTs (F(2,
112) = 16.980, p < .0001). Finally, there was no interaction between trial and group in
145
145
terms of RTs or accuracy. Bonferroni post hoc tests revealed that in terms of accuracy,
the native speaker group was significantly less accurate than both the L2 advanced and
the L2 intermediate groups (p = .035 and p = .003, respectively). However, there was no
significant difference in terms of accuracy between the L2 intermediate and advanced
groups (p > .05).
In terms of reaction times, tests revealed that the native speaker group was slower
than both the L2 advanced and the L2 intermediate groups (p = .029 and p < .0001,
respectively). Interestingly, the L2 advanced group was also significantly slower in terms
of RTs than the L2 intermediate group (p < .0001). Thus, overall, as far as accuracy is
concerned, the results of this task are in general agreement with the notion that there is an
advantage in inhibitory control for the L2 learner groups overall. Furthermore, in regards
to accuracy, this advantage appears to be modulated by proficiency in the L2. In terms of
RTs, however, there was an overall effect between the L2 learner group as a whole
(native speakers were slower than both of the L2 groups), but proficiency did not appear
to modulate this relationship. It is possible age effects might affect the nature of the
relationship, especially as the L2 group is concerned. On average, the native speaker
group was a decade older than the advanced L2 learners and, in turn, the L2 advanced
learners were also about a decade older than the L2 intermediate learners (MNS = 43.58,
SDNS = 12.02; ML2.A = 32.94, SDL2.A = 10.46; ML2.I = 21.42, SDL2.I = 3.6). Additionally,
it is also possible that the L2 group was more used to experimental settings. Although the
groups were somewhat match for education level (all the natives had completed at least
two years of college), most of the natives had attended college many years prior.
5.5 Inter-correlations between tasks
As mentioned in Chapter 4, one of the objectives of this study was to investigate
whether individual differences in abilities such as inhibitory control and statistical
learning ability could predict variability in the acquisition and processing of CLLD in L2
146
146
Spanish. For this reason, a correlation among the different tasks was calculated. Table 5-
12 presents the inter-correlations between the different task measures for the L2 group
only, using the same exclusion criteria described in section 5.4.1.2.
Table 5-12 shows that Proficiency scores were positively related to all but
one of the task measures and other variables: the measure of statistical learning
ability. In fact, statistical learning ability did not show a positive association with
any of the different task measures. The fact that proficiency scores correlated
significantly with both overall exposure to Spanish, (r = .478, n = 90, p < .0001,
and with study abroad, r = .511, n = 90, p < .0001) is significant although
expected under Usage-based models. These correlations, considered “moderate”
in magnitude, provide evidence that proficiency is driven, at least to some degree,
by the amount of exposure learner has had to the language.
147
Prof D9 Clitics SLT SILS Acc O
Acc Ex
Study Abroad Exp.
Cong RTs
Incong RTs
Cong Acc.
Incong Acc.
AJT CL
AJT WO
D9 .321**
Clitics .703** .239*
SLT .036 -.099 .055
SILS .682** .132 .462** .037
Acc O .537** .193 .354** .093 .525**
Acc Ex .619** .200 .560** .091 .618** .794**
S Abroad .511** .225* .397** -.056 .285** .243* .222*
Exp .478** .087 .381** .108 .383** .272** .246* .272**
Cong RTs .389** .003 .312** .130 .363** .351** .298** .221* .310**
Incong RTs .343** -.054 .270* .117 .320** .286** .198 .197 .309** .829**
Cong Acc -.170 .240* -.160 -.014 -.168 -.086 -.085 -.087 -.117 -.326** -.277**
Incong Acc -.105 .176 -.018 -.023 -.047 .026 .070 -.003 -.106 -.380** -.426** .653**
AJT CL .358** .054 .257* .044 .216* .177 .308** .406** .278** -.003 .010 -.141 -.047
AJT WO .346** .026 .365** -.128 .251* .141 .190 .289** .106 .202 .223* -.141 -.046 .180
Age .595** .023 .441** .114 .528** .351** .351** .427** .755** .549** .501** -.475** -.291 .363** .227**
* Correlation is significant at the 0.05 level; ** Correlation is significant at the 0.01 level
Table 5-12 Inter-correlation between task measures (significant correlations in shaded cells).
148
Note:
Prof = Proficiency score; D9 = Magnitude of Expectation violation in milliseconds (= late RTs – early RTs), segment 9; Clitics = Clitic test score; SLT = Statistical learning score; SILS = Vocabulary score; Acc O = Overall accuracy in comprehension questions in SPR; Acc Ex = Accuracy in comprehension questions in SPR on the experimental items only; SAbroad = Length of study abroad; Exp = Total exposure to Spanish; Cong RTs = Stroop task, RTs on Congruent items; Incong RTs = Stroop task, RTs on Incongruent items; Cong Acc = Stroop task, Accuracy scores on Congruent items (percentages); Incong RTs = Stroop task, Accuracy scores on Incongruent items (percentages); AJT CL = Acceptability Judgment Task difference scores (= grammatical – ungrammatical) in embedded clause items where the manipulation involved clitics; AJT WO = Acceptability Judgment Task difference scores (= grammatical – ungrammatical) in main clause items where the manipulation involved word order; Age = Age of participant.
Table 5-12 (continued) Inter-correlation between task measures (significant correlations in shaded cells).
149
In terms of the off-line measure of CLLD knowledge which was most
comparable to the self-paced reading task (AJT CL = Acceptability Judgment Task
difference scores (= grammatical – ungrammatical) in embedded clause items where the
manipulation involved clitics), we find that these scores were positively related to
Proficiency (r = .358, n = 90, p = .001), the scores of the clitic test (r = .257, n = 90, p =
.015), the measure of English vocabulary (r = .216, n = 90, p = .042), accuracy in the
self-paced reading compression questions in experimental items (r = .308, n = 90, p =
.003), length of study abroad (r = .406, n = 90, p < .0001) and overall exposure to
Spanish (r = .278, n = 90, p = .008). Importantly, the scores from the off-line CLLD task
were not related to the on-line CLLD task (r = .054, n = 90, p > 0.05).
Multiple regression analysis was used in order to assess the degree to which each
task measure predicted performance in the off-line CLLD task (both the portion
manipulating word order and the portion manipulating clitic presence/absence). The
predictors included were Proficiency, scores on the Clitic Test, SILS scores, length of
study abroad, total Exposure to Spanish, accuracy on the Stroop task (congruent and
incongruent trials), RTs on the congruent and incongruent trials of the Stroop task and
accuracy on the comprehension questions of the self-paced reading task. For the portion
manipulating clitic absence/presence (Embedded CLLD), the results of the regression
indicated that the predictors explained a significant portion of the variance (R2 = .299,
F(11,88) = 2.989, p = .002). These analyses showed that the reliable predictors of native-
like CLLD judgments were length of study abroad (β = .303, p = .010), total exposure to
Spanish (β = .242, p = .036), and accuracy on the self-paced reading comprehension
questions (β = .541, p = .007). The results of the multiple linear regression for the portion
of the AJT manipulating word order was not significant (R2 = .200, F(11,88) = 1.75,
p > 0.05).
150
In terms of the on-line measure (D9 in Table 5-13, or the magnitude of
expectation violation obtained by subtracting the RTs in the early condition from the RTs
in the late condition in segment 9), these scores were positively related to three other task
measures: Proficiency, (r = .321, n = 90, p = .002, moderate effect), the scores of the
Clitic Test (r = .239, n = 90, p < .0001, small effect) and study abroad (r = .225, n = 90,
p = .033, small effect). It is relevant to note that this on-line measure did not display a
positive association with overall exposure to Spanish (r = .087, n = 90, p > 0.05). Because
the number of months spent learning Spanish is logically confounded with the number of
months spent studying abroad, two partial-correlations were computed. These partial
correlations showed that there was a significant positive correlation between the
magnitude of the effect elicited by the violation of expectations and number of months of
study abroad, after statistically controlling for the total number of months spent learning
Spanish overall (r = .219, n = 87, p = .049). However, after controlling for the number of
months of studying abroad, there was no significant relationship between the number of
months spent learning Spanish and the magnitude of the expectation violation (r = .028,
n = 87, p = .796).
In order to assess the degree to which each task measure predicted performance in
the on-line CLLD task after statistically controlling for other measures in an alternative
way, regression coefficients were calculated for each measure (Proficiency, scores on the
Clitic Test, SILS scores, length of study abroad, total Exposure to Spanish, accuracy on
the Stroop task (congruent and incongruent trials), RTs on the congruent and incongruent
trials of the Stroop task, accuracy on the comprehension questions of the self-paced
reading task). The results of the regression indicated that the predictors explained a
significant portion of the variance (R2 = .226, F(11,88) = 2.040, p = .036). Analyses
showed that there were two reliable predictors of native-like RT patterns on segment 9:
L2 Proficiency (β = .386, p = .05) and accuracy on the Congruent trials of the Stroop task
151
(β = .308, p = .029). In summary, when controlling for the effects of all the other
measures, the only significant predictors of native-lie processing of CLLD was L2
proficiency level and accuracy on the Congruent trials on the Stroop task.
5.6 Summary
This chapter presented the results of the experimental tasks used in this study with
the intention to test the predictions of the Shallow Structure Hypothesis as well as to
investigate the relationship between proficiency and the ability to generate predictions.
These tasks included off-line and on-line tasks that measured both linguistic and non-
linguistic abilities. The two main tasks were the tasks focusing on CLLD: an offline
rating task and an online self-paced reading task. A portion of the off-line rating task
tested main clause CLLD while the other tested embedded CLLD. As mentioned in
section 5.3, overall, the results of the native speakers and the L2 advanced participants on
this task conformed to the expectations based on the linguistic literature. On the other
hand, the intermediate group had trouble making a distinction between the grammatical
and ungrammatical sentences in two of the five conditions.
The results of the on-line self-paced reading task were largely comparable to the
results of the off-line task. Namely, all participant groups displayed a significant
difference in reading times in segment 9 (in the late versus the early condition), following
the absence/presence of the clitic. There was a spill over effect on the following segment
for both the native speakers and the L2 advanced, but not the intermediate L2 group.
There was also an effect on the regions following the appearance of the second clitic.
Namely, both the native speaker group and the L2 advanced group some of the segments
in the early condition slower than in the late condition. While the advanced group
registered this effect earlier (segment 13, immediately following the clitic), the native
speaker group registered the effect on segments 14 and 15 (final segment). The
152
intermediate group showed only one effect, on segment 9. As I will argue in the next
chapter, I believe these results pose a problem for accounts such as the Shallow Structure
Hypothesis, which argues that the representations of the L2 learners will show processing
differences due to incomplete grammatical representations.
153
CHAPTER 6
DISCUSSION AND CONCLUSIONS
6.1 Introduction to the Chapter
This chapter will present a summary of the main findings of the experimental
tasks included in the study and detail how these results address the research questions that
informed this empirical study. Additionally, this discussion will situate the results within
previous research studies, especially those on L2 processing. Although some of the
results replicated some aspects of previous investigations (Blumenfeld & Marian, 2011;
Gómez, 2002; Pablos, 2006), the investigation of long-distance dependencies involving a
pronoun rather than a gap in L2 processing had not been previously carried out in L2
research. Although these novel findings are an important first step, more research in this
area is clearly needed in order to draw strong generalizations.
As noted earlier, the overarching goal of this dissertation was to examine the on-
line processing of long distance dependencies by L2 learners of Spanish with special
regard to the predictions of the Shallow Structure Hypothesis. Thus, in this chapter I will
address the degree to which the SSH can account (or not) for the experimental findings.
In order to draw connections between judgments and processing of long-distance
dependencies by L2 learners, I will compare the on-line and off-line results of the tests
measuring CLLD knowledge and processing.
Additionally, I will discuss the results of the tasks measuring linguistic and non-
linguistic individual abilities and detail the extent to which these tasks can account for the
variability found in the CLLD tasks. After presenting the main findings and how these
address the three main research questions, this chapter will include a discussion of the
shortcomings and limitations of the study. Finally, I will offer some possible avenues for
future research studies and offer a brief summative conclusion.
154
6.2 Summary of Main Findings
In this section, I will present the summary of the main findings as these relate to
the research questions put forth in Chapter 4, which are reproduced below.
6.2.1 Research Question 1
1. Do L2 learners of Spanish, native speakers of English, show evidence of off-line
knowledge of the syntactic and pragmatic constraints of CLLD in Spanish? If they
do, how does L2 proficiency modulate the L2 learners’ off-line judgments?
In view of the results of the untimed, audio-visual acceptability judgment task
focusing on CLLD, this research question can be answered, overall, in the affirmative: a
segment of the L2 learner group judged the grammaticality of embedded and main clause
CLLD in a native-like manner. Overall, the performance of the native Spanish speakers
also confirmed the expectations set by the linguistic literature. Importantly, the
performance of the L2 learners showed evidence of being modulated by proficiency
level: The more proficient the learners, the better they were at judging the sentences.
Recall that there were two main types of items in the rating task: Main clause CLLD,
where the dimension that was manipulated in the stimuli was word order and Embedded
CLLD, which manipulated clitic absence/presence.
With regard to Main clause CLLD items, there were two conditions that paired
CLLD with other left-periphery operations: Fronted Focus and wh-movement. In these
conditions, the crucial difference between the grammatical and ungrammatical sentences
involved word order. When CLLD is combined with other structures in the left periphery,
CLLD must precede any other structures. Thus, the order Topic > {FF, wh} was expected
to be grammatical according to the literature while the alternative order *{FF, wh} >
Topic was expected to be ungrammatical. Not surprisingly, the native speaker judgments
revealed just such a difference (see Figure 5.1). Additionally, the L2 advanced and
155
intermediate groups also showed a clear distinction between the ungrammatical and
grammatical sentences. These overall results are not completely surprising, given that the
even the intermediate group had been exposed to an average of 7.5 years of Spanish
instruction and practice. Nevertheless, this is not an easy feat, especially because this
knowledge is not likely to be something learned explicitly, as CLLD is not typically
covered in classroom instruction. Furthermore, although these sentences are
ungrammatical, the different orders would not necessarily impede the comprehensibility
of the sentences.
In terms of comparing conditions A and B (Main Clause CLLD), one result
clearly stands out. Namely, although the native speakers reliably distinguished between
the grammatical and ungrammatical sentences, they rated the grammatical sentences in
the CLLD + Fronted Focus reliably lower than those in the grammatical sentences in the
CLLD + wh-questions condition. One possibility is that this is a dialectal preference—the
native speaker group was tested in the same region in central Mexico. López (2009), for
example, has noted that Fronted Focus is less acceptable in Mexican Spanish. However,
prior investigation using this structure (e.g. Leal Méndez et al., 2014) testing a group of
native Mexican Spanish speakers did not reveal such a marked difference.
Another possibility is that the sentences were rated lower because they are more
infrequent. Although to my knowledge there are no corpora studies investigating the
frequency of Fronted Focus, there is no doubt that this structure is much less frequent
than wh-questions, regardless of the dialect. Furthermore, the frequency of Fronted Focus
in combination with CLLD should be even lower. In my own judgments, however, the
ungrammatical sentences involving CLLD + wh-questions are indeed much worse than
those involving CLLD + Fronted Focus. Thus, although it would appear, in the surface,
that the L2 speakers are outperforming the natives, it might be the case that the natives
156
are sensitive to these subtle frequency differences. Clearly, more research in this area is
needed.
If we assume López’s, analysis, however, there might be a more straightforward
explanation for these results. Recall that in López’s analysis, both CLLD dislocated
elements and Fronted Focus dislocates move to in Spec,Fin (multiple specifiers). The
order CLLD > Fronted Focus is thus driven by the requirement that the CLLD dislocated
element, which is marked [+a] by the pragmatics, must find its (discourse antecedent).
The dislocated element in Fronted Focus, which is marked [-a] by the pragmatics, does
not have such a requirement. Note that these requirements are brought into effect by
discourse constraints, rather than by syntactic ones. On the other hand, the CLLD > wh-
element order derives, in principle, from two constraints. The first, as in the case of
Fronted Focus, from the (discourse) requirement that the CLLD dislocated element be
closer to its antecedent. The second is syntactic in nature because wh-movement is also
driven by an uninterpretable feature that motivates the movement of the wh-constituent to
Spec,Fin. This uninterpretable feature, being a “narrow” syntactic feature, operates
independently of the surrounding discourse context. Thus, again, if we follow López’s
analysis, the difference between these requirements (discourse requirements vs. discourse
+ syntax requirements) appears explain the native speaker judgments (i.e. judging the
ungrammatical order *Fronted Focus > CLLD higher than the ungrammatical order *wh-
element > CLLD). Under Rizzi’s approach, however, there is no straightforward
explanation, as there is no a priori reason to assume differences between movement to the
speficier of TopP vs. movement to Spec,whP.
The portion of the rating task investigating Embedded CLLD included three
different conditions: conditions C (Subjunctive complements), D (Temporal adverbial
clauses) and E (Sentential Subjects). These conditions were chosen for two reasons. First,
several research investigations had previously uncovered that L2 learners were successful
157
at rating the acceptability of main clause CLLD when the relevant dimension on which
the grammatical and ungrammatical sentences differed was the absence/presence of a
doubling clitic (e.g. Slabakova et al., 2010, 2011, 2012).47 Second, the on-line task used
in this study (i.e. self-paced reading) involved embedded clauses, so including embedded
CLLD items were deemed to be most suitable in terms of comparing the on-line and off-
line tasks. With regard to the portion of the rating task focusing on Embedded CLLD,
both the native speakers and the L2 advanced learners made a significant distinction
between how they judged the grammatical versus the ungrammatical sentences in all
conditions. The intermediate L2 learners, however, only made a distinction between the
two in one condition: CLLD in Sentential Subjects.
Interestingly, however, it is precisely in the Sentential Subjects condition that the
judgments of native Speakers show an interesting effect. Haegemann (2006) noted that
her Spanish-speaking informants had rated these sentences lower. However, in this
investigation what we see is that native speakers did not rate the grammatical sentences
lower; instead, they rated the ungrammatical sentences higher (rejected them less
reliably) than in the other two conditions. Although the L2 advanced and intermediate
speakers reliably distinguished between the grammatical and ungrammatical sentence, we
do not see such an effect in their responses.
In sum, in regards to the embedded CLLD, CLLD in Sentential Subjects were
judged in a more native-like fashion earlier (at the intermediate level), whereas CLLD in
Temporal Adverbials and Subjunctive Complements is not mastered until the advanced
level. Although it might be tempting to appeal to a notion like syntactic complexity, it is
unclear how that would work in these particular conditions. Recall that this manipulation
47 Additionally, these investigations explored two different semantic relationships between the dislocated element and its antecedent (identity, set-subset). The stimuli for the rating task in the present study only included identity relationships.
158
only varied in one dimension: the absence or presence of the clitic. Additionally, for
independent reasons, all the sentences in the Sentential Subjects condition involved the
subjunctive, so it cannot be the case that the presence of the subjunctive alone (i.e. in the
Subjunctive Complements condition) could account for the difference. An alternative
explanation might again involve frequency. Although I have no personal intuitions about
the relative frequency of occurrence of these three sentence-types, it is not impossible to
imagine that there might be a difference among then.
Nevertheless, the most important result from the portion of the rating task
focusing on Embedded CLLD is that, overall, L2 learners can eventually reach native-
like performance. In terms the relationship between Proficiency and off-line CLLD
knowledge, we can see that as learners attain higher proficiency levels, their off-line
judgments of CLLD become increasingly more native-like. Additionally, there seems to
be an effect of the type of CLLD in the sequence of acquisition, with some properties
being acquired earlier than others.
6.2.2 Research Question 2
2. Are long-distance dependencies such as Spanish CLLD especially difficult for L2
learners to process in real time, as predicted by the Shallow Structure Hypothesis?
As mentioned in Chapter 2, the SSH (Clahsen & Felser, 2006a,b) makes clear and
testable predictions with regard to the processing of long-distance dependencies by
learners of a second language. Specifically, the SSH proposes that when processing a
sentence in their L2, learners can make use only of semantic and contextual information
(e.g. thematic information, argument structure, word order, world-knowledge), to the
exclusion of hierarchical syntactic information which is crucial to process long-distance
syntactic dependencies. This is proposed to be the case regardless of L1 background (i.e.
L1 English vs. L1 Japanese) or L2 proficiency level. Thus, the SSH would not expect an
159
improvement in performance that is specifically yoked to proficiency. The SSH has
received support from a variety of empirical investigations focusing on the L2 acquisition
and processing of wh-dependencies, relative clause attachment preferences, and verb
subcategorization (e.g. Clahsen, Felser, Neubauser, Sato, & Silva, 2010; Felser, Roberts,
Gross & Marinis, 2003; Marinis et al., 2005; Papadopoulou & Clahsen, 2003). However,
there is also a body of evidence that shows evidence against the predictions of the SSH
(e.g. Hopp, 2006, 2013; Tanner, 2011).
In terms of the present empirical investigation, the SSH predicts that the L2
learners in this study should not be able to predict a syntactic element (in this case, an
agreeing clitic) based on structural cues. Recall that the self-paced reading task
manipulated syntactic expectations—in one condition, the long-distance dependency was
completed at the first possible opportunity (early condition). On the other hand, the late
condition delayed this resolution, thereby investigating the effects of predictions when
these were not met (compared to when they were met). The absence of a clitic in the
sentences in the late condition is unexpected but it does not alter meaning—the clitic is a
purely syntactic reflex in Spanish. In the paragraphs that follow, I will address the results
in the main region of interest (segment 9, main verb). I will then address the effects in the
latter part of the sentence.
With regard to the main region of interest (segment 9), the results of the native
speakers essentially replicate the effects elicited in a prior investigation of long-distance
dependency processing in L1 Spanish speakers (Pablos, 2006). Namely, there was a main
effect of clitic presence at the main verb, where verbs in the late condition were read
substantially slower than those in the early condition. As indicated in Chapter 4, the only
difference between the two conditions was the presence of a (dative) clitic before the
main verb in the early but, crucially, not the late condition.
160
With respect of the explanatory adequacy of the SSH, even the results from the
L2 intermediate group appear problematic for the hypothesis because the data shows that
these learners, like the native Spanish speakers, read segment 9 (main verb) differentially,
when comparing RTs in the late and early conditions. In other words, these learners
showed sensitivity to the violation of expectations. As was the case with the native
speakers in the present study (and those in Pablos’s study), the intermediate L2
participants took longer to read the main verb in the late condition—the condition where
the expectation for a clitic was violated (clitic was absent). Interestingly, this same group
showed that, even if their off-line judgment of embedded CLLD was somewhat
inconsistent in some of the embedded CLLD conditions, the on-line processing of CLLD
embedded sentences showed sensitivity to the violation of expectations. However, it must
also be noted that the conditions in the off-line CLLD task different structures, which
may make the comparison of these two tasks far from straightforward.
The data from the advanced L2 learner group are also problematic for any
approaches to SLA, such as the SSH, which posit unassailable processing difficulties for
L2 learners. Again, with regard to the current study, the SSH predicts that learners will
not have access to detailed syntactic information when processing long-distance
dependencies in their L2—even if they show evidence of knowledge of syntactic
dependencies off-line. Unlike the L2 intermediate group, the advanced L2 group
performed within native-speaker expectations on the off-line task in every condition.
Under the SSH, even though L2 advanced learners showed evidence of off-line
knowledge, these learners were not expected to show evidence of native-like language
processing. However, our results showed evidence of native-like processing of CLLD by
advanced learners in segment 9.
As mentioned in the previous chapter, it seems clear that with increasing
proficiency, the learners show an increased sensitivity to the violation of expectations.
161
Once more, as figure 5.8 shows, the increase in the magnitude in the difference scores
was in part driven by faster reading times of the main verb when a clitic was present
(early condition = expectation met). Thus, the increased magnitude of the difference
score (late condition - early condition) indicates that learners are indeed more bothered
when their expectations are violated but it also shows that they have been successful at
learning the long-distance dependency, processing the main verb faster with increasing
proficiency.
In addition to the main effect found at segment 9 (main verb), there was a second
effect elicited by the violation of expectations at the second (embedded) verb, after the
(accusative) clitic. This effect is slightly different than that found in Pablos’s (2006) prior
investigation in terms of the timing of the effect. In Pablos’s study, the effect was visible
at the accusative clitic preceding the embedded verb whereas in the present investigation
the effect was delayed. This difference is very probably due to the fact that the present
study used a slightly modified version of Pablos’s stimuli. Specifically, the stimuli used
in this dissertation simplified the sentences by removing a third, neutral clitic that
appeared in between the main and embedded verbs. A sample stimuli from Pablos’
dissertation is presented in example (6-1) below.
(6-1) Early/Late Completion conditions in Pablo’s stimuli (Late condition omitted the material in parenthesis) A estas chicas, mi hermana mayor más tarde (les) dijo To these girls my sister older more late CL-DAT.3.PL told.3rd.PAST que ya lo cree que las conoce that indeed CL.NEUT.SG think.2nd that CL.ACC.3F.PL knows.2nd desde hace tiempo. since long time “My old sister later told these girls that she indeed thinks she has known them for a long time.”
162
In light of the fact that these sentences were already long and complex, especially
as far as the L2 learners were concened, the present study used a slightly modified
version. This approach was chosen because the sentences would be too complex for the
lower end of the proficiency scaled (three instead of two levels of embedding and three
clitics—accusative, dative, neutral—in the early condition).48 The second, perhaps most
important reason is that Pablos’s original results did not register an effect at that region
for the native speakers in her study. Therefore this middle region including an additional
level of embedding (“ya lo cree que”) was exchanged for a second adverb such as
probablemente (“probably”). Therefore, it is possible that the differences between the
present investigation and Pablos’s study are due to this additional level of embedding
and/or clitic.
In terms of the results of the present investigation, interestingly this second effect
surfaced earlier in the sentence in the data of the L2 advanced group than it did in the
data of the native Spanish speaker group. Whereas the advanced L2 group read the
embedded verb more slowly (segment 13, embedded verb), the native speakers of
Spanish did not show this effect until the word after (segment 14). However, although
this effect appeared earlier in the L2 advanced group (segment 13), the effect was also
more transitory. The native speaker group displayed that effect at segment 14
(preposition), which then spilled over in the final segment (segment 15). The intermediate
L2 group did not read any of the later segments significantly differently at any region
except segment 9. Although the timing of this effect might be somewhat surprising, it is
worthy to note that the advanced L2 group, overall, displayed significantly faster reading
times overall.
48 Additionally, some lexical items were modified to others that were deemed more standard. Recall that the native speakers in Pablos’s study were tested in Spain while the native speakers in this dissertation were tested in Mexico.
163
Nevertheless, the most important feature of this effect is that the direction of the
effect was the same. Namely, the regions in the early condition were read more slowly
than their analogues in the late condition both by the native speaker group in Pablos’s
investigation and the native speaker and L2 advanced groups in the present study. Pablos
interpreted this effect as being “facilitative”. She argued that, given that the sentences in
the late condition the long-distance dependency completion was delayed, participants
read the clitic faster because it was highly predictable. Indeed, this effect seems to point
again at predictive processing—these effects are only possible if participants have an
expectation upon encountering the topicalized element.
In the results of the present investigation, however, neither the native speakers nor
the L2 group showed an effect at the accusative clitic (segment 12). Thus, it might be
somewhat unlikely to hypothesize, based on this data set, that the high predictability of
the clitic would elicit a facilitative effect either one or two words after the clitic (for the
L2 advanced and native speaker groups, respectively). While this explanation might be
possible, especially given the fact that the stimuli were slightly different, I believe that
there might be another way to account for this effect, at least in this data set. In fact, this
alternative explanation might be additive, meaning that it might operate in parallel to
Pablos’s interpretation.
To wit, the generation of predictions might have consequences not only for those
elements for which predictions are generated but also for those elements that occur when
they are not predicted (unexpected elements). In the case of the specific manipulation, it
is reasonable to assume, as Pablos did, that participants would have a high expectation for
the clitic in the late condition, given that the long-distance dependency was not
completed at the first, predicted, opportunity. Therefore, the shorter reading times could
be interpreted as indicating facilitative effects. However, it is also possible that in the
early condition, participants were surprised to encounter a second clitic that agreed with
164
the same topicalized phrase. In this condition, the syntactic dependency was already
completed, so there would be no expectation for a second clitic—even if such a clitic
would be, in theory, grammatical.
Indeed, if we consider this explanation seriously, we would not necessarily expect
this effect at the (accusative) clitic. In Spanish, third person plural accusative clitics are
homonymous with definite articles (los, las). This is relevant because the stimuli in
segment 12 only included clitics which bore the phi features accusative 3rd person,
feminine, plural (las).49 Therefore, given that an accusative clitic would not be expected
in the early condition (i.e. the dependency had already been completed) the point of
disambiguation would not take place until the word following the clitic: the embedded
verb. Indeed, the results of this investigation show that this effect appears not at the clitic
but after encountering the clitic.
In any case, regardless of the interpretation, the data again pose a challenge to the
SSH. Namely, there was no evidence of a lack of sensitivity in the L2 advanced to the
syntactic long-distance dependencies investigated in this study. In fact, the participants in
this group reacted slightly sooner than the participants in the native speaker group. The
data of the intermediate L2 group, however, could very well be accounted by the SSH. As
mentioned earlier, however, the SSH explicitly rejects that these effects would be brought
about by differences in proficiency level. In that regard, the results of this dissertation
lend additional support to the notion that L2 proficiency is perhaps the key factor that
modulates L2 processing. In fact, there exist an increasing number of research studies
that lend support to this idea (Hopp, 2006, 2013; cite, cite).
49 Accordingly, all the dislocated noun phrases also involved the phi features plural, feminine.
165
6.2.3 Research Question 3
3. Can individual differences in measures of linguistic (e.g. L1 vocabulary) and non-
linguistic (e.g. inhibitory control and statistical learning ability) predict variability
in the acquisition and processing of long-distance dependencies such as CLLD in
L2 Spanish?
The study of individual differences in L2 acquisition has a long tradition and has
focused the investigation of numerous factors on various aspects of L2 acquisition
outcomes, including those related to L2 processing. In this regard, there have been studies
investigating the effects of individual differences in working memory (e.g. Baralt, 2010;
Harrington & Sawyer, 1992; Juffs, 2005; Juffs & Harrington, 2011; Mackey, Philip, Egi,
Fuji, & Tatsumi, 2002; Miyake & Friedman, 1998), language aptitude (e.g. Carroll, 1989;
Robinson, 2001; Skehan, 1991), anxiety levels (e.g. Baralt & Gurzynzki-Weiss, 2011;
Kim & Tracy-Ventura, 2011; MacIntyre & Gardner, 1989; Scovel, 1978), and motivation
(e.g. Dörnyei, 2001; Gardner, 2001; MacIntyre, 2002; Williams, Burden, & Lanvers,
2002). This is by no means an exclusive list.
Recently, increased theoretical and empirical attention has been devoted to the
construct of inhibitory control (Green, 1998) and its effects on second language
acquisition and on bilinguals in general (e.g. Bialystok, Craik, Klein, & Viswanathan,
2004; Blumenfeld & Marian, 2011; Kroll, Misra, & Guo, 2008; Levy, McVeigh, Marful,
& Anderson, 2007; Rodriguez-Fornells, Balaguer, & Münte, 2006; Sorace, 2011).50
Green (1998) proposed that the system responsible for inhibitory control must be able to
monitor the task goals as well as manage the attention system so that the system can
ultimately interact with linguistic systems.
50 Although some authors (e.g. Green 1998) draw a distinction between the terms such as “mental control,” “executive control,” “cognitive control” on one hand and “inhibitory control” on the other, where the latter is either a subcomponent or a level of control, I use the terms interchangeably here.
166
As mentioned earlier, Marian and colleagues (e.g. Blumenfeld & Marian, 2007;
Marian, Spivey, & Hirsch, 2003; Spivey & Marian, 1999) have presented evidence that
bilinguals show evidence of activating both their L1 and L2 during processing (parallel
activation). It has been proposed that this dual language activation in bilinguals causes
languages to compete and that that increased practice at managing this competition in
bilinguals can boost inhibitory control. Nevertheless, Schwartz and Kroll (2006) note
that, although it is perhaps counterintuitive, numerous research studies have shown
evidence that parallel activation does not appear to be specifically related to L2
proficiency levels, as even highly proficient bilinguals show significant levels of dual
activation.
Although proficiency level does not appear to modulate the degree of the
cognitive advantages that result from the competition that arises from parallel activation,
the effects have been replicated in several studies. In fact, many researchers working on
this cognitive construct have proposed that bilinguals’ increased need to handle two (or
more) different languages gives rise to lasting cognitive benefits which extend well
beyond the linguistic realm—the so-called “bilingual executive processing advantage”
(see Hilchey & Klein, 2011 for discussion).
In order to investigate the possible effects of inhibitory control task (as measured
by a non-linguistic arrow Stroop) on L2 acquisition and processing, Chapter 5 included
correlation analyses to examine the relationship between performance on this task and the
other measures of linguistic and non-linguistic knowledge. Stroop trials in both trial types
(congruent, incongruent) were compared both in terms of percentage of trial accuracy
(correct, incorrect) and reaction times in milliseconds. Recall that in the congruent trials,
the two dimensions that were manipulated to create a match (e.g. arrow location: right;
arrow direction: right-ward facing arrow). In the incongruent trials, however, the
167
manipulation created a mismatch (e.g. arrow location: right; arrow direction: left-ward
facing arrow).
In terms of accuracy, correlational analyses showed that the accuracy rates on the
Congruent trials displayed a positive relationship with the magnitude of expectation
violation on segment 9 (main verb) in the self-paced reading task. In these analyses, only
the data from the L2 learners were included. Thus, these findings indicate that higher
accuracy rates on the Congruent trials were associated with more native-like processing
in the self-paced reading task. In light of the previous discussion, it would be tempting to
interpret this as a ramification of the so-called bilingual advantage. As noted in the
previous chapter, overall, both of the L2 groups were faster and more accurate than the
monolingual native speakers. The regression analyses, however, showed that accuracy
rates on Congruent trials were only marginally significant.
The interpretation of this result in terms of a bilingual advantage cannot be
straightforwardly interpreted because the accuracy rates on the Incongruent trials did not
correlate significantly with native-like processing in the self-paced reading task. Recall
that these trials were designed to present a mismatch between arrow location and arrow
direction. Thus, one would naturally expect that the effect would be more prominent here.
Nonetheless, this effect was not present in the Incongruent trials.
As far as Stroop reaction times are concerned, the data showed a slightly different
pattern. Overall, in terms of RTs, the results of the Stroop task indicated that both of the
L2 groups were faster than the monolingual native Spanish speakers. These findings
appear to support the idea that bilinguals have a relative advantage in terms of inhibitory
control as measured by reaction times. The relationship between L2 proficiency,
however, is not as straightforward. This is complicated by the possibility that within the
L2 group, age effects might conceal the possible effects of proficiency level. To wit, the
age of L2 participants was correlated with higher reading times in both the Congruent and
168
Incongruent trials, so that older L2 participants were slower to respond in both trial types.
As mentioned earlier, Schwartz and Kroll (2006) note that dual activation does not seem
to be associated with proficiency. In this regard, the RT results of the Stroop task would
seem to suggest that the benefits of bilingualism might emerge even at intermediate
levels of proficiency. This suggestion, however, must be interpreted with caution because
of the possible effects of Age of the L2 participants in these responses. With regard to the
processing of CLLD, however, results did not show a relationship between faster RTs in
the Stroop task and native-like processing of CLLD.
As discussed at the outset of this dissertation, the purpose of including measures
of individual ability was an attempt to determine whether these differences could account
for the variability found in L2 outcomes with specific regard to the acquisition and
processing of long-distance dependencies in L2 Spanish. Given that statistical learning
has been shown to be a powerful learning mechanism both in infants and adults, this
study also included a task that measured the abilities of finding structural regularities in
an artificial language. In view of the statistical learning findings, it stands to reason that
individual variability in statistical learning abilities could have an impact on L2
acquisition and processing outcomes. The statistical learning task used in this study
(Gómez, 2002) was chosen because it contains a long-distance dependency that has been
proved to be learnable through exposure alone.
Overall, however, the data from this study did not provide any evidence that
statistical learning, as gauged by the specific learning task, was associated with L2
processing outcomes. Based on these results, then, we cannot support the hypothesis that
better pattern-learners (in an artificial long-distance dependency) were better at learning
and processing a natural long-distance dependency in their L2. Nevertheless, it is
possibly the case that the lack of association between statistical learning and L2
169
processing outcomes (as measured by the self-paced reading task) is related to the choice
of task used to measure statistical learning.
A disadvantage of using Gómez’s (2002) task is that the range of responses (0 to
12) is rather small, which limits the amount of variation that can be found between the
scores. Goodwin and Leech (2006) discuss several factors that affect the size of a Pearson
correlation, some of which might be at least partially at play in this particular case. One
of the factors that influences the Pearson correlation is the amount of variability among
the observations. This factor is commonly referred to as “restricted” or “truncated” range.
It is likely the case that the specific task does not discriminate statistical learning too
precisely, resulting in the scores clustering around either chance level (~6 points) or
ceiling (12 points).
Another factor that has an influence on the size of a correlation involves the
(dis)similarity between shapes of the distributions of the variables involved. In this
particular case, the distribution of the difference RT scores (late – early RTs) at segment
9 was substantially different from the distribution of statistical learning scores. RT
difference scores had a substantial range (653 ms), with scores ranging from -208.83 ms
to 444.61 ms (M = 73.51, SD = 111.52). RT difference scores were roughly normally
distributed, with skewness of .397 (SE = .254) and kurtosis of 1.176 (SE = .503). On the
other hand, the distribution of statistical learning scores had a range of 8 points, with
scores ranging from 4 to 12 (M = 8.75, SD = 2.74). Statistical learning scores were non-
normally distributed (roughly bi-modal), with skewness of -.085 (SE = .254) and kurtosis
of -1.611 (SE .503). Thus, the fact that the two distributions were quite different could
have affected the presence of a correlation.
A further potential factor is the presence of outliers. In regards to the RT
difference scores, there are several data points that clearly fall outside of the range of
observations. On the other hand, the SLT scores, with a smaller measurement range,
170
displayed virtually no outliers. Nevertheless, upon visual inspection, the number of
outliers in the RT difference scores was rather small, so this factor might not weigh in as
heavily as the others. Furthermore, it should be noted this explanation does not address
why there was no single correlation between the scores of the statistical learning task and
all the other variables. Lastly, there is some evidence (e.g. Conway & Christiansen, 2005)
that shows that statistical learning ability might be constrained by modality (e.g. visual,
auditory, tactile). Thus, it might be the case that the outcomes of Gómez’s task (which
exposed participants to auditory stimuli) might be modulated by some sort of perceptual
auditory bottleneck that does not translate well to our other tasks.
Thus, as far as statistical learning abilities are concerned, the results of this
investigation are to some extent inconclusive. Although the correlation analyses did not
detect evidence of a relationship between statistical learning scores and L2 processing (as
measured by the magnitude of expectation violation measured in milliseconds), there is
reason to believe that the choice of statistical learning task (Gómez, 2002) might have
had at least some impact in the absence of relationship between statistical learning
abilities on one hand and the measures of acquisition and processing of CLLD on the
other. In any case, it is quite clear that what is now needed is further research that
explores the impact of statistical learning abilities using tasks that are better
discriminators of individual ability.
One such measure, which appears to be very promising in this respect, is a task
used in a study by Karuza, Farmer, Fine, Smith, and Jaeger (submitted). The statistical
learning task used in this study, which is an adaptation of Gómez’s (2002) task, focuses
on a long-distance dependency by using a self-paced moving window methodology.
Participants are exposed to regularities that can be learned and, consequently, predict,
over the course of the experiment. Karuza and colleagues found that those participants
who made predictions that were subsequently met attained higher scores in the post-test.
171
There are several advantages to this task, especially in the context of comparing this
measure to the results obtained using the methodologies employed in this dissertation. A
first important advantage of this task is that the task employs visual shapes rather than
auditory stimuli of words in the artificial language.51 Presently, there is some evidence
that statistical learning ability measures might be constrained by modality. Thus,
presenting a visual (rather than an auditory task) would be preferable for the purposes of
comparison. Additionally, the self-paced presentation, where participants regulated the
speed in which they were exposed to the stimuli, is also more suitable because the
obtained measure is not only accuracy but also processing time (in milliseconds), which
might alleviate previously problems related to issues such as truncated range and the
presence of outliers.
If after using this more sensitive task, further research does not uncover a
relationship between statistical learning and L2 processing, it might indeed be the case
that statistical learning is not a good predictor of on-line language comprehension. To
date, there is no study substantiating this link (see Misyak & Christiansen, 2012 for a link
between statistical learning abilities and off-line language comprehension). There is,
however, some evidence that statistical learning has an impact on L2 literacy skills (Frost
et al., 2013).
In the previous discussion, the measures of individual ability that were discussed
involve differences in general cognitive abilities that can be regarded as non-linguistic.
The following discussion addresses the potential relationships between measures of
linguistic ability on the acquisition and processing of CLLD in L2 Spanish. For this
purpose, we will focus on two different variables: a measure of L1 linguistic ability (the
51 The shapes are glyphs from the Ge’ez writing system, which is employed in countries like Ethiopia and Eritrea, and were chosen because most native English speakers would be unfamiliar with said (non-Roman) script.
172
English vocabulary portion of the SILS test) and a measure of L2 knowledge of the
syntactic restrictions of clitics (Clitic Test). In the case of the L1 vocabulary measure,
correlation analyses found no relationship between SILS scores and L2 processing as
measured by the difference RT scores on segment 9. On the other hand, the results of the
Pearson correlation revealed that Vocabulary scores were positively associated with the
scores of the CLLD Acceptability Judgment task, both with the portion that manipulated
word order (Main Clause CLLD) and the portion manipulating the absence/presence of
the clitic (Embedded CLLD). However, the results of the regression analysis showed that
Vocabulary scores were not reliable as predictors of L2 acquisition and processing of
CLLD in L2 Spanish.
In terms of the test of (L2 syntactic) clitic knowledge, correlation analyses
revealed a small relationship between clitic test scores and L2 processing as measured by
RT difference scores on segment 9. However, after assessing the degree that these scores
could predict native-like L2 processing performance using a regression coefficient, clitic
test scores were not found to be significant predictors of L2 processing. Similarly,
although correlation analyses revealed a positive association between the scores on the
clitic test and rating task responses (both the portion manipulating word order and the
portion manipulating clitic presence/absence), regression analyses revealed that clitic
scores were not significant predictors for either measure of off-line CLLD knowledge
(rating task manipulating either word order or clitic absence/presence). Thus, although
knowledge of the syntactic properties of clitics is a first step in learning about the long-
distance dependency, our Clitic test measure was not a good predictor of native-like
processing of CLLD. In sum, neither of these linguistic measures (L1 Vocabulary and L2
test of syntactic clitic knowledge) appears to modulate the acquisition and processing of
long-distance dependencies in L2 Spanish. The only significant and reliable predictor of
native-like processing of CLLD appears to be proficiency level.
173
6.3 Limitations of the Study and Directions for Future
Research
The present study has several limitations that have potential repercussions in
terms of the generalizability of the results. One important limitation involves the choice
of measure in the statistical learning task. As discussed in the previous section, there are
several reasons why the statistical learning task might not have been the best
discriminator of individual learning abilities in the context of this particular study. Thus,
these data from the present study must be interpreted with caution—especially as it
pertains the potential contribution of statistical learning in L2 acquisition and processing
outcomes. Future research should explore this question using more sensitive and
appropriate tasks (e.g. Karuza et al., submitted).
Another limitation involves the choice of L2 processing task. Although the use of
self-paced reading might be perfectly suitable for testing the predictions of the SSH, this
particular methodology might be perceived by some as less suitable to test L2 prediction.
What we know about prediction in the L1 literature is largely based upon empirical
studies that use methodologies such as mouse and eye tracking, ERP, and fMRI. These
methodologies allow for the exploration of effects before the resolution of an ambiguity.
Although it is hard to envision for the processing findings in this study to be explained by
a mechanism other than prediction, future studies focusing on long-distance dependencies
should one or more of these, more sensitive, methodologies in order to more safely
extrapolate the results to a larger population.
In terms of sampling, another limitation of the study concerns the demographics
of the groups involved, in particular the age of the participant groups. Although this
limitation might not affect many of the tasks, there is evidence that the Stroop task results
might have been affected by it. Although the results are in general agreement with
previous literature, the inclusion of age-control participants might alter the replicability of
174
the present results. In view of the fact that one of the motivations for including a
monolingual group was to avoid the potential effects of L1 attrition, this study included
only those native Speakers of Spanish who were not functionally bilingual. Given that
English education is compulsory in Mexico and that this requirement has been enforced
more systematically in recent times, most of the monolingual speakers recruited in the
study were, on average, about a decade older than the participants in the L2 advanced
group. Similarly, given that the participants in the L2 advanced group were mostly
graduate students or high school and college professors, these participants were also, on
average, about a decade older than the L2 intermediate students. Therefore, future studies
would also benefit from including age-controlled participant groups.
In order to better specify the developmental time course of L1 acquisition/
processing, one possible avenue for future research could involve a training study aimed
at determining the degree to which implicitly exposing L2 learners to an increased
number of CLLD sentences can facilitate learning of the dependency (when compared to
a baseline L2 group). Based on the results of this dissertation, the prediction would be
that increased and systematic exposure for the Experimental group to CLLD should
facilitate learning of the long-distance dependency inherent to the CLLD manipulation,
and thus, an increase in the ability of L2 learners to predict the presence of a clitic when
topicalization occurs.
Another possible improvement to the design of the self-paced reading task itself
would be to include tokens that included both CLLD stimuli and HTLD stimuli (for
instance, introduced by a phrase such as En cuanto a X… “As for X…”). The experiment
in its present form does not allow for the researcher to know whether any given
participant responded in a particular way because of him/her failing to notice the a-
marking and thus processing a particular sentence as a HTLD rather than CLLD. The
inclusion of HTLD stimuli would allow the researcher to investigate whether participants
175
actually comprehended the sentence as CLLD rather than a hanging topic. A further
improvement would be to include comprehension questions that actually focused on the
manipulation itself. Pablos’s (2006) materials included questions that focused either on
the context (half the time) or the target sentence but did not focus on the actual
manipulation (the slight difference in interpretation between the sentences with the clitic
and the sentences without the clitic). Such a design could have the advantage or
providing a more meaningful measure of comprehension of CLLD.
6.4. Conclusions
This dissertation investigated the acquisition and processing of CLLD in L2
Spanish with special regard to the generation of predictions involving long-distance
dependences, which are hypothesized under the Shallow Structure Hypothesis (SSH;
Clahsen & Felser, 2006a, b) to pose extreme difficulties for L2 learners. Additionally,
this study also examined the relationship between off-line knowledge and on-line
processing of CLLD, focusing on the possible impact of individual differences in
variables such as inhibitory control and statistical learning abilities on L2 outcomes. It
was expected that these variables would predict some of the variability in the L2 off-line
and on-line data. Overall, the results did not support the predictions of the SSH. The
evidence suggests, instead, that the processing of long distance dependencies is subject to
developmental improvements related to proficiency level. Although native-like
processing of CLLD appears to be a gradual and slow process, the most advanced
learners showed that their off-line knowledge of CLLD as well as their CLLD processing
signatures were indistinguishable from native speakers’.
The results of this study, which are in agreement with previous findings,
underscore the need to focus on aspects beyond the native-like or non-native-like
dichotomy. In this regard, further studies would benefit from increased focus on the
176
developmental time course of processing and the generation of predictions in a second
language. In terms of individual differences, neither non-linguistic (statistical learning
scores and reaction times in the Stroop task) nor linguistic (SILS Vocabulary and scores
on the Clitic Test) appeared to be reliable predictors of L2 processing. In this regard, the
findings of this study show evidence that the only reliable predictor of native-like
processing of long-distance dependencies involves proficiency in the L2.
177
REFERENCES
Abutalebi, J., & Green, D. (2007). Bilingual language production: The neurocognition of language representation and control. Journal of Neurolinguistics, 20(3), 242-275.
Aissen, J. (2003). Differential Object Marking: Iconicity vs. Economy. Natural Language and Linguistic Theory, 21, 435-483.
Anagnostopoulou, E. (1997). Clitic left dislocation and contrastive left dislocation. In E. Anagnostopoulou, H. van Riemsdijk, & F. Zwarts, (Eds.), Materials on left dislocation, (151-192) Amsterdam/Philadelphia: John Benjamins Publishing.
AlFallay, I. (2004). The role of some selected psychological and personality traits of the rater in the accuracy of self-and peer-assessment. System, 32(3), 407-425.
Altmann, G. T. M., & Kamide, Y. (1999). Incremental interpretation at verbs: Restricting the domain of subsequent reference. Cognition, 73(3), 247–264.
Altmann, G., & Mirković, J. (2009). Incrementality and prediction in human sentence processing. Cognitive Science, 33(4), 583-609.
Aoun, J., & Benmamoun. E. (1998). Minimality, reconstruction, and PF-movement. Linguistic Inguirv 29, 569-597.
Arai, M., & Keller, F. (2013). The use of verb-specific information for prediction in sentence processing. Language and Cognitive Processes, 28(4), 525-560.
Arregi, K. (2003). Clitic left dislocation is contrastive topicalization. In E. Kaiser & S. Arunachalam (Eds.), Proceedings of the 26th Annual Penn Linguistics Colloquium, U. Penn Working Papers in Linguistics, 9(1), 31-44.
Aslin, R. N., Saffran, J. R., & Newport, E. L. (1998). Computation of conditional probability statistics by 8-month-old infants. Psychological science, 9(4), 321-324.
Baddeley, A. (2007). Working memory, thought, and action. Oxford: Oxford University Press.
Baralt, M. (2010). Task complexity, the Cognition Hypothesis, and interaction in CMC and FTF environments. Unpublished Dissertation. Washington, D.C.: Georgetown University.
Barlow, M., & Kemmer, S. (2000). Introduction. In M. Barlow, & S. Kemmer, (Eds,) Usage-based models of language. Standford: Center for the Study of Language
178
and Information.
Bates, E., & MacWhinney, B. (1987). Competition, variation and language learning. In B. MacWhinney (Ed.), Mechanisms of language acquisition. Hillsdale, NJ: Lawrence Erlbaum Associates Inc.
Baralt, M., & Gurzynski-Weiss, L. (2011). Comparing learners’ state anxiety during task-based interaction in computer-mediated and face-to-face communication. Language Teaching Research, 15(2), 201-229.
Barbosa, P. (2001). On inversion in wh-questions in Romance. Subject inversion in Romance and the theory of universal grammar, 20-59.
Beckner, C., Blythe, R., Bybee, J., Christiansen, M. H., Croft, W., Ellis, N. C., Holland, J., Ke, J., Larsen-Freeman, D., & Schoenemann, T. (2009). Language is a complex adaptive system: Position paper. Language learning, 59(1), 1-26.
Belletti, A., Bennati, E., & Sorace, A. (2007). Theoretical and developmental issues in the syntax of subjects: Evidence from near-native Italian. Natural Language & Linguistic Theory, 25(4), 657-689.
Benati, A. (Ed.). (2011). Issues in second language proficiency. New York: Continuum International Publishing Group.
Benincà, P., & Poletto, C. (2004). Topic, Focus and V2: Defining the CP sublayers. In L. Rizzi (Ed.), The Structure of CP and IP, (pp. 52-75). Oxford: Oxford University Press.
Bialystok, E. (2009). Bilingualism: The good, the bad, and the indifferent. Bilingualism: Language and Cognition, 12(1), 3-11.
Bialystok, E., Craik, F. I. M., Klein, R., & Viswanathan, M. (2004). Bilingualism, aging, and cognitive control: Evidence from the Simon task. Psychology and Aging, 19, 290–303.
Bianchi, V. & Frascarelli, M. (2010). Is Topic a root phenomenon? Iberia: An International Journal of Theoretical Linguistics, 2, 43-88.
Birner, B. J. & Ward, G. (1998). Information Status and Noncanonical Word Order in English. Amsterdam: John Benjamins.
Blanche, P., & Merino, B. J. (1989). Self‐Assessment of Foreign‐Language Skills: Implications for Teachers and Researchers. Language Learning, 39(3), 313-338.
Bley-Vroman, R. (1990). The logical problem of foreign language learning. Linguistic
179
Analysis, 20, 3-49.
Bley-Vroman, R. (2009). The evolving context of the fundamental difference hypothesis. Studies in Second Language Acquisition, 31(02), 175-198.
Bloom, E., Polišenská, D., & Weerman, F. (2008). Articles, adjectives and age of onset: The acquisition of Dutch grammatical gender. Second Language Research, 24, 297-331.
Blumenfeld, H. K., & Marian, V. (2007). Constraints on parallel activation in bilingual spoken language processing: Examining proficiency and lexical status using eye-tracking. Language and Cognitive Processes, 22(5), 633-660.
Blumenfeld, H. K., & Marian, V. (2011). Bilingualism influences inhibitory control in auditory comprehension. Cognition, 118(2), 245-257.
Boeckx, C. (2011). Approaching parameters from below. In A.M. Di Sciullo & C. Boeckx, The biolinguistic enterprise: New perspectives on the evolution and nature of the human language faculty, (205-221). Oxford: Oxford University Press.
Bowers, T. L., & Pantle, M. L. (1998). Shipley institute for living scale and the Kaufman Brief Intelligence Test as screening instruments for intelligence. Assessment, 5(2), 187-195.
Bybee, J. (2008). Usage-based grammar and second language acquisition. In. P. Robinson, & N.C. Ellis, Handbook of cognitive linguistics and second language acquisition, (pp. 216-236). New York: Taylor & Francis.
Bybee, J. L., & Hopper, P. J. (2001). Introduction to frequency and the emergence of linguistic structure. In J.L. Bybee, & P.J. Hopper, (Eds.) Frequency and the emergence of linguistic structure (1-26). Amsterdam: John Benjamins Publishing.
Campos, H., & Zampini, M. (1990). Focalization strategies in Spanish. Probus,2(1), 47-64.
Carreiras, M., & Clifton, C. (1999). Another word on parsing relative clauses: Eyetracking evidence from Spanish and English. Memory & Cognition, 27(5), 826-833.
Carroll, S., & Meisel, J. M. (1990). Universals and second language acquisition.Studies in Second Language Acquisition, 12(2), 201-208.
Casielles-Suárez, E. (2003). Left-dislocated structures in Spanish. Hispania, 326-338.
180
Casiellez-Suárez, E. (2004). The syntax-information structure interface. New York: Garland.
Cecchetto, C. (1999). A comparative analysis of left and right dislocation in Romance. Studia Linguistica, 53(1), 40-67.
Chang, F., Dell, G. S., & Bock, K. (2006). Becoming syntactic. Psychological Review, 113(2), 234– 272.
Chater, N. & Manning, C. (2006) Probabilistic models of language processing and acquisition. Trends in Cognitive Sciences 10(7): 335–44.
Chee, M. W., Tan, E. W., & Thiel, T. (1999). Mandarin and English single word processing studied with functional magnetic resonance imaging. The Journal of Neuroscience, 19(8), 3050-3056.
Chomsky, N. (1957). Syntactic structures. The Hague: Mouton.
Chomsky, N. (1986). Knowledge of language: Its nature, origin, and use. New York: Praeger.
Chomsky, N. (1995). The Minimalist program. Cambridge, MA: MIT Press.
Chomsky, N. (2001). Derivation by Phase. In M. Kenstowicz, (Ed.), Ken Hale: A life in language, (pp 1-52). Cambridge: MIT Press.
Chomsky, N. (2005). Three factors in language design. Linguistic inquiry, 36(1), 1-22.
Christiansen, M. H., & Chater, N. (2001). Connectionist psycholinguistics: Capturing the empirical data. Trends in Cognitive Sciences, 5(2), 82-88.
Cinque, G. (1977). The movement nature of left dislocation. Linguistic Inquiry, 8, 397-411.
Cinque, G. (1990). Types of Ā-dependencies. Cambridge, MA: MIT press.
Clahsen, H., & Felser, C. (2006a). Grammatical processing in language learners. Applied Psycholinguistics, 27, 3–42.
Clahsen, H., & Felser, C. (2006b). Grammatical processing in language learners. Applied Psycholinguistics, 27(1), 3–42.
Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(3), 181–204.
181
Cohen, J. (1983). The cost of dichotomization. Applied Psychological Measurement, 7, 249-253.
Conway, C. M., & Christiansen, M. H. (2005). Modality-constrained statistical learning of tactile, visual, and auditory sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31(1), 24.
Cornilescu, A. (2006). On clitic doubling and parasitic gaps in Romanian. Revue roumaine de linguistique, 1, 23-42.
Cuetos, F., & Mitchell, D. C. (1988). Cross-linguistic differences in parsing: Restrictions on the use of the Late Closure strategy in Spanish. Cognition, 30(1), 73-105.
Cummins, J. (1980). The cross-lingual dimensions of language proficiency: Implications for bilingual education and the optimal age issue. Tesol Quarterly, 14(2), 175-187.
Dallas, A. C. (2008). Influences of verbal properties on second-language filler-gap resolution: A cross-methodological study. Unpublished doctoral dissertation, University of Florida.
Dallas, A. C., DeDe, G., & Nicol, J. (2013). An Event-Related-Potential (ERP) investigation of filler-gap processing in native and second language learners. Language Learning, 63, 766-799.
De Cat, C. (2007). French dislocation without movement. Natural Language & Linguistic Theory, 25(3), 485-534.
DeKeyser, R. M. (2005). What Makes Learning Second Language Grammar Difficult? A Review of Issues. Language Learning, 55(1), 1-25.
de Villers, J. & Roeper, T. (2011). Introduction. In J. de Villers & T. Roeper, Handbook of generative approaches to language acquisition, (pp. 3-12). Dordrecht: Springer.
Dekydspotter, L., & Renaud, C. (in press). On second language processing and grammatical development: The parser in second language acquisition. Linguistic Approaches to Bilingualism, 4(2), 131-165.
Dekydtspotter, L., Schwartz, B. D., & Sprouse, R. A. (2006). The comparative fallacy in L2 pro- cessing research. In M. G. O’Brien, C. Shea, & J. Archibald (Eds.), 8th Generative Approaches to Second Language Acquisition Conference (GASLA 2006) (pp. 33–40). Somerville, MA: Cascadilla Proceedings Project.
182
DeLong, K. A., Urbach, T. P., & Kutas, M. (2005). Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. Nature Neuroscience, 8(8), 1117–1121.
Dik, S. C. (1997). The theory of functional grammar: the structure of the clause. Walter de Gruyter.
Dikker, S., Rabagliati, H., Farmer, T. A., & Pylkkänen, L. (2010). Early occipital sensitivity to syntactic category is based on form typicality. Psychological Science, 21(5), 629–634.
Dobrovie-Sorin, C. (1990). Clitic doubling, wh-movement, and quantification in Romanian. Linguistic Inquiry, 351-397.
Donaldson, B. (2011). Nativelike right-dislocation in near-native French. Second Language Research, 27(3), 361-390.
Dörnyei, Z. (2001). New themes and approaches in second language motivation research. Annual review of applied linguistics, 21, 43-59.
Dussias, P. (2003). Syntactic ambiguity resolution in second language learners: Some effects of Bilinguality on L1 and L2 processing strategies. Studies in Second Language Acquisition, 25, 529–557.
Dussias, P. E., & Piñar, P. (2010). Effects of reading span and plausibility in the reanalysis of wh-gaps by Chinese-English second language speakers. Second Language Research, 26(4), 443-472.
Dussias, P. E., Valdés Kroff, J. R., Guzzardo Tamargo, R. E., & Gerfen, C. (2013). When gender and looking go hand in hand. Studies in Second Language Acquisition, 35(02), 353-387.
Ellis, N. C. (2002). Frequency effects in language acquisition: A review with implications for theories of implicit and explicit language acquisition. Studies in Second Language Acquisition, 24, 143-188.
Ellis, N.C. (2007). Dynamic Systems Theory and SLA: The wood and the trees. Bilingualism: Language and Cognition, 10(1), 23-25.
Ellis, N. C. (2008). The dynamics of second language emergence: Cycles of language use, language change, and language acquisition. Modern Language Journal, 92:2, 232-249.
Ellis, N. C., & Larsen-Freeman, D. (2006). Language emergence: Implications for
183
applied linguistics—Introduction to the special issue. Applied Linguistics,27(4), 558-589.
Ellis, R. (1990). Grammaticality judgments and learner variability. In H. Burmeister & P. Rounds (Eds.). Proceedings of the 10th Second Language Research Forum, (pp. 25-60). American English Institute, University of Oregon.
Ellis, R. (1997). Second Language Acquisition (8th edition). Oxford: Oxford University Press.
Elman, J. L. (1991). Distributed representations, simple recurrent networks, and grammatical structure. Machine Learning, 7(2–3), 195–225.
Elman, J. L. (1993). Learning and development in neural networks: The importance of starting small. Cognition, 48(1), 71-99.
Emonds, J. (2004). Unspecified categories as the key to root constructions. In D. Adger, C. de Cat, & G. Tsoulas, Peripheries: Syntactic Edges and their Effects, (pp. 75-120). Dordrecht: Kluwer.
Epstein, S., Flynn, S., & Martohardjono, G. (1998). The strong continuity hypothesis: some evidence concerning functional categories in adult L2 acquisition. In S. Flynn, G. Martohardjono, & W. O’Neil (Eds.). The generative study of second language acquisition, (pp. 61-77). Mahwah, NJ: Lawrence Erlbaum.
Erteschik-Shir, N. (1997). The dynamics of focus structure. Cambridge: Cambridge University Press.
Farmer, T. A., Brown, M., & Tanenhaus, M. K. (2013). Prediction, explanation, and the role of generative models in language processing. Behavioral and Brain Sciences, 36(03), 211-212.
Farmer, T. A., Fine, A. B., & Jaeger, T. F. (2011). Implicit context-specific learning leads to rapid shifts in syntactic expectations. In L. Carlson, C. Holscher, & T. Shipley (Eds.), Proceedings of the 33rd annual meeting of the Cognitive Science Society (pp. 2055–2060). Austin, TX: Cognitive Science Society
Farmer, T. A., Christiansen, M. H., & Monaghan, P. (2006). Phonological typicality influences on-line sentence comprehension. Proceedings of the National Academy of Sciences, 103(32), 12203-12208.
Fathman, A., & Precup, L. (1983). Influences of age and setting on second language oral proficiency. Second language acquisition studies, 151-161.
Felser, C., Roberts, L., Gross, R., & Marinis, T. (2003). The processing of ambiguous sentences by first and second language learners of English. Applied
184
Psycholinguistics, 24(3), 453–489.
Feldhausen, I. (2014). Modeling individual variation in prosody: the case of Clitic Left-Dislocations in Spanish. Proceedings of the 10th International Seminar on Speech Production, Köln (Germany).
Fernández, E. M. (2003). Bilingual sentence processing: Relative clause attachment in English and Spanish. Amsterdam: John Benjamins Publishing.
Firth, A., & Wagner, J. (1997). On discourse, communication, and (some) fundamental concepts in SLA research. The Modern Language Journal, 81(3), 285-300.
Firth, A., & Wagner, J. (1998). SLA property: No trespassing!. The Modern Language Journal, 82(1), 91-94.
Flege, J. E., & Liu, S. (2001). The effect of experience on adults’ acquisition of a second language. Studies in second language acquisition, 23(4), 527-552.
Flege J.E., Yeni-Komishian G.H., & Liu, S. (1999). Age constraints on second-language acquisition. Journal of Memory and Language, 41, 78–104.
Flynn, S., & Martohardjono, G. (1994). Mapping from the Initial State to the Final State: The separation of universal principles and language specific properties. In B. Lust, M. Suñer, & J. Whitman (Eds.), Syntactic theory and first language acquisition: crosslinguistic perspectives, (pp. 319-336), Hillsdale, NJ: Lawrence Erlbaum.
Fodor, J. A. (1983). The modularity of mind: An essay on faculty psychology. Cambridge, MA: MIT press.
Frascarelli, M. & Hinterhölzl, R. (2007). Types of topics in German and Italian. In, S. Winkler & K. Schwabe (Eds.), On Information Structure, Meaning and Form, (pp. 87–116). Amsterdam/Philadelphia: John Benjamins.
Frenck-Mestre, C. (2002). An on-line look at sentence processing in the second language. In R. Heredia & J. Altarriba (Eds.), Bilingual sentence processing (pp. 217–236). Amsterdam: Elsevier
Friederici, A. D., Bahlmann, J., Heim, S., Schubotz, R. I., & Anwander, A. (2006). The brain differentiates human and non-human grammars: functional localization and structural connectivity. Proceedings of the National Academy of Sciences of the United States of America, 103(7), 2458-2463.
Frost, R., Siegelman, N., Narkiss, A., & Afek, L. (2013). What predicts successful literacy acquisition in a second language?. Psychological science,24(7), 1243-1252.
185
Garrod, S. (2006). Psycholinguistic Research Methods. In E.K. Brown (Ed.): Encyclopedia of Language & Linguistics. 2nd Edition, Vol. 10. S., 251-257. Amsterdam: Elsevier.
Gass, S. A Historical survey of SLA research. In W. Ritchie & T.J. Bhatia (Eds.), The New Handbook of Second Language Acquisition (pp. 3-28). Bingley, U.K.: Emerald Group Publishing Limited.
Gass, S., Flecker, C., Leder, N., & Svetics, I. (1998). Ahistoricity revisited: Does SLA have a history? Studies in Second Language Acquisition, 20, 407-421.
Gass, S., & Lee, J. (2007). Second language acquisition of relative clauses. Studies in Second Language Acquisition, 29(02), 329-335.
Gass, S. & Selinker, L. (2008). Second language acquisition: An introductory course (3rd edition). New York: Routledge.
Goad, H., & White, L. (2008). Prosodic structure and the representation of L2 functional morphology: A nativist approach. Lingua, 118(4), 577-594.
Gómez, R. L. (2002). Variability and detection of invariant structure. Psychological Science, 13(5), 431-436.
Goodwin, L. D., & Leech, N. L. (2006). Understanding correlation: Factors that affect the size of r. The Journal of Experimental Education, 74(3), 249-266.
Gow, D. W., & McMurray, B. (2007). Word recognition and phonology: The case of English coronal place assimilation. Papers in laboratory phonology, 9, 173-200.
Graybiel, A. M. (1995). Building action repertoires: memory and learning functions of the basal ganglia. Current Opinion in Neurobiology, 5, 733-741.
Green, D. W. (1998). Mental control of the bilingual lexico-semantic system. Bilingualism: Language and cognition, 1(02), 67-81.
Gregg, K. R. (1993). Taking explanation seriously; or, let a couple of flowers bloom. Applied Linguistics, 14(3), 276-294.
Gregg, K. R., Long, M, Jordan, G., & Berreta, A. (1997) Rationality and its discontents in SLA. Applied Linguistics, 18(4): 539-559.
Gundel, J. K. (1988). Universals of topic-comment structure In M. Hammond, E. Moravczik & J. Wirth (Eds.), Studies in Syntactic Typology (pp. 209-239). Amsterdam: John Benjamins.
Haegeman, L. (2006). Argument fronting in English, Romance CLLD, and the Left
186
Periphery. In R. Zanuttini, H. Campos, E. Erburger, P.H. Portner, (Eds,). Crosslinguistic Research in Syntax and Semantics: Negation, Tense, and Clausal Architecture (pp. 27-52). Washington, DC: Georgetown University Press.
Haegeman, L. (2012). Adverbial Clauses, Main Clause Phenomena, and Composition of the Left Periphery: The Cartography of Syntactic Structure. Oxford: Oxford University Press.
Hahne, A. (2001). What’s different in second-language processing? Evidence from event- related brain potentials. Journal of Psycholinguistics Research, 30, 251–266.
Hahne, A., & Friederici, A. D. (2001). Processing a second language: late learners’ comprehension mechanisms as revealed by event-related brain potentials. Bilingualism: Language and Cognition, 4(2), 123–141.
Hakuta, K. (1976). A case study of a Japanese child learning English. Language Learning, 26, 321-351.
Halliday, M.A.K. (1967). Notes on transitivity and theme in English. Part II. Journal of Linguistics, 3, 199-244.
Halle, M., & Marantz, A. (1993). Distributed Morphology and the Pieces of Inflection. In K. Hale & S.J. Keyser, (Eds.), The View From Building 20 (pp. 111-176), Cambridge, Mass.: MIT Press.
Harrington, M. W. & M. Sawyer (1992). L2 working memory capacity and L2 reading skills. Studies in Second Language Acquisition 14.1, 25–38.
Hawkins, R. (2011). Statistical learning and innate knowledge in the development of second language proficiency: evidence from the acquisition of gender concord. In A.G. Benati, (Ed.) Issues in second language proficiency. Clevedon: Multilingual Matters.
Hawkins, R., & Chan, C. Y.-H. (1997). The partial availability of Universal Grammar in second language acquisition: The ‘failed functional features hypothesis.’ Second Language Research, 13(3), 187–226.
Hawkins, R., & Hattori, H. (2006). Interpretation of English multiple wh-questions by Japanese speakers: A missing uninterpretable feature account. Second Language Research, 22(3), 269-301.
Hilchey, M. D., & Klein, R. M. (2011). Are there bilingual advantages on nonlinguistic interference tasks? Implications for the plasticity of executive control processes. Psychonomic bulletin & review, 18(4), 625-658.
187
Hilles, S. (1991). Access to Universal Grammar in second language acquisition. In L. Eubank, Point Counterpoint: Universal Grammar in the Second Language, (pp. 305-338). Amsterdam: John Benjamins Publishing Company.
Hoey, M. Lexical priming: A new theory of words and language. New York: Routledge.
Hoover, M. L., & Dwivedi, V. D. (1998). Syntactic processing by skilled bilinguals. Language Learning, 48(1), 1-29.
Hopp, H. (2006). Syntactic features and reanalysis in near-native processing. Second Language Research, 22(3), 369-397.
Hopp, H. (2007). Ultimate attainment at the interfaces in second language acquisition: Grammar and processing. Unpublished doctoral dissertation, University of Groningen, The Netherlands.
Hopp, H. (2010). Ultimate attainment in L2 inflection: Performance similarities between nonnative and native speakers. Lingua, 120(4), 901–931.
Hopp, H. (2013). Grammatical gender in adult L2 acquisition: Relations between lexical and syntactic variability. Second Language Research, 29(1), 33-56.
Hughes, A. (1989). Testing for language teachings. Cambridge: Cambridge University Press.
Hulstijn, Jan H. (2011). Language proficiency in native and nonnative speakers: An agenda for research and suggestions for second-language assessment. Language Assessment Quarterly, 8(3), 229-249.
Hyams, N. (1991). Seven Not-So-Trivial Trivia of Language Acquisition: Comments on Wolfgang Klein. In L. Eubank, Point Counterpoint: Universal Grammar in the Second Language, (pp. 71-103). Amsterdam: John Benjamins Publishing Company.
Hyltenstam, K., & Abrahamsson, N. (2003). Maturational constraints in SLA. In C. Doughty & M. H. Long (Eds.), The handbook of second language acquisition (pp. 539 – 588). Oxford : Blackwell .
Ivanov, I. P. (2012). L2 acquisition of Bulgarian clitic doubling: A test case for the Interface Hypothesis. Second Language Research, 28(3), 345-368.
Iverson, M., Kempchinsky, P., & Rothman, J. (2008). Interface vulnerability and knowledge of the subjunctive/indicative distinction with negated epistemic predicates in L2 Spanish. Eurosla Yearbook, 8(1), 135-163.
Jackendoff, R. (2003). Foundations of language: Brain, meaning, grammar, evolution.
188
Oxford University Press.
Jackson, C. (2008). Proficiency level and the interaction of lexical and morphsyntactic information duging L2 sentence processing. Language Learning, 58(4), 875-909.
Jaeger, T. F., & Snider, N. E. (2013). Alignment as a consequence of expectation adaptation: Syntactic priming is affected by the prime’s prediction error given both prior and recent experience. Cognition, 127(1), 57-83.
Jiang, N. (2004). Morphological insensitivity in second language processing. Applied Psycholinguistics, 25, 603-634.
Juffs, A. (1998). Some effects of first language argument structure and morphosyntax on second language sentence processing. Second Language Research, 14(4), 406–424.
Juffs, A. (2005). The influence of first language on the processing of wh-movement in English as a second language. Second Language Research, 21(2), 121–151.
Juffs, A., & Harrington, M. (2011). Aspects of working memory in L2 learning.Language Teaching, 44(02), 137-166.
Kaan, E. (in press). Predictive sentence processing in L2 and L1: What is different? Linguistic Approaches to Bilingualism 4(2)-257-282.
Kato, M. A. (1993). Object Position in Brazilian Portuguese. In Linguistic Perspectives
on Romance Languages: Selected Papers from the XXI Linguistic Symposium on Romance Languages, Santa Barbara, February 21–24, 1991 (Vol. 103, p. 225). John Benjamins Publishing.
Kato, M. A., & Raposo, E. P. (2007). Topicalization in European and Brazilian
Portuguese. Amsterdam studies in the theory and history of linguistic science series. 4, 287, 205.
Kaufman, S. B., DeYoung, C. G., Gray, J. R., Jiménez, L., Brown, J., & Mackintosh, N.
(2010). Implicit learning as an ability. Cognition, 116(3), 321-340.
Kempchinsky, P. (2013). CLLD as a Window on the Left Periphery. In C. Howe, S.E. Blackwell, & M. Libbers Quesada, Selected Proceedings of the 15th Hispanic Linguistics Symposium (pp. 310-327). Somerville, MA: Cascadilla Procesedings Project.
Kim, K. H., Relkin, N. R., Lee, K. M., & Hirsch, J. (1997). Distinct cortical areas associated with native and second languages. Nature, 388(6638), 171-174.
Kim, Y., & Tracy-Ventura, N. (2011). Task complexity, language anxiety, and the
189
development of the simple past. In P. Robinson (Ed.), Second Language Task Complexity: Researching the Cognition Hypothesis of language learning and performance, pp. 287-306. Amsterdam: John Benjamins.
Kiss, K. E. (1998). Identificational focus versus information focus. Language, 74, 245–73.
Klein, D., Milner, B., Zatorre, R. J., Meyer, E., & Evans, A. C. (1995). The neural substrates underlying word generation: a bilingual functional-imaging study. Proceedings of the National Academy of Sciences, 92(7), 2899-2903.
Kroll, J. F., Bobb, S. C., Misra, M., & Guo, T. (2008). Language selection in bilingual speech: Evidence for inhibitory processes. Acta psychologica, 128(3), 416-430.
Kutas, M., DeLong, K. A., & Smith, N. J. (2011). A look around at what lies ahead: Prediction and predictability in language processing. In M. Bar, (Ed.) Predictions in the brain: Using our past to generate a future, (pp. 190-207). Oxford: Oxford University Press.
Lantolf, J. P. (1996) SLA building: Letting all the flowers bloom. Language Learning, 46(4), 713-749.
Lambrecht, K. (1994). Information Structure and Sentence Form: Topic, Focus and the Mental Representation of Discourse Referents. Cambridge University Press.
Lardiere, D. (2009). Some thoughts on the contrastive analysis of features in second language acquisition. Second Language Research, 25(2), 173-227.
Larsen-Freeman, D. (2009). Adjusting expectations: The study of complexity, accuracy, and fluency in second language acquisition. Applied Linguistics, 30(4), 579-589.
Larsen‐Freeman, D. (2010). Not so fast: A discussion of L2 morpheme processing and acquisition. Language Learning, 60(1), 221-230.
Leonetti, M. (2004). Specificity and differential object marking in Spanish.Catalan Journal of Linguistics, 3(2004), 75-114.
Leal Méndez, T., Farmer, T. A., & Slabakova, R. (in press). A same-system view of L2 processing: Evidence from long-distance syntactic dependencies in L2 Spanish. BUCLD 38 Proceedings.
Leal Méndez, T., Slabakova, R. & Rothman, J (in press). A rare structure at the syntax-discourse interface: Heritage and Spanish-dominant Native Speakers weigh in. Language Acquisition.
190
Leal Méndez, T., Slabakova, R., & Rothman, J. (2014). Discourse-sensitive Clitic-doubled Dislocations in Heritage Spanish. Lingua.
Leal Méndez, T., & Slabakova, R. (2012). The Interpretability Hypothesis again: A partial replication of Tsimpli and Dimitrakopoulou (2007). International Journal of Bilingualism. DOI: 10.1177/1367006912448125.
Leonetti, M., & Escandell-Vidal, V. (2009). Fronting and verum focus in Spanish. In A. Duffer & D. Jacob, Focus and Background in Romance Languages. Amsterdam: John Benjamins, (pp. 155-204). Amsterdam: John Benjamins Publishing Company.
Levy, R. (2008). Expectation-based syntactic comprehension. Cognition, 106(3), 1126-1177.
Levy, B. J., McVeigh, N. D., Marful, A., & Anderson, M. C. (2007). Inhibiting Your Native Language The Role of Retrieval-Induced Forgetting During Second-Language Acquisition. Psychological Science, 18(1), 29-34.
Lim, J. H., & Christianson, K. (2013). Second language sentence processing in reading for comprehension and translation. Bilingualism: Language and Cognition, 16(03), 518-537.
Linck, J. A., Kroll, J. F., & Sunderman, G. (2009). Losing access to the native language while immersed in a second language: Evidence for the role of inhibition in second-language learning. Psychological Science, 20(12), 1507-1515.
Long, M. H. (1997). Construct validity in SLA research: A response to Firth and Wagner. Modern Language Journal, 81(3): 318-323.
López, L. (2009). A derivational syntax for information structure . Oxford: Oxford University Press.
MacDonald, M. C. (2013). How language production shapes language form and comprehension. Frontiers in Psychology, 4, 226.
MacIntyre, P. D., Dörnyei, Z., Clément, R., & Noels, K. A. (1998). Conceptualizing willingness to communicate in a L2: A situational model of L2 confidence and affiliation. The Modern Language Journal, 82(4), 545-562.
MacIntyre, P. D., & Gardner, R. C. (1989). Anxiety and Second‐Language Learning: Toward a Theoretical Clarification. Language learning, 39(2), 251-275.
Mackey, A., & Gass, S. M. (2005). Second language research: Methodology and design. New York: Routledge.
191
Mackey, A., Philp, J., Egi, T., Fujii, A., & Tatsumi, T. (2002). Individual differences in
working memory, noticing of interactional feedback and L2 development. In P. Robinson (Ed.,) Individual differences and instructed language learning (pp. 181-209). Philadelphia: John Benjamins.
MacWhinney, B. (2005). A unified model of language acquisition. In J. F. Kroll & A. M.
B. de Groot (Eds.), Handbook of bilingualism: Psycholinguistic approaches (pp. 49–67). New York: Oxford University Press.
Marantz, A. (2005). Generative linguistics within the cognitive neuroscience of language.
The Linguistic Review 22, 429-445. Marian, V., & Spivey, M. (2003). Bilingual and monolingual processing of competing
lexical items. Applied Psycholinguistics, 24(02), 173-193. Marian, V., Spivey, M., & Hirsch, J. (2003). Shared and separate systems in bilingual
language processing: Converging evidence from eyetracking and brain imaging. Brain and language, 86(1), 70-82.
Marinis, T., Roberts, L., Felser, C., & Clahsen, H. (2005). Gaps in second language
sentence processing. Studies in Second Language Acquisition, 27(1), 53-78. Maye, J., Werker, J. F., & Gerken, L. (2002). Infant sensitivity to distributional
information can affect phonetic discrimination. Cognition, 82(3), B101-B111. McDonald, J. L. (2006). Beyond the critical period: Processing-based explanations for
poor grammaticahty judgment performance by late second language learners. Journal of Memory and Language, 55, 381-401.
McMurray, B., Horst, J. S., & Samuelson, L. K. (2012). Word learning emerges from the
interaction of online referent selection and slow associative learning.Psychological review, 119(4), 831.
Meisel, J. (1991). Principles of Universal Grammar and strategies of language learning:
Some similarities and differences between first and second language acquisition. In L. Eubank, Point Counterpoint: Universal Grammar in the Second Language, (pp. 231-276). Amsterdam: John Benjamins Publishing Company.
Meisel, J. M., Elsig, M., & Bonnesen, M. (2011). Delayed grammatical acquisition in
first language development: Subject-verb inversion and subject clitics in French interrogatives. Linguistic Approaches to Bilingualism, 1(4), 347-390.
Miller, K., & Schmitt, C. (2010). Effects of variable input in the acquisition of plural in
two dialects of Spanish. Lingua, 120(5), 1178-1193.
192
Misyak, J. B., & Christiansen, M. H. (2012). Statistical learning and language: an individual differences study. Language Learning, 62(1), 302-331.
Miyake, A. & N. P. Friedman (1998). Individual differences in second language proficiency: Working memory as language aptitude. In A. F. Healy & L. E. Bourne (eds.), Foreign language learning: Psycholinguistic studies on retention and training. Mahwah, NJ: Lawrence Erlbaum, 339–364.
Montrul, S. (2004). Subject and object expression in Spanish heritage speakers: A case of morphosyntactic convergence. Bilingualism Language and Cognition, 7(2), 125-142.
Montrul, S., & Bowles, M. (2009). Back to basics: Incomplete knowledge of Differential Object Marking in Spanish heritage speakers. Bilingualism: Language and Cognition, 12(3), 363-383.
Montrul, S., & Slabakova, R. (2003). Competence similarities between native and near-native speakers. Studies in Second Language Acquisition, 25(03), 351-398.
Moyer, A. 1999. Ultimate attainment in L2 phonology: The critical factors of age, motivation, and instruction. Studies in Second Language Acquisition, 21, 81–108.
Mulder, K., & Hulstijn, J. H. (2011). Linguistic skills of adult native speakers, as a function of age and level of education. Applied linguistics, 32(5), 475-494.
Munro, M. J., Flege, J. E., & MacKay, I. R. (1996). The effects of age of second language learning on the production of English vowels. Applied Psycholinguistics, 17(03), 313-334.
O’Grady, W., M. Lee, and H.–Y Kwak. (2009). Emergentism and second language acquisition. In Ritchie W. and T. Bhatia (Eds.), The New Handbook of second language acquisition (pp. 69–88). Bingley, U.K.: Emerald Press.
Pablos, L. (2006). Pre-verbal structure building in Romance languages and Basque. Unpublished dissertation. The University of Maryland.
Pablos, L. (2008) Filled-Gap Effects in Sentence Processing: different accounts compared, ASJU, Anuario del Seminario de Filología Vasca Julio de Urquijo: International Journal of Basque Linguistics and Philology, 42 (1), 309-332.
Pakulak, E., & Neville, H. J. (2010). Proficiency differences in syntactic processing of monolingual native speakers indexed by event-related potentials. Journal of Cognitive Neuroscience, 22(12), 2728-2744.
193
Papadopoulou, D., & Clahsen, H. (2003). Parsing strategies in L1 and L2 sentence processing. Studies in Second Language Acquisition, 25(04), 501-528.
Phillips, C. (2006). Three Benchmarks for Distributional Approaches to Natural Language Syntax. In R. Zanuttini, H. Campos, E. Erburger, P.H. Portner, (Eds,). Crosslinguistic Research in Syntax and Semantics: Negation, Tense, and Clausal Architecture (pp.13-23). Washington, D.C.: Georgetown University Press.
Phillips, C. (2012). Individual variation and constraints on language learning. Linguistic Approaches to Bilingualism, 2(3), 281-286.
Pickering, M. J., & Garrod, S. (2011). The use of prediction to drive alignment in dialogue. In G. R. Semin, & G. Echterhoff (Eds.), Grounding sociality. Neurons, mind and culture (pp.175–191). New York: Taylor & Francis.
Pickering, M. J., & Garrod, S. (2013). An integrated theory of language production and comprehension. Behavioral and Brain Sciences, 36(4), 329–347.
Pliatsikas, C., & Marinis, T. (2013). Processing empty categories in a second language: When naturalistic exposure fills the (intermediate) gap. Bilingualism: Language and Cognition, 16(01), 167-182.
Prévost, P., & White, L. (2000). Missing surface inflection or impairment in second language acquisition? Evidence from tense and agreement. Second Language Research, 16(2), 103-133.
Rizzi, L. (1982). Issues in Italian syntax . Berlin: Walter de Gruyter.
Rizzi, L. (1997). The fine structure of the left periphery. In: Haegeman L (ed.), Elements of grammar (pp. 281–337). Dordrecht: Kluwer.
Rizzi, Luigi. (2001). On the position ‘Int(errogative)’ in the left periphery of the clause. In G. Cinque & G. Salvi, (Eds.), Current Studies in Italian Syntax: Essays Offered to Lorenzo Renzi, (pp. 287-296). Amsterdam: Elsevier.
Reinhart, T. (2006). Interface strategies: Reference-set computation. Cambridge, MA: MIT Press.
Robinson, P. (2001). Individual differences, cognitive abilities, aptitude complexes and learning conditions in second language acquisition. Second language research, 17(4), 368-392.
Rodriguez‐Fornells, A., De Diego Balaguer, R., & Münte, T. F. (2006). Executive control in bilingual language processing. Language Learning, 56(s1), 133-190.
Roeper, T. (1999). Universal bilingualism. Bilingualism Language and Cognition,2(3),
194
169-186.
Rothman, J. (2007). Sometimes they use it, sometimes they don't: An epistemological discussion of L2 morphological production and its use as a competence measurement. Applied linguistics, 28(4), 609-614.
Rothman, J. (2009). Pragmatic deficits with syntactic consequences?: L2 pronominal subjects and the syntax–pragmatics interface. Journal of Pragmatics, 41(5), 951-973.
Rothman, J., & Slabakova, R. (2011). The Mind-Context Divide: On acquisition at the linguistic interfaces. Lingua, 121(4), 568-576.
Rossi, S., Gugler, M. F., Friederici, A. D. & Hahne, A. (2006). The Impact of Proficiency on Syntactic Second-language Processing of German and Itahan: Evidence from Event-related Potentials. Journal'of"Cognitive Neuroscience, 18, 2030-2
Saffran, J. R., Aslin, R. N., & Newport, E. L. (1996). Statistical learning by 8-month-old infants. Science, 274(5294), 1926-1928.
Saffran, J. R., Johnson, E. K., Aslin, R. N., & Newport, E. L. (1999). Statistical learning of tone sequences by human infants and adults. Cognition, 70(1), 27-52.
Schwartz, B. D., & Sprouse, R. A. (1996). L2 cognitive states and the full transfer/full access model. Second Language Research, 12(1), 40–72.
Schwartz A.I., & Kroll J.F. (2006). Language processing in bilingual speakers. In: Traxler M, Gernsbacher MA, editors. Handbook of Psycholinguistics, (pp. 967–999), 2nd Edition; Amsterdam: Elsevier
Scovel, T. (1978). The effect of affect on foreign language learning: A review of the anxiety research. Language Learning, 28(1), 129-142.
Seidenberg, M. S., & McClelland, J. L. (1989). A distributed, developmental model of word recognition and naming. Psychological Review, 96(4), 523.
Seliger, H.W., & Shohamy, E. (1989) Second language research methods. New York: Oxford University Press.
Selinker, L. (1972). Interlanguage. IRAL-International Review of Applied Linguistics in Language Teaching, 10(1-4), 209-232.
Skehan, P. (1991). Individual differences in second language learning. Studies in second language acquisition, 13(02), 275-298.
Skehan, P. (2009). Modeling second language performance: Integrating complexity,
195
accuracy, fluency, and lexis. Applied Linguistics, 30(4), 510-532.
Slabakova, R. (2008). Meaning in the second language. Berlin: Mouton de Gruyter.
Slabakova, R., Kempchinsky, P., & Rothman, J. (2012). Clitic-doubled left dislocation and focus fronting in L2 Spanish: A case of successful acquisition at the syntax–discourse interface. Second Language Research, 28(3), 319-343.
Slabakova, R., Leal Méndez, T., & Liskin-Gasparro, J. (in review). We have moved on: Current concepts and positions in generative SLA. Applied Linguistics, Forum Piece.
Slabakova, R., Rothman, J., & Kempchinsky, P. (2011). Gradient competence at the Syntax-Discourse Interface. EUROSLA Yearbook, 11(1), 218-243.
Slabakova, R., Rothman, J., Leal Méndez, T., Campos, G., & Kempchinsky, P. (2011). Pragmatic Features at the L2 Syntax-Discourse Interface. BUCLD 35 Proceedings. (pp. 564-576). Somerville, MA: Cascadilla.
Sorace, A., (2003). Near-nativeness. In C. Doughty & M.Long (Eds.), The handbook of second language acquisition, (pp. 130–151). Oxford: Blackwell.
Sorace, A. (2011). Pinning down the concept of “interface” in bilingualism. Linguistic Approaches to Bilingualism, 1(1) 1-34.
Sorace, A., & Filiaci, F. (2006). Anaphora resolution in near-native speakers of Italian. Second Language Research, 22(3), 339-368.
Sorace, A., & Serratrice, L. (2009). Internal and external interfaces in bilingual language development: Beyond structural overlap. International Journal of Bilingualism, 13(2), 195-210.
Spivey, M. J., & Marian, V. (1999). Cross talk between native and second languages: Partial activation of an irrelevant lexicon. Psychological Science,10(3), 281-284.
Street, J. A., & Dąbrowska, E. (2010). More individual differences in language attainment: How much do adult native speakers of English know about passives and quantifiers?. Lingua, 120(8), 2080-2094.
Suppes, P., Smith, R., & Leveillé, M. (1973). The French syntax of a child’s noun phrases. Archives de Psychologie, 42, 207-269.
Swets, B., Desmet, T., Clifton, C., & Ferreira, F. (2008). Underspecification of syntactic ambiguities: Evidence from self-paced reading. Memory & Cognition,36(1), 201-216.
Tanner, D., McLaughlin, J., Herschensohn, J., & Osterhout, L. (2013). Individual
196
differences reveal stages of L2 grammatical acquisition: ERP evidence. Bilingualism, Language and Cognition, 16, 367-382.
Thomas, M. (1994). Assessment of L2 proficiency in second language acquisition research. Language Learning, 44, 307-336.
Thomas, M. (1998). Programmatic ahistoricity in second language acquisition theory. Studies in Second Language Acquisition, 20, 387-405.
Thomas, M. (2006). Research synthesis and historiography: The case of assessment in second language proficiency. In J.M. Norris & L. Ortega, Synthesizing research on language learning and teaching, (pp. 279-298). Amsterdam: John Benhamins Publishing Company.
Tokowicz, N., & MacWhinney, B. (2005). Implicit and explicit measures of sensitivity to violations in second language grammar: An event-related potential investigation. Studies in second language acquisition, 27(02), 173-204.
Torrego, E. (1998). The Dependencies of Objects. Cambridge, MA: MIT Press.
Townsend, D. J., & Bever, T. G. (2001). Sentence comprehension: The integration of habits and rules. Cambridge, MA: MIT Press.
Trenkic, D. (2007). Variability in second language article production: beyond the representational deficit vs. processing constraints debate. Second Language Research, 23(3), 289-327.
Tsimpli, I.M., & Dimitrakopoulou, M. (2008). The Interpretability Hypothesis: Evidence from wh-interrogatives in second language acquisition. Second Language Research 23, 215-242.
Tsimpli, I. M., & Mastropavlou. M. (2007). Feature interpretability in L2 acquisition and SLI: Greek clitics and determiners. The role of formal features in second language acquisition, edited by H. Goodluck, J.Liceras & H.Zobl, 143-183. Routledge.
Ullman, M. T. (2001). The neural basis of lexicon and grammar in first and second language: The declarative/procedural model. Bilingualism: Language and cognition, 4(2), 105-122.
Ullman, M. T. (2004). Contributions of memory circuits to language: The declarative/procedural model. Cognition, 92, 231–270.
Vainikka, A., & Young-Scholten, M. (1996). Gradual development of L2 phrase structure. Second language research, 12(1), 7-39.
197
Valenzuela, E., 2006. L2 end state grammars and incomplete acquisition of the Spanish CLLD constructions. In: Slabakova, R., Montrul, S., Prévost, P. (Eds.), Inquiries in Linguistic Development: In Honor of Lydia White (pp. 283–304). Amsterdam: John Benjamins.
Vallduví, E., & Vilkuna, M. (1998). On rheme and kontrast. In W. Abraham (Ed.) On the Formal Syntax of the Westgermania (pp79-108). Amsterdam: John Benjamins.
van Haaften, T., Smits, R., & Vat, J. (1983). Left Dislocation, Connectedness and Reconstruction. In W. Abraham (Ed.) On the Formal Syntax of the Westgermania, (133-
154). Amsterdam: John Benjamins.
Van Petten, C., & Luka, B. J. (2012). Prediction during language comprehension: Benefits, costs, and ERP components. International Journal of Psychophysiology, 83(2), 176–190
Verhoeven, L., & de Jong, J. H. (Eds.). (1992). The construct of language proficiency: applications of psychological models to language assessment. Amsterdam: John Benjamins Publishing.
Villalba, X. (2000). The syntax of sentence periphery. PhD dissertation, Universitat Autònoma de Barcelona.
Ward, G.L. (1988). The semantics and pragmatics of preposing. Unpublished PhD dissertation, University of Pennsylvania, Philadelphia, PA, USA.
Weiss, J. L., & Schell, R. E. (1991). Estimating WAIS‐R IQ from the Shipley institute of living scale: A replication. Journal of Clinical Psychology, 47(4), 558-562.
West, S. G., Aiken, L. S., & Krull, J. L. (1996). Experimental personality designs: Analyzing categorical by continuous variable interactions. Journal of personality, 64(1), 1-48.
White, L. (1985). The pro-drop parameter in adult second language acquisition. Language Learning, 35, 47-62.
White, L. (2003). Second language acquisition and Universal Grammar. Cambridge: University Press.
White, L., Valenzuela, E., Kozlowska–Macgregor, M., & Leung, Y. K. I. (2004). Gender and number agreement in nonnative Spanish. Applied Psycholinguistics, 25(01), 105-133.
198
Williams, M., Burden, R., & Lanvers, U. (2002). ‘French is the language of love and stuff’: Student perceptions of issues related to motivation in learning a foreign language. British Educational Research Journal, 28(4), 503-528.
Willingham, D. B. (1998). A neuropsychological theory of motor skill learning. Psychological Review, 103(3), 558-584.
Wolfe-Quintero, K., Inagaki, S., & Kim, H. Y. (1998). Second language development in writing: Measures of fluency, accuracy, & complexity (No. 17). National Foreign Language Resource Center.
Yang, C. (2002). Knowledge and learning in natural language. Oxford: Oxford University Press.
Yang, C. (2010). Three factors in language variation. Lingua, 120(5), 1160-1177.
Zachary, R. A., Crumpton, E., & Spiegel, D. E. (1985). Estimating WAIS‐R IQ from the Shipley Institute of Living Scale. Journal of Clinical Psychology, 41(4), 532-540.
Zagona, K. (2002). The syntax of Spanish. Cambridge: Cambridge University Press.
Zubizarreta, M.L. (1998). Prosody, focus, and word order. Cambridge, MA: MIT Press.