34
What can a corpus tell us about lexis & What can a corpus tell us about grammer

What can a corpus tell us about grammar

Embed Size (px)

Citation preview

Page 1: What can a corpus tell us about grammar

What can a corpus tell us about lexis

& What can a corpus tell us about grammer

Page 2: What can a corpus tell us about grammar

Sami Ullah 13131502-103 Usama Shoukat13131502-065 Sarmad shakoor 13131502-058

Page 3: What can a corpus tell us about grammar

What can a corpus tell us about lexis

Page 4: What can a corpus tell us about grammar

Lexis and the lexicon

Lexis can be searched simply through exploring individual lexical items and their behaviour.

Or By using corpus data to examine the lexicon as a whole or to test lexical theory.

Page 5: What can a corpus tell us about grammar

THE GENERAL LEXICON

.What do corpora tell us about the English lexicon?

How many words comprise the main vocabulary of a language, its central lexicon?

“Exact rank orderings of words and rates of occurrence vary

according to corpus composition and lemmatisation policies”

Page 6: What can a corpus tell us about grammar

WORD FORMATION

General corpora also provide information about derivation and compounding, helping establish with potential words are actually institutionalised .

For example, Formations such as colo(u)rous , miscolo(u)r are morphologically possible but not found .

Page 7: What can a corpus tell us about grammar

Phraseology and phrases

It is better to look at words in their corpus than looking at them in isolation.

COLLOCATION AND PATTERNING What can we learn about a word from looking at its collocates, thewords which it co-occurs?

Page 8: What can a corpus tell us about grammar

FIXED EXPRESSIONS AND IDIOMS

Multi-words vs. idioms

Multi-words: Have a wide range

Great varieties of behaviour·

Idioms: Infrequent Appear mainly in journalism & fiction.

Page 9: What can a corpus tell us about grammar

Meaning

CONTEXT AND MEANING

Corpus concordances makes us aware of how far the meanings of words are derived from context

Do words have independence meaning at all?

What can corpora tell us about the meanings of words?

Page 10: What can a corpus tell us about grammar

POLYSEMY

How many different senses or uses words have?

How these are distinguished in context?

Page 11: What can a corpus tell us about grammar

METAPHOR, CONNOTATION & IDEOLOGY

Does corpora inform us about metaphors? How does it do it?

How is connotational meaning analyzed?

How is ideology analyzed?

Page 12: What can a corpus tell us about grammar

Sets and synonyms

LEXICAL SETS Is it useful to divide a corpus by fields?

SYNONYMS AND ANTONYMS

How does corpora analyze synonyms?

Antonyms also be explored with corpus data?

Page 13: What can a corpus tell us about grammar

Lexis in spoken language

How lexis in spoken interaction is analyzed or explored?

PHRASEOLOGY What types of features can appear when analyzing spoken lexis? What distinctions can be found between speech and writing lexis?

Page 14: What can a corpus tell us about grammar

MEANING AND USAGE

Do words and phraseologies fulfil pragmatic functions?

How the meaning has to be explored?

Page 15: What can a corpus tell us about grammar

What can a corpus tell us about grammar?

Page 16: What can a corpus tell us about grammar

Outline

Understanding grammar through patterns and contexts: moving from correct/incorrect to likely/unlikely

Methodological principles in corpus-based grammar analysis Types of grammatical patterns

Grammar-vocabulary associations (lexico-grammar) Grammar co-text

Discourse-level factors Context of the situation

Investigating multiple features/conditions simultaneously The grammar of speech

Page 17: What can a corpus tell us about grammar

Understanding grammar through patterns and contexts. DICHOTOMOUS PERSPECTIVE From this perspective, to describe the grammar of a language, all a

researcher needs is a native speaker because any native speaker can judge grammaticality.

Works well for certain grammatical features. However many grammatical choices cannot be made on the basis of correct/incorrect.

Empirical analyses. The analyses must cover numerous data in order to tell which

language choices are widespread, which occur predictably although under rare circumstances, and which are more idiosyncratic.

Page 18: What can a corpus tell us about grammar

Understanding grammar through patterns and contexts CORPUS LINGUISTICS: Increases researchers’ ability to systematically study the variation in

a large collection of texts. Common/uncommon typical/untypical. Show the correspondence between the use of a grammatical feature

and some other factor in the discourse or situational context O’Keeffe et al. (2007) explain, corpus analyses lead us to describing

grammar not just in structural terms, but in probabilistic terms – describing the typical social and discourse circumstances associated with the use of particular grammatical features .

Page 19: What can a corpus tell us about grammar

Understanding grammar through patterns and contexts METHODOLOGICAL PRINCIPLES IN CORPUS-BASED GRAMMAR

ANALYSIS Any analysis of ‘ typical ’ or ‘ probable ’ choices depends on

frequency analysis. McCarthy and Carter (2001) explain the need for fine-grained

distinctions in spoken corpora to describe when ellipsis is and is not common.

Biber et al . (2004) explain, we do not regard frequency data as explanatory. In fact we would argue for the opposite: frequency data identifies patterns that must be explained. The usefulness of frequency data (and corpus analysis generally) is that it identifies patterns of use that otherwise often go unnoticed by researchers.

Page 20: What can a corpus tell us about grammar

Types of grammatical patterns

GRAMMAR-VOCABULARY ASSOCIATIONS Grammar and lexis were not as distinct as traditionally presented. One type of lexico-grammatical relationship concerns the lexical items that

tend to occur with a particular grammatical structure ( that -clause objects). Biber et al. (1999) find that think , say and know are by far the most

common verbs with that -clauses in both British and American conversation. The structure is less common overall with any verb in academic prose, but suggest and show are most common.

Rather than reporting thoughts and feelings, the verb + that -clause structures in academic prose are used to report previous research.

Another type of lexico-grammatical relationship: verb tense. A grammatical description would explain the form of tenses.

Page 21: What can a corpus tell us about grammar

Types of grammatical patterns

GRAMMAR-VOCABULARY ASSOCIATIONS No empirical investigation. A modern corpus-based reference

grammar can provide that information. The verbs most strongly associated with present tense convey

mental, emotional and logical states: E.g.: bet, doubt, know, matter, mean, mind, reckon, suppose, thank The verbs most strongly associated with past tense, convey events or

activities, especially body movements and speech: E.g. : exclaim, eye, glance, grin, nod, pause, remark, reply, shrug,

sigh

Page 22: What can a corpus tell us about grammar

Types of grammatical patterns

GRAMMAR-VOCABULARY ASSOCIATIONS Certain structures tend to be associated with certain types of

meaning, such as positive or negative circumstances. This is called ‘ semantic prosody ’ (Sinclair 1991).

O’Keeffe et al. (2007: 106–14) provide an extended corpus-based analysis of get -passives (e.g. he got arrested). They show that the get -passive is usually used to express unfortunate incidences.

O’Keeffe et al. (2007) discuss the type of subjects usually found with get-passives and he lack of adverbials in these clauses.

Page 23: What can a corpus tell us about grammar

Types of grammatical patterns

GRAMMATICAL CO-TEXT Grammatical descriptions in traditional textbooks sometimes make

claims about the grammatical co-text of features. Example: Frazier investigation (2003) Investigated: would- clauses of hypothetical or counterfactual

conditionals. e.g.: If water boils, it will turn to steam. Concerned about: the way that ESL grammars virtually always

present the would- clause as an adjacent to an if- clause.

Page 24: What can a corpus tell us about grammar

Types of grammatical patterns

GRAMMATICAL CO-TEXT Frazier investigation (2003) Examined: the extent to which this was true in combination of spoken

and written corpora over a million words. Found: almost 80% of the hypothetical would -clauses were not

adjacent to an if -clause. Some of them were part of continuing discourse that had been framed with an if -clause at a lengthy distance from the would -clause.

The co-occurring grammatical features include infinitives and gerunds.

e.g.: Letting the administration … hands would give them.

Page 25: What can a corpus tell us about grammar

GRAMMATICAL CO-TEXT

Types of grammatical patterns Frazier's systematic corpus analysis thus highlights two aspects of

grammatical co-text for hypothetical/counter factual would -clauses: The traditional claim that they usually occur with if -clauses is NOT

true. They do often occur with infinitives and gerunds. Looking at grammatical co-occurrence patterns can also help to

explain when rare constructions occur.

Page 26: What can a corpus tell us about grammar

GRAMMATICAL CO-TEXT

Types of grammatical patterns Constructions with that - and the fact that - subject position clauses

occur about twenty to forty times per million words in academic prose and newspapers, and almost never in conversations.

that -clauses in other positions occur over 2,000 to 7,000 times per million words in different registers.

These subject position clauses are obviously harder for listeners or readers to process.

Page 27: What can a corpus tell us about grammar

DISCOURSE-LEVEL FACTORS

Types of grammatical patterns Because many people ’ s introduction to corpus linguistics is with

simple concordance searches, they sometimes believe that corpus linguistics has little to offer discourse-level studies.

That is not the case because for example, determining the semantic prosody of get -passives required considering discourse context.

When that -clauses are in subject position, they tend to restate information that has already been mentioned in the previous discourse.

Page 28: What can a corpus tell us about grammar

DISCOURSE-LEVEL FACTORS

Example: It became surprisingly apparent that all meteorites are of the same

age, somewhere in the vicinity of 4.5 billion years old … That there are no meteorites of any other age, regardless of when they fell to Earth , suggests strongly that… ”

Analysis of discourse-level factors affecting grammar often requires interpreting meaning, organisation, and information structure in texts. Such analysis is part of the more qualitative, interpretative side of corpus study, focusing on how a grammatical structure is used in context.

Page 29: What can a corpus tell us about grammar

CONTEXT OF THE SITUATION

Number of factors associated with a grammatical feature: Audience Purpose Participant roles Formality of the situation Most common perspective on variation = registers or genres.

Page 30: What can a corpus tell us about grammar

CONTEXT OF THE SITUATION

Cambridge Grammar of English (comparisons between the use of features in speech and writing).

Use of Michigan Corpus of Academic Spoken English . Biber compares grammatical features across ten spoken and written

registers from four American universities. It is misleading to characterise the frequency and use of a

grammatical feature in only one way if focused on general settings. Variables like social class, ethnic group and age have been less

studied in corpus-based grammar research. In the future, new corpus will facilitate more study of the

sociolinguistic variation of grammatical features.

Page 31: What can a corpus tell us about grammar

Investigating multiple features/conditions simultaneously Difficult to focus on only one pattern when explaining grammatical

choices. Example: omitting that in a (that -complement clause) Omitting that

is associated with some factors: Lexico-grammatical: the verb in the main clause is say or think . Grammatical co-text: main and complement clause have co-

referential subjects; that -clause has a personal pronoun subject. <Situational context: that is omitted more often in conversation than in newspaper writing.

Page 32: What can a corpus tell us about grammar

Investigating multiple features/conditions simultaneously Two approaches: To consider a functional system within a language and describe

factors that influence the grammatical features that are used to realise the system.

To study the grammar of a variety. The focus shifts from describing grammar to describing the variety.

Page 33: What can a corpus tell us about grammar

The grammar of speech

Unplanned spoken language was neglected for its 'incomplete' clauses,

messy repairs and non-standard forms. Spoken grammar is studied as a legitimate grammar. The common choice of though rather than however as a contrastive

connector in conversation. [Watching a football game, discussing a penalty call] A: Oh, that's outrageous. B: Well, he did put his foot out though . Speaker B disagrees with A, but the use of though (along with well )

downplays the disagreement.

Page 34: What can a corpus tell us about grammar

. The grammar of speech

The grammar of conversation has been thought to be simple compared to writing but there is complexity in conversation at a clausal level.

[The trouble is [[if you're the only one in the house] [he follows you] [and you're looking for him] [so you can't find him.]]] [I thought [I wonder [where the hell he's gone]]] [I mean he was immediately behind me.]] (Biber et al. 1999: 1068) Corpus also provides frequency information as evidence that certain

complex structures are more common in conversation than writing.