Syntax Analysis - WordPress.com · 1. To check syntax (= string recognizer) – And to report...

Syntax AnalysisSyntax Analysis

Syntax Analysis

LexicalAnalyzerLexical

Analyzer

Parserand rest offront-end

SourceProgram

Token,tokenval

Get nexttoken

Intermediaterepresentation

Position of a Parserin the Compiler Model

LexicalAnalyzer

Parserand rest offront-end

Symbol TableSymbol Table

Get nexttoken

Lexical error Syntax errorSemantic error

The Role Of Parser

• A parser implements a C-F grammar• The role of the parser is twofold:1. To check syntax (= string recognizer)

– And to report syntax errors accurately

2. To invoke semantic actions– For static semantics checking, e.g. type checking of

expressions, functions, etc.– For syntax-directed translation of the source code to an

intermediate representation

Syntax-Directed Translation

• One of the major roles of the parser is to produce anintermediate representation (IR) of the sourceprogram using syntax-directed translation methods

• Possible IR output:– Abstract syntax trees (ASTs)– Control-flow graphs (CFGs) with triples, three-address

code, or register transfer list notation– WHIRL (SGI Pro64 compiler) has 5 IR levels!

Error Handling

• A good compiler should assist in identifying andlocating errors– Lexical errors: important, compiler can easily recover and

continue– Syntax errors: most important for compiler, can almost

always recover– Static semantic errors: important, can sometimes recover– Dynamic semantic errors: hard or impossible to detect at

compile time, runtime checks are required– Logical errors: hard or impossible to detect

Viable-Prefix Property

• The viable-prefix property of LL/LR parsersallows early detection of syntax errors– Goal: detection of an error as soon as possible

without further consuming unnecessary input– How: detect an error as soon as the prefix of the

input does not match a prefix of any string in thelanguage

…for (;)…

…DO 10 I = 1;0…

Error isdetected here

Prefix Prefix

Error Recovery Strategies

• Panic mode– Discard input until a token in a set of designated

synchronizing tokens is found• Phrase-level recovery

– Perform local correction on the input to repair the error• Error productions

– Augment grammar with productions for erroneousconstructs

• Global correction– Choose a minimal sequence of changes to obtain a global

least-cost correction

Grammars (Recap)

• Context-free grammar is a 4-tupleG = (N, T, P, S) where– T is a finite set of tokens (terminal symbols)– N is a finite set of nonterminals– P is a finite set of productions of the form

where (NT)* N (NT)* and (NT)*– S N is a designated start symbol

Notational Conventions Used

• Terminalsa,b,c,… Tspecific terminals: 0, 1, id, +

• NonterminalsA,B,C,… Nspecific nonterminals: expr, term, stmt

• Grammar symbolsX,Y,Z (NT)

• Strings of terminalsu,v,w,x,y,z T*

• Strings of grammar symbols,, (NT)*

Derivations (Recap)

• The one-step derivation is defined by A

where A is a production in the grammar• In addition, we define

– is leftmostlm if does not contain a nonterminal– is rightmostrm if does not contain a nonterminal– Transitive closure* (zero or more steps)– Positive closure+ (one or more steps)

• The language generated by G is defined byL(G) = {w T* | S+ w}

Derivation (Example)

Grammar G = ({E}, {+,*,(,),-,id}, P, E) withproductions P = E E + E

E E * EE ( E )E - EE id

E - E - idE - E - id

E* EE* E

E+ id * id + idE+ id * id + id

Erm E + Erm E + idrm id + idErm E + Erm E + idrm id + id

Example derivations:Example derivations:

E* id + idE* id + id

Chomsky Hierarchy: LanguageClassification

• A grammar G is said to be– Regular if it is right linear where each production is of the

formA w B or A w

or left linear where each production is of the formA B w or A w

– Context free if each production is of the formA

where A N and (NT)*– Context sensitive if each production is of the form A

where A N, ,, (NT)*, || > 0– Unrestricted

formA w B or A w

Chomsky Hierarchy

L(regular) L(context free) L(context sensitive) L(unrestricted)L(regular) L(context free) L(context sensitive) L(unrestricted)

Where L(T) = { L(G) | G is of type T }That is: the set of all languages

generated by grammars G of type T

L1 = { anbn | n 1 } is context freeL1 = { anbn | n 1 } is context free

L2 = { anbncn | n 1 } is context sensitiveL2 = { anbncn | n 1 } is context sensitive

Every finite language is regular! (construct a FSA for strings in L(G))Every finite language is regular! (construct a FSA for strings in L(G))

Examples:Examples:

Parsing

• Universal (any C-F grammar)– Cocke-Younger-Kasimi– Earley

• Top-down (C-F grammar with restrictions)– Recursive descent (predictive parsing)– LL (Left-to-right, Leftmost derivation) methods

• Bottom-up (C-F grammar with restrictions)– Operator precedence parsing– LR (Left-to-right, Rightmost derivation) methods

• SLR, canonical LR, LALR

Top-Down Parsing

• LL methods (Left-to-right, Leftmost derivation)and recursive-descent parsing

Grammar:E T + TT ( E )T - ET id

Leftmost derivation:Elm T + Tlm id + Tlm id + id

• LL methods (Left-to-right, Leftmost derivation)and recursive-descent parsing

Grammar:E T + TT ( E )T - ET id

Leftmost derivation:Elm T + Tlm id + Tlm id + id

• Productions of the formA A

are left recursive• When one of the productions in a grammar is

left recursive then a predictive parser loopsforever on certain inputs

Left Recursion (Recap)

General Left Recursion EliminationMethod

Arrange the nonterminals in some order A1, A2, …, Anfor i = 1, …, n do

for j = 1, …, i-1 doreplace each

Ai Aj with

Ai 1 | 2 | … | k where

Aj 1 | 2 | … | kenddoeliminate the immediate left recursion in Ai

Ai Aj with

Ai 1 | 2 | … | k where

Ai Aj with

Ai 1 | 2 | … | k where

Ai Aj with

Ai 1 | 2 | … | k where

Immediate Left-Recursion EliminationMethod

Rewrite every left-recursive productionA A

| | | A

into a right-recursive production:A AR

| ARAR AR

| | | A

| ARAR AR

| | | A

| ARAR AR

| | | A

| ARAR AR

Example Left Recursion Elim.A B C | aB C A | A bC A B | C C | a

A B C | aB C A | A bC A B | C C | a

Choose arrangement: A, B, C

i = 1: nothing to doi = 2, j = 1: B C A | A b

B C A | B C b | a b(imm) B C A BR | a b BR

BR C b BR | i = 3, j = 1: C A B | C C | a

C B C B | a B | C C | ai = 3, j = 2: C B C B | a B | C C | a

C C A BR C B | a b BR C B | a B | C C | a(imm) C a b BR C B CR | a B CR | a CR

CR A BR C B CR | C CR |

BR C b BR | i = 3, j = 1: C A B | C C | a

Left Factoring

• When a nonterminal has two or more productionswhose right-hand sides start with the same grammarsymbols, the grammar is not LL(1) and cannot beused for predictive parsing

• Replace productionsA 1 | 2 | … | n |

withA AR | AR1 | 2 | … | n

Predictive Parsing

• Eliminate left recursion from grammar• Left factor the grammar• Compute FIRST and FOLLOW• Two variants:

– Recursive (recursive calls)– Non-recursive (table-driven)

FIRST (Revisited)

• FIRST() = { the set of terminals that begin allstrings derived from }

FIRST(a) = {a} if a TFIRST() = {}FIRST(A) = A FIRST() for A PFIRST(X1X2…Xk) =

if for all j = 1, …, i-1 : FIRST(Xj) thenadd non- in FIRST(Xi) to FIRST(X1X2…Xk)

if for all j = 1, …, k : FIRST(Xj) thenadd to FIRST(X1X2…Xk)

FOLLOW• FOLLOW(A) = { the set of terminals that can

immediately follow nonterminal A }

FOLLOW(A) =for all (B A ) P do

add FIRST()\{} to FOLLOW(A)for all (B A ) P and FIRST() do

add FOLLOW(B) to FOLLOW(A)for all (B A) P do

add FOLLOW(B) to FOLLOW(A)if A is the start symbol S then

add $ to FOLLOW(A)

• FOLLOW(A) = { the set of terminals that canimmediately follow nonterminal A }

add $ to FOLLOW(A)

First Set (2)

S aSeS BB bBeB CC cCeC d

Red : A Blue :

First Set (2)

• First (SaSe) = First(a) ={a}• First (SB) = First(B)• First (B bBe) = First(b)={b}• First (B C) = First(C)• First (C cCe) = First(c) ={c}• First (C d) = First(d)={d}

Red : A Blue :

Step 1:• First (SaSe) = First(a) ={a}• First (SB) = First(B)• First (B bBe) = First(b)={b}• First (B C) = First(C)• First (C cCe) = First(c) ={c}• First (C d) = First(d)={d}

First Set (2)

• First (SaSe) = {a}• First (SB) = First(B)• First (B bBe) = {b}• First (B C) = First(C)• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 1:• First (SaSe) = {a}• First (SB) = First(B)• First (B bBe) = {b}• First (B C) = First(C)• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

Step 1 {a}∪First(B) {b}∪First(C) {c, d}

First Set (2)

• First (SaSe) = {a}• First (SB) = First(B) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = First(C)• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 2:• First (SaSe) = {a}• First (SB) = First(B) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = First(C)• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

First Set (2)

• First (SaSe) = {a}• First (SB) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = First(C)• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 2:• First (SaSe) = {a}• First (SB) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = First(C)• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

Step 2 {a}∪ {b}∪First(C)

First Set (2)

• First (SaSe) = {a}• First (SB) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = First(C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 2:• First (SaSe) = {a}• First (SB) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = First(C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

Step 2 {a}∪ {b}∪First(C)

First Set (2)

• First (SaSe) = {a}• First (SB) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 2:• First (SaSe) = {a}• First (SB) = {b}∪First(C)• First (B bBe) = {b}• First (B C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

Step 2 {a}∪ {b}∪First(C) {b}∪{c, d} = {b,c,d} {c, d}

First Set (2)

• First (SaSe) = {a}• First (SB) = {b}∪First(C) = {b}∪ {c, d}• First (B bBe) = {b}• First (B C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 3:• First (SaSe) = {a}• First (SB) = {b}∪First(C) = {b}∪ {c, d}• First (B bBe) = {b}• First (B C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

First Set (2)

• First (SaSe) = {a}• First (SB) = {b, c, d}• First (B bBe) = {b}• First (B C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Red : A Blue :

Step 3:• First (SaSe) = {a}• First (SB) = {b, c, d}• First (B bBe) = {b}• First (B C) = {c, d}• First (C cCe) = {c}• First (C d) = {d}

Step First Set

S B C a b c d

{a}∪First(B) {b}∪First(C) {c, d}

{a}∪ {b}∪First(C) {b}∪{c, d} = {b,c,d} {c, d}

{a}∪ {b}∪{c, d} = {a,b,c,d} {b}∪{c, d} = {b,c,d} {c, d}

First Set (2)

Red : A Blue :

Step 3:

If no more change…The first set of a terminalsymbol is itself

Step First Set

S B C a b c d

Step 3 {a}∪ {b}∪{c, d} = {a,b,c,d} {b}∪{c, d} = {b,c,d} {c, d} {a} {b} {c} {d}

Another Example….

First Set (2)

S ABcA aA B bB

Red : A Blue :

S ABcA aA B bB

First Set (2)S ABcA aA B bB

Red : A Blue :

• First (SABc) = First(ABc)• First (Aa) = First(a)• First (A ) = First()∪First()• First (B b) = First(b)• First (B ) = First()∪First()

Step 1:• First (SABc) = First(ABc)• First (Aa) = First(a)• First (A ) = First()∪First()• First (B b) = First(b)• First (B ) = First()∪First()

Red : A Blue :

• First (SABc) = First(ABc)• First (Aa) = {a}• First (A ) = {}• First (B b) = {b}• First (B ) = {}

Step 1:• First (SABc) = First(ABc)• First (Aa) = {a}• First (A ) = {}• First (B b) = {b}• First (B ) = {}

Step First Set

S A B a b c

Step 1 First(ABc) {a, } {b, }

Red : A Blue :

• First (SABc) = First(ABc) = {a, }= {a, } - {} ∪ First(Bc)= {a} ∪ First(Bc)

• First (Aa) = {a}• First (A ) = {}• First (B b) = {b}• First (B ) = {}

Step 2:• First (SABc) = First(ABc) = {a, }

= {a, } - {} ∪ First(Bc)= {a} ∪ First(Bc)

• First (Aa) = {a}• First (A ) = {}• First (B b) = {b}• First (B ) = {} Step First Set

S A B a b c

Step 2 {a} ∪ First(Bc) {a, } {b, }

Red : A Blue :

• First (SABc) = {a} ∪ First(Bc)= {a} ∪{b, }= {a} ∪{b, } - {} ∪First(c)= {a} ∪{b,c}

Step 3:• First (SABc) = {a} ∪ First(Bc)

= {a} ∪{b, }= {a} ∪{b, } - {} ∪First(c)= {a} ∪{b,c}

Step First Set

S A B a b c

Step 2 {a} ∪ First(Bc) {a, } {b, }

Step 3 {a} ∪ {b, c}= {a,b,c} {a, } {b, }

Red : A Blue :

• First (SABc) = {a,b,c}• First (Aa) = {a}• First (A ) = {}• First (B b) = {b}• First (B ) = {}

Step 3:• First (SABc) = {a,b,c}• First (Aa) = {a}• First (A ) = {}• First (B b) = {b}• First (B ) = {}

Step First Set

S A B a b c

First(ABc) {a, } {b, }

{a} ∪ First(Bc) {a, } {b, }

{a} ∪ {b, c}= {a,b,c} {a, } {b, } {a} {b} {c}

LL(1) Grammar

• A grammar G is LL(1) if it is not left recursive and foreach collection of productions

A1 | 2 | … | nfor nonterminal A the following holds:

1. FIRST(i) FIRST(j) = for all i j2. if i* then

2.a. j* for all i j2.b. FIRST(j) FOLLOW(A) =

for all i j

Non-LL(1) Examples

Grammar Not LL(1) because:S S a | a Left recursiveS a S | a FIRST(a S) FIRST(a)

S a S | a FIRST(a S) FIRST(a) S a R | R S | For R: S* and * S a R aR S |

For R:FIRST(S) FOLLOW(R)

Recursive Descent Parsing (Recap)

• Grammar must be LL(1)• Every nonterminal has one (recursive) procedure

responsible for parsing the nonterminal’s syntacticcategory of input tokens

• When a nonterminal has multiple productions, eachproduction is implemented in a branch of a selectionstatement based on input look-ahead information

Using FIRST and FOLLOW to Write aRecursive Descent Parser

expr term restrest + term rest

| - term rest|

term id

| - term rest|

term id

procedure rest();begin

if lookahead in FIRST(+ term rest) thenmatch(‘+’); term(); rest()

else if lookahead in FIRST(- term rest) thenmatch(‘-’); term(); rest()

else if lookahead in FOLLOW(rest) thenreturn

else error()end;

| - term rest|

term id

| - term rest|

term id

else error()end;

FIRST(+ term rest) = { + }FIRST(- term rest) = { - }FOLLOW(rest) = { $ }

Non-Recursive Predictive Parsing:Table-Driven Parsing

• Given an LL(1) grammar G = (N, T, P, S)construct a table M[A,a] for A N, a T anduse a driver program with a stack

Predictive parsingprogram (driver)

Parsing tableM

a + b $

output

Constructing an LL(1) PredictiveParsing Table

for each production A do

for each a FIRST() doadd A to M[A,a]

if FIRST() then

for each b FOLLOW(A) doadd A to M[A,b]

enddoendif

enddoMark each undefined entry in M error

if FIRST() then

enddoendif

if FIRST() then

enddoendif

if FIRST() then

enddoendif

Example Table

E T ERER + T ER | T F TRTR * F TR | F ( E ) | id

A FIRST() FOLLOW(A)E T ER ( id $ )

ER + T ER + $ )ER $ )

T F TR ( id + $ )TR * F TR * + $ )

TR + $ )F ( E ) ( * + $ )

id + * ( ) $E E T ER E T ER

ER ER + T ER ER ER

T T F TR T F TR

TR TR TR * F TR TR TR

F F id F ( E )

F ( E ) ( * + $ )F id id * + $ )

LL(1) Grammars are Unambiguous

Ambiguous grammarS i E t S SR | aSR e S | E b

A FIRST() FOLLOW(A)S i E t S SR i e $

S a a e $SR e S e e $

Ambiguous grammarS i E t S SR | aSR e S | E b

a b e i t $S S a S i E t S SR

SR e S SR

SR e S e e $SR e $E b b t

Error: duplicate table entry

Predictive Parsing Program (Driver)push($)push(S)a := lookaheadrepeat

X := pop()if X is a terminal or X = $ then

match(X) // moves to next token and a := lookaheadelse if M[X,a] = X Y1Y2…Yk then

push(Yk, Yk-1, …, Y2, Y1) // such that Y1 is on top… invoke actions and/or produce IR output …

else error()endif

until X = $

push($)push(S)a := lookaheadrepeat

else error()endif

until X = $

else error()endif

until X = $

else error()endif

until X = $

Example Table-Driven ParsingStack$E$ERT$ERTRF$ERTRid$ERTR$ER$ERT+$ERT$ERTRF$ERTRid$ERTR$ERTRF*$ERTRF$ERTRid$ERTR$ER$

Stack$E$ERT$ERTRF$ERTRid$ERTR$ER$ERT+$ERT$ERTRF$ERTRid$ERTR$ERTRF*$ERTRF$ERTRid$ERTR$ER$

Inputid+id*id$id+id*id$id+id*id$id+id*id$

+id*id$+id*id$+id*id$

id*id$id*id$id*id$

*id$*id$

id$id$

id*id$id*id$id*id$

*id$*id$

id$id$

Production appliedE T ERT F TRF id

TR ER + T ER

T F TRF id

TR * F TR

TR ER + T ER

T F TRF id

TR * F TR

Stack$E$ERT$ERTRF$ERTRid$ERTR$ER$ERT+$ERT$ERTRF$ERTRid$ERTR$ERTRF*$ERTRF$ERTRid$ERTR$ER$

id*id$id*id$id*id$

*id$*id$

id$id$

id*id$id*id$id*id$

*id$*id$

id$id$

TR ER + T ER

T F TRF id

TR * F TR

TR ER + T ER

T F TRF id

TR * F TR

Panic Mode Recovery

FOLLOW(E) = { ) $ }FOLLOW(ER) = { ) $ }FOLLOW(T) = { + ) $ }FOLLOW(TR) = { + ) $ }FOLLOW(F) = { + * ) $ }

Add synchronizing actions toundefined entries based on FOLLOWAdd synchronizing actions toundefined entries based on FOLLOW

Pro: Can be automatedCons: Error messages are neededPro: Can be automatedCons: Error messages are needed

id + * ( ) $E E T ER E T ER synch synchER ER + T ER ER ER

T T F TR synch T F TR synch synchTR TR TR * F TR TR TR

F F id synch synch F ( E ) synch synch

FOLLOW(E) = { ) $ }FOLLOW(ER) = { ) $ }FOLLOW(T) = { + ) $ }FOLLOW(TR) = { + ) $ }FOLLOW(F) = { + * ) $ }

synch: the driver pops current nonterminal A and skips input tillsynch token or skips input until one of FIRST(A) is found

Pro: Can be automatedCons: Error messages are needed

Phrase-Level Recovery

Change input stream by inserting missing tokensFor example: id id is changed into id * idChange input stream by inserting missing tokensFor example: id id is changed into id * id

Can then continue here

Pro: Can be automatedCons: Recovery not always intuitivePro: Can be automatedCons: Recovery not always intuitive

T T F TR synch T F TR synch synchTR insert * TR TR * F TR TR TR

insert *: driver inserts missing * and retries the production

Can then continue here

Error Productions

E T ERER + T ER | T F TRTR * F TR | F ( E ) | id

Add “error production”:TR F TR

to ignore missing *, e.g.: id id

Add “error production”:TR F TR

to ignore missing *, e.g.: id id

Pro: Powerful recovery methodCons: Cannot be automatedPro: Powerful recovery methodCons: Cannot be automated

T T F TR synch T F TR synch synchTR TR F TR TR TR * F TR TR TR

E T ERER + T ER | T F TRTR * F TR | F ( E ) | id Pro: Powerful recovery method

Cons: Cannot be automatedPro: Powerful recovery methodCons: Cannot be automated

Bottom-Up Parsing

• LR methods (Left-to-right, Rightmostderivation)– SLR, Canonical LR, LALR

• Other special cases:– Shift-reduce parsing– Operator-precedence parsing

Operator-Precedence Parsing

• Special case of shift-reduce parsing• We will not further discuss (you can skip

textbook section 4.6)

Shift-Reduce Parsing

Grammar:S a A B eA A b c | bB d

Shift-reduce correspondsto a rightmost derivation:Srm a A B erm a A d erm a A b c d erm a b b c d e

Reducing a sentence:a b b c d ea A b c d ea A d ea A B eS

Shift-reduce correspondsto a rightmost derivation:Srm a A B erm a A d erm a A b c d erm a b b c d e

Reducing a sentence:a b b c d ea A b c d ea A d ea A B eS

a b b c d e

These matchproduction’s

right-hand sides

These matchproduction’s

right-hand sides

Handles

A handle is a substring of grammar symbols in aright-sentential form that matches a right-hand side

of a production

A handle is a substring of grammar symbols in aright-sentential form that matches a right-hand side

of a productiona b b c d ea A b c d ea A d ea A B eS

a b b c d ea A b c d ea A d ea A B eS

Handle

NOT a handle, becausefurther reductions will fail

(result is not a sentential form)

a b b c d ea A b c d ea A A e… ?

a b b c d ea A b c d ea A d ea A B eS

Stack Implementation ofShift-Reduce Parsing

Stack$$id$E$E+$E+id$E+E$E+E*$E+E*id$E+E*E$E+E$E

Inputid+id*id$

+id*id$+id*id$

id*id$*id$*id$

id$$$$$

Inputid+id*id$

+id*id$+id*id$

id*id$*id$*id$

id$$$$$

Actionshiftreduce E idshiftshiftreduce E idshift (or reduce?)shiftreduce E idreduce E E * Ereduce E E + Eaccept

Grammar:E E + EE E * EE ( E )E id

How toresolve

conflicts?

Stack$$id$E$E+$E+id$E+E$E+E*$E+E*id$E+E*E$E+E$E

Inputid+id*id$

+id*id$+id*id$

id*id$*id$*id$

id$$$$$

Inputid+id*id$

+id*id$+id*id$

id*id$*id$*id$

id$$$$$

Actionshiftreduce E idshiftshiftreduce E idshift (or reduce?)shiftreduce E idreduce E E * Ereduce E E + Eaccept

Grammar:E E + EE E * EE ( E )E id

Find handlesto reduce

How toresolve

conflicts?

Conflicts

• Shift-reduce and reduce-reduce conflicts arecaused by– The limitations of the LR parsing method (even

when the grammar is unambiguous)– Ambiguity of the grammar

Shift-Reduce Parsing:Shift-Reduce Conflicts

Stack$…$…if E then S

Input…$

else…$

Input…$

else…$

Action…shift or reduce?

Ambiguous grammar:S if E then S

| if E then S else S| other

Stack$…$…if E then S

Input…$

else…$

Action…shift or reduce?

Resolve in favorof shift, so else

matches closest if

Resolve in favorof shift, so else

matches closest if

Shift-Reduce Parsing:Reduce-Reduce Conflicts

Stack$$a

Inputaa$

Actionshiftreduce A a or B a ?

Grammar:C A BA aB a

Stack$$a

Inputaa$

Actionshiftreduce A a or B a ?

Grammar:C A BA aB a

Resolve in favorof reduce A a,

otherwise we’re stuck!

Resolve in favorof reduce A a,

otherwise we’re stuck!

LR(k) Parsers: Use a DFA forShift/Reduce Decisions

0start

aState I1:S C•State I1:S C• State I4:

C A B•State I4:C A B•goto(I0,C)

Grammar:S CC A BA aB a

State I0:S •CC •A BA •a

State I1:S C•

State I2:C A•BB •a

State I3:A a•State I3:A a•

State I4:C A B•State I4:C A B•

State I5:B a•State I5:B a•

goto(I0,C)

goto(I0,a)

goto(I0,A)

goto(I2,a)

goto(I2,B)

Can onlyreduce A a(not B a)

DFA for Shift/Reduce Decisions