The Pennsylvania State University
The Graduate School
College of Agricultural Sciences
PREPARATION AND CHARACTERIZATION OF LIGNIN-
PROTEIN COVALENT LINKAGES
A Dissertation in
Biorenewable Systems
by
Brett Galen Diehl
©2014 Brett Galen Diehl
Submitted in Partial Fulfillment
of the Requirements
for the Degree of
Doctor of Philosophy
May 2014
ii
The dissertation of Brett Galen Diehl was reviewed and approved* by the following:
Nicole R. Brown
Associate Professor of Wood Chemistry
Dissertation Adviser
Chair of Committee
John E. Carlson
Professor of Molecular Genetics
Jeffrey M. Catchmark
Associate Professor of Agricultural and Biological Engineering
Emmanuel Hatzakis
Director of NMR facility
John Ralph
Special Member
Professor of Biochemistry
University of Wisconsin at Madison
Paul Smith
Head of Biorenewable Systems department
*Signatures are on file in the Graduate School.
iii
Abstract
Lignin is a natural aromatic polymer that is bio-synthesized in the cell walls of almost all
land plants. Great strides have been made in understanding lignin’s biological origins and
chemical and physical properties. However, many unanswered questions remain. For example,
the extent to which lignin interacts with other cell wall components, such as proteins, is largely
unknown. In order to help address this question, the preparation and characterization of lignin-
protein covalent linkages is reported here for the first time. Chapter 1 provides a more detailed
introduction, justification, and literature review.
Chapter 2 focuses on the preparation of low molecular weight lignin-protein model
compounds. The compounds were not prepared under biomimetic conditions. Instead, the
primary focus of this study was on the characterization of the model compounds, leading to the
identification of diagnostic lignin-protein NMR chemical shifts.
Chapter 3 describes the characterization of lignin-protein linkages prepared under
biomimetic conditions of lignin DHP formation. NMR showed that cysteine and tyrosine
containing peptides covalently crosslink with lignin, while other amino acids do not. IR and EDS
were useful for showing the general incorporation of protein into the lignin, but were incapable
of distinguishing covalent and non-covalent interactions.
Chapter 4 describes the interaction between lignin and gelatin protein. It was found, using
EDS and IR, that gelatin was incorporated into lignin DHP. However, a lack of diagnostic NMR
signatures revealed that the crosslinking was likely dominated by non-covalent interactions such
as physical entanglement. This seems likely, as gelatin is lacking in both cysteine and tyrosine
residues, which were shown to be the only reactive amino acids towards lignin.
Chapter 5 details attempts at identifying lignin-protein linkages in wild type Arabidopsis.
Arabidopsis was grown to maturity, then lignin was extracted from cell wall material using
acidified dioxane. Elemental analysis was used to show that the lignin was contaminated with
about 3.75% protein; however, NMR was not able to identify lignin-protein covalent linkages.
Chapter 6 details some future experiments that could be used to explore lignin-protein
linkages, and it is hoped that this work will pave the way for such studies.
iv
TABLE OF CONTENTS
List of Figures…………………………………………………………………………………....vii
List of Tables……………………………………………………………………………………viii
Abbreviations……………………………………………………………………………………..ix
Acknowledgements…………………………………………………………………………..........x
Chapter 1. Introduction ...………………………………………………………………………... 1
1.1. Problem statement ...………………………………………………………………… 1
1.2. Literature review ……………………………………………………………………. 1
1.2.1. Lignin biosynthesis ……………………………………………………….. 1
1.2.2. Plant cell wall structural proteins …………………………………………. 6
1.2.3. Evidence for lignin-protein linkages …………………………………….. 10
1.3. Methods for investigating lignin-protein linkages ………………………………… 12
1.3.1. Preparation of lignin-protein compounds ……………………………….. 12
1.3.1. Characterization of lignin-protein compounds ………………………….. 16
1.4. References …………………………………………………………………………. 21
Chapter 2. Towards lignin-protein crosslinking: Amino acid adducts of a lignin model quinone
methide …………………………………………………………………………………………. 25
2.1. Abstract ……………………………………………………………………………. 25
2.2. Introduction ………………………………………………………………………... 25
2.3. Experimental ………………………………………………………………………. 28
2.3.1. Materials ………………………………………………………………… 28
2.3.2. Model compound preparations ………………………………………….. 28
2.3.3. Model compound properties …………………………………………….. 29
2.3.4. Nuclear magnetic resonance spectroscopy ……………………………… 42
2.3.5. Mass spectrometry ………………………………………………………. 42
2.3.6. Computational methods …………………………………………………. 43
2.4. Results and discussion …………………………………………………………….. 44
2.4.1. Preparation of quinone methide-amino acid adducts ……………………. 44
2.4.2. Solution-state NMR of compounds 3-9 and density functional theory
calculations for compounds 10 and 11 …………………………………. 46
2.4.3. Adduct isomer determination ……………………………………………. 50
2.5. Conclusions ………………………………………………………………………... 50
2.6. Acknowledgements ………………………………………………………………... 51
2.7. References …………………………………………………………………………. 51
Chapter 3. Lignin crosslinks with peptides under biomimetic conditions ……………………... 55
3.1. Abstract ……………………………………………………………………………. 55
3.2. Introduction ………………………………………………………………………... 55
3.3. Experimental ………………………………………………………………………. 57
v
3.3.1. Materials ………………………………………………………………… 57
3.3.2. Synthesis of lignin DHP and lignin-peptide adducts ……………………. 57
3.3.3. Scanning electron microscopy and energy dispersive X-ray spectroscopy 57
3.3.4. Nuclear magnetic resonance spectroscopy ……………………………… 58
3.3.5. Fourier-transform infrared spectroscopy ………………………………... 58
3.4. Results and discussion …………………………………………………………….. 58
3.4.1. Preparation and yields of the lignin-peptide adducts ……………………. 58
3.4.2. Lignin-peptide morphology ……………………………………………... 59
3.4.3. Lignin-peptide linkage identification ……………………………………. 60
3.4.4. Supporting techniques for lignin-peptide characterization ……………… 64
3.5. Conclusions ………………………………………………………………………... 66
3.6. Acknowledgments …………………………………………………………………. 67
3.7. References …………………………………………………………………………. 67
Chapter 4. Preparation and characterization of lignin-gelatin complexes ……………………... 71
4.1. Abstract ……………………………………………………………………………. 71
4.2. Introduction ………………………………………………………………………... 71
4.3. Experimental ………………………………………………………………………. 73
4.3.1. Materials ………………………………………………………………… 73
4.3.2. DHP and DHP-Gel syntheses …………………………………………… 74
4.3.3. Fourier-transform infrared spectroscopy ………………………………... 74
4.3.4. X-ray photoelectron spectroscopy ………………………………………. 74
4.3.5. Scanning electron microscopy and energy dispersive X-ray spectroscopy 75
4.3.6. Nuclear magnetic resonance spectroscopy ……………………………… 75
4.4. Results and discussion …………………………………………………………….. 75
4.4.1. Preparation of DHP-Gel adducts ………………………………………... 75
4.4.2. Fourier-transform infrared spectroscopy of DHP-Gel adducts ………….. 76
4.4.3. Morphology and nitrogen content of DHP-Gel adducts ………………… 77
4.4.4. Nuclear magnetic resonance spectroscopy of DHP-Gel adducts ………... 80
4.5. Conclusions ………………………………………………………………………... 82
4.6. Acknowledgments …………………………………………………………………. 82
4.7. References …………………………………………………………………………. 83
Chapter 5. Searching for lignin-protein linkages in Arabidopsis ……………………………… 86
5.1. Abstract ……………………………………………………………………………. 86
5.2. Introduction ………………………………………………………………………... 86
5.3. Experimental ………………………………………………………………………. 87
5.3.1. Growth and lignin extraction from Arabidopsis ………………………… 87
5.3.2. Elemental analysis of Arabidopsis lignin ……………………………….. 88
5.3.3. Nuclear magnetic resonance spectroscopy of Arabidopsis lignin ………. 88
5.4. Results and discussion …………………………………………………………….. 88
vi
5.4.1. Lignin extractions from Arabidopsis ……………………………………. 88
5.4.2. Protein content of Arabidopsis extracts …………………………………. 89
5.4.3. Nuclear magnetic resonance spectroscopy of Arabidopsis lignin ………. 90
5.5. Conclusions ………………………………………………………………………... 91
5.6. Acknowledgments …………………………………………………………………. 92
5.7. References …………………………………………………………………………. 92
Chapter 6. Conclusions ………………………………………………………………………… 93
6.1. Research summary ………………………………………………………………… 93
6.2. Future endeavors …………………………………………………………………... 94
6.3. References …………………………………………………………………………. 97
vii
List of Figures
1.1. Three ‘common’ and three ‘uncommon’ monolignols …………………………………...….3
1.2. Resonance forms of monolignol radicals ………………………………………………….....3
1.3. Typical lignin inter-unit linkages …………………………………………………….……....4
1.4. Formation via radical coupling of β-ether QMs during lignin polymerization ……………...5
1.5. Nucleophilic amino acids that could potentially react with lignin QMs …………………….5
1.6. Tyrosine radicals and cross-coupled products ……………………………………………….7
1.7. Lignin-protein complex formed via lignin-carbohydrate linkage ....…………………………9
1.8. Preparation of a lignin β-ether model compound and its corresponding QM analog ...…….13
1.9. Preparation of lignin DHP ...………………………………………………………………..14
1.10. General structure of peptides added to lignin DHP preparations ...……………………….14
1.11. 1H NMR spectrum of lignin DHP ...……………………………………………………….17
1.12. HSQC spectrum of lignin DHP ...………………………………………………………….18
1.13. FT-IR ATR spectrum of lignin DHP ...……………………………………………………19
1.14. SEM image of lignin DHP ...………………………………………………………………20
2.1. Formation of β-ether QMs via radical coupling, and their rearomatization ...……………...26
2.2. Guaiacylglycerol-β-guaiacyl ether 1 and its derived quinone methide (QM) 2 ...………….27
2.3. QM-AA model compounds …………………………………………………………………45
2.4. Overlaid HMQC side chain regions of compounds 3 and 5 ...……………………………...48
2.5. HSQC NMR spectrum of lignin DHP with overlaid α- and β-correlation data of 3-11 ...….49
3.1. Lignin-peptide crosslinking mechanism ……………………………………………………56
3.2. SEM images of DHP and lignin-peptide adducts ...………………………………………...60
3.3. HSQC NMR of lignin-CGG adduct ...………………………………………………………61
3.4. HSQC NMR of lignin-YGG adduct ...……………………………………………………...62
3.5. HSQC NMR of lignin-HGG adduct ..………………………………………………………63
3.6. FT-IR spectra of DHP and lignin-peptide adducts …………………………………………65
4.1. FT-IR of neat DHP and DHP-Gel adducts …………………………………………………77
4.2. SEM of neat DHP and DHP-Gel adducts …………………………………………………..78
4.3. Morphology and nitrogen atomic percentages for DHP-Gel adducts ……………………...79
4.4. HSQC NMR spectrum of DHP-Gel1 ……………………………………………………….81
5.1. Optical microscopy of solvents extracted and ball milled Arabidopsis cell wall material …89
5.2. HSQC NMR spectrum of Arabidopsis lignin ………………………………………………81
6.1. Cell wall models ……………………………………………………………………………95
6.2. Synthetic route to α-13C coniferyl alcohol ………………………………………………….96
6.3. Standard lignin α-shifts and α-shifts of lignin-protein linkages ……………………………97
viii
List of Tables
2.1. 1H and 13C NMR chemical shifts for lignin-amino acid adducts …………………………...47
2.2. Observed and DFT calculated α-13C NMR chemical shifts for compound 3 ………………50
3.1. Yield data for the DHP and lignin-peptide adducts ………………………………………...59
3.2. Inter-unit linkage ratios of the DHP and lignin-peptide adducts …………………………...64
3.3. EDS elemental analysis data for DHP and the lignin-peptide adducts ……………………..66
4.1. Nucleophilic amino acid abundance (g/100 g dry, ash-free protein) in gelatin …………….72
4.2. Preparation and yields of DHP and DHP-Gel adducts ……………………………………..76
4.3. Inter-unit linkage ratios of DHP and DHP-Gel adducts ……………………………………81
5.1. Estimated protein content of Arabidopsis and extracted lignin …………………………….89
5.2. Inter-unit linkage ratios of Arabidopsis acidic dioxane lignin ……………………………...90
ix
Abbreviations
AA – amino acid
AGP – arabinogalactan protein
ATR – attenuated total reflectance
DFT – density functional theory
DHP – dehydrogenation polymer
EDS – energy dispersive X-ray spectroscopy
FT-IR – Fourier-transform infrared spectroscopy
GRP – glycine-rich protein
HRGP – hydroxyproline-rich glycoprotein
NMR – nuclear magnetic resonance
PRP – proline-rich protein
QM – quinone methide
SEM – scanning electron microscopy
XPS – X-ray photoelectron spectroscopy
x
Acknowledgements
Here, at the end of my doctoral dissertation, I would like to take the opportunity to thank the
people, without whose assistance this work would not have been possible. This list is not
exhaustive, and I apologize greatly for any unintentional omissions.
First, I would like to thank my advisor, Dr. Nicole Brown, and my committee members, Drs.
John Ralph, Jeff Catchmark, Emmanuel Hatzakis, and John Carlson. I would also like to thank
Professor Emeritus Alan Benesi, who graciously served on my committee until his happy and
healthy retirement. Without guidance and patience from these individuals, this work would not
have been possible.
I would like to thank Dr. John Ralph’s entire research group, especially Matt Regner and Yuki
Tobimatsu, who not only helped with my research but also made life immensely enjoyable when
I visited John’s lab in May of 2012. I plan to revisit Madison as often as possible.
I would like to thank the seemingly countless number of individuals who have helped me with
myriad technical matters throughout the course of my PhD. To the folks in the Materials
Characterization Lab, Josh Stapleton, Trevor Clark, Vince Bojan, Tim Tighe, Julie Anderson,
Melisa Yashinski, and Joe Stitt, for assistance in collecting and interpreting an endless tide of
spectral data, my deepest thanks. To Wenbin Luo and the members of the Scott Showalter group,
sincere thanks for assistance with all manner of NMR technical support.
My funding sources and the programs and research they fostered were instrumental toward the
completion of this dissertation. I would like to acknowledge the USDA National Needs
Fellowship, which provided tuition and research funding support for several years. A very
special thanks is warranted to the DOE sponsored Center for Lignocellulose Structure and
Formation (CLSF) and all of the members therein. The center provided not only funding and
facilities to support this research, but also the breadth and depth of intellectual power necessary
to inspire all its members to perform to their highest capabilities. A very special thanks is also
warranted to the NSF CarbonEARTH fellowship program. This program provided me with
financial support, but more importantly, opportunities and memories that will last a lifetime.
Many thanks to all of my fellow CarbonEARTH’ers for advice, support, and fun times.
I would like to thank my lab mates, Paul Munson and Curtis Frantz, for companionship
throughout the seemingly endless process of graduate school, and for research related insights.
Finally, I would like to thank my parents for their love and patience, and for instilling in me the
skills I need to make it through life’s challenges. I am indebted to them in ways that can never be
repaid.
1
Chapter 1
Introduction
1.1. Problem statement
The purpose of this research was to investigate the reactivities of amino acids and
possibly proteins toward lignin, ultimately resulting in the formation of lignin-protein crosslinks.
It has been suggested that proteins located in plant cell walls may interact with lignin in many
ways. For example, enzymatic proteins such as peroxidases and laccases are necessary for lignin
polymerization. Furthermore, structural proteins (non-enzymatic proteins that assist in cell wall
scaffolding) may assist in the initial stages of lignin deposition, which occurs in the cell wall
corner region. Several mechanisms could be envisioned, with the structural proteins playing a
relatively passive role, or an active one in which they template lignin polymerization, perhaps
influencing the inter-unit linkage sequence of the final lignin polymer. Lignin-protein linkages
may also play a role in genetically engineered plant lines. Research has shown that plants with
up-regulated cell wall protein expression sometimes exhibit altered physical and chemical
properties, including enhanced lignin extractability, which may be due to increased levels of
lignin-protein linkages.
In spite of the potential implications of lignin-protein crosslinks in both wild type and
mutant plant lines, there have been few studies addressing the fundamental aspects of lignin-
protein linkages and their formation. For example, prior to this work, it was largely unknown
which amino acids (if any) were reactive toward lignin, how stable lignin-protein linkages were,
and how the linkages could be identified using standard analytical tools. The goal of this work
was to address and answer some of those questions, mostly through in vitro studies, while trying
to keep in mind the future necessity of lignin-protein identification in vivo.
The remainder of this chapter provides a literature review, which focuses on lignin
biosynthesis, plant cell wall structural proteins, evidence for lignin-protein linkages, and a
section detailing the methods used in this study for lignin-protein linkage preparation and
characterization. The second, third, and fourth chapters are presented in manuscript form and
should be considered stand-alone publications. The final chapter presents a summary of pertinent
findings, discusses limitations, and provides suggestions for future work.
1.2. Literature review
1.2.1. Lignin biosynthesis
Plant cell walls are composed of a network of interacting polymers, namely cellulose,
hemicelluloses, pectins, lignin, and structural proteins (Cosgrove, 2005; McQueen-Mason and
Cosgrove, 1994). Of these, lignin is the most abundant aromatic biopolymer, and the second
most abundant biopolymer overall (Boerjan et al., 2003). The lignin polymer is unique among
2
the plant cell wall polymers in that it is composed of phenylpropanoid monomers known as the
so-called monolignols, which undergo radical polymerization via a mechanism that is apparently
not under biological control beyond the generation of the lignin radicals themselves. Lignin
polymerization exhibits incredible plasticity among and within species. It is thought that lignin
benefits the plant by providing strength and rigidity to the cell wall, enhanced water
conductivity, and pathogen resistance. Pathogen resistance in particular is provided due to the
recalcitrance of lignin towards degradation. Unfortunately, this recalcitrance negatively impacts
human efforts to effectively use plant cell wall materials as biorenewable resources. Specifically,
lignin recalcitrance affects the pulp and paper industry, the developing biofuels industry, the
agricultural industries, and the chemical industries, all of whom seek higher value products from
lignin (Chapple et al., 2007; Chen and Dixon, 2008; Jung and Allen, 1995; Jung, 1989; Li et al.,
2008; Stewart et al., 2006). Thus, a greater understanding of lignin chemistry and biochemistry is
desirable towards controlling and minimizing its recalcitrance, as well as engineering lignin-
based products.
The process of lignification begins with the biosynthesis of the monolignols. Typically,
monolignols are biosynthesized from phenylalanine via a series of enzymatic steps (Boerjan et
al., 2003). There is evidence that the monolignols may be stored and/or transported to the cell
wall as monolignol glucosides (i.e., the phenolic hydroxyl of the monolignol is blocked by 4-O-
glycosylation); however, this may not always be the case. In addition, the mode of monoglignol
transport into the cell wall is unknown (i.e., golgi-derived vesicles versus plasma membrane
pumps) (Boerjan et al., 2003). As noted above, the process of lignification exhibits plasticity, and
this is evidenced by the variability of monolignol biosynthesis and expression. The three most
common lignin monomers are shown in Fig 1.1. The expression of these monolignols varies
among plant taxa. For example, gymnosperm lignin is almost entirely composed of G-units (i.e.,
coniferyl alcohol based) with some traces of H-units (p-coumaryl alcohol based), dicotyledonous
angiosperm lignin is mainly composed of G- and S-units (sinapyl alcohol based), and
monocotyledonous angiosperm lignin is composed of all three units, as well as ferulates,
sinapates, and p-coumarates. Other monolignols are biosynthesized and incorporated into the
lignins of both wildtype and mutant plant lines. For example, lignin found in the seed coats of
some wildtype vanilla orchids and cacti is almost completely composed of caffeyl alcohol (Chen
et al., 2012). In caffeic acid/5-hydroxyconiferaldehyde O-methyltransferase (COMT) deficient
mutants, 5-hydroxyconiferyl alcohol is incorporated into the lignin polymer (Li et al., 2000;
Ralph et al., 2001). And in cinnamyl alcohol dehydrogenase (CAD) deficient mutants,
coniferaldehyde and other aldehydes are incorporated into the lignin polymer (Ralph et al.,
2001).
3
Fig 1.1. Three ‘common’ and three ‘uncommon’ monolignols. From left to right: p-coumaryl
alcohol, coniferyl alcohol, sinapyl alcohol, caffeyl alcohol, 5-hydroxyconiferyl alcohol, and
coniferaldehyde. The side chain carbons of the monolignols are often referred to as α, β, and γ-
positions (see leftmost structure). This nomenclature will be used throughout the document.
Once the monolignols are shuttled to the cell wall, polymerization occurs via enzymatic
dehydrogenation followed by radical recombination. Glycosyl hydrolases are implicated in the
removal of the glucose residue from monolignol glucosides (Boerjan et al., 2003).
Dehydrogenation is then catalyzed by peroxidases and/or laccases. The exact peroxidase and/or
laccase isozymes responsible for monolignol oxidation have yet to be elucidated and may vary
among species (Boerjan et al., 2003). Hydrogen peroxide is necessary for peroxidase catalyzed
monolignol oxidation, and the source of this peroxide is uncertain, though NADPH oxidases may
play a role. Again, further research in this area is necessary. The monolignol radical is stabilized
by resonance (Fig 1.2), a direct consequence of which is the multiple lignin inter-unit linkage
types that are observed (Fig 1.3).
Fig 1.2. Resonance forms of monolignol radicals. R typically represents H or OCH3.
There is currently no evidence for enzymatic control over the process of monolignol
radical recombination (Ralph et al., 2008). Instead, the relative ratios of lignin inter-unit linkages
can vary substantially and can be influenced by many factors, including but not limited to,
monolignol composition (i.e., which monolignols are present), monolignol concentration,
oxidant concentration (e.g., H2O2), catalyst/enzyme concentration, pH, the polymerization matrix
(i.e., is the lignin polymerizing in a hemicellulose-rich, pectin-rich, or protein-rich environment,
or bulk water, etc.), and other physical and chemical concerns (Boerjan et al., 2003; Cathala et
al., 1998). In general though, the predominant inter-unit linkage type in native lignins is the so-
called β-ether (β-O-4) linkage, with varying quantities of other linkages, including
phenylcoumaran (β-5), resinol (β-β), dibenzodioxocin (5-5/β-O-4/α-O-4), spirodienone (β-1),
4
biphenyl ether (4-O-5), biphenyl (5-5), and β-ether/α-aryl ether (β-O-4/α-O-4) (Boerjan et al.,
2003; Capanema et al., 2005; Vanholme et al., 2010).
Fig 1.3. Typical lignin inter-unit linkages. Linkage ratios vary and are influenced by many
factors. Linkage ratios depicted here are not indicative of ratios observed in native lignins.
In the case of the predominant β-ether linkage, radical recombination results in the
formation of an unstable quinone methide (QM) intermediate (Fig 1.4) that cannot be trapped
intramolecularly, but instead must be trapped by an external nucleophile (in contrast to β-5 and
β-β QMs, which can be trapped intramolecularly). The nucleophile is most often water, yielding
the β-ether/α-OH structure. However, other cell wall nucleophiles are known to quench the QM.
For example, lignin has long been understood to covalently crosslink with plant cell wall
components such as hemicelluloses through nucleophilic reactions (via hydroxyl or carboxylic
acid groups) with the α-carbon of lignin QMs (Balakshin et al., 2011; Leary, 1980; Miyagawa et
al., 2012; Ralph et al., 2009; Toikka et al., 1998; Yuan et al., 2011).
5
Fig 1.4. Formation via radical coupling of β-ether QMs during lignin polymerization. L = lignin
polymer, Nuc = nucleophile, R = H or OCH3.
The crosslinking of lignin with cell wall constituents other than hemicelluloses has not
been well investigated. Cell wall structural proteins contain amino acid residues with
nucleophilic side chains that could react with lignin QMs (Harrak et al., 1991; Jose and
Puigdomenech, 1993; Ryser et al., 1997; Kieliszewski et al., 2011). Specifically, the amino acids
cysteine (Cys), lysine (Lys), histidine (His), aspartic acid (Asp), glutamic acid (Glu), serine
(Ser), threonine (Thr), tyrosine (Tyr) and hydroxyproline (Hyp) (Fig 1.5) all contain nucleophilic
side chain groups. Cell wall proteins containing these amino acids vary in quantity among
species and cell types, ranging from as low as 1-2% to 20% dry weight basis (Albersheim et al.,
2010; Cassab et al., 1988). They have previously been postulated to crosslink with lignin, and it
has been suggested that they may serve as nucleation sites or templates during lignification, but
this has not been adequately tested (Boerjan et al., 2003; Cassab et al., 1988; Harrak et al., 1991;
Albersheim et al., 2010; Beat et al., 1989). If true, this mechanism could provide spatial and
temporal control over lignin deposition and architecture (Beat et al., 1989). The following
section will discuss the various classes of cell wall structural proteins that contain nucleophilic
amino acids and are likely to be in close spatial proximity to lignin within the cell wall.
Fig 1.5. Nucleophilic amino acids (nuc side chain groups are highlighted in green) that could
potentially react with lignin QMs. In their free amino acid forms (shown here), the α-amine and
α-acid groups could also be nucleophilic; therefore, these groups were blocked to prevent
competing reactions in the studies described below. From left to right, starting at the top:
6
cysteine (Cys), lysine (Lys), histidine (His), aspartic acid (Asp), glutamic acid (Glu), serine
(Ser), threonine (Thr), tyrosine (Tyr), hydroxyproline (Hyp). Amino acid stereochemistry is not
shown; L-isomers dominate in nature.
1.2.2. Plant cell wall structural proteins
Plant cell wall structural proteins account for a relatively small percentage (dry weight
basis) of the total cell wall material in mature tissues. Early studies showed that primary cell
walls of dicots typically contain 5-10% protein and 2% hydroxyproline (Hyp), which originates
primarily from extensins (Lamport, 1974; Talmadge et al., 1973). Once secondary walls are
deposited, the relative protein content drops. The following section describes the classes of cell
wall proteins that may potentially interact with lignin, as well as proposed interaction
mechanisms. Proteomics of specific plant species of interest are discussed in a later section.
There are two broad classes of cell wall structural proteins that seem most likely to
interact with lignin: glycine-rich proteins (GRPs), and the proline/hydroxyproline-rich
glycoproteins, which are often further subdivided into the proline-rich proteins (PRPs),
hydroxyproline-rich glycoproteins (HRGPs), and arabinogalactan-proteins (AGPs). These
protein classes are evolutionarily related, resulting in structural and functional similarities. Some
evidence has indicated that these proteins may interact with lignin, or even serve as nucleation
sites for lignification in the cell corners and/or the general compound middle lamella. However,
conclusive evidence for lignin-protein linkages has yet to be described.
Glycine-rich proteins (GRPs) are a diverse group of proteins that are often expressed in
plant cell walls. As their moniker implies, they are glycine-rich and typically contain between
60% and 70% glycine, which is much higher than most other enzymatic or structural proteins
found in plants or animals. They most commonly occur in tracheary elements of protoxylem and
metaxylem tissues, and are involved in diverse cellular processes during plant development and
adaptation to environmental change (Chen et al., 2007; Ringli et al., 2001). Their function varies
among cell types, as does their structure, which is the basis for the most current GRP
classification system. Class I GRPs may contain a signal peptide followed by a highly conserved
(GGX)n region, where X is often Ala, Ser, Val, His, Phe, Tyr or Glu. Class II GRPs may also
contain a characteristic cysteine-rich C-terminal. Class III GRPs typically contain fewer glycine-
rich regions compared to other GRPs. Class IV GRPs are RNA-binding and contain either an
RNA-recognition motif or a cold-shock domain. And class V GRPs are glycine-rich with mixed
glycine repeat patterns that are not typically observed in the other classes (Mangeon et al., 2010).
For in-depth information regarding GRP tissue expression pattern, subcellular localization,
structure, and function, three excellent reviews are Sachetto-Martins et al. (2000), Ringli et al.
(2001), and Mangeon et al. (2010).
Based on the amino acid composition of GRPs, two modes of lignin-GRP crosslinking
may be envisioned. The first mode of crosslinking is through QM-nucleophile reactions, the
7
chemistry of which was discussed in a preceding section. In GRPs, the amino acids most likely
to react with lignin in this manner are His, Glu, Ser, and Tyr. Another potential lignin-GRP
crosslinking mechanism is through oxidative coupling of lignin with amino acid moieties,
specifically tyrosine. It has been shown that GRPs are often tyrosine-rich (up to 10% Tyr), and
they crosslink in an intra- and inter-peptide manner via peroxidase mediated reactions (Ringli et
al., 2001; Ryser et al., 2004). The tyrosine radical and experimentally observed tyrosine cross-
coupled products are shown in Fig 1.6. Such intra- and inter-peptide linkages are also observed
with PRPs and HRGPs, as discussed below. When lignin is in close proximity to GRPs, lignin-
tyrosine crosslinking via this oxidative mechanism may result. Alternatively, lignin-tyrosine
radical coupling may be discouraged if the oxidation potentials of the monolignols and tyrosine
are quite different. This seems likely, given that monolignols exhibit radical delocalization over
five resonance forms (Fig 1.2), while tyrosine only exhibits four (Fig 1.6) (Cong et al., 2013).
The work described here mainly focuses on the preparation and characterization of lignin-peptide
linkages formed through QM-nucleophile chemistry, but some attempts were made to identify
putative lignin-tyrosine radical mediated linkages, as well. More work in this area is warranted.
Fig 1.6. Tyrosine radicals and cross-coupled products. Top row: tyrosine radical resonance
forms. Middle row: isodityrosine, dityrosine, and pulcherosine. Bottom row: di-isodityrosine.
8
Of the GRPs, those filling cell wall structural functions may be in closest spatial
proximity to lignin. Previous research has shown that GRPs may interact with lignin, though
covalent linkage formation has not been clearly demonstrated. In 1989, Beat et al. noted that
GRPs and lignin were localized to the same cell types within Phaseolus vulgaris (common
bean), and it was hypothesized that the GRPs might provide nucleation sites for lignification via
tyrosine residues. The benefits to the plant would include spatial and temporal control of various
lignin properties including density and three-dimensional pattern (Beat et al., 1989). Similar
results were obtained by Ye and Varner in 1991, this time with regards to soybean (Ye and
Varner, 1991b). In 2004, Ryser et al. demonstrated that GRPs act as linkages between secondary
cell wall thickenings, mainly composed of lignin, in protoxylem elements of seed plants as the
cells passively expand following apoptosis (Ryser et al., 2004). Yet no attempt was made to
determine how the GRPs anchor to the lignin-rich thickenings. Interestingly, in 2007, Chen et al.
showed that an Arabidopsis GRP (AtGRP9) exhibits subcellular localization comparable with
that of AtCAD5, a major Arabidopsis cinnamyl alcohol dehydrogenase localized to the cell wall.
Yeast two-hybrid analysis also revealed that the two proteins interacted strongly, suggesting that
GRPs may play a role in lignin monolignol synthesis, which occurs prior to lignin
polymerization.
Proline-rich proteins (PRPs) display great heterogeneity in their amino acid sequences,
but they all contain amino acids with nucleophilic side chains such as Lys, His, Glu, Ser, and Tyr
(Jose and Puigdomenech, 1993), potentially allowing for QM-nucleophile crosslinking or lignin-
tyrosine oxidative crosslinking. Ryser et al. (1997) stated, "localization of PRPs in lignified
secondary walls and the secretion of the protein during lignification support the hypothesis of Ye
et al. (1991a) that PRP localization is related to the pattern of lignification." They also made the
bold claim that, "it may be speculated that PRPs function as a scaffold for lignin deposition via
their tyrosine groups followed by oxidative cross-linking of lignin monomers" (Ryser et al.,
1997). A similar conclusion was reached with regards to primary cell walls by Harrak et al.
(1999), as it was found that a certain PRP located in wild tomato is down-regulated in response
to drought, as is lignin production. The authors concluded that lignin and protein potentially
interact with one another on the basis that they are up-regulated and down-regulated together and
are located within the same cellular compartment (Harrak et al., 1999).
Hydroxyproline-rich glycoproteins (HRGPs) contribute to tissue integrity and tensile
strength. The most abundant and well-studied HRGPs are the extensins, which are defined by
Ser-Hyp4 glycomodules. The proline hydroxyl groups and glycomodules (typically consisting of
one through four arabinose residues) are post-translationally added and their placement and
abundance is determined by the sequence of the peptide chain (Cannon et al., 2008; Kieliszewski
et al., 2011). Extensins are generally tyrosine-rich, enabling them to crosslink via extensin
peroxidase. These networks involve short motifs, where isodityrosine forms very short
intramolecular crosslinks. This isodityrosine moiety may then react with a tyrosine residue to
form pulcherosine, or react with another isodityrosine residue to form the tyrosine tetramer, di-
9
isodityrosine (Fig 1.6) (Kieliszewski et al., 2011). It is conceivable that lignin could crosslink
with these tyrosine residues via the radical mechanism described previously. In addition,
nucleophilic amino acids are abundant in HRGPs, and include Cys, Lys, His, Tyr, Thr, Asp, and
Ser and Hyp residues that remain un-glycosylated, which may allow for QM-nucleophile
crosslinking. It is also possible that the hydroxyproline-bound arabinose groups may crosslink
with lignin, as the primary hydroxyl of arabinose has been shown to react with lignin QMs in
vitro (Toikka et al., 1998). If this occurs in vivo, then lignin might be indirectly coupled to
HRGPs via lignin-carbohydrate linkages (Fig 1.7). Observing this scenario may be difficult using
standard lignin characterization techniques such as HSQC NMR, and warrants further
investigation.
Fig 1.7. Hypothetical lignin-protein complex formed via lignin-carbohydrate linkage. Protein
fragment sequence is Ser-Hyp-Hyp-Hyp, with varying degrees of arabinose glycosylation.
Lignin-carbohydrate linkage forms through reaction of the arabinose primary hydroxyl (C6-OH)
with the electrophilic α-carbon of the lignin QM.
The arabinogalactan-proteins (AGPs) are much more highly glycosylated than the PRPs
and HRGPs, with type II arabino-3,6-galactans (5 – 25 kDa) accounting for 90% to 98% (w/w)
of the AGP (Ellis et al., 2010). The miniscule protein component is often rich in Hyp, Pro, Ala,
Ser, and Thr. Of these, Hyp, Ser, and Thr could potentially be reactive toward lignin QMs.
However, it seems unlikely that lignin-AGP crosslinking would occur via addition of
nucleophilic amino acids to the QM, both because the protein component is so insignificant and
because the oligosaccharides likely encase the protein, shielding it from inter-polymer
interactions. Crosslinking between lignin and AGPs would likely occur through the mechanism
shown in Fig 1.7.
GRPs, PRPs, HRGPs, and AGPs are abundant plant cell wall structural proteins. Liyama
et al. (1994) stated, "there is evidence that both HRGPs and Gly-rich proteins are associated with
lignin and possibly act as foci for lignin polymerization. However, no information as to the
10
nature of possible covalent linkages or their biosynthetic route is available". There may be at
least two mechanisms for lignin-protein crosslinking in plant cell walls. One mechanism
involves radical crosslinking, perhaps via lignin and tyrosine moieties, while the second
mechanism involves reactions of nucleophilic amino acid side chains with lignin QM
intermediates. The latter mechanism is the primary focus of the research described here, but
lignin-protein oxidative coupling will also be studied where possible.
1.2.3. Evidence for lignin-protein linkages
In the previous section it was shown that lignin and cell wall structural proteins are often
co-localized within the plant cell wall, leading some researchers to speculate on the formation of
lignin-protein complexes. These lignin-protein linkages have proven difficult to detect
conclusively, especially in vivo. Nevertheless, evidence (which is largely anecdotal) suggests that
lignin-protein linkages may occur. The prevailing theory of lignin biosynthesis supports this
hypothesis. Under the prevailing theory, monolignol radicals couple to form lignin inter-unit
linkages under conditions that are free of enzymatic control. This results in the formation of the
predominant β-ether linkage and the subsequent QM intermediate (Fig 1.4), which reacts quickly
with the most abundant and/or most chemically compatible nucleophile. Because the quenching
of the QM is under chemical control, the QM could be expected to react with any nearby
nucleophile, including nucleophiles located on proteins. Indeed, the quenching of QMs by
nucleophiles that are often present on amino acids has been studied in non-lignin systems. For
example, the thiol group of glutathione reacts with an o-QM generated from the flavonoid
quercetin (Awad et al., 2000), the thiol group of cysteine reacts with the relatively unreactive p-
QM, 2,6-di-tert-butyl-4-methylene-2,5-cyclohexadienone (Bolton et al., 1997), and thiols and
thiolates react with QMs derived from anthracyclines (Ramakrishnan and Fisher, 1983).
Similarly, amines (but not amino acids) have been shown to trap lignin QMs (Ralph and Young,
1983). A wide array of acid and hydroxyl-containing reagents react with p-QMs (Leary et al.,
1977). And primary (and to a much lesser extent, secondary) hydroxyl groups of carbohydrates
react with lignin QMs (Toikka et al., 1998). Thus, given the general ability of soft (and even
relatively hard) nucleophiles to quench QMs, and given that lignin QM quenching is under
simple chemical control, it seems plausible that similar reactions could occur in vivo between
lignin QMs and nucleophilic amino acids.
In vitro experiments have provided some evidence for lignin-protein coupling. In 1978
and 1982, F. W. Whitmore published three articles regarding lignin-protein interactions
(Whitmore, 1978a, 1978b, and 1982). Whitmore isolated cell walls of Pinus elliottii (slash pine)
in such a way that native peroxidase enzymes were left intact and active. Lignin dehydrogenation
polymer was then added to one group of cell walls (control), and coniferyl alcohol was added to
another (experimental). Upon extraction, the experimental lignin contained significantly more
protein than the control, providing evidence that proteins were incorporated into lignin during
polymerization and not merely physically entangled in lignin following polymerization.
Whitmore then determined that hydroxyproline interacted more strongly with the lignin than
11
other amino acids, perhaps by forming ether linkages. He hypothesized that extensin was most
responsible for lignin-protein crosslinking (Whitmore, 1978, 1982). However, failure to directly
observe the proposed lignin-protein linkage rendered the results inconclusive. With quantitative
1D and 2D NMR experiments now commonplace, it is perhaps time to revisit these experiments
in order to more accurately ascertain the exact nature of the lignin-protein interactions.
More recently, evidence for lignin-protein interactions has been obtained through the use
of dynamic mechanical analysis (DMA) and Fourier-transform infrared spectroscopy (FT-IR).
Salmen and Petterson (1995) found that only one glass transition was observed for protein and
lignin within the primary cell wall, indicating an association that is roughly homogenous in
nature. Upon treatment with a protease, the glass transition temperature increased due to removal
of protein and subsequent increase in the relative concentration of the thermally stable lignin
polymer. Between 2006 and 2008, Stevanic and Salmen used DMA and FT-IR to study the
primary cell walls of Norway spruce, resulting in three publications. The first article found that,
"strong interactions were evident between lignin and protein, between cellulose and xyloglucan,
and between cellulose and pectin" (Stevanic and Salmen, 2006). A similar conclusion was
reached in the second publication, with the authors stating, "to a certain extent, all the polymers
in the surface material...took part in the stress transfer...indicating an intimately linked network
structure" (Stevanic and Salmen, 2008a). Finally, the third publication reported similar findings,
namely that there appear to be lignin-protein and lignin-pectin interactions within the primary
cell wall (Stevanic and Salmen, 2008b). DMA and dynamic FT-IR can indicate that polymer-
polymer interactions exist, but the exact nature of these interactions cannot be determined using
these methods, so further studies are warranted. It has been shown that horseradish peroxidase
enzymes can crosslink, or at least strongly interact, with a growing lignin polymer. This may be
why active peroxidases persist in lignified plant cells even after apoptosis (Evans and
Himmelsbach, 1991). Kaewtip et al. (2010) showed an interaction between wheat gluten and
lignin, and postulated that thiol groups on cysteine residues reacted with the double bonds of
lignin to form lignin-protein linkages. Unfortunately, they were unable to conclusively confirm
such linkages. It is interesting to note that, using FT-IR, blood plasma protein was observed to
hydrogen bond to lignin (Polus-Ratajczak et al., 2003). It is important to keep in mind that in
addition to covalent crosslinking, non-covalent interactions between lignin and protein could
play an important role in the structure and function of plant cell walls.
In summary, previous work has shown that a variety of nucleophiles react with non-lignin
QMs, indicating the possibility for lignin-protein linkages to form via QM-nucleophile
chemistry. Furthermore, evidence has shown that lignin interacts with proteins under in vitro
conditions as well as in native plant cell walls. Yet there has been no attempt to directly observe
in vitro or in vivo lignin-protein linkages using modern techniques such as multidimensional
NMR. Given the economic importance of lignin and its ubiquitous nature within the biosphere,
increased knowledge of its structure and function should be a priority. The work described here
12
extends our fundamental understanding of lignin chemistry by characterizing lignin-protein
covalent linkages as well as lignin-protein non-covalent interactions.
1.3. Methods for investigating lignin-protein linkages
1.3.1. Preparation of lignin-protein compounds
Lignin-protein model compounds were first prepared and characterized under relatively
simple, in vitro conditions. The simplest lignin-protein model compounds (in terms of chemical
structure and molecular weight) were prepared by reacting single nucleophilic amino acids with
a lignin model quinone methide (QM). A nucleophile (meaning, “nucleus loving”) is broadly
defined as a chemical group containing a partial negative charge that is relatively free to react
with a complementary group of opposite charge called an electrophile (meaning, “electron
loving”). As described above, some amino acids contain nucleophilic side chains (Fig 1.5), as
well as nucleophilic α-amine and α-acid groups. In order to prevent side reactions, these α-amine
and α-acid groups were chemically blocked, resulting in the side chain groups becoming the sole
nucleophilic species in the amino acids. It was hypothesized that the amino acids would react
with a lignin QM, which is an unstable electrophile that forms during lignin polymerization
according to the mechanism shown in Fig 1.4. The model lignin QM used here (Fig 1.8) was
chosen because it can be prepared cleanly, it is relatively small and simple (chemically
speaking), and it is structurally representative of QMs that form in native guaiacyl-based lignins
(Kawai et al., 1999; Landucci et al., 1981; Ralph and Young, 1983). Cross-coupling reactions
were carried out in dichloromethane to obtain the desired lignin-protein model compounds and to
prevent addition of nucleophilic solvent to the QM. Chapter 2 provides detailed descriptions of
the preparation and characterization of these lignin-amino acid compounds.
13
Fig 1.8. Preparation of a lignin β-ether model compound and its corresponding QM analog. The
amino acids shown in Fig 1.5 were then reacted with the QM to form lignin-protein model
compounds via reaction of the amino acid nucleophilic side chain with the electrophilic α-carbon
of the lignin QM.
In order to explore lignin-protein coupling under more biomimetic conditions, tripeptides
were added to lignin dehydrogenation polymer (DHP) during the lignin polymerization process.
Lignin DHP has been used for decades to approximate the natural lignification process. It is
usually prepared by slowly combining lignin monomer, peroxidase enzymes, and hydrogen
peroxide over the course of hours or days (Fig 1.9). This results in a synthetic lignin that is
chemically similar to native lignin, though DHP typically exhibits increased resinol and
phenylcoumaran structures and a corresponding reduction in β-ether structures compared to
native lignins (Freudenberg, 1968; Terashima et al., 1995). For this study, DHP was prepared
according to previously published methods using coniferyl alcohol as the sole lignin monomer,
dilute hydrogen peroxide as initiator, and horseradish peroxidase as enzymatic catalyst
(Terashima et al., 1995). The pH of the DHP solution was 6.5, which is standard for DHP
preparations and only slightly higher than biologically relevant pH (4.5 - 6.0) (Cosgrove, 2005).
14
Fig 1.9. Preparation of lignin DHP. Over the course of several days, a peristaltic pump combines
coniferyl alcohol and horseradish peroxidase (and in this case, peptides) with dilute hydrogen
peroxide, forming lignin DHP (cream-colored solution in flask on left).
Peptides were added to the lignin polymerization reaction with the general formula of
XGG (Fig 1.10), where X was any of the amino acids shown in Fig 1.5. The C-termini and N-
termini of the peptides were blocked via amidation and esterification, respectively, to ensure that
the amino acid of interest (i.e., residue X) was the only nucleophilic moiety. Glycine was chosen
as the "place holder" residue due to its expected inertness towards lignin. Peptide length was
limited to three residues in order to inhibit the formation of large lignin-peptide complexes that
may have been insoluble and thus difficult to characterize (e.g., liquid state NMR may have
become impractical). Peptides were added in 25% mol/mol ratio to the lignin monomer
(coniferyl alcohol) because it was previously reported that lignin DHPs contain between 20 and
30% β-ether linkages (Tobimatsu, 2012). Thus, the ratio of nucleophilic residues to lignin β-
ether QMs was expected to be approximately 1:1 over the course of the polymerization reaction.
In summary, this experiment was designed to explore the ability of amino acids to outcompete
water and other nucleophiles for addition to the QM under aqueous conditions. Chapter 3
provides detailed descriptions of the preparation and characterization of these lignin-peptide
compounds.
Fig 1.10. General structure of peptides added to lignin DHP preparations. X represents the amino
acid nucleophilic side chain.
15
Lignin DHP was prepared in the presence of gelatin protein under conditions similar to
those described above. Though gelatin is of animal origin, the lignin-gelatin complex was
expected to be informative for several reasons. First, gelatin is both glycine and hydroxyproline-
rich, as are many plant cell wall structural proteins. Second, gelatin has a rather high molecular
weight (20 kDa – 100 kDa depending on gelatin type), and is thus more similar in size to cell
wall structural proteins compared to tripeptides. And third, gelatin was previously shown to
interact with lignin, though the presence or absence of covalent linkages was not definitively
determined (Whitmore, 1978b). Gelatin contains amino acids that could potentially be
nucleophilic towards lignin (see Chapter 4); however, two key amino acids, namely cysteine and
tyrosine, are almost entirely lacking. Chapter 4 provides a detailed description of the preparation
and characterization of these lignin-gelatin complexes.
Finally, in an attempt to identify lignin-protein linkages formed under natural conditions
of lignin biosynthesis, Arabidopsis (wild-type Columbia-0) plants were grown to maturity (8
weeks), then lignin was extracted from the inflorescence stems and characterized. The cell wall
proteome of Arabidopsis has been studied more extensively than most other plant species, with
20 published papers and 500 proteins with predicted signal peptide identified (Albenne et al.,
2013). Inconsistencies surrounding the Arabidopsis cell wall proteome remain, and much more
work is needed. For example, the size of the cell wall proteome for five-day-old cell suspension
cultures has been estimated at anywhere between 33 and 96 proteins (Chivasa et al., 2002; Feiz
et al., 2006; Kwon et al., 2005; Robertson et al., 1997), while one study, which characterized
three-day-old cell suspension cultures, estimated the proteome at 792 (Bayer et al., 2006)! It has
been estimated that structural proteins account for 1.6% of the Arabidopsis cell wall proteome
(Albenne et al., 2013). The quantity of structural protein in mature Arabidopsis cell wall, in
terms of dry weight percentage, is unclear. In order to estimate the protein content of
Arabidopsis and extracted Arabidopsis lignin, nitrogen analysis was performed on various
Arabidopsis extracts. This allowed for protein estimation by multiplying the nitrogen percentage
by a factor of 6.25, assuming that all nitrogen in the sample was due to protein (Chang et al.,
2008; Fukushima and Hatfield, 2001).
Lignin was extracted from Arabidopsis following a previously described acidic dioxane
method (Fukushima and Hatfield, 2004). In short, Arabidopsis inflorescence stem material was
pre-ground in a Wiley mill, extracted (water, ethanol, chloroform, and acetone) in a Soxhlet
apparatus, then ball milled in a cryomill. This cell wall material was then extracted by refluxing
with 90:10 dioxane/2 M HCl, to obtain a crude lignin extract. The crude lignin extract was
“purified” by precipitation in water followed by multiple washings with diethyl ether to yield
~30-35 mg lignin per g of Arabidopsis cell wall material. It has been postulated that this
extraction method selectively cleaves α-ether linkages, which should raise concerns regarding
the cleavage of putative lignin-protein linkages, as well. However, this method was deemed
useful for several reasons. First, it was not possible to extract lignin using the typical milled
wood lignin procedure of refluxing the sample in 96:4 dioxane/water. This method has been
16
employed for decades; however, during preliminary investigations with Arabidopsis, only ~2 mg
of lignin was extracted per 1 g of Arabidopsis cell wall material, which is extremely inefficient
and yields far too little lignin for effective characterization. Furthermore, lignin-protein linkages
are expected to be low in quantity in wild type plants, so observing the putative linkages in
cellulolytic enzyme lignins or whole cell walls seems unlikely due to very low signal to noise.
Chapter 5 provides a detailed description of the extraction and characterization of the
Arabidopsis lignin.
1.3.2. Characterization of lignin-protein compounds
The lignin-protein model compounds and Arabidopsis lignin extracts were characterized
using a variety of complementary methods. Perhaps the single most useful of these, at least in
terms of ability to directly detect lignin-protein covalent linkages, is nuclear magnetic resonance
(NMR) spectroscopy. NMR relies on exploiting the quantum mechanical property of spin. When
atomic nuclei with an odd number of protons and/or neutrons are placed in a magnetic field the
magnetic nuclear spins align with the field. A radio frequency (RF) pulse is then applied to the
sample and the nuclear spins align perpendicular to the magnetic field. The nuclear spins
spontaneously relax, realigning with the magnetic field in a finite amount of time through a
series of complex relaxation phenomena based on their local environment. In doing so, they re-
emit radio frequencies at slightly different wavelengths than the original RF pulse, determined by
the local chemical environment of each nucleus. This leads (following Fourier-transform) to the
generation of the NMR spectrum, expressed in ppm.
As noted above, any atomic nucleus with an odd number of protons and/or neutrons is, in
principle, NMR active, though in reality active isotopes exhibit varying degrees of sensitivity to
the NMR technique, and the natural abundance of the varying isotopes is also of critical
importance. For the study of lignin-protein linkages, the most useful atomic isotopes are proton
(1H), carbon-13 (13C, because the most abundant isotope of carbon, 12C, is not NMR active), and
potentially nitrogen-15 (15N, because 14N gives broad NMR peaks). Lignin and proteins also
contain oxygen; however, the NMR active isotope of oxygen (17O) is extremely low in natural
abundance and is quite insensitive to the NMR technique. Thus, 17O NMR is almost never
employed.
There are many NMR techniques, based on the various active nuclei as well as various
pulse programs. Furthermore, NMR data can be acquired as 1-dimensional (1D), 2-dimensional
(2D), or higher dimensional spectra, and in either the solid or liquid state. For the study of lignin-
protein linkages, 1D and 2D liquid state spectra are likely the most useful, but require solubility,
which is sometimes limited. The simplest NMR experiments (in terms of pulse programs, not
necessarily in terms of spectral interpretation) for the study of lignin are the 1D 1H and 13C
experiments. 1H spectra can be collected within minutes, and provide information on functional
groups within a range of ~0-12 ppm. This technique is very useful for the study of small, simple
molecules. However, for complex molecules such as lignin, the relatively narrow ppm range
17
results in significant chemical shift degeneracy (Fig 1.11). The 13C NMR experiment exhibits a
broad chemical shift range of ~0-200 ppm and is therefore more diagnostic for determining
lignin chemical structure compared to the 1H experiment. However, the low sensitivity of the 13C
experiment due to the low natural abundance and the low magnetogyric ratio of the 13C nucleus,
means that relatively large sample quantities (tens of milligrams) are required. This, coupled
with extremely long 13C T1 relaxation times, result in experimental times of many hours or even
days to collect high resolution, quantitative spectra. Despite these disadvantages, the usefulness
of quantitative 13C NMR in determining lignin structure has been well documented (Capanema et
al., 2004; Capanema et al., 2005; Holtman and Kadla, 2004; Holtman et al., 2006).
Fig 1.11. 500 MHz 1H NMR spectrum of lignin DHP collected in DMSO-d6/pyridine-d5.
In addition to 1D NMR, 2D NMR has proven quite useful for elucidating lignin chemical
structure. For example, the heteronuclear single quantum coherence (HSQC) technique (Fig
1.12) shows peaks that correspond to direct 1H-13C coupling, and it has the advantage of
relatively high sensitivity while at the same time largely eliminating the chemical shift
degeneracy that arises in 1D spectra. The HSQC technique has been used, sometimes in
conjunction with quantitative 13C NMR, to identify novel lignin structures and/or interpolymer
crosslinking, for example in the case of the so-called lignin-carbohydrate complexes that arise
from lignin-polysaccharide coupling (Balakshin et al., 2007; Balakshin et al., 2011; Chen et al.,
2012; Kim and Ralph, 2010; Mansfield et al., 2012). The following chapters show that this
technique is also quite useful for the investigation of lignin-protein coupling. Other 2D NMR
techniques useful for investigating lignin-protein linkages include heteronuclear multiple
18
quantum coherence (HMQC), which shows direct 1H-13C coupling but uses a different pulse
program than HSQC, heteronuclear multiple bond correlation (HMBC), which shows long-range
(typically 2 and 3-bond) through-bond 1H-13C coupling, correlation spectroscopy (COSY) or
total correlation spectroscopy (TOCSY), which show long-range through-bond 1H-1H coupling,
and nuclear Overhauser effect spectroscopy (NOESY), which shows 1H-1H through-space
interactions.
Fig 1.12. 500 MHz 1H-13C HSQC spectrum of lignin DHP collected in DMSO-d6/pyridine-d5.
Each shift is indicative of a unique inter-unit linkage type or other functional group, which
collectively represent the lignin polymer.
Fourier-transform infrared (FT-IR) spectroscopy is another useful lignin characterization
technique. It has the advantages of quick spectral acquisition (seconds to minutes) with just a few
milligrams of sample, especially if an attenuated total reflectance (ATR) accessory is used. In
attenuated total reflectance, the IR beam passes through a crystal (typically germanium, zinc
selenide, silicon, or diamond) and total internal reflection occurs. An evanescent wave, which
penetrates several microns into the sample, is established at the boundary of the crystal. The
sample absorbs some wavelengths of IR radiation stronger than others, resulting in the IR
spectrum. An FT-IR ATR spectrum of lignin DHP is shown in Fig 1.13, and spectral
19
assignments are shown where possible (Faix and Beinhoff, 1988). IR is useful for showing
protein incorporation into lignin because proteins exhibit unique IR signatures. The most
diagnostic of these occur near 1540 and 1658 cm-1, which are attributed to N-H deformation with
C-N stretching, and C=O stretching, respectively (Socrates, 2001). In addition, the overall shape
of the OH/NH region is altered upon protein incorporation, generally becoming sharper, and
sometimes exhibiting an enhanced shoulder at 3200 cm-1, attributed to N-H stretching in amide
functional groups (Socrates, 2001). Unfortunately, unlike NMR, direct detection of lignin-protein
linkages may not be possible with IR. This is because IR shifts diagnostic of lignin-protein
linkages are likely to be of very low intensity and located within the crowded fingerprint region.
Thus, IR may be useful for showing protein incorporation into lignin, but not necessarily capable
of elucidating the mechanism of lignin-protein interaction (i.e., covalent vs. non-covalent).
Fig 1.13. FT-IR ATR spectrum of lignin DHP.
Scanning electron microscopy (SEM) can be used to determine how protein incorporation
affects the physical morphology of lignin. Transmission electron microscopy (TEM) can also be
used, but SEM has the advantage of negligible sample preparation. Furthermore, advantages of
TEM, such as the ability to collect diffraction spectra, are nullified by the amorphous nature of
lignin. An SEM image of lignin DHP is shown in Fig 1.14. SEM has been used in the past to
show that lignin morphology is altered by the presence of cellulose (Micic et al., 2003), and to
investigate native lignin morphology within the cell wall (Terashima et al., 2004; Terashima and
Yoshida, 2006).
20
Fig 1.14. SEM image of lignin DHP. Scale = 1 µm.
Elemental analysis, in various forms, is an important analytical tool for characterizing
lignin. Due to the chemical structures of the monolignol constituents, neat lignin contains only
the elements carbon, oxygen, and hydrogen. These three elements also compose the lignin-
carbohydrate complexes, which often form in planta. However, in addition to these three
elements, proteins also contain nitrogen. Thus, if a lignin contains nitrogen, then protein
incorporation/contamination should be suspected. It is common to perform bulk elemental
analyses on extracted lignins to determine protein content (N% is multiplied by a factor of 6.25
to obtain protein percentage, assuming all nitrogen in the sample is from protein) (Chang et al.,
2008; Fukushima and Hatfield, 2001). In addition to purely bulk elemental analyses, energy
dispersive X-ray spectroscopy (EDS) and X-ray photoelectron spectroscopy (XPS) can be used
to obtain elemental data. In EDS, elemental composition is determined by bombarding the
sample with electrons, then analyzing characteristic X-rays emitted from the sample. One of the
advantages of EDS is that it can be collected in the SEM instrument while imaging the sample.
This allows for comparison of sample morphology and elemental composition across the sample
on the micron scale (the EDS spot size can be ~1 mm in diameter with an information depth of
~1-2 µm depending upon e- accelerating voltage). XPS is essentially the reverse process of EDS,
as it determines elemental composition by bombarding the sample with X-rays then observing
ejected electrons with characteristic energy levels. Because electrons have a far shorter mean free
path than X-rays, the information depth of XPS is only about 10 nm. This allows for elemental
analysis of the surface region. Comparison of the EDS and XPS data can then be used to show
variations in elemental composition throughout the samples.
The following chapters will show that the experiments and characterization techniques
described above are useful for investigating lignin-protein linkages, specifically under in vitro
conditions. Future research should address the possibility of lignin-protein linkage formation in
native plant systems.
21
1.4. References
Albenne, C.; Canut, H.; Jamet, E. Frontiers in Plant Science 2013, 4, article 111.
Albersheim, P.; Darvill, A.; Roberts, K.; Sederoff, R.; Staehelin, A. Principles of Cell Wall
Architecture and Assembly, in: Plant Cell Walls. 2010. Garland Science, New York, New York,
pp. 227-272.
Awad, H.M.; Boersma, M.G.; Vervoort, J.; Rietjens, I.M.C.M. Arch. Biochem. Biophys. 2000,
378, 224.
Balakshin, M.; Capanema, E.; Chang, H. Holzforschung 2007, 61, 1.
Balakshin, M.; Capanema, E.; Gracz, H.; Chang, H.; Jameel, H. Planta 2011, 233, 1097.
Bayer, E.M.; Bottrill, A.R.; Walshaw, J.; Vigouroux, M.; Naldrett, M.J.; Thomas, C.L.; et al.
Proteomics 2006, 6, 301-311.
Beat, K.; Templeton, M.D.; Lamb, C.J. Proc. Natl. Acad. Sci. USA 1989, 86, 1529.
Boerjan, W.; Ralph, J.; Baucher, M. Annu. Rev. Plant Biol. 2003, 54, 519.
Bolton, J.L.; Turnipseed, S.B.; Thompson, J.A. Chem.-Biol. Interact. 1997, 107, 185.
Cannon, M. C.; Terneus, K.; Hall, Q.; Tan, L.; Wang, Y.; Wegenhart, B.L.; Chen, L.; Lamport,
D.T.A.; Chen, Y.; Kieliszewski, M.J. PNAS 2008, 105(6), 2226.
Capanema, E.A.; Balakshin, M.Y.; Kadla, J.F. J. Agric. Food Chem. 2004, 52, 1850.
Capanema, E.A.; Balakshin, M.Y.; Kadla, J.F. J. Agric. Food Chem. 2005, 53, 9639.
Cassab, I.G.; Varner, J.E. Ann. Rev. Plant Physiol. Plant Mol. Biol. 1988, 39, 321.
Cathala, B.; Saake, B.; Faix, O.; Monties, B. Poly. Deg. and Stab. 1998, 59, 65.
Chang, X.F.; Chandra, R.; Berleth, T.; Beatson, R.P. J. Agric. Food Chem. 2008, 56, 6825.
Chapple, C.; Ladisch, M.; Meilan, R. Nat. Biotechnol. 2007, 25, 746.
Chen, F.; Dixon, R.A. In Vitro Cell. Dev. Biol.: Anim. 2008, 44, S28.
Chen, A.; Zhong, N.; Qu, Z.; Wang, F.; Liu, N.; Xia, G. J. Plant Res. 2007, 120, 337.
Chen, F.; Tobimatsu, Y.; Havkin-Frenkel, D.; Dixon, R.A.; Ralph, J. PNAS USA 2012, 109(5),
1772.
Chivasa, S.; Ndimba, B.K.; Simon, W.J.; Robertson, D.; Yu, X.L.; Knox, J.P., et al.
Electrophoresis 2002, 23, 1754-1765.
22
Cong, F.; Diehl, B.G; Hill, J.L.; Brown, N.R.; Tien, M. Phytochem. 2013, 96, 449-456.
Cosgrove, D.J. Nat. Rev. Mol. Cell Biol. 2005, 6, 850.
Ellis, M.; Egelund, J.; Schultz, C.J.; Bacic, A. Plant Phys. 2010, 153, 403.
Evans, J.J.; Himmelsbach, D.S. J. Agrc. Food Chem. 1991, 39, 830.
Faix, O. J. of Wood Chem. and Tech. 1988, 8(4), 505.
Feiz, L.; Irshad, M.; Pont-Lezica, R.F.; Canut, H.; Jamet, E. Plant Methods 2006, 2, 10.
Freudenberg, K. The constitution and biosynthesis of lignin, in: Freudenberg, K., Neish, A.C.
(Eds) Constitution and Biosynthesis of Lignin. 1968. Springer-Verlag, Berlin, Germany.
Fukushima, R.S.; Hatfield, R.D. J. of Ag. and Food Chem. 2001, 49(7), 3133.
Harrak, H.; Chamberland, H.; Plante, M.; Bellemare, G.; Lafontaine, J.G.; Tabaeizadeh, Z. Plant
Phys. 1999, 121, 557.
Holtman, K.M.; Chang, H.; Jameel, H.; Kadla, J.F. J. of Wood Chem. and Tech. 2006, 26, 21.
Holtman, K.M.; Kadla, J.F. J. Agric. Food Chem. 2004, 52(4), 720.
Jose, M.; Puigdomenech, P. New Phytol. 1993, 125, 259.
Jung, H.G. Agron. J. 1989, 81, 33.
Jung, H.G.; Allen, M.S. J. Anim. Sci. 1995, 73, 2774.
Kaewtatip, K.; Menut, P.; Auvergne, R.; Tanrattanakul, V.; Morel, M.; Guilbert, S. J. of Ag. and
Food Chem. 2010, 58, 4185.
Kawai, S.; Okita, K.; Sugishita, K.; Tanaka, A.; Ohashi, H. J. Wood Sci. 1999, 45, 440.
Kieliszewski, M.; Lamport, D.T.A.; Tan, L.; Cannon, M.C. Annu. Plant Rev. 2011, 41, 321.
Kim, H.; Ralph, J. Org. Biomol. Chem. 2010, 8, 576.
Kwon, H.K.; Yokoyama, R.; Nishitani, K. Plant Cell Physiol. 2005, 46, 843-857.
Lamport, D.T.A. 1974. 30th Symp. Soc. Dev. Biol., pp. 113-130.
Landucci, L.L.; Geddes, S.A.; Kirk, T.K. Holzforschung 1988, 35, 66.
Leary, G.J. Wood Sci. Technol. 1980, 14, 21.
Leary, G.; Miller, I.J.; Thomas, W.; Woolhouse, A.D. J. Chem. Soc., Perkin Trans. 2 1977, 13,
1737.
Li, L.; Popko, J.L.; Umezawa, T.; Chiang, V.L. J. Biol. Chem. 2000, 275, 6537.
23
Li, X.; Weng, J.K.; Chapple, C. Plant J. 2008, 54, 569.
Liyama, K.; Lam, T.B.; and Stone, B.A. Plant Phys. 1994, 104, 315.
Mangeon, A.; Junqueira, R.M.; Sachetto-Martins, G. Plant Signaling and Behavior 2010, 5(2),
99.
Mansfield, S.D.; Kim, H.; Lu, F.; Ralph, J. Nat. Prot. 2012, 7(9), 1579.
McQueen-Mason, S.; Cosgrove, D.J. PNAS USA 1994, 91, 6574.
Micic, M.; Radotic, K.; Jeremic, M.; Leblanc, R.M. Macromol. Biosci. 2003, 3, 100.
Miyagawa, Y.; Takemoto, O.; Takano, T.; Kamitakahara, H.; Nakatsubo, F. Holzforschung 2012,
66, 459.
Polus-Ratajczak, I.; Mazela, B.; and Golinkski, P. Annals of Warsaw Agricultural University,
Forestry and Wood Technology 2003, 53, 296.
Ralph, J.; Brunow, G.; Harris, P.J.; Dixon, R.A.; Schatz, P.F.; Boerjan, W. Chapter 2.
Lignification: are lignins biosynthesized via simple combinatorial chemistry or via proteinaceous
control and template replication. In Recent Advances in Polyphenol Research. 2008. Blackwell
Publishing.
Ralph, J.; Lapierre, C.; Marita, J.M.; Kim, H.; Lu, F.; Hatfield, R.D.; Ralph, S.; Chapple, C.;
Franke, R.; Hemm, M.R.; Van Doorsselaere, J.; Sederoff, R.R.; O’Malley, D.M.; Scott, J.T.;
MacKay, J.J.; Yahiaoui, N.; Boudet, A.; Pean, M.; Pilate, G.; Jouanin, L.; Boerjan, W.
Phytochem. 2001, 57(6), 993.
Ralph, J.; Schatz, P.F.; Lu, F.; Kim, H.; Akiyama, T.; Nelsen, S.F. Quinone Methides in
Lignification, in: Rokita, S.E. (Ed.), Quinone Methides. 2009. John Wiley & Sons, Hoboken,
New Jersey, pp. 385-420.
Ralph, J.; Young, R.A. J. Wood Chem. Technol. 1983, 3(2), 161.
Ralph, S.A.; Ralph, J.; Landucci, L.L. NMR Database of Lignin and Cell Wall Model
Compounds. Available at URL http://ars.usda.gov/Services/docs.htm?docid=10491 (November
2004).
Ramakrishnan, K.; Fisher, J. J. Am. Chem. Soc. 1983, 105, 7187.
Ringli, C.; Keller, B.; Ryser, U. Cell. and Mol. Life Sci. 2001, 58, 1430.
Ryser, U.; Schorderet, M.; Guyot, R.; Keller, B. J. of Cell Sci. 2004, 117, 1179.
Ryser, U.; Schorderet, M.; Zhao, G.; Studer, D.; Ruel, K.; Hauf, G.; Keller, B. The Plant J. 1997,
12(1), 97.
24
Sachetto-Martins, G.; Franco, L.O.; Oliveira, D.E. Biochimica et Biophysica Acta 2000, 1492,
10.
Salmen, L.; Petterson, B. Cellulose Chem. and Tech. 1995, 29, 331.
Socrates, G. Infrared and Raman characteristic group frequencies, third edition. 2001. George
Wiley and Sons, LTD. West Sussex, England.
Stevanic, J.S.; Salmen, L. Cellulose Chem. and Tech. 2006, 40(9-10), 761.
Stevanic, J.S.; Salmen, L. Cellulose 2008, 15, 285.
Stevanic, J.S.; Salmen, L. J. of Pulp and Paper Sci. 2008, 34(2), 107.
Stewart, J.J.; Kadla, J.F.; Mansfield, S.D. Holzforschung 2006, 60, 111.
Talmadge, K.W.; Keegstra, K.; Bauer, W.D.; Albersheim, P. Plant Physiol. 1973, 51, 158-173.
Terashima, N.; Atalla, R.H.; Ralph, S.A.; Landucci, L.L.; Lapierre, C.; Monties, B.
Holzforschung 1995, 49, 521.
Terashima, N.; Awano, T.; Takabe, K.; Yoshida, M. C. R. Biologies 2004, 327, 903.
Terashiam, N.; Yoshida, M. Cell. Chem. and Tech. 2006, 40(9-10), 727.
Tobimatsu, Y.; Elumalai, S.; Grabber, J.H.; Davidson, C.L.; Pan, X.; Ralph, J. ChemSusChem.
2012, 5(4), 676.
Toikka, M.; Jussi, S.; Teleman, A.; Brunow, G. J. Chem. Soc., Perkin Trans. 1 1998, 1, 3813.
Vanholme, R.; Demedts, B.; Morreel, K.; Ralph, J.; Boerjan, W. Plant Phys. 2010, 153, 895.
Whitmore, F.W. Phytochemistry 1978a, 17, 421.
Whitmore, F.W. Plant Science Letters 1978b, 13, 241.
Whitmore, F.W. Phytochemistry 1982, 21(2), 315.
Ye, Z.; Song, Y.; Marcus, A.; Varner, J. E. The Plant J. 1991, 1(2), 175.
Ye, Z.; Varner, J.E. The Plant Cell 1991, 3, 23.
Yuan, T.; Sun, S.; Xu, F.; Sun, R. J. Agric. Food Chem. 2011, 59, 10604.
25
Chapter 2
Towards lignin-protein crosslinking: Amino acid adducts of a lignin model quinone
methide
(Published in Cellulose, available here)
2.1. Abstract
The polyaromatic structure of lignin has long been recognized as a key contributor to the rigidity
of plant vascular tissues. Although lignin structure was once conceptualized as a highly
networked, heterogeneous, high molecular weight polymer, recent studies have suggested a very
different configuration may exist in planta. These findings, coupled with the increasing attention
and interest in efficiently utilizing lignocellulosic materials for green materials and energy
applications, have renewed interest in lignin chemistry. Here we focus on quinone methides—
key intermediates in lignin polymerization—that are quenched via reaction with cell-wall-
available nucleophiles. Reactions with alcohol and uronic acid groups of hemicelluloses, for
example, can lead to lignin-carbohydrate crosslinks. Our work is a first step toward exploring
potential quinone methide (QM) reactions with nucleophilic groups in cell wall proteins. We
conducted a model compound study wherein the lignin model compound guaiacylglycerol-β-
guaiacyl ether 1, was converted to its QM 2, then reacted with amino acids bearing nucleophilic
side-groups. Yields for the QM-amino acid adducts ranged from quantitative in the case of QM-
lysine 3, to zero (no reaction) in the cases of QM-threonine 10 and QM-hydroxyproline 11. The
structures of the QM-amino acid adducts were confirmed via 1D and 2D nuclear magnetic
resonance (NMR) spectroscopy and density functional theory calculations, thereby extending the
lignin NMR database to include amino acid crosslinks. Some of the QM-amino acid adducts
formed both syn- and anti-isomers, whereas others favored only one isomer. Because the QM-
threonine 10 and QM-hydroxyproline 11 compounds could not be experimentally prepared under
conditions described here but could potentially form in vivo, we used density functional theory to
calculate their NMR shifts. Characterization of these model adducts extends the lignin NMR
database to aid in the identification of lignin-protein linkages in more complex in vitro and in
vivo systems, and may allow for the identification of such linkages in planta.
2.2. Introduction
Plant cell walls are composed of a network of interacting polymers, namely cellulose,
hemicelluloses, pectins, lignin, and structural proteins (Cosgrove 2005; McQueen-Mason and
Cosgrove 1994). Of these, lignin is the major aromatic component, derived from monolignols—
phenylpropanoid units whose biosynthesis exhibits incredible plasticity (Boerjan et al. 2003;
Ralph et al. 2004; Vanholme et al. 2010). Lignin’s mode of polymerization is unique among the
cell wall polymers. Resonance stabilized radicals are enzymatically generated from the
monolignols, and as the radical-bearing structures couple combinatorially, a heterogeneous
polymer containing many types of inter-unit linkages forms. The variety of the inter-unit
26
linkages contributes notable recalcitrance to the plant cell wall, stymying not only natural
degradation, but also affecting the economics of many industrial sectors, including the pulp and
paper industry, the developing biofuels industry, agricultural industries, and chemical industries,
which all seek higher value products from lignin (Chapple et al. 2007; Chen and Dixon 2008;
Jung 1989; Jung and Allen 1995; Li et al. 2008; Stewart et al. 2006).
Inter-unit linkages are not, however, the sole factor influencing lignin’s recalcitrance in
planta. Lignin may be crosslinked with other polymers in the plant wall. Hydroxyl and uronic
acid groups of polysaccharides bear mildly nucleophilic groups that can react with a key lignin
intermediate—the α-carbon of quinone methides (QMs) (Balakshin et al. 2011; Leary 1980;
Miyagawa et al. 2012; Ralph et al. 2009; Toikka et al. 1998; Yuan et al. 2011). These QMs form
each time a monolignol radical couples at its β-position and, because β-coupling is prevalent, the
importance of QMs in lignin structure cannot be understated. In certain cases, particularly β-5-
and β-β-coupling, QM intermediates are quickly trapped intramolecularly, producing
phenylcoumaran and resinol units (Leary 1980; Ralph et al. 2009). However, in the case of the
predominant β-O-4-coupling, which produces β-ether linkages, the QM’s α-carbon becomes
susceptible to external nucleophilic attack (Fig 2.1) (Leary 1980; Ralph et al. 2009). This
reactivity of the QM is the focus of the current study.
Fig 2.1. Formation of β-ether QMs via radical coupling, and their rearomatization during lignin
polymerization. L = lignin polymer, Nuc = nucleophile (e.g., H2O, and also here Cys, Lys, His,
Asp, Glu, Tyr or Ser), R = H or OCH3
The crosslinking of lignin with cell wall constituents other than hemicelluloses has not been
well investigated. Cell wall structural proteins, including glycine-rich proteins (GRPs), proline-
rich proteins (PRPs), and hydroxyproline-rich glycoproteins (HRGPs), all contain amino acid
residues with nucleophilic side-chains that could react with lignin QMs (Harrak et al. 1991; Jose
and Puigdomenech 1993; Kieliszewski et al. 2011; Ryser et al. 1997). Cell wall proteins vary in
quantity among species and cell types, ranging from as low as 1-2% to 20% on a dry weight
basis in wild type plants (Albersheim et al. 2010; Cassab and Varner 1988). In 1978 and 1982,
Whitmore showed evidence for the formation of lignin-protein linkages in isolated cell walls of
slash pine. Further literature sources suggest that structural proteins may crosslink with lignin, or
possibly even nucleate, or provide a template for, lignin structure, but these ideas have not been
adequately tested (Albersheim et al. 2010; Beat et al. 1989; Boerjan et al. 2003; Cassab and
27
Varner 1988; Harrak et al. 1991). If true, this mechanism could provide spatial and temporal
control over lignin deposition and architecture (Beat et al. 1989). Furthermore, it has recently
been suggested that over-expression of cell wall proteins could result in increased lignin-protein
linkage formation, which may affect cell wall physical and chemical properties, for example
increased sugar extractability (Liang et al. 2008; Xu et al. 2013). However, identifying such
linkages in planta would be difficult without first determining diagnostic lignin-protein
spectroscopic signatures under simpler, more controlled conditions.
As a first step toward investigating potential lignin-protein linkages in planta, we conducted
a model compound study to characterize products formed when the lignin model compound
guaiacylglycerol-β-guaiacyl ether 1 was converted to its QM 2 (Fig 2.2), then reacted with amino
acids bearing nucleophilic side-groups. Thiols, amines, acids and alcohols have been shown to
quench QMs in a diverse array of systems. The thiol group of glutathione reacts with an o-QM
generated from the flavonoid, quercetin (Awad et al. 2000); the thiol group of cysteine reacts
with the relatively unreactive p-QM, 2,6-di-tert-butyl-4-methylene-2,5-cyclohexadienone
(Bolton et al. 1997); and thiols and thiolates react with QMs derived from anthracyclines
(Ramakrishnan and Fisher 1983). Similarly, amines have been shown to trap lignin QMs (Ralph
and Young 1983). A wide array of acid- and hydroxyl-containing compounds react with p-QMs
(Leary et al. 1977), and primary (and to a much lesser extent, secondary) hydroxyl groups of
carbohydrates may react with QM 2 (Toikka et al. 1988). However, similar nucleophile-QM
adducts have not been characterized in lignin-protein systems.
Fig 2.2. Guaiacylglycerol-β-guaiacyl ether 1 and its derived quinone methide (QM) 2
The nucleophilic amino acids investigated here—cysteine (Cys), lysine (Lys), histidine (His),
aspartic acid (Asp), glutamic acid (Glu), tyrosine (Tyr), serine (Ser), threonine (Thr) and
hydroxyproline (Hyp)—occur in plant cell wall structural proteins and may react to form lignin-
protein crosslinks in vivo (Jose and Puigdomenech 1993; Kieliszewski et al. 2011). Because cell
wall proteins are thought to exist in the wall prior to lignification, the α-amine and α-acid groups
of the amino acids were protected to mimic their inclusion within a peptide. This allowed
reactions of the nucleophilic side-chains to be determined without the complication of competing
reactions from the terminal α-amine and α-acid groups. The QM-amino acid adducts (Fig 2.3)
were characterized by nuclear magnetic resonance (NMR) spectroscopy, density functional
28
theory (DFT), mass spectrometry, and UV/Visible (UV/Vis) spectrophotometry. The
characterization of these model adducts extends the lignin NMR database to aid in the
identification of lignin-protein linkages in more complex in vitro and in vivo systems (Ralph et
al. 2004).
2.3. Experimental
2.3.1 Materials
All chemicals used in the preparation of compounds 1 and 2, and lignin dehydrogenation
polymer (DHP), were purchased from Sigma. All amino acids used in the preparation of
compounds 3-9 were purchased from Sigma with the exception of Boc-L-histidine methyl ester,
which was purchased from Indofine Chemical Company.
2.3.2. Model compound preparations
Compound 1 was prepared according to previous methods, as was its QM analog (2)
(Kawai et al. 1999; Landucci et al. 1981; Ralph and Young 1983). Protected amino acids (1.05
eq) were added directly to the anhydrous solution of 2 in dichloromethane at room temperature.
In the case of Lys, which was obtained as Nα-acetyl-L-lysine methyl ester hydrochloride,
triethylamine (~5 eq) was added in order to deprotonate the terminal amine and facilitate
dissolution. NMR was used to show that triethylamine was not reactive towards the QM. A stir
bar was added, the flask was stoppered, and the atmosphere was rendered inert by alternating
between vacuum and dry nitrogen several times. The reaction was monitored visually;
dissipation of the yellow hue indicated consumption of the QM. Intermittently, the reaction was
also monitored by TLC (1:1 ethyl acetate/hexanes). Lys and His reacted with the QM within
minutes. Other amino acids reacted more slowly with the QM and were allowed to stir overnight
(Cys, Asp, Glu, Tyr) or for several days (Ser, Thr, Hyp), again, with intermittent monitoring by
TLC. When TLC revealed that the reaction had reached equilibrium the mixture was evaporated
to dryness. In the case of Lys, the reaction went to completion (complete consumption of the
QM) and the crude products were evaporated to dryness and submitted to NMR without further
purification. QM reactions with other amino acids did not go to completion. In the case of QM-
Thr and QM-Hyp, TLC and NMR showed no reaction even over the course of several weeks.
The products were purified via flash chromatography using silica gel and 1:1 ethyl
acetate/hexanes as eluent. The purified products were then characterized using nuclear magnetic
resonance (NMR) spectroscopy, mass spectrometry, and density functional theory (DFT). In the
case of QM-His (5), the product could not be chromatographically separated (a range of eluent
solvent systems were attempted) from α-O-aryl products formed presumably due to self-
dimerization of the QM (2); however, mass spec and 2D NMR techniques were still able to
confirm the identity of the QM-His product. In the case of QM-Ser (8), the product could not be
fully separated from unreacted serine. The neat serine shifts as well as the shifts of compound 8
are labeled in the NMR spectra (see below).
29
Lignin guaiacyl-based dehydrogenation polymer (DHP) was synthesized according to a
previously published method (Terashima et al. 1995). The DHP was characterized via HSQC
NMR as described below and was found to contain shifts typical of native lignin and DHP
(Capanema et al. 2004; Kim and Ralph 2010).
2.3.3. Model compound properties
QM-Cys, 3 (2-tert-Butoxycarbonylamino-3-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-(2-
methoxy-phenoxy)-propylsulfanyl]-propionic acid methyl ester). Pale white oil (yield: 73% after
purification). Theoretical mass: 537.20 g/mol (+ H+: 538.21 g/mol). Observed m/z + H+: 538.21.
Major isomer (86%): 1H NMR (400 MHz, acetone-d6): δ = 1.39 (9H, s, H7), 2.82 (2H, m,
H1), 3.53 (1H, m, Hγ), 3.67 (3H, s, H4), 3.74 (1H, m, Hγ), 3.80 (3H, s, OMeB), 3.87 (3H, s,
OMeA), 4.34 (1H, t, J = 5.78, Hα), 4.42 (1H, m, H2), 4.65 (1H, m, Hβ), 6.77 (1H, m, HA5), 6.85
(1H, m, HB6), 6.90 (1H, m, HB4), 6.94 (1H, m, HA6), 6.97 (1H, m, HB5), 7.05 (1H, m, HB3),
7.34 (1H, m, HA2). Minor isomer (14%): 1H NMR (400 MHz, acetone-d6): δ = 1.42 (9H, s,
H7), 2.74 (2H, m, H1), 3.58 (1H, m, Hγ), 3.65 (1H, s, H4), 3.72 (1H, m, Hγ), 3.79 (3H, s,
OMeB), 3.86 (3H, s, OMeA), 4.55 (1H, m, Hβ). Major isomer (86%): 13C NMR (75.5 MHz,
acetone-d6): δ = 28.50 (C7), 33.70 (C1), 51.41 (Cα), 52.40 (C4), 54.13 (C2), 56.11 (OMeA),
56.25 (OMeB), 62.01 (Cγ), 79.54 (C6), 83.87 (Cβ), 113.51 (CA2), 113.78 (CB5), 114.80 (CA5),
117.64 (CB3), 121.61 (CB6), 122.79 (CB4), 123.37 (CA6), 130.87 (CA1), 146.78 (CA4), 148.03
(CA3), 149.07 (CB1), 151.50 (CB2), 155.95 (C5), 172.20 (C3). Minor isomer (14%): 13C
NMR (75.5 MHz, acetone-d6): δ = 113.63 (CA2), 115.41 (CA5), 117.88 (CB3), 121.79 (CB6),
131.00 (CA1), 156.33 (C5). Major isomer (86%): 1H NMR (400 MHz, DMSO-d6/pyridine-
d5): δ = 1.34 (9H, s, H7), 2.66-2.86 (2H, m, H1), 3.43 (1H, m, Hγ), 3.56 (3H, s, H4), 3.58 (1H,
m, Hγ), 3.70 (3H, s, OMeA), 3.78 (3H, s, OMeB), 4.18-4.27 (1H, m, H2), 4.33 (1H, m, Hα), 4.70
(1H, m, Hβ), 5.08 (1H, s, γ-OH), 6.76 (1H, m, HA5), 6.85 (1H, m, HB5), 6.88 (1H, m, HB4),
6.91 (1H, m, HA6), 6.92 (1H, m, HB6), 7.10 (1H, m, HB3), 7.34 (1H, m, HA1), 7.39 (1H, d, J =
8.07, NH), 9.19 (1H, s, A4-OH). 13C NMR (75.5 MHz, DMSO-d6/pyridine-d5): δ = 28.09
(C7), 32.23 (C1), 50.27 (Cα), 51.93 (C4), 53.47 (C2), 55.35 (OMeB), 55.74 (OMeA), 60.68 (Cγ),
81.78 (Cβ), 112.74 (CB6), 113.53 (CA2), 114.60 (CA5), 115.34 (CB3), 120.78 (CB5), 121.44
(CB4), 122.48 (CA6), 129.22 (CA1), 130.90 (C6), 146.05 (CA4), 147.43 (CA3), 148.01 (CB1),
149.87 (CB2), 155.37 (C5), 171.78 (C3). Minor isomer (14%): 1H NMR (400 MHz, DMSO-
d6/pyridine-d5): δ = 4.35 (1H, m, Hα), 4.61 (1H, m, Hβ). 13C NMR (75.5 MHz, DMSO-
d6/pyridine-d5): δ = 28.12 (C7), 32.42 (C1), 50.60 (Cα), 51.97 (C4), 53.69 (C2), 60.96 (Cγ),
81.65 (Cβ), 155.63 (C5), 171.74 (C3).
31
QM-Lys, 4 (2-Acetylamino-6-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-(2-methoxy-
phenoxy)-propylamino]hexanoic acid methyl ester). Pale white oil (yield: quantitative, no
purification necessary). Theoretical mass: 504.25 g/mol (+ H+: 505.25 g/mol). Observed m/z +
H+: 505.25. 1H NMR (300 MHz, acetone-d6): δ = 1.4 (4H, m, H2, H3), 1.52 (2H, m, H4), 1.90
(3H, s, H7), 2.41 (2H, m, H1), 3.50 (1H, m, Hγ), 3.63 (3H, s, H9), 3.69 (1H, m, Hγ), 3.79 (3H, s,
OMeA), 3.86 (3H, s, OMeB), 3.97 (1H, d, J = 6.82 Hz, Hα), 4.18 (1H, m, Hβ), 4.38 (1H, m, H5),
6.77 (1H, m, HB6), 6.85 (1H, m, HA6), 6.93 (1H, m, HA5), 6.96 (1H, m, HB5), 6.96 (1H, m,
HB4), 7.09 (1H, m, HA2), 7.14 (1H, m, HB3), 7.39 (1H, d, J = 7.47, acetyl-NH). 13C NMR
(75.5 MHz, acetone-d6): δ = 22.55 (C7), 24.09 (C3), 30.24 (C2), 32.34 (C4), 47.34 (C1), 52.03
(C9), 52.97 (C5), 56.09 (OMeA), 56.09 (OMeb), 62.19 (Cγ), 64.62 (Cα), 87.12 (Cβ), 111.96
(CA2), 113.16 (CB5), 115.35 (CB6), 118.98 (CB3), 121.75 (CA6), 121.75 (CB4), 122.99 (CA5),
133.12 (CA1), 146.70 (CA4), 148.33 (CA3), 149.45 (CB1), 151.63 (CB2), 170.11 (C6), 173.60
(C8). 1H NMR (300 MHz, DMSO-d6/pyridine-d5): δ = 1.21 (4H, m, H2, H3), 1.58 (2H, m,
H4), 1.88 (3H, s, H7), 2.31 (2H, m, H1), 3.45 (1H, m, Hγ), 3.59 (3H, s, H9), 3.67 (1H, m, Hγ),
3.71 (3H, s, OMeA), 3.79 (3H, s, OMeB), 3.90 (1H, d, J = 6.82 Hz, Hα), 4.21 (1H, m, Hβ), 4.27
(1H, m, H5), 5.00 (1H, s, γ-OH), 6.76 (2H, m, HA5, HA6), 6.84 (1H, m, HB5), 6.90 (1H, m,
HB4), 6.95 (1H, m, HB6), 6.98 (1H, m, HA2), 7.15 (1H, m, HB3), 8.30 (1H, d, J = 7.47, acetyl-
NH), 9.12 (1H, s, A4-OH). 13C NMR (75.5 MHz, DMSO-d6/pyridine-d5): δ = 23.24 (C7),
23.26 (C3), 29.21 (C2), 30.94 (C4), 46.61 (C1), 51.62 (C9), 52.04 (C5), 55.41 (OMeA), 55.46
(OMeb), 60.82 (Cγ), 63.00 (Cα), 85.89 (Cβ), 111.66 (CA2), 112.29 (CB6), 115.15 (CA5), 116.99
(CB3), 120.71 (CA6), 120.78 (CB5), 121.57 (CB4), 131.4 (CA1), 145.83 (CA4), 147.58 (CA3),
148.60 (CB1), 149.93 (CB2), 169.56 (C6), 172.96 (C8).
32
QM-His, 5a (2-tert-Butoxycarbonylamino-3-{3-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-
(2-methoxy-phenoxy)-propyl]-3H-imidazol-4-yl}-propionic acid methyl ester) and QM-His, 5b
(2-tert-Butoxycarbonylamino-3-{1-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-(2-methoxy-
33
phenoxy)-propyl]-1H-imidazol-4-yl}-propionic acid methyl ester) (Note: Some of the NMR shift
assignments for this compound were based on interpretation of the 2D HMQC and HMBC NMR
spectra because, as noted above, the QM-His products could not be chromatographically
separated from a lignin QM dimer, leading to some shift degeneracy in the 1H and 13C 1D
spectra.) Pale white oil (yield: 45% by NMR). Theoretical mass: 571.25 g/mol (+ H+: 572.26
g/mol). Observed m/z + H+: 572.26. 1H NMR (400 MHz, acetone-d6): 1.35 (9H, s, H12), 2.97
(2H, m, H6), 3.44 (1H, m, Hγ), 3.54 (3H, s, H9), 3.56 (1H, m, Hγ), 3.63 (6H, s, OMe), 4.37 (1H,
m, H7), 4.96 (1H, m, Hβ), 5.66 (1H, m, Hα), 6.74 (1H, m, HA5), 6.77 (1H, m, HB3), 6.86 (1H,
m, HB4), 6.88 (1H, m, HA6), 6.89 (1H, m, HB5), 6.90 (1H, m, H5), 7.06 (1H, m, HB6), 7.11
(1H, m, HA2), 7.55 (1H, s, H5), 7.81 (1H, s, H2). 13C NMR (75.5 MHz, acetone-d6): 27.80
(C12), 29.40 (C6), 51.00 (C9), 54.06 (C7), 55.60 (OMe), 59.90 (Cγ), 60.97 (Cα), 78.40 (C11),
81.45 (Cβ), 111.40 (CB6), 111.60 (CA2), 112.60 (CB5), 115.00 (CA5), 116.56 (C5), 117.30
(C4), 120.20 (CA6), 120.80 (CB3), 122.40 (CB4), 129.80 (CA1), 135.20 (C5), 138.00 (C2),
147.50 (CA3), 147.60 (CB1), 147.80 (CB4), 150.80 (CB2), 155.40 (C10), 172.60 (C8). 1H NMR
(400 MHz, DMSO-d6/pyridine-d5): 1.33 (9H, m, H12), 2.89 (2H, m, H6), 3.46 (3H, s, OMeA),
3.47 (2H, m, Hγ), 3.47 (3H, s, H9), 3.67 (3H, s, OMeB), 4.37 (1H, m, H7), 5.02 (1H, m, Hβ),
5.70 (1H, m, Hα), 6.76 (1H, m, HA5), 6.76 (1H, m, H5), 6.82 (1H, m, HA6), 6.89 (1H, m, HB5),
7.05 (1H, m, HB3), 7.08 (1H, m, HB6), 7.13 (1H, m, HA2), 7.13 (1H, m, HB4), 7.62 (1H, s,
H2), 7.81 (1H, d, J = 8.42 Hz, acetyl-0NH), 9.23 (1H, s, A4-OH). 13C NMR (75.5 MHz,
DMSO-d6/pyridine-d5): 28.04 (C12), 29.77 (C6), 51.20 (C9), 53.98 (C7), 55.45 (OMeA), 55.63
(OMeβ), 59.31 (Cγ), 60.20 (Cα), 78.29 (C11), 80.20 (Cβ), 111.62 (CB6), 111.81 (CA2), 112.47
(CB5), 114.80 (C5), 114.83 (CA5), 116.60 (CB3), 120.69 (CA6), 130.00 (CA1), 134.98 (C2),
136.44 (C4), 147.20 (CA4), 147.88 (CB1), 148.05 (CA3), 150.66 (CB2), 155.41 (C10), 172.63
(C8).
QM-Asp, 6 (2-tert-Butoxycaronylamino-succinic acid 1-benzyl ester 4-[3-hydroxy-1-(4-
hydroxy-3-methoxy-phenyl)-2-(2-methoxy-phenoxy)-propyl] ester). Pale white oil (yield: 58%
after purification). Theoretical mass: 625.25 g/mol (+ Na+: 648.24 g/mol). Observed m/z + Na+:
648.24. Major isomer (74%): 1H NMR (400 MHz, acetone-d6): δ = 1.37 (9H, s, H6), 2.92
34
(2H, m, H2), 3.6 (1H, m, H3), 3.65 (1H, m, Hγ), 3.75 (1H, m, Hγ), 3.80 (3H, s, OMeB), 3.84
(3H, s, OMeA), 4.61 (1H, m, Hβ), 5.10 (2H, s, H8), 6.06 (1H, d, J = 4.42, Hα), 6.82 (1H, m,
HA5), 7.03 (1H, m, HB3), 6.84 (1H, m, HB5), 6.94 (1H, m, HA6), 6.94 (1H, m, HB4), 6.96 (1H,
m, HB6), 7.17 (1H, m, HA2), 7.34 (5H, m, H10, H11, H12, H13, H14), 7.65 (1H, s, A4-OH). 13C NMR (75.5 MHz, acetone-d6): δ = 28.42 (C6), 37.13 (C2), 51.23 (C3), 56.20 (OMeA),
56.16 (OMeB), 61.18 (Cγ), 67.37 (C8), 75.82 (Cα), 79.57 (C5), 83.60 (Cβ), 112.22 (CA2),
113.48 (CB6), 115.14 (CA5), 119.10 (CB3), 121.65 (CA6), 121.65 (CB5), 128.69 (CB4), 128.72
(CA1), 129.18 (C10-C14), 136.81 (C9), 147.32 (CA3), 147.97 (CA4), 151.75 (CB1), 151.80
(CB2), 156.11 (C4), 169.98 (C1), 171.71 (C7). Minor isomer (26%): 1H NMR (400 MHz,
acetone-d6): δ = 4.51 (1H, m, Hβ), 6.14 (1H, m, Hα). 13C NMR (75.5 MHz, acetone-d6): δ =
76.41 (Cα), 84.62 (Cβ), 119.27 (CB3), 169.81 (C1), 171.67 (C7). Major isomer (74%): 1H
NMR (400 MHz, DMSO-d6/pyridine-d5): δ = 1.33 (9H, s, H6), 2.82 (2H, m, H2), 3.57 (1H, m,
Hγ), 3.66 (1H, m, Hγ), 3.70 (3H, s, OMeB), 3.76 (3H, s, OMeA), 4.55 (1H, m, H3), 4.67 (1H, m,
Hβ), 5.09 (2H, s, H8), 6.04 (1H, m, Hα), 6.81 (1H, m, HA5), 6.83 (1H, m, HB3), 6.86 (1H, m,
HB5), 6.89 (1H, m, HA6), 6.91 (1H, m, HB6), 7.09 (1H, m, HA2), 7.11 (1H, m, HB4), 7.37 (5H,
m, H10, H11, H12, H13, H14), 7.48 (1H, d, J = 3.42, NH), 9.36 (1H, s, A4-OH). 13C NMR (75.5
MHz, DMSO-d6/pyridine-d5): δ = 27.98 (C6), 35.60 (C2), 50.24 (C3), 55.44 (OMeA), 55.54
(OMeB), 59.63 (Cγ), 65.98 (C8), 74.75 (Cα), 78.45 (C5), 81.39 (Cβ), 111.96 (CA2), 112.76
(CB6), 114.84 (CA5), 116.77 (CB4), 120.69 (CA6), 120.69 (CB3), 120.69 (CB5), 127.19 (CA1),
127.53 (C12), 127.66 (C13), 127.78 (C11), 127.95 (C14), 128.29 (C10), 135.98 (C9), 144.32
(C4), 146.65 (CA4), 147.30 (CA3), 147.67 (CB1), 150.12 (CB2), 169.12 (C1), 171.79 (C7).
Minor isomer (26%): 1H NMR (400 MHz, DMSO-d6/pyridine-d5): δ = 4.55 (1H, m, Hβ),
6.11 (1H, m, Hα). 13C NMR (75.5 MHz, DMSO-d6/pyridine-d5): δ = 75.08 (Cα), 82.70 (Cβ),
112.58 (CB6), 116.87 (CB4), 147.27 (CA3), 150.16 (CB2), 168.92 (C1).
35
QM-Glu, 7 (2-tert-Butoxycaronlyamino-pentanedioic acid 1-tert-butyl ester 5-[3-hydroxy-1-(4-
hyroxy-3-methoxy-phenyl)-2-(2-methoxy-phenoxy)-propyl] ester). Pale white oil (yield: 47%
after purification). Theoretical mass: 605.28 g/mol (+ Na+: 628.27 g/mol). Observed m/z + Na+:
628.25. Major isomer (74%): 1H NMR (400 MHz, acetone-d6): δ = 1.42 (9H, s, H10), 1.46
(9H, s, H7), 1.92, 2.08 (2H, m, H3), 2.47 (2H, t, J = 8.02, H2), 3.69, 3.79 (2H, m, Hγ), 3.84 (3H,
s, OMeB), 3.88 (3H, s, OMeA), 4.08 (1H, m, H4), 4.62 (1H, m, Hβ), 5.78 (1H, s, γ-OH), 6.06
(1H, d, J = 5.11, Hα), 6.79 (1H, m, HA5), 6.85 (1H, m, HB5), 6.94 (1H, m, HA6), 6.94 (1H, m,
HB4), 6.96 (1H, m, HB6), 7.02 (1H, m, HB3), 7.17 (1H, m, HA2). 13C NMR (75.5 MHz,
acetone-d6): δ = 27.69 (C3), 28.07 (C10), 28.49 (C7), 31.19 (C2), 54.57 (C4), 56.21 (OMeA),
56.21 (OMeB), 61.27 (Cγ), 75.23 (Cα), 79.17 (C9), 81.46 (C6), 83.89 (Cβ), 112.29 (CA2),
113.55 (CB6), 115.15 (CA5), 119.02 (CB3), 121.68 (CB5), 121.68 (CA6), 123.31 (CB4), 129.32
(CA1), 147.32 (CA4), 147.96 (CA3), 148.97 (CB1), 151.78 (CB2), 156.40 (C5), 171.93 (C1),
174.01 (C8). Minor isomer (26%): 1H NMR (400 MHz, acetone-d6): δ = 4.53 (1H, m, Hβ),
6.13 (1H, m, Hα). 13C NMR (75.5 MHz, acetone-d6): δ = 75.93 (Cα), 84.85 (Cβ), 115.53
36
(CA5), 119.26 (CB3), 129.72 (CA1), 147.53 (CA4), 148.20 (CA3), 172.10 (C1). Major isomer
(74%): 1H NMR (400 MHz, DMSO-d6/pyridine-d5): δ = 1.34 (9H, s, H10), 1.36 (9H, s, H7),
1.83 (2H, m, H3), 2.39 (2H, m, H2), 3.58, 3.66 (2H, m, Hγ), 3.71 (3H, s, OMeB), 3.76 (3H, s,
OMeA), 3.93 (1H, m, H4), 4.67 (1H, m, Hβ), 5.10 (1H, s, γ-OH), 6.02 (1H, m, Hα), 6.81 (1H, m,
HA5), 6.82 (1H, m, HB3), 6.85 (1H, m, HB5), 6.89 (1H, m, HA6), 6.92 (1H, m, HB6), 7.08 (1H,
m, HA1), 7.10 (1H, m, HB4), 7.28 (1H, d, J = 7.86, NH), 9.34 (1H, s, A4-OH). 13C NMR (75.5
MHz, DMSO-d6/pyridine-d5): δ = 26.04 (C3), 27.48 (C10), 28.05 (C7), 30.36 (C2), 53.60
(C4), 55.45 (OMeA), 55.57 (OMeB), 59.71 (Cγ), 74.23 (Cα), 78.09 (C6), 80.37 (C9), 81.58 (Cβ),
111.91 (CA2), 112.77 (CB6), 114.90 (CA5), 116.73 (CB4), 120.02 (CB3), 120.56 (CB5), 120.70
(CA6), 127.53 (CA1), 146.61 (CA4), 147.30 (CA3), 147.72 (CB1), 150.13 (CB2), 155.65 (C5),
171.20 (C8), 171.38 (C1). Minor isomer (26%): 1H NMR (400 MHz, DMSO-d6/pyridine-d5):
δ = 1.97 (2H, m, H3), 4.56 (1H, m, Hβ), 6.08 (1H, m, Hα). 13C NMR (75.5 MHz, DMSO-
d6/pyridine-d5): δ = 26.31 (C3), 30.13 (C2), 53.86 (C4), 74.68 (Cα), 78.00 (C6), 80.24 (C9),
82.64 (Cβ), 111.35 (CA2), 115.21 (CA5), 128.32 (CA1), 146.72 (CA4), 147.47 (CA3), 148.42
(CB1), 149.98 (CB2), 154.74 (C5), 171.65 (C1).
37
QM-Ser, 8 (2-Benzyloxycarbonylamino-3-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-(2-
methoxy-phenoxy)-propoxy]-propionic acid methyl ester). Pale white oil (39% after
purification). Theoretical mass: 555.21 g/mol (+ Na+: 578.20 g/mol). Observed m/z + Na+:
578.20. Major isomer (68%): 1H NMR (400 MHz, acetone-d6): δ = 3.65 (2H, m, H1), 3.70
(3H, s, H4), 3.76 (3H, s, OMeB), 3.81 (3H, s, OMeA), 3.81 (2H, m, Hγ), 4.32 (1H, m, Hβ), 4.43
(1H, m, H2), 4.60 (1H, m, Hα), 5.09 (2H, m, H6), 6.80 (1H, m, HB6), 6.82 (1H, m, HB3), 6.88
(1H, m, HA5), 6.89 (1H, m, HA6), 6.89 (1H, m, HB4), 6.92 (1H, m, HB5), 7.00 (1H, m, HA2),
7.15, (1H, m, H10), 7.38 (5H, m, H8, H9, H11, H12). 13C NMR (75.5 MHz, acetone-d6): δ =
52.35 (C4), 55.59 (C2), 56.20 (OMe), 61.74 (Cγ), 66.87 (C6), 69.92 (C1), 82.25 (Cα), 85.28
(Cβ), 111.70 (C10), 112.00 (CA2), 113.49 (CB5), 115.23 (CB6), 119.27 (CA5), 121.73 (CA6),
122.03 (CB3), 123.09 (CB4), 128.67 (C8, C12), 129.22 (C9, C11), 130.62 (CA1), 138.05 (C7),
147.23 (CA4), 148.17 (CA3), 149.20 (CB1), 151.76 (CB2), 157.06 (C5), 171.87 (C3). Minor
isomer (32%): 1H NMR (400 MHz, acetone-d6): δ = 3.72 (2H, m, Hγ), 3.79 (3H, s, OMeB),
3.83 (3H, s, OMeA), 4.55 (1H, m, Hα). 13C NMR (75.5 MHz, acetone-d6): δ = 55.42 (C2),
61.33 (Cγ) 69.47 (C1), 82.40 (Cα), 85.68 (Cβ), 111.94 (CA2), 119.15 (CA5), 130.19 (CA1),
147.29 (CA4), 148.95 (CB1), 156.95 (C5). Major isomer (68%): 1H NMR (400 MHz, DMSO-
d6/pyridine-d5): δ = 3.39 (1H, m, Hγ), 3.56-3.65 (2H, m, H1), 3.65 (3H, s, H4), 3.67 (1H, m,
Hγ), 3.65 (3H, s, OMeB), 3.72 (3H, s, OMeA), 4.41 (1H, m, H2), 4.46 (1H, m, Hβ), 4.59 (1H, m,
Hα), 5.08 (2H, s, H6), 6.73-6.83 (1H, m, HA5), 6.74-6.80 (1H, m, HA6), 6.80-6.87 (1H, m
HB4), 6.81 (1H, m, HB6), 6.82-6.93 (1H, m, HB5), 6.94-7.07 (1H, m, HA2), 6.97-7.06 (1H, m,
HB3), 7.25-7.38 (5H, m, H7-H12), 7.81 (1H, d, J = 8.42, NH), 9.23 (1H, s, A4-OH). 13C NMR
(75.5 MHz, DMSO-d6/pyridine-d5): δ = 54.20 (C1), 55.29 (OMeA), 55.38 (OMeB), 51.81 (C4)
60.08 (Cγ), 65.80 (C6), 67.90 (C1), 80.91 (Cα), 82.50 (Cβ), 111.48 (CA2), 112.75 (CB5), 114.82
(CA5), 115.06 (CB6), 116.38 (CB4), 120.57 (CA6), 120.59 (CB3), 127.80-128.38 (C7-C12),
128.72 (CA1), 137.05 (C7), 146.36 (CA4), 147.84 (CB1), 149.55 (CB2), 149.93 (CA3), 156.30
(C5), 170.81 (C3). Minor isomer (32%): 1H NMR (400 MHz, DMSO-d6/pyridine-d5): δ =
38
4.42 (1H, m, Hβ), 4.55 (1H, m, Hα). 13C NMR (75.5 MHz, DMSO-d6/pyridine-d5): δ = 80.69
(Cα), 82.90 (Cβ), 111.84 (CA2), 115.90 (CB4), 128.54 (CA1).
39
QM-Tyr, 9 (2-tert-Butoxycarbonlyamino-3-{4-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-
(2-methoxy-phenoxy)-propoxy]-phenyl}-propionic acid methyl ester). Pale white oil (yield: 45%
after purification). Theoretical mass: 597.26 g/mol (+ H+: 598.27 g/mol). Observed m/z + H+:
598.29. 1H NMR (300 MHz, acetone-d6): δ = 1.33 (9H, d, J = 3.28 Hz, H13), 2.88 (1H, m, H7),
3.00 (1H, m, H7), 3.62 (3H, d, J = 3.74, H10), 3.78 (3H, s, OMeB), 3.81 (3H, s, OMeA), 3.81
(1H, m, Hγ), 3.91 (1H, m, Hγ), 4.31 (1H, m, H8), 4.55 (1H, m, Hβ), 5.13 (1H, d, J = 5.13, Hα),
6.07 (1H, d, J = 7.66, γ-OH), 6.78 (1H, m, HA5), 6.81 (1H, m, HB5), 6.87 (2H, m, H3, H5), 6.93
(1H, m, HB4), 6.95 (2H, m, HB3, HB6), 6.97 (1H, m, HA6), 7.06 (2H, m, H2, H6), 7.17 (1H, m,
HA2), 7.55 (1H, s, Ph-OH). 13C NMR (100 MHz, acetone-d6): δ = 28.42 (C13), 37.29 (C7),
52.07 (C10), 56.02 (C8), 56.17 (OMe), 61.19 (Cγ), 79.22 (C12), 79.42 (Cα), 85.40 (Cβ), 112.09
(CA2), 113.50 (CB6), 115.29 (CA5), 116.81 (C3, C5), 119.21 (CB5), 121.43 (CA6), 121.70
(CB3), 123.22 (CB4), 130.32 (CA1), 130.82 (C2, C4, C6), 147.12 (CA4), 148.13 (CA3), 149.10
(CB1), 151.82 (CB2), 156.10 (C11), 157.77 (C1), 173.22 (C9). 1H NMR (400 MHz, DMSO-
d6/pyridine-d5): δ = 1.27 (9H, s, H13), 2.71-2.95 (2H, m, H7), 3.62 (3H, s, H4), 3.67 (3H, s,
OMeB), 3.71 (3H, s, OMeA), 3.73 (1H, m, Hγ), 3.55 (3H, s, H10), 3.79 (1H, m, Hγ), 4.18 (1H, m,
H8), 4.66 (1H, m, Hβ), 5.08 (1H, s, γ-OH), 5.49 (1H, d, J = 4.20, Hα), 6.76 (1H, m, HA5), 6.81
(3H, m, HB3, H3, H5), 6.84 (1H, m, HB5), 6.89 (2H, m, HA6, HB6), 7.05 (2H, m, H2, H6), 7.06
(1H, m, HB4), 7.11 (1H, m, HA2), 9.21 (1H, s, A4-OH). 13C NMR (75.5 MHz, DMSO-
d6/pyridine-d5): δ = 28.10 (C13), 35.50 (C7), 51.60 (C10), 55.47 (OMeB), 55.50 (C8), 55.60
(OMeA), 59.78 (Cγ), 78.26 (C12), 78.30 (Cα), 82.69 (Cβ), 112.02 (CA2), 112.81 (CB6) 115.01
(CA5), 115.79 (C3, C5), 116.43 (CB4), 120.73 (CA6), 120.75 (CB3), 121.57 (CB5), 128.42
(CA1), 129.65 (C2, C6), 129.96 (C4), 146.35 (CA4), 147.37 (CA3), 148.03 (CB1), 150.06
(CB2), 156.30 (C1), 172.56 (C9).
41
QM-Thr, 10 (2-tert-Butoxycarbonylamino-3-[3-hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-(2-
methoxy-phenoxy)-propoxy]-butyric acid methyl ester). Not experimentally observed. DFT-
calculated 1H NMR (syn-isomer, DMSO force field): 0.8 (H1), 3.2 (Hβ), 3.3 (Hγ), 3.3 (H3),
3.5 (OMeB), 3.6 (Hγ), 3.6 (H2), 3.6 (OMeA), 4.4 (Hα), 6.1 (HB6), 6.7 (HA5), 6.7 (HB3), 6.7
(HB5), 6.9 (HA2), 6.9 (HA6), 7.0 (HB4). DFT-calculated 13C NMR (syn-isomer, DMSO force
field): 15.2 (C1), 52.1 (OMeA), 52.1 (OMeB), 55.9 (Cγ), 60.7 (C3), 69.2 (C2), 70.7 (Cα), 88.5
(Cβ), 109.0 (CA2), 112.4 (CB3), 113.4 (CA5), 121.5 (CB5), 123.6 (CA6), 124.5 (CB6), 125.7
(CB4), 131.4 (CA1), 144.5 (CA4), 145.8 (CA3), 145.9 (CB1), 151.9 (CB2). DFT-calculated 1H
NMR (anti-isomer, DMSO force field): 0.9 (H1), 3.1 (Hγ), 3.2 (H3), 3.3 (H2), 3.3 (OMeA), 3.4
(Hβ), 3.5 (OMeB), 3.9 (Hγ), 4.3 (Hα), 4.8 (HB6), 6.4 (HB5), 6.6 (HB3), 6.8 (HA5), 6.8 (HA6),
6.8 (HB4), 7.2 (HA2). DFT-calculated 13C NMR (anti-isomer, DMSO force field): 51.9
(OMeB), 52.6 (OMeA), 73.7 (Cα), 60.9 (Cγ), 90.4 (Cβ), 112.1 (CB3), 113.9 (CA5), 114.3 (CA2),
121.2 (CA6), 121.5 (CB5), 122.4 (CB6), 123.8 (CB4), 130.1 (CA1), 144.8 (CA4), 146.1 (CA3),
149.3 (CB2), 149.3 (CB1).
QM-Hyp, 11 (4-[3-Hydroxy-1-(4-hydroxy-3-methoxy-phenyl)-2-(2-methoxy-phenoxy)-
propoxy]-pyrrolidine-1,2-dicarboxylic acid 1-tert-butyl ester 2-methyl ester). Not experimentally
observed. DFT-calculated 1H NMR (syn-isomer, DMSO force field): 3.2 (Hβ), 3.5 (OMeB),
3.6 (OMeA), 4.7 (Hα), 3.4 (Hγ), 3.5 (Hγ), 6.0 (HB6), 6.1 (H2), 6.6 (HA5), 6.7 (HB3), 6.7 (HB5),
6.8 (HA6), 7.0 (HB4), 7.2 (HA2), 7.2 (H4). DFT-calculated 13C NMR (syn-isomer, DMSO
force field): 52.0 (OMeB), 52.2 (OMeA), 56.3 (Cγ), 80.5 (Cα), 87.7 (Cβ), 105.1 (C3), 109.8
(CA2), 112.4 (CB3), 112.5 (C1), 113.0 (CA5), 119.1 (C4), 121.6 (CB5), 122.3 (CA6), 124.5
(CB6), 125.5 (CB4), 132.7 (CA1), 144.1 (CA4), 145.3 (CA3), 147.1 (CB1), 147.6 (C2), 151.2
(CB2). DFT-calculated 1H NMR (anti-isomer, DMSO force field): 3.4 (OMeA), 3.4 (Hγ), 3.5
(OMeB), 3.6 (Hβ), 3.7 (Hγ), 4.0 (H2), 4.6 (Hα), 4.8 (HB6), 6.1 (H4), 6.5 (HB5), 6.6 (HB3), 6.8
(HA5), 6.8 (HA6), 6.9 (HB4), 7.2 (HA2). DFT-calculated 13C NMR (anti-isomer, DMSO
42
force field): 51.9 (OMeB), 52.4 (OMeA), 60.9 (Cγ), 78.7 (Cα), 90.9 (Cβ), 95.7 (C3), 99.8 (C1),
112.2 (CB3), 112.3 (C4), 113.0 (CA2), 113.9 (CA5), 119.5 (CA6), 121.8 (CB5), 122.4 (CB6),
124.1 (CB4), 131.8 (CA1), 144.5 (CA4), 145.9 (CA3), 139.0 (C2), 149.1 (CB1), 149.2 (CB2).
2.3.4. Nuclear magnetic resonance spectroscopy
NMR spectra were collected in both acetone-d6 (spectra shown above) and DMSO-
d6/pyridine-d5 (4:1 v/v, 500 ul). DMSO-d6/pyridine-d5 was chosen because it is a preferred
solvent for NMR of lignin DHP, milled wood lignin (MWL), and whole cell walls; using the
same solvent system allows for accurate shift comparisons (Kim and Ralph 2010). In general,
negligible shift migration was observed between the two solvent systems. NMR spectra were
acquired on Bruker DPX-300 (300 MHz 1H resonance freq.), DRX-400 (400 MHz 1H resonance
freq.), AV-III-500 (500 MHz 1H resonance freq.) with a cryogenically-cooled probe and inverse
probe geometry (i.e. proton coils closest to sample), AV-III-600 (500 MHz 1H resonance freq.)
with a cryogenically-cooled probe, and AV-III-850 (850 MHz 1H resonance freq.) with a
cryogenically-cooled probe. Spectral processing was performed in Bruker's Topspin 3.1
software. Standard Bruker pulse programs were employed: 1H (8-16 scans), 13C (5k-10k scans),
HMQC (Bruker pulse program 'inv4gptp’, 64 scans), and HMBC (Bruker pulse program
'inv4gslplrnd’, 64 scans). Spectra were calibrated to the central solvent peaks (acetone: 2.05/29.8
ppm; dimethyl sulfoxide: 2.50/39.5 ppm). In the case of lignin DHP, NMR spectra were acquired
on a Bruker Biospin (Billerica, MA, USA) AVANCE 500 (500 MHz 1H resonance freq.)
spectrometer fitted with a cryogenically-cooled probe having inverse geometry, i.e., with the
proton coils closest to the sample. Spectra were processed with Bruker’s Topspin 3.1 software,
using the central solvent peak as internal reference (δH/δC: dimethyl sulfoxide (DMSO),
2.50/39.5 ppm). The synthetic lignin DHP (~50 mg) was placed in an NMR tube (ID: 4.1 mm),
swollen homogeneously in DMSO-d6/pyridine-d5 (4:1 v/v, 500 ul) with the aid of
ultrasonication (~3h), and then subjected to adiabatic 2D-HSQC (‘hsqcetgpsisp2.2’) experiments
using the parameters described by Mansfield et al. (2012). Processing used typical matched
Gaussian apodization in F2 (LB = -0.3, GB = 0.001), and squared cosine-bell and one level of
linear prediction (32 coefficients) in F1 (Mansfield et al. 2012).
2.3.5. Mass spectrometry
Exact masses for compounds 3-6 (see online resource) were calculated using
ChemBioDraw Ultra 13.0. Mass spectrometric analysis was performed on a Waters LCT Premier
time-of-flight (TOF) mass spectrometer (Waters Corporation (Micromass Ltd.), Manchester,
UK), using MassLynx™ software Version 4.0. Samples were introduced using a Waters 2695
high performance liquid chromatograph. Sample analysis utilized flow injection analysis (FIA).
The mobile phase used was 90% acetonitrile (LC-MS grade) and 10% aqueous ammonium
acetate (10mM). The flow rate was 0.25 mL/min. The nitrogen drying gas temperature was set to
300 °C at a flow of 7 L/min. The capillary voltage was 2200 V. The mass spectrometer was set
to scan from 100-1000 m/z in positive ion mode, using electrospray ionization (ESI).
43
2.3.6. Computational methods
Eight conformational isomers of QM-Cys, QM-Thr, and QM-Hyp, and sixteen
conformational isomers of QM-His were built using Materials Studio 6.0 (Accelrys Inc., San
Diego, CA). Eight of the QM-His models exhibited a CαQM-N1His bond and eight models
exhibited CαQM-N3His bond; these models allowed us to determine which CαQM-NHis bond was
occurring and to determine if an observed chemical shift (α13C) at 78.8 ppm was due to a C-N
bond. Each set of eight models (i.e., compound 3 (QM-Cys), compound 5a (QM-His(N3)),
compound 5b (QM-His(N1)), compound 10 (QM-Thr), or compound 11 (QM-Hyp)) contained
two of each of the stereoisomers (R,R), (SS), (R,S), and (S,R), where the former two
stereoisomers are syn and the latter two stereoisomers are anti. These models were built to
determine if the calculated NMR chemical shifts could differentiate the observed shifts for the
syn and anti stereoisomers of QM-His and QM-Cys. Experimental NMR shifts for QM-Thr and
QM-Hyp were not obtained because Thr and Hyp did not react with the QM; however, we
reported the calculated shifts for these compounds (below) as potential references for other
researchers to use.
Each model was energy minimized without symmetry or atomic constraints using the
density functional theory (DFT) method M05-2X, coupled with the 6-311++G(2df,2p) basis set
using the program Gaussian 09 (Curtiss et al. 2001; Frisch et al. 2009; Hohenberg and Kohn
1964; Kohn and Sham 1965; Zhao et al. 2006). Following the geometry optimization
calculations, frequency calculations assured that each model attained a potential energy surface
(PES) minimum, where no imaginary frequencies were present (Frisch et al. 2009).
Subsequent gauge-independent atomic orbital (GIAO) calculations using Gaussian 09 at
the mPW1PW91/6-31G(d) theory level provided the NMR magnetic shielding tensors (α13C and
α1H) for the energy-minimized structures (Adamo and Barone 1998; Buhl et al. 1999;
Cheeseman et al. 1996; Karadakov 2008; Lodewyk et al. 2012; Schreckenbach and Ziegler 1995;
Wolinski et al. 1990). Because our experiments were conducted in dimethylsulfoxide (DMSO),
the GIAO calculations were also performed in a dielectric continuum of DMSO using a self-
consistent reaction field (SCRF) and the integral equation formalism variant of the polarized
continuum model (IEFPCM) (Cances et al. 1997; Gogonea 1998). Note that the structures were
not energy minimized within the polarized continuum because prior work showed that doing so
did not improve the precision of the calculations (Watts et al. 2011). A multi-standard NMR
method using benzene for sp2-hybridized C- and H-atoms, and methanol for sp3-hybridized C-
and H-atoms led to the α13C and α1H results (Sarotti and Pellegrinet 2009; Sarotti and Pellegrinet
2012; Watts et al. 2011). Benzene and methanol were energy minimized using M05-2X/6-
311++G(2df,2p) and underwent subsequent GIAO calculations using mPW1PW91/6-31G(d).
The precision of the multi-standard method versus the single-standard method (e.g.,
tetramethylsilane as the standard) is illustrated when comparing single-standard results recently
reported by Mostaghni et al. (2013) with the multi-standard results of Watts et al. (2011). Both
44
groups reported the δ13C for β-O-4 linkages in lignin model compounds; however, the mean
unsigned errors, root-mean-squared errors, and maximum errors reported by Mostaghni et al.
(2013) were approximately 10, 12, and 23 ppm, while those reported by Watts et al. (2011) were
approximately 2, 3, and 8 ppm. Therefore, the multi-standard method produced results that were
more precise than those produced by the single-standard method for lignin model compounds
with β-O-4 linkages.
For each C- or H-nucleus, we used δxcalc= σref - σcalc + δref to calculate the chemical shift
of each H- and C-nucleus of interest (δxcalc) in the GG-amino acid models (Sarotti and Pellegrinet
2009; Sarotti and Pellegrinet 2012). Here, σref is the calculated tensor of the C- or H- nucleus of
the standard (i.e., methanol or benzene), σcalc is the calculated tensor of the nucleus of interest
from the GG-amino acid model, and δref is the experimental chemical shift of the C- and H-
nuclei in benzene or methanol dissolved in DMSO (Gottlieb et al. 1997). The chemical shifts for
each C- and H-nucleus was thermodynamically weighted using the relative, calculated Gibbs
free energy of each model to account for the thermodynamic abundance of each model (Barone
et al. 2002). The calculated δ13C and δ1H results were then correlated with their respective NMR
data.
2.4. Results and discussion
2.4.1. Preparation of quinone methide-amino acid adducts
A lignin β-ether QM 2 was prepared cleanly from guaiacylglycerol-β-guaiacyl ether 1, as
previously described (Kawai et al. 1999; Landucci et al. 1981; Ralph and Young 1983). One of
nine amino acids bearing a nucleophilic side-group was then added to the QM, with each
reaction monitored by thin layer chromatography. It was observed that amino acids with amine-
containing side-chains (Lys and His) reacted with the QM quickly (within minutes), whereas
thiol-, acid-, and hydroxyl-containing amino acids reacted slowly (over hours or days). In the
case of the secondary hydroxyl-containing amino acids (Thr and Hyp) no cross-coupling was
observed (i.e., compounds 10 and 11 did not form), despite attempts to catalyze the cross-
coupling reaction (refer to the electronic supplement for detailed reaction protocols). Products
were purified via column chromatography and yields ranged from quantitative in the case of
compound 3 (QM-Lys) to zero (no reaction) in the cases of compounds 10 and 11 (QM-Thr and
QM-Hyp). Cross-coupling reactions were carried out in dichloromethane to produce the desired
lignin-protein adducts.
46
Fig 2.3. QM-AA model compounds. Lignin-cysteine (QM-Cys) 3, lignin-lysine (QM-Lys) 4,
lignin-histidine (QM-His) 5, lignin-aspartic acid (QM-Asp) 6, lignin-glutamic acid (QM-Glu) 7,
lignin-serine (QM-Ser) 8, lignin-Tyrosine (QM-Tyr) 9, lignin-threonine (QM-Thr) 10, and
lignin-hydroxyproline (QM-Hyp) 11 adducts derived from QM 2
2.4.2. Solution-state NMR of compounds 3-9 and density functional theory calculations for
compounds 10 and 11
Reaction products were characterized using solution-state 1D 1H and 13C NMR, as well as
2D heteronuclear multiple quantum coherence (HMQC) and heteronuclear multiple-bond
correlation (HMBC) experiments. Full spectral assignments for compounds 3-9 are given in the
methods sections (sections 2.3.3 and 2.3.4). Interpretation of these results is consistent with
structures 3-9 (Fig 2.3), indicating that Cys, Lys, His, Asp, Glu, Ser and Tyr all add to QM 2 in
vitro. Density functional theory (DFT) was used to predict NMR shifts for compounds 10 (QM-
Thr) and 11 (QM-Hyp), which did not form under the synthetic conditions employed here.
Table 2.1 shows the lignin α and β 1H and 13C shifts for compounds 3-11. The γ-shifts of
these compounds are almost entirely degenerate and are therefore considered non-diagnostic.
Because threonine and hydroxyproline are abundant in cell wall structural proteins (especially
hyp, which can account for up to 33% of the amino acid profiles of some structural proteins), the
authors perceived that estimations of the QM-Thr and QM-Hyp NMR chemical shifts could still
be useful. Thus, NMR shifts for compounds 10 and 11 were calculated using DFT. As a control,
DFT was also used to calculate NMR shifts for compounds 3 and 5 (Fig 2.4), showing
comparison to experimental results. Calculated 13C shifts were generally in agreement with
experimentally observed shifts. For example, calculated 13C α-shifts overestimated the observed
shifts by only 0.8-3.1 ppm. Calculated 13C β-shifts overestimated the observed shifts by 5.4-9.8
ppm. Similar discrepancies in DFT calculated β-shifts of β-ether compounds have been
previously reported, and further work is necessary to refine these calculations (Watts et al. 2011).
Calculated 1H shifts consistently underestimated the experimentally observed shifts by about 0.5-
1 ppm. Thus, the calculated 1H shifts for compounds 10 and 11 are not reproducing the observed 1H shifts; however it could be possible with future work to develop a method to correlate the
calculated and experimental 1H shifts, because of the consistent underestimation of the
experimental 1H shifts by the calculated shifts. Lodewyk et al. (2012) described a method for
using empirical scaling factors to obtain improved correlation between experimental and
calculated 1H and 13C shifts; however, doing so is beyond the scope of the present work. In
addition to the use of scaling factors, further research to develop multi-standard methods that are
based on DFT results is necessary. This work could require the development and assessment of
DFT methods, as well as basis sets to obtain methods to calculate 1H shifts more precisely.
47
Table 2.1. 1H and 13C NMR chemical shifts for lignin-amino acid adducts.
α-shifts β-shifts
Experimental Calculated Experimental Calculated
Compound 1H/13C 1H/13C 1H/13C 1H/13C
3 (QM-Cys)a 4.3/50.3
4.4/50.6
3.8/53.4
3.5/52.9
4.7/81.8
4.6/81.7
3.7/89.6
4.0/87.1
4 (QM-Lys) 3.9/63.0 4.2/85.9
5 (QM-His)b 5.7/60.2 5.0/61.0 5.0/80.2 3.9/90.0
6 (QM-Asp)a 6.0/74.8
6.1/75.1
4.7/81.4
4.6/82.7
7 (QM-Glu)a 6.0/74.2
6.1/74.7
4.7/81.6
4.6/82.6
8 (QM-Ser)a 4.6/80.9
4.6/80.7
4.4/82.5
4.4/82.9
9 (QM-Tyr) 5.5/78.3 4.7/82.7
10 (QM-Thr)c n/a 4.4/70.7
4.3/73.7 n/a
3.2/88.5
3.4/90.4
11 (QM-Hyp)c n/a 4.7/80.5
4.6/78.7 n/a
3.2/87.7
3.6/90.9
Key: a, products exhibited two stereoisomers, shifts for the major isomer are shown first; b, only
the calculated shifts of anti-5b are shown, see the electronic supplement for calculated shifts of
additional isomers of 5; c, syn-isomer shifts are shown first.
48
Fig 2.4. Overlaid HMQC side chain regions of compounds 3 and 5. The α- and β-shifts are
labeled; methoxyl and γ-shifts are not labeled due to substantial shift degeneracy. Grey shifts are
non-diagnostic. DFT calculated α-shifts (red squares) and β-shifts (blue squares) are shown for
compounds 3, 5a and 5b (both threo and erythro stereoisomers are shown). Calculated 13C shifts
correlate relatively well with experimentally observed 13C shifts, though not well enough to
allow for assignment of stereochemistry in the experimentally observed product shifts.
Calculated 1H shifts are underestimated by about 0.5-1.0 ppm. Further research is necessary to
refine the predicative abilities of DFT for 1H shifts of lignin compounds.
Fig 2.5 highlights the location of diagnostic HMQC NMR peak contours of the lignin-amino
acid adducts overlaid on the spectrum of a synthetic lignin (a so-called dehydrogenation
polymer, or DHP). Differences in chemical shifts among the lignin-amino acid adducts are most
salient for the α-positions and, as expected, less for those from the β-positions. Most of the
lignin-amino acid shifts are readily distinguishable from correlations of native structures in
lignin; however, the α-shifts of compound 4 (QM-Lys) are degenerate with phenylcoumaran γ-
shifts. In this case, identifying a lignin-lysine crosslink may be possible by observing the lignin-
lysine β-shifts. The α- and β-shifts of compound 9 (QM-Tyr) are degenerate with benzyl aryl
ether linkages (so called α-O-aryl linkages) sometimes observed in synthetic and native lignin
polymers. These lignin-lignin linkages form when QMs are quenched by phenolic moieties, and
degeneracy is not surprising given the structural similarities among tyrosine and the lignin
monomers, p-coumaryl, coniferyl, and sinapyl alcohols. This may make it difficult to distinguish
lignin-tyrosine crosslinking from lignin-lignin α-O-aryl linkages in native lignins.
Though not depicted graphically, the lignin-peptide linkages described herein are largely free
from overlap with previously described polysaccharide shifts in both angiosperms and
49
gymnosperms. However, a few of the lignin-amino acid shifts may overlap with signatures
attributed to lignin-carbohydrate linkages. For example, the α-shifts of compounds 6 and 7
exhibit degeneracy with lignin-carbohydrate benzyl esters (α-shifts at 6.1/75.0 ppm) due to
structural similarity (Balakshin et al. 2011; Toikka et al. 1998). Likewise, the α-shifts of 8 and 11
exhibit degeneracy with lignin-carbohydrate benzyl ethers (α-shifts located at 4.6/80.5 ppm)
(Balakshin et al. 2011; Toikka et al. 1998). Thus, caution should be exercised when attempting to
discern certain lignin-protein and lignin-carbohydrate linkages using 1D and 2D NMR
techniques. The results of the current study indicate that NMR identification of lignin-protein
linkages, especially linkages of the benzyl thioether and benzyl amine types, should be possible
in whole cell walls or lignin extracts provided the linkages are adequately abundant (Kim and
Ralph 2010; Mansfield et al. 2012).
Fig 2.5. HSQC NMR spectrum of a lignin DHP with overlaid α- and β-correlation data from
compounds 3-11 represented by red (α) and blue squares (β)
50
2.4.3. Adduct isomer determination
Of purely fundamental interest, we attempted to resolve the stereochemistry of the products
by the use of DFT, but these efforts were largely unsuccessful. For example, in the case of QM-
Cys, 3, the root mean-squared error (RMSE) between experimental and calculated shifts was too
large to reliably assign the isomers (Table 2.2). Although it may have been possible to improve
the DFT results through the addition of conformational isomers, the added computational cost
may not have reduced the calculated RMSE to experimental uncertainty levels. Hence additional
attempts to resolve stereoisomers (compounds 6, 7, 8) were abandoned; likewise, DFT was not
used to identify which stereoisomer was produced in 4 and 9 (only one product was observed in
each case). Previously, addition of primary amines were shown (via diagnostic NMR of
tetrahydro-1,3-oxazine derivatives) to strongly (>90%) favor formation of the syn-isomer (Ralph
and Young 1983), so product 4 is likely syn.
Table 2.2. Observed and DFT calculated α-13C NMR chemical shifts for compound 3.
α-13C chemical shifts (ppm)
Observed Calculated RMSE
50.27 52.90 - (3, syn) 2.7
50.60 53.40 - (3, anti) 2.8
In the case of 5 (QM-His), one α-shift was observed, occurring at 5.7/60.2 ppm. The His
system is an interesting one to consider given the tautomerization in the His imidazole group and
the potential for various regio-isomeric products (Nagy et al. 2005). In the HMQC and HMBC
spectra (see methods section) the α-1H shows correlations to positions 2 and 5 of the imidazole
ring, though α-1H correlations to position 5 are weak and partially degenerate with correlations to
carbon A6. The NMR results suggest the formation of both compounds 5a and 5b, resulting from
either N1 or N3 addition, but quantification of these compounds via NMR was rendered
impossible due to the aforementioned shift degeneracy. The Gibbs free energy-based Boltzmann
factors in the gas-phase suggested that compound 5b is thermodynamically prevalent relative to
compound 5a (93.5% to 6.5%, respectively), and prior work by Watts et al. (2011) suggested that
models with greater thermodynamic abundance generally provided α-13C results that were better
correlated with experimental NMR data.
2.5. Conclusions
This study is the first to report on the synthesis of lignin-protein model compounds and
contributes to the growing lignin NMR database. QM-amino acid adducts were synthesized and
characterized. Namely, Cys, Lys, His, Asp, Glu, Ser, Tyr, Thr, and Hyp were reacted with a
lignin model quinone methide—an important intermediate in lignification. The selected quinone
methide 2 represents the structure and reactivity of QMs native to lignin. The amino acids were
51
selected because of their nucleophilic side-groups; furthermore, these amino acids are common
in plant cell wall structural proteins and represent functional groups (amines, thiols, acids, and
alcohols) that are known to react with quinone methides (Awad et al. 2000; Bolton et al. 1997;
Ramakrishnan and Fisher 1983). The selected amino acids quenched the QM with varying
efficiencies (in general, amine > thiol > acid > hydroxyl) under neutral organic solvent
conditions. The secondary alcohols (Thr, Hyp) did not react under the selected conditions.
Using the results from these model compounds to identify any lignin-protein crosslinks in
planta is our goal. Based on the results herein, lignin-protein NMR shifts should be well
dispersed and, in most cases, distinct even within the complex NMR spectra of polymerized
lignin (Fig 2.5). This suggests that the linkages may be detectable in planta if they exist in
significant quantities.
Although density functional theory was used to predict NMR chemical shifts of lignin-
protein crosslinks, the calculated chemical shifts did not display the level of accuracy required to
distinguish stereoisomers. Future studies are needed to improve the correlation between these
DFT calculations and experimentally observed shifts.
2.6. Acknowledgements
This research was supported as part of The Center for Lignocellulose Structure and
Formation, an Energy Frontier Research Center funded by the U.S. Department of Energy,
Office of Science, Office of Basic Energy Sciences under Award Number DE-SC0001090, and
the DOE Great Lakes Bioenergy Research Center (DOE Office of Science BER DE-FC02-
07ER64494). The authors would like to thank and acknowledge the Center for Lignocellulose
Structure and Formation (CLSF) and the members thereof. Student fellowships were provided by
the USDA National Needs Program and the National Science Foundation. The authors would
like to thank Dr. Alan Benesi and Dr. Wenbin Luo for assistance in acquiring NMR spectra of
the lignin model compounds, Dr. James Miller for acquiring mass spec data, and Dr. Josh
Stapleton for providing assistance with UV/Vis. The primary author would also like to
acknowledge Paul Munson and Curtis Frantz for valuable discussion, and valuable interactions
with Dan Gall and other members of the Wisconsin lab.
2.7. References
Adamo, C.; Barone, V. J. Chem. Phys. 1998, 108, 664–675.
Albersheim, P.; Darvill, A.; Roberts, K.; Sederoff, R.; Staehelin, A. 2010. Principles of Cell Wall
Architecture and Assembly, in: Plant Cell Walls. Garland Science, New York, New York, pp.
227-272.
Awad, H. M.; Boersma, M. G.; Vervoort, J.; Rietjens, I. M. C. M. Arch. Biochem. Biophys. 2000,
378, 224-233.
52
Barone, G.; Duca, D.; Silvestri, A.; Gomez-Paloma, L.; Riccio, R.; Bifulco, G. Chem.--Eur. J.
2002, 8(14), 3240–3245.
Beat, K.; Templeton, M. D.; Lamb, C. J. Proc. Natl. Acad. Sci. USA 1989, 86, 1529-1533.
Balakshin, M.; Capanema, E.; Gracz, H.; Chang, H.; Jameel, H. Planta 2011, 233, 1097-1110.
Boerjan, W.; Ralph, J.; Baucher, M. Annu. Rev. Plant Biol. 2003, 54, 519-546.
Bolton, J. L.; Turnipseed, S. B.; Thompson, J. A. Chem.-Biol. Interact. 1997, 107, 185-200.
Buhl, M.; Kaupp, M.; Malkina, O.L.; Malkin, V.G. J. Comput. Chem. 1999, 20, 91–105.
Cances, E.; Mennucci, B.; Tomasi, J. J. Chem. Phys. 1997, 107(8), 3032–3041.
Capanema, E.A.; Balakshin, M.Y.; Kadla, J.F. J. Agric. Food Chem. 2004, 52, 1850-1860.
Cassab, I. G.; Varner, J. E. Ann. Rev. Plant Physiol. Plant Mol. Biol. 1988, 39, 321-353.
Chapple, C.; Ladisch, M.; Meilan, R. Nat. Biotechnol. 2007, 25, 746-748.
Cheeseman, J.R.; Trucks, G.W.; Keith, T.A.; Frisch, M.J. J. Chem. Phys. 1996, 104, 5497–5509.
Chen, F.; Dixon, R. A. In Vitro Cell. Dev. Biol.: Anim. 2008, 44, S28-S29.
Cosgrove, D. J. Nat. Rev. Mol. Cell Biol. 2005, 6, 850-861.
Curtiss, L.A.; Redfern, P.C.; Raghavachari, K.; Pople, J.A. J. Chem. Phys. 2001, 114(1), 108–
117.
Frisch, M.J.; Trucks, G.W.; Schlegel, H.B.; Scuseria, G.E.; Robb, M.A.; Cheeseman, J.R.;
Montgomery, Jr., J.A. 2009. Gaussian 09, Revision B.01. Gaussian, Inc., Wallingford,
Connecticut.
Gogonea, V. 1998. Self-Consistent Reaction Field Methods: Cavities, in: Schleyer, P.v.R.,
Schreiner, P.R., Allinger, N.L., Clark, T., Gasteiger, J., Kollman, P., Schaefer III, H.F. (Eds.),
Encyclopedia of Computational Chemistry. Wiley, New York, New York, pp. 2560–2574.
Gottlieb, H.E.; Kotlyar, V.; Nudelman, A. J. Org. Chem. 1997, 62(21), 7512–7515.
Harrak, H.; Chamberland, H.; Plante, M.; Bellemare, G.; Lafontaine, J. G.; Tabaeizadeh, Z. Plant
Phys. 1991, 121, 557-564.
Hohenberg, P.; Kohn, W. Phys. Rev. 1964, 136(3B), B864-B871.
Jose, M.; Puigdomenech, P. New Phytol. 1993, 125, 259-282.
53
Jung, H. G.; Allen, M. S. J. Anim. Sci. 1995, 73, 2774-2790.
Jung, H. G. Agron. J. 1989, 81, 33-38.
Karadakov, P.B. 2008. Ab Initio Calculation of NMR Shielding Constants, in: Webb, G.A. (Ed.),
Modern Magnetic Resonance. Springer, New York, New York, pp. 63–70.
Kawai, S.; Okita, K.; Sugishita, K.; Tanaka, A.; Ohashi, H. J. Wood Sci. 1999, 45, 440-443.
Kieliszewski, M.; Lamport, D. T. A.; Tan, L.; Cannon, M. C. Annu. Plant Rev. 2011, 41, 321-
342.
Kim, H.; Ralph, J. Org. Biomol. Chem. 2010, 8, 576-591.
Kohn, W.; Sham, L.J. Phys. Rev. 1965, 140(4A), A1133–A1138.
Landucci, L. L.; Geddes, S. A.; Kirk, T. K. Holzforschung 1981, 35, 66-69.
Leary, G. J. Wood Sci. Technol. 1980, 14, 21-34.
Leary, G.; Miller, I. J.; Thomas, W.; Woolhouse, A. D. J. Chem. Soc., Perkin Trans. 2 1977, 13,
1737-1739.
Liang, H.; Frost, C.J.; Wei, X.; Brown, N.R.; Carlson, J.E.; Tien, M. Clean 2008, 36(8), 662-
668.
Li, X.; Weng, J. K.; Chapple, C. Plant J. 2008, 54, 569-581.
Lodewyk, M.W.; Siebert, M.R.; Tantillo, D.J. Chem. Rev. 2012, 112(3), 1839–62.
Mansfield, S. D.; Kim, H.; Lu, F.; Ralph, J. Nat. Protoc. 2012, 7(9), 1579-1589.
McQueen-Mason, S.; Cosgrove, D. J. Proc. Natl. Acad. Sci. USA 1994, 91, 6574-6578.
Miyagawa, Y.; Takemoto, O.; Takano, T.; Kamitakahara, H.; Nakatsubo, F. Holzforschung 2012,
66, 459-465.
Mostaghni, F.; Abbas T.; Seyed, A.M. Spectrochimica Acta Part A: Molecular and Biomolecular
Spectroscopy. 2013, 110, 430-436.
Nagy, P. I.; Tejada, F. R.; Messer, W. S. Jr. J. Phys. Chem. 2005, 109, 22588-22602.
Ralph, J.; Lundquist, K.; Brunow, G.; Lu, F.; Kim, H.; Schatz, P. F.; Marita, J. M.; Hatfield, R.
D.; Ralph, S. A.; Christensen, J. H. Phytochem. Rev. 2004, 3, 29-60.
Ralph, J.; Schatz, P. F.; Lu, F.; Kim, H.; Akiyama, T.; Nelsen, S. F. 2009. Quinone Methides in
Lignification, in: Rokita, S. E. (Ed.), Quinone Methides. John Wiley & Sons, Hoboken, New
Jersey, pp. 385-420.
54
Ralph, J.; Young, R. A. J. Wood Chem. Technol. 1983, 3(2), 161-181.
Ralph, S.A., Ralph, J., Landucci, L.L. NMR Database of Lignin and Cell Wall Model
Compounds. Available at URL http://ars.usda.gov/Services/docs.htm?docid=10491 (November
2004).
Ramakrishnan, K.; Fisher, J. J. Am. Chem. Soc. 1983, 105, 7187-7188.
Ryser, U.; Schorderet, M.; Zhao, G.; Studer, D.; Ruel, K.; Hauf, G.; Keller, B. The Plant J. 1997,
12(1), 97-111.
Sarotti, A.M.; Pellegrinet, S.C. J. Org. Chem. 2009, 74(19), 7254–7260.
Sarotti, A.M.; Pellegrinet, S.C. J. Org. Chem. 2012, 77(14), 6059–65.
Schreckenbach, G.; Ziegler, T. J. Chem. Phys. 1995, 99(2), 606–611.
Stewart, J. J.; Kadla, J. F.; Mansfield, S. D. Holzforschung 2006, 60, 111-122.
Terashima, N.; Atalla, R.H.; Ralph, S.A.; Landucci, L.L.; Lapierre, C.; Monties, B.
Holzforschung 1995, 49, 521-527.
Toikka, M.; Jussi, S.; Teleman, A.; Brunow, G. J. Chem. Soc., Perkin Trans. 1. 1998, 1, 3813-
3818.
Vanholme, R.; Morreel, K.; Ralph, J.; Boerjan, W. Plant Phys. 2010, 153, 895-905.
Watts, H. D.; Mohamed, M. N. A.; Kubicki, J. D. J. Phys. Chem. B. 2011, 115(9), 1958–1970.
Wolinski, K.; Hinton, J.F.; Pulay, P. J. Am. Chem. Soc. 1990, 112(23), 8251–8260.
Xu, Y.; Chen, C.; Thomas, T.P.; Azadi, P.; Diehl, B.; Tsai, C.; Brown, N.; Carlson, J.E.; Tien,
M.; Liang, H. Plant Cell Rep. 2013, 32, 1827-1841.
Yuan, T.; Sun, S.; Xu, F.; Sun, R. J. Agric. Food Chem. 2011, 59, 10604-10614.
Zhao, Y.; Schultz, N.E.; Truhlar, D.G. J. Chem. Theory Comput. 2006, 2(2), 364–382.
55
Chapter 3
Lignin crosslinks with peptides under biomimetic conditions
(Target journal for publication is Biomacromolecules)
3.1. Abstract
The work presented here investigates the crosslinking of various nucleophilic amino acids with
lignin under aqueous conditions, thus providing insight as to which amino acids might crosslink
with lignin in planta. Lignin dehydrogenation polymer (DHP) was prepared in aqueous solutions
that contained peptides with the general structure XGG, where X represents an amino acid with a
nucleophilic side chain. Fourier-transform infrared spectroscopy and energy dispersive X-ray
spectroscopy showed that peptides containing cysteine and tyrosine were incorporated into the
DHP, while peptides containing other nucleophilic amino acids were not. Scanning electron
microscopy showed that the physical morphology of the DHP was altered by the presence of
peptides, regardless of peptide incorporation. Nuclear magnetic resonance (NMR) spectroscopy
showed that cysteine-containing peptide crosslinked with lignin at the lignin α-position, whereas
in the case of the lignin-tyrosine adduct the exact crosslinking mechanism could not be
determined. This is the first study to use NMR to confirm crosslinking between lignin and
peptides under biomimetic conditions. The results of this study may indicate the potential for
lignin-protein linkage formation in planta, particularly between lignin and cysteine and/or
tyrosine-rich proteins.
3.2. Introduction
Lignin is an abundant, aromatic biopolymer that forms in the lignocellulosic matrices of
plant cell walls. Its free radical polymerization mechanism and heterogeneous nature make it
unique within the plant kingdom. Lignin is economically important to the pulp and paper
industries, the agricultural industries, and the biofuels and biorenewables industries, all of whom
are hampered by its recalcitrance against extraction and/or degradation (Boerjan, 2003; Stewart,
2006; Chen, 2008; Li, 2008; Chapple, 2007; Jung, 1989; Jung, 1995). Many aspects of
lignification are still poorly understood, in spite of its abundance and economic relevance. For
example, the extent to which lignin interacts with surrounding cell wall polymers, particularly
proteins, is largely unknown.
It is understood that lignin forms covalent crosslinks with plant cell wall components,
particularly hemicelluloses (Balakshin, 2011; Miyagawa, 2012; Toikka, 1998; Yuan, 2011). One
prevalent mechanism for lignin-carbohydrate linkage formation is through the reaction of a
nucleophilic moiety (e.g., an hydroxyl or carboxylic acid group) with the electrophilic α-carbon
of the lignin quinone methide (QM) intermediate (Leary, 1980; Ralph, 2009). The crosslinking
of lignin with other cell wall components, such as proteins, has not been well investigated,
despite the fact that lignin-protein linkages may play important roles in wild type and transgenic
plant lines. In most wild type plant lines the pattern of lignin deposition indicates the presence of
so-called nucleation sites within specific regions of the plant cell wall (e.g., the cell corners), but
the nature of these nucleation sites remains unknown (Boerjan et al., 2003). It has been suggested
that nucleation sites may be rich in structural proteins, perhaps leading to lignin-protein
56
crosslinking, but this hypothesis has not been adequately tested. Furthermore, lignin-protein
linkages may affect the physical and chemical properties of transgenic plant lines. For example, a
recently engineered line of Populus secretes a tyrosine-rich peptide into the cell wall. Increased
sugar extractability was observed in these Populus lines upon protease digestion of the walls, and
it was hypothesized that this was due to lignin-protein linkage formation. However, the putative
lignin-protein linkages have yet to be identified (Liang et al., 2008; Xu et al., 2013). Diehl et al.
(2014) recently showed that amino acids bearing nucleophilic side chains, namely Cys, Lys, His,
Asp, Glu, Ser, and Tyr all react with a lignin model QM in dichloromethane. The study identified
diagnostic NMR shifts of lignin-peptide compounds, but did not investigate the propensity for
such linkages to form under biomimetic conditions (i.e., conditions of higher molecular weight
lignin formation with peptides in aqueous media). In order to expand upon these results, the
work described here investigates the propensities for various amino acids (in peptide chains) to
crosslink with lignin dehydrogenation polymer, which is a biomimetic lignin model compound
(Terashima et al., 1995). It is anticipated that this will assist in future studies to help elucidate the
interactions between lignin and proteins in planta.
In order to investigate the propensity for lignin-peptide crosslinking under biomimetic
conditions, lignin dehydrogenation polymer (DHP) was prepared in aqueous solutions containing
peptides. Each peptide had the general structure X-glycine-glycine (XGG), with X being cysteine
(C), lysine (K), histidine (H), aspartic acid (D), glutamic acid (E), serine (S), tyrosine (Y),
threonine (T), or hydroxyproline (Hyp). These amino acids were previously identified as being
reactive (or potentially reactive in the case of T and Hyp) toward lignin QMs (Diehl et al., 2014).
The general peptide structure and predicted mode of lignin-peptide crosslinking is shown in Fig
3.1. The C-termini and N-termini of the peptides were blocked via amidation and esterification,
respectively, to ensure that the amino acid of interest (i.e., residue X) contained the only
nucleophilic moiety. Glycine was chosen as the "place holder" residue due to its expected lack of
reactivity toward lignin. The lengths of the peptides were limited to three residues because
reaction of larger peptides with DHPs results in the formation of lignin-peptide complexes that
are insoluble and thus difficult to characterize (e.g., liquid state NMR becomes impractical)
(results not yet published). Peptides were added in 25% mol/mol ratio to the lignin monomer
(coniferyl alcohol) because it was previously reported that lignin DHPs contain between 20 and
30% β-ether linkages (Tobimatsu, 2012). Thus, the ratio of nucleophilic amino acids to lignin β-
ether QMs was expected to be approximately 1:1 over the course of the polymerization reaction.
57
Fig 3.1. Proposed lignin-peptide crosslinking mechanism.
Lignin-peptide crosslinks form when nucleophilic side chains of amino acids react with quinone
methides formed during lignin β-ether coupling. R = H or OMe, L = lignin.
Fourier-transform infrared spectroscopy (FT-IR), scanning electron microscopy (SEM),
energy dispersive X-ray spectroscopy (EDS), and nuclear magnetic resonance spectroscopy
(NMR) were used to characterize the lignin-peptide adducts. FT-IR and, more recently, NMR,
have become staples of lignin characterization (Capanema et al., 2004; Faix, 1988; Kim and
Ralph, 2010). Multidimensional NMR techniques (e.g., heteronuclear single quantum coherence
(HSQC)) are particularly useful because the shift degeneracies observed in 1D spectra are largely
eliminated. Furthermore, diagnostic NMR shifts of lignin-peptide model compounds have
previously been assigned (Diehl et al., 2014). SEM imaging of synthetic and native lignins has
not garnered much research attention, but the technique was employed here in order to monitor
morphological differences between neat DHP and the lignin-peptide adducts (Micic et al., 2003;
Terashima et al., 2004). It was convenient to also collect EDS elemental analysis data while the
lignin-peptide samples were in the SEM instrument, with the presence of nitrogen suggesting
peptide incorporation because neat lignin contains only carbon, hydrogen, and oxygen. Through
the use of these techniques, this study provides new insights into the propensities and
mechanisms of lignin-peptide linkage formation. It is expected that this will be useful toward the
continued study of lignin formation in both native and mutant plant lines.
3.3. Experimental
3.3.1. Materials
All chemicals necessary for DHP preparation were purchased from Sigma with the
exception of the peptides (>95% purity), which were purchased from Peptide 2.0
(www.peptide2.com).
3.3.2. Synthesis of lignin DHP and lignin-peptide adducts
Guaiacyl-based DHP was synthesized according to a previously published method in
sodium phosphate buffer (pH 6.5) using coniferyl alcohol as the sole lignin monomer
(Terashima, 1995). The DHP crude product was centrifuged (10k g, 20 min, 4 °C) and the pellet
washed four times with distilled water. The DHP product was then lyophilized to yield dry DHP
(typical yields 60-70%), which was characterized via NMR as described below and was found to
contain shifts typical of G-DHP (Capanema, 2004; Kim, 2010).
Lignin-peptide adducts were prepared as above, with the exception that 25% peptide to
coniferyl alcohol (mol/mol basis) was added to the flask containing coniferyl alcohol prior to the
start of the reaction. The crude reaction products were centrifuged and lyophilized as described
above to yield tan powders. These adducts were characterized using IR, SEM, EDS and NMR.
3.3.3. Scanning electron microscopy and energy dispersive X-ray spectroscopy
Scanning electron microscopy images were collected on a field emission SEM (FESEM -
FEI NanoSEM 630) at 2 or 3 kV under high vacuum (1.7 x 10-6 Torr). Samples were not sputter
coated prior to imaging. Characteristic X-rays were collected with an X-Max silicon drift
detector (Oxford Instruments) inside the FESEM at 10 kV under low vacuum conditions (0.6
58
Torr) in order to prevent sample charging. Elements were selected and quantified using Aztec
Energy Analyser Software (Oxford Instruments).
3.3.4. Nuclear magnetic resonance spectroscopy
The neat peptides (25 mg) were dissolved in DMSO-d6/pyridine-d5 (4:1 v/v, 500 ul), and
proton (16 scans), carbon (4k scans), HMQC (64 scans) and HMBC (32 scans) spectra were
collected using standard Bruker pulse programs on a Bruker DRX-400 (400 MHz 1H resonance
freq.) using the central solvent peak [δH/δC: dimethyl sulfoxide (DMSO), 2.50/39.50 ppm] as
internal standard. In the case of DHP and the lignin-peptide adducts, NMR spectra were acquired
on a Bruker Biospin (Billerica, MA, USA) AVANCE 500 (500 MHz 1H resonance freq.)
spectrometer fitted with cryogenically-cooled gradient probes having inverse geometry, i.e., with
the proton coils closest to the sample. Spectra were processed with Bruker’s Topspin 3.1
software, using the central solvent peak as internal reference [δH/δC: dimethyl sulfoxide
(DMSO), 2.50/39.5 ppm]. The DHP or lignin-peptide adducts (~45 mg) were placed in an NMR
tube (ID: 4.1 mm), dissolved in DMSO-d6/pyridine-d5 (4:1 v/v, 500 ul), and subjected to
adiabatic HSQC (‘hsqcetgpsisp2.2’) experiments, and, in the case of DHP-YGG, also subjected
to HMBC (‘hmbcgpndqf’), COSY (‘cosygpqf’), and NOESY (‘noesyesgpph’) experiments in an
attempt to determine the lignin-tyrosine crosslinking mechanism. Processing used typical
matched Gaussian apodization in F2 (LB = -0.3, GB = 0.001), and squared cosine-bell and one
level of linear prediction (32 coefficients) in F1 (Mansfield, 2012). For an estimation of the
various inter-unit linkage types in DHP and DHP-peptide adducts (Table 2; β-ether/α-OH, β-
ether/α-O-aryl, β-ether/α-peptide, phenylcoumaran, pinoresinol, and dibenzodioxocin), the well
resolved Cα-Hα contours were integrated; no correction factors were used.
3.3.5. Fourier-transform infrared spectroscopy
Lignin DHP, neat peptides, and lignin-peptide adducts were analyzed using a Bruker
Vertex V70 Spectrometer (Bruker Optics Billerica MA) equipped with an MVP-Pro diamond
single reflection ATR accessory (Harrick Scientific Pleasantville NY), and 100 scans at 6 cm-1
resolution were averaged for each sample using a DTGS detector and scan frequency of 5 kHz.
In all cases, the spectrum of the clean diamond crystal was used as the reference spectrum. All
spectral manipulations were performed using OPUS 6.0 (Bruker Optics, Billerica MA).
3.4. Results and discussion
3.4.1. Preparation and yields of the lignin-peptide adducts
Lignin DHP was prepared in aqueous solutions containing tripeptides (25%
peptide:coniferyl alcohol mol/mol basis). Each tripeptide contained one nucleophilic amino acid
and blocked N- and C-termini in order to mimic inclusion within a larger protein and to prohibit
potential side reactions. The lignin-peptide adducts were collected via centrifugation, washed,
and characterized via SEM, NMR, EDS, and FT-IR. The results, detailed below, indicate
covalent incorporation of CGG and YGG peptides into the lignin polymer, while other peptides
did not show significant reactivity.
Yields for the DHP and lignin-peptide adducts are shown in Table 3.1. Yield A was
determined by dividing the mass of recovered solids by the total starting mass (i.e., combined
mass of lignin monomer and peptide), while yield B was determined by dividing the mass of
59
recovered solids by the starting mass of lignin monomer only. Thus, yield B is only valid for
lignin-peptide reactions in which peptide incorporation into the lignin was negligible. Notably,
the yields were very high when DHP was prepared in the presence of non-covalently reactive
peptides (i.e., all peptides other than CGG and YGG). The reason for this was unclear.
In the cases of the considerably reactive peptides (i.e., CGG and YGG) the yields of
recoverable DHP were depressed. For DHP-CGG, a likely explanation is that the thiol group of
the CGG peptide inhibited the catalytic ability of horseradish peroxidase, thus hampering
polymerization (Tobimatsu et al, 2009; Veitch, 2004). It is not known why the yield was
depressed in the case of DHP-YGG. The authors perceived that in the cases of DHP-CGG and
DHP-YGG a portion of the lignin-peptide adducts may have been aqueous soluble and held in
solution during the centrifugation process. However, extraction of the aqueous supernatants with
ethyl acetate and chloroform followed by NMR analyses did not show evidence for lignin-
protein complexes. Drying down the aqueous supernatant, re-suspending the solids in DMSO-
d6/pyridine-d5, and analyzing the products via NMR similarly failed to provide evidence for
lignin-protein crosslinking. This confirmed the depression of DHP yields in the cases of DHP-
CGG and DHP-YGG.
Table 3.1. Yield data for the DHP and lignin-peptide adducts
CA (mg) pep (mg) yield (mg) Yield A (%) Yield B (%)
DHP 200.0 0.0 130.0 65.0 65.0
DHP-CGG 200.0 88.4 100.0 34.7 -
DHP-KGG 200.0 83.6 174.5 61.5 87.3
DHP-HGG 200.0 86.1 176.0 61.5 88.0
DHP-DGG 200.0 80.0 180.0 64.3 90.0
DHP-EGG 200.0 83.9 197.9 69.7 99.0
DHP-SGG 200.0 72.2 172.8 63.5 86.4
DHP-YGG 200.0 93.3 114.2 38.9 -
DHP-TGG 200.0 76.1 179.1 64.9 89.6
DHP-HypGG 200.0 79.4 177.4 63.5 88.7
Yield A was determined by dividing the mass of recovered solids by the total starting mass (i.e.,
combined mass of lignin monomer and peptide). Yield B was determined by dividing the mass of
recovered solids by the starting mass of lignin monomer only. CA = coniferyl alcohol, pep =
peptide.
3.4.2. Lignin-peptide morphology
Scanning electron microscopy was used to compare the morphologies of DHP and the
lignin-peptide adducts (Fig 3.2). The DHP particles clumped together to form nearly perfect
spheres, as reported previously (Micic et al, 2003; Micic et al, 2004). Comparatively, spheres of
lignin-peptide adducts tended to form large, amorphous domains. This alteration of morphology
was observed regardless of whether the peptide in question was incorporated into the lignin. This
change in morphology was unexpected but was most likely due to non-covalent interactions
60
occurring between the peptides and the growing lignin chain. Further research is necessary to
determine the influence of non-covalent inter-polymer interactions during lignin polymerization.
Fig 3.2. SEM images of DHP (top), then, proceeding from left to right and top to bottom, DHP-
CGG, DHP-YGG, DHP-HypGG, DHP-DGG, DHP-EGG, DHP-KGG, DHP-HGG, DHP-SGG,
and DHP-TGG. Scale bar: 2 µm.
3.4.3. Lignin-peptide linkage identification
Fig 3.3 shows the heteronuclear single quantum coherence (HSQC) spectrum of DHP-
CGG. This 2D NMR technique is particularly useful for lignin analysis because the shift
degeneracy observed in 1D NMR spectra is largely avoided. Novel shifts are shown in green,
red, and blue, while standard lignin shifts are shown in black (Capanema, 2004; Kim, 2010).
Reference shifts of neat CGG (purple) were added to the spectrum during processing; these shifts
were not observed in the DHP-CGG spectrum. Some peptide shifts migrated as a result of DHP-
CGG crosslinking. For example, the cys-13C/1Hα shift (originally at 4.5/56.0 ppm in neat CGG)
migrated to 4.6/52.8 ppm, and the cys-13C/1Hβ shift (originally at 2.8/26.6 ppm in neat CGG)
migrated to 2.8/32.9 ppm in the DHP-CGG adduct. These shifts migrated upon lignin-peptide
crosslinking due to their proximity to the thiol group, which is the reactive center of the CGG
61
peptide. Shifts of proton and carbon atoms located far from the reactive thiol were largely
unaffected by crosslinking (e.g., shifts at 3.9/42.8 and 3.8/42.5 ppm).
Two novel lignin shifts, found at 4.4/50.1 ppm (Fig 3.3, red peak) and 4.8/81.3
ppm (Fig 3.3, blue peak), confirmed covalent crosslinking of DHP with CGG. Similar lignin α-
shifts (4.3/50.4 ppm) and β-shifts (4.7/81.7 ppm) were previously reported when a single
cysteine residue was reacted with a lignin model quinone methide to yield a structure similar to
that shown in Fig 3.3 (Diehl et al., 2014). The minute differences in shift locations can be
attributed to changes in chemical environment between a small lignin model compound and a
high molecular weight lignin. Volume integration of the HSQC contours showed that
approximately 33% of the β-ether linkages in DHP-CGG exhibited cysteine functionality at the
α-carbon, while the remaining β-ether linkages exhibited typical α-hydroxyl functionality and a
minor fraction of α-aryl ether (α-O-aryl) moieties. This indicated that cysteine was an efficient
trapper of lignin QMs under biomimetic conditions.
Fig 3.3. Side-chain and aromatic regions (inset) of the HSQC NMR spectrum of DHP-CGG.
Black shifts are typical of G-DHPs, green shifts correspond to peptide α- and β-signals, and red
and blue shifts correspond to lignin α- and β-signals in β-ether/α-cysteine structures (top left).
Purple shifts were added during processing to indicate shifts of neat CGG peptide.
Fig 3.4 shows the HSQC spectrum of DHP-YGG. As with the DHP-CGG adduct,
incorporation of YGG peptide into lignin was evidenced by the appearance of diagnostic
chemical shifts (Fig 3.4, green and orange contours). Reference shifts of neat YGG (purple, solid
yellow, and solid orange contours) were added during processing. The authors perceived that
given the similarity of tyrosine and coniferyl alcohol, crosslinking of YGG with DHP may have
occurred via two mechanisms.
62
The first potential mechanism involves oxidation of the phenolic hydroxyl of tyrosine by
horseradish peroxidase, followed by recombination of the tyrosine radical with a radical on the
lignin polymer. This mechanism may be unfavorable because radicals generated on tyrosine
could be shuttled to coniferyl alcohol, which exhibits an additional resonance structure compared
to tyrosine, presumably making it more stable (Cong et al., 2013). In addition to HSQC NMR,
we submitted the DHP-YGG adduct to heteronuclear multiple bond correlation (HMBC),
correlation spectroscopy (COSY), and nuclear Overhauser effect spectroscopy (NOESY)
techniques (spectra not shown), but were unable to conclusively assign NMR shifts of lignin-
tyrosine linkages formed in this manner. This may indicate that the mechanism is not valid under
our experimental conditions, and/or may illustrate the inadequacy of NMR to resolve shift
degeneracy between lignin-tyrosine linkages and typical lignin shifts.
A second crosslinking mechanism is possible when the phenolic hydroxyl of tyrosine
quenches the lignin quinone methide to form the α-aryl ether structure shown in Fig 3.4. Again,
shift degeneracy may complicate the investigation of this mechanism, as a lignin-tyrosine model
compound exhibited similar NMR shifts (α-1H/13C: 5.5/78.3 ppm in DMSO/pyridine) to α-aryl
ether linkages known to occur in neat DHPs (α-13C: 79.01 ppm in DMSO) (Diehl et al., 2014;
Ralph, Ralph and Landucci, 2004). In an attempt to overcome this issue, the well-resolved
HSQC α-signals of neat DHP and DHP-YGG adduct were integrated. It was observed that α-aryl
ether shifts comprised approximately 4.2% of the total α-signal in DHP-YGG but only 1.9% in
neat DHP synthesized under similar conditions. This increase could be due to imprecision in the
HSQC volume integration or random variation among DHP syntheses (other lignin-peptide
adducts displayed similarly high α-aryl ether signals), making it unclear if the structure shown in
Fig 3.4 formed in the DHP-YGG adduct. In summary, the NMR, EDS, and IR data (shown
below) strongly suggest that the YGG peptide crosslinked with lignin DHP; however, the
mechanism of lignin-tyrosine crosslinking is still uncertain.
63
Fig 3.4. Side-chain and aromatic regions (inset) of the HSQC NMR spectrum of DHP-YGG.
Black shifts are typical of G-DHPs, green shifts correspond to peptide α- and β-signals, and red
and blue shifts correspond to lignin α- and β-signals in β-ether/α-tyrosine structures (top left)
and/or lignin-lignin α-O-aryl structures. Purple shifts were added during processing to indicate
shifts of neat YGG peptide. Within the aromatic region, solid yellow and orange shifts (added
during processing) were assigned to the aromatic ring of tyrosine in neat YGG. It can be seen
that the contours (orange) representative of tyrosine ring positions 3 and 5 shift downfield as a
result of lignin-tyrosine crosslinking. The specific lignin-tyrosine crosslinking mechanism could
not be determined by NMR, and the structure shown is one of several possibilities.
It was notable that in the case of lignin-peptide adducts other than DHP-CGG and DHP-
YGG, peptide peaks could always be observed when viewing the HSQC contours quite low (i.e.,
near the signal to noise limit). Fig 3.5 shows the HSQC spectrum of DHP-HGG. This sample
showed the highest concentration of peptide after DHP-CGG and DHP-YGG. A putative lignin-
α-histidine crosslink was observed at 5.7/60.4 ppm, in good agreement with the α-shift of a
lignin-histidine model compound (5.7/60.2 ppm) (Diehl et al., 2014). Volume integration showed
that the abundance of the lignin-α-histidine shift only accounted for ~0.1% of the total lignin α-
signal. It is noteworthy that this low abundance of peptide was detected by HSQC NMR but not
readily detected by IR or EDS, thus illustrating the sensitivity of multidimensional NMR toward
investigating lignin-protein linkages.
Other lignin-peptide adducts exhibited less abundant NMR peptide shifts than DHP-
HGG. This indicated that negligible lignin-peptide crosslinking had occurred, in concurrence
with IR and EDS data (below).
64
Fig 3.5. Side-chain region of the HSQC NMR spectrum of DHP-HGG. Black shifts are typical of
G-DHPs, green shifts correspond to peptide α- and β-signals, and red and blue shifts correspond
to lignin α- and β-signals in β-ether/α-histidine structures (top left). Purple shifts were added
during processing to indicate shifts of neat HGG peptide.
Volume integration of the HSQC contour signals allowed for comparison of the various
lignin inter-unit linkages among DHP and lignin-peptide adducts (Table 3.2). The DHP
contained linkage ratios typical of DHPs (Terashima et al., 1995 and 2009; Tobimatsu et al.,
2012). Linkage ratios varied among the lignin-peptide adducts, but decreased β-ether content
with increased pinoresinol (β-β) content was generally observed. This occurred regardless of
covalent reactivity towards the lignin DHP, demonstrating the ability of a matrix material (in this
case peptides) to influence lignin structure during polymerization.
Table 3.2. Inter-unit linkage ratios of the DHP and lignin-peptide adducts
HSQC signal ratios
β-ether/α-OH β-ether/α-O-aryl β-ether/α-pep β-5 β-β Dibenz.
DHP 27.3 1.9 - 50.3 19.2 1.2
DHP-CGG 8.6 1.5 5.1 50.8 32.4 1.6
DHP-DGG 10.1 5.5 0.1 54.1 30.2 tr
DHP-EGG 13.1 4.6 0.1 54.1 27.2 0.9
DHP-KGG 4.7 3.1 tr 62.3 29.9 tr
DHP-HGG 20.4 0.9 0.1 57.7 20.3 0.6
DHP-SGG 11.1 2.3 tr 52.5 34.1 tr
DHP-YGG 11.5 4.2 51.7 32.5 tr
DHP-TGG 21.8 0.9 tr 54.1 22.2 1.0
DHP-HypGG 17.7 2.7 tr 53.2 26.2 0.2
Lignin inter-unit linkage ratios (as percentage of total α-signal) for DHP and lignin-peptide
adducts. In the case of DHP-YGG the DHP-α-peptide shift was degenerate with standard lignin
α-O-aryl shifts, thus the β-ether/α-O-aryl and β-ether/α-pep quantities were combined. tr, trace
(<0.1%).
3.4.4. Supporting techniques for characterization of lignin-peptide entanglement
In addition to NMR, which can provide direct evidence of covalent crosslinking, other
techniques can be used to show peptide incorporation into lignin. Fig 3.6 shows FT-IR spectra of
neat DHP and the lignin-peptide adducts. The neat DHP IR spectrum exhibited bands typical of
lignin DHPs (Faix, 1988). The DHP-CGG and DHP-YGG spectra exhibited three peaks
indicative of peptide incorporation into the lignin. The shoulder near 3200 cm-1 was attributed to
N-H stretching in amide functional groups, the peak at 1658 cm-1 increased dramatically and was
attributed to increased C=O stretching due to the incorporation of amide functional groups, and
the shoulder at 1540 cm-1 was attributed to N-H deformation with C-N stretching, again
indicating incorporation of amide functionalities (Socrates, 2001). It is notable that these shifts
displayed greater intensity in the DHP-CGG adduct compared to the DHP-YGG adduct,
suggesting greater incorporation of CGG peptide. These peaks were not observed in the IR
spectra of other lignin-peptide adducts, suggesting a lack of peptide incorporation. In the case of
DHP-CGG and DHP-YGG, incorporation of peptide into the lignin polymer caused an increase
65
in the band at 1505 cm-1, which was previously assigned to aromatic skeletal vibrations (Faix,
1988). This increase in peak height was not observed in other lignin-peptide adducts and the
origin of this increased peak intensity is unclear. Though the IR results suggested incorporation
of CGG and YGG peptides into lignin DHP, peaks directly attributable to lignin-peptide linkages
were not identified. These results demonstrate that IR is a quick and reliable technique for
indicating lignin-peptide interactions in general, but may be insufficient for determining the
presence or absence of covalent crosslinks.
Fig 3.6. FT-IR spectra of DHP and lignin-peptide adducts.
66
Because neat lignin contains only carbon, oxygen, and hydrogen, elemental analysis
techniques can be used to show incorporation of proteins into lignin when nitrogen is present
(assuming no inorganic nitrogen contamination). In this case, energy dispersive X-ray
spectroscopy (EDS) was used to determine the elemental compositions of DHP and lignin-
peptide adducts because EDS spectra are readily attainable in the SEM instrument, and can
therefore be carried out in conjunction with morphological studies. The DHP and most lignin-
peptide adducts contained 0% nitrogen (Table 3), which indicated no detectable incorporation of
peptide into the lignin. In comparison, DHP-CGG and DHP-YGG contained 3.5% and 2.0%
nitrogen, respectively. This provided compelling evidence for incorporation of cysteine and
tyrosine-containing peptides into the lignin DHP. It was not possible to determine quantitative
ratios (for example, lignin-to-peptide wt/wt ratio) using this technique; however, it is possible to
roughly compare peptide quantities among samples when necessary. Thus, the EDS data
suggested that cysteine reacted more readily with lignin compared to tyrosine, which is in
agreement with the IR data.
Table 3.3. EDS elemental analysis data for DHP and the lignin-peptide adducts.
Average atomic % (std. dev.)
carbon oxygen nitrogen sulfur
DHP 78.8 (2.8) 21.2 (2.9) 0.0 (0.0) 0.0 (0.0)
DHP-CGG 76.5 (2.0) 18.9 (1.2) 3.5 (0.7) 1.2 (0.3)
DHP-KGG 86.8 (2.5) 13.2 (2.5) 0.0 (0.0) 0.0 (0.0)
DHP-HGG 90.5 (2.1) 9.5 (2.5) 0.0 (0.0) 0.0 (0.1)
DHP-DGG 82.8 (3.8) 17.2 (3.8) 0.0 (0.0) 0.0 (0.0)
DHP-EGG 83.7 (2.0) 15.4 (1.1) 0.0 (0.0) 0.6 (0.8)
DHP-SGG 86.3 (2.9) 13.7 (2.9) 0.0 (0.0) 0.0 (0.0)
DHP-YGG 84.0 (2.7) 14.0 (1.1) 2.0 (1.7) 0.0 (0.0)
DHP-TGG 85.8 (1.1) 14.2 (1.1) 0.0 (0.0) 0.0 (0.0)
DHP-HypGG 84.4 (0.2) 15.6 (0.2) 0.0 (0.0) 0.0 (0.0)
Atomic percentages are reported as averages of three sampling locations. Standard deviations are
shown in parentheses. Trace levels of calcium account for the balance in the case of DHP-EGG.
3.5. Conclusions
Amino acid residues with nucleophilic side chains were previously shown to react with a
lignin model quinone methide in dichloromethane, yielding lignin-α-peptide structures (Diehl et
al., 2014). In the present study, we extended this work by characterizing DHP-peptide covalent
crosslinks and non-covalent effects of peptides on DHP formation, under biomimetic conditions.
Lignin DHP was prepared using coniferyl alcohol as the sole lignin monomer and
peptides were added having the general structure XGG, in which X was an amino acid residue
with a nucleophilic side chain (i.e., C, K, H, D, E, S, Y, T, and Hyp). The lignin was precipitated
via centrifugation to yield DHP-peptide adducts, and analysis using IR, EDS, and NMR showed
that CGG and YGG were significantly reactive toward lignin while other peptides were not. In
the case of DHP-CGG, HSQC NMR showed that crosslinking occurred at the lignin α-position
67
(Fig 3.4). The crosslinking mechanism of DHP with YGG could not be conclusively elucidated.
SEM imaging showed that DHP-peptide adducts exhibited a unique morphology compared to
neat DHP, regardless of peptide incorporation into the lignin. With regards to lignin inter-unit
lignin ratios, the quantity of β-ether linkages was typically depressed in DHPs synthesized in the
presence of peptides, while the quantity of pinoresinol structures increased. The yields of DHP-
CGG and DHP-YGG were depressed compared to neat DHP yields; however, curiously, the
yields increased when DHP was prepared in the presence of other peptides.
We have shown that cysteine and tyrosine crosslink with lignin under biomimetic
conditions. This suggests that similar crosslinking may occur in the cell walls of both native and
transgenic plant lines. Further research is needed to investigate whether this crosslinking does
occur, and to discover how the plant might control and benefit from such crosslinking (i.e., does
it help stiffen the wall, assist water conduction, provide protection from pathogens, etc). In
addition, a better understanding of lignin-protein linkages could lead to genetic manipulation
(up-regulation or down-regulation of the linkages), as already suggested by Liang et al. (2008)
and Xu et al. (2013). This could reduce lignin recalcitrance, which is currently a barrier to using
lignocellulosic materials in developing industries such as biofuels. It is anticipated that this work
will lead to future studies of lignin-protein linkages in planta, and a more thorough
understanding of how such linkages could be tailored and modified.
3.6. Acknowledgements
This material is based upon work supported as part of The Center for Lignocellulose
Structure and Formation, an Energy Frontier Research Center funded by the U.S. Department of
Energy, Office of Science, Office of Basic Energy Sciences under Award Number DE-
SC0001090. Student fellowships were provided by the USDA National Needs Program and the
National Science Foundation via the CarbonEARTH program. Many thanks to Julie Anderson
and Melisa Yashinski (PSU MRI) for acquisition of SEM/EDS data and for valuable discussions.
Thanks also to Dr. John Ralph, Yuki Tobimatsu, and Matt Regner for valuable discussions
regarding multidimensional NMR of lignin.
3.7. References
Awad, H. M.; Boersma, M. G.; Vervoort, J.; Rietjens, I. M. C. M. Peroxidase-catalyzed
formation of quercetin quinone methide-glutathione adducts. Arch. Biochem. Biophys. 2000, 378,
224-233.
Balakshin, M.; Capanema, E.; Gracz, H.; Chang, H.; Jameel, H. Quantification of lignin-
carbohydrate linkages with high-resolution NMR spectroscopy. Planta 2011, 233, 1097-1110.
Boerjan, W.; Ralph, J.; Baucher, M. Lignin Biosynthesis. Annu. Re. Plant Biol. 2003, 54, 519-
546.
Bolton, J. L.; Turnipseed, S. B., Thompson, J. A. Influence of quinone methide reactivity on the
alkylation of thiol and amino groups in proteins: studies utilizing amino acid and peptide models.
Chem. Biol. Interact. 1997, 107, 185-200.
68
Capanema, E. A.; Balakshin, M. Y.; Kadla, J. F. A comprehensive approach for quantitative
lignin characterization by NMR spectroscopy. J. Agric. Food Chem. 2004, 52, 1850-1860.
Chapple, C.; Ladisch, M.; Meilan, R. Loosening lignin's grip on biofuel production. Nat.
Biotechnol. 2007, 25, 746-748.
Chen, F.; Dixon, R. A. Genetic manipulation of lignin biosynthesis to improve biomass
characteristics for agro-industrial processes. In Vitro Cell. Dev. Biol. - Animal. 2008, 44, S28-
S29.
Cong, F.; Diehl, B.G.; Hill, J.L.; Brown, N.R.; Tien, M. Covalent bond formation between amino
acids and lignin: Cross-coupling between proteins and lignin. Phytochem. 2013, 96, 449-456.
Diehl, B.G.; Watts, H.D.; Kubicki, J.D.; Regner, M.R.; Ralph, J.; Brown, N.R. Towards lignin-
protein crosslinking: Amino acid adducts of a lignin model quinone methide. Cellulose 2014,
available online.
Faix O, Beinhoff O. 1988. Journal of Wood Chemistry and Technology, 8 (4): 505-522. "FTIR
spectra of milled wood lignins and lignin polymer models with enhanced resolution obtained by
deconvolution."
Fisher, J.; Abdella, B. R. J.; McLane, K. E. Anthracycline antibiotic reduction by spinach
ferredoxin-NADP+ reductase and ferredoxin. Biochemistry 1985, 24, 3562-3571.
Jung, H. G.; Allen, M. S. Characteristics of plant cell walls affecting intake and digestibility of
forages by ruminants. J. Animal Sci. 1995, 73, 2774-2790.
Jung, H. G. Forage lignins and their effects on fiber digestibility. Agron. J. 1989, 81, 33-38.
Kim, H.; Ralph, J. Solution-state 2D NMR of ball-milled plant cell wall gels in DMSO-
d6/pyridine-d5. Org. Biomol. Chem. 2010, 8, 576-591.
Leary, G. J. Quinone methides and the structure of lignin. Wood Sci. and Tech. 1980, 14, 21-34.
Liang, H.; Frost, C. J.; Wei, X.; Brown, N. R.; Carlson, J. E.; Tien, M. Improved sugar release
from lignocellulosic material by introducing a tyrosine-rich cell wall peptide gene in poplar.
Clean 2008, 36(8), 662-668.
Li, X.; Weng, J. K.; Chapple, C. Improvement of biomass through lignin modification. Plant J.
2008, 54, 569-581.
Mansfield, S. D.; Kim, H.; Lu, F.; Ralph, J. Whole plant cell wall characterization using
solution-state 2D NMR. Nature Protocols. 2012, 7(9), 1579-1589.
Micic, M.; Radotic, K.; Jeremic, M.; Djikanovic, D.; Kammer, S. B. Study of the lignin model
compound supramolecular structure by combination of near-field scanning optical microscopy
and atomic force microscopy. Colloids and Surfaces B: Biointerfaces 2004, 34, 33-40.
Micic, M.; Radotic, K.; Jeremic, M.; Leblanc, R. M. Study of the self-assembly of the lignin
model compound on cellulose model substrate. Macromol. Biosci. 2003, 3 (2), 100-106.
69
Miyagawa, Y.; Takemoto, O.; Takano, T.; Kamitakahara, H.; Nakatsubo, F. Fractionation and
characterization of lignin carbohydrate complexes (LCCs) of Eucalyptus globulus in residues left
after MWL isolation. Part I: Analyses of hemicellulose-lignin fractionation (HC-L).
Holzforschung 2012, 66, 459-465.
Ralph, S.A., Ralph, J., Landucci, L.L. NMR Database of Lignin and Cell Wall Model
Compounds. Available at URL http://ars.usda.gov/Services/docs.htm?docid=10491 (November
2004).
Ralph, J.; Schatz, P. F.; Lu, F.; Kim, H.; Akiyama, T.; Nelsen, S. F. Quinone methides in
lignification. In Quinone Methides; Rokita, S. E., Ed.; John Wiley & Sons , 2009; pp 385-420.
Ramakrishnan, K.; Fisher, J. Nucleophilic trapping of 7,11-dideoxyanthracyclinone quinone
methides. J. Am. Chem. Soc. 1983, 105, 7187-7188.
Richard, J. P.; Toteva, M. M.; Crugeiras, J. Structure-reactivity relationships and intrinsic
reaction barriers for nucleophile additions to a quinone methide: a strongly resonance-stabilized
carbocation. J. Am. Chem. Soc. 2000, 122, 1664-1674.
Socrates, George. Infrared and Raman characteristic group frequencies, third edition. George
Wiley and Sons, LTD. West Sussex, England. 2001.
Stewart, J. J.; Kadla, J. F.; Mansfield, S. D. The influence of lignin chemistry and ultrastructure
on the pulping efficiency of clonal aspen (Populus termuloides Michx). Holzforschung 2006, 60,
111-122.
Terashima, N.; Akiyama, T.; Ralph, S.; Evtuguin, D.; Neto, C.P.; Parkas, J.; Paulsson, M.;
Westermark, U.; Ralph, J. 2D-NMR (HSQC) difference spectra between specifically 13C-
enriched and unenriched protolignin of Ginkgo biloba obtained in the solution state of whole cell
wall material. Holzforschung 2009, 63, 379-384.
Terashima, N.; Atalla, R. H.; Ralph, S. A.; Landucci, L. L.; Lapierre, C.; Monties, B. New
preparations of lignin polymer models under conditions that approximate cell wall lignification.
Holzforschung 1995, 49, 521-527.
Terashima, N.; Awano, T.; Takabe, K.; Yoshida, M. Formation of macromolecular lignin in
gingko xylem cell walls as observed by FESEM. C.R. Biologies 2004, 327, 903-910.
Tobimatsu, Y.; Elumalai, S.; Grabber, J. H.; Davidson, C. L.; Pan, X.; Ralph, J.
Hydroxycinnamate conjugates as potential monolignol replacements: In vitro lignification and
cell wall studies with rosmarinic acid. ChemSusChem. 2012, 5 (4), 676-686.
Tobimatsu, Y.; Takano, T.; Kamitakahara, H.; Nakatsubo, F. Reactivity of syringyl quinone
methide intermediates in dehydrogenative polymerization I: high-yield production of synthetic
lignins (DHPs) in horseradish peroxidase-catalyzed polymerization of sinapyl alcohol in the
presence of nucleophilic reagents. J. Wood Sci. 2009, 56(3), 233-241.
70
Toikka, M.; Jussi, S.; Teleman, A.; Brunow, G. Lignin-carbohydrate model compounds.
Formation of lignin-methyl arabinoside and lignin-methyl galactoside benzyl ethers via quinone
methide intermediates. J. Chem. Soc.; Perkin Trans. 1. 1998, 1, 3813-3818.
Veitch, N. C. Horseradish peroxidase: a modern view of a classic enzyme. Phytochem. 2004, 65,
249-259.
Xu, Y.; Chen, C.; Thomas, T.P.; Azadi, P.; Diehl, B.; Tsai, C.; Brown, N.; Carlson, J.E.; Tien,
M.; Liang, H. Plant Cell Rep. 2013, 32, 1827-1841.
Yuan, T.; Sun, S.; Xu, F.; Sun, R. Charcterization of lignin structures and lignin-carbohydrate
complex (LCC) linkages by quantitative 13C and 2D HSQC NMR spectroscopy. J. Agric. Food
Chem. 2011, 59, 10604-10614.
71
Chapter 4
Preparation and characterization of lignin-gelatin complexes
(Target journal for publication is Journal of Applied Polymer Science)
4.1. Abstract
Lignin dehydrogenation polymer (DHP) was prepared in the presence of gelatin protein to yield
“DHP-Gel adducts.” The DHP-Gel adducts were characterized using Fourier-transform infrared
spectroscopy (FT-IR), scanning electron microscopy (SEM), energy dispersive X-ray
spectroscopy (EDS), X-ray photoelectron spectroscopy (XPS) and heteronuclear single quantum
coherence nuclear magnetic resonance spectroscopy (HSQC NMR). FT-IR, EDS and XPS
showed that gelatin was incorporated into the lignin even when added in small quantities. In
addition, EDS and XPS showed that gelatin was distributed throughout the DHP when added
during lignin polymerization, but adsorbed to the surface of DHP when added following
polymerization. A lack of diagnostic lignin-protein crosslink signatures in the HSQC NMR
spectra suggested that the lignin-gelatin interaction was largely non-covalent in nature. This may
have implications toward lignification in planta, in which lignin is biosynthesized in a pre-
deposited matrix of polysaccharides and proteins. Cell wall structural proteins, which are
common in the cell corner region of the middle lamella (where lignification begins), may help
“nucleate” lignification without necessarily covalently crosslinking with lignin.
Key words: lignin, gelatin, crosslinking, non-covalent, scanning electron microscopy, energy
dispersive X-ray spectroscopy, X-ray photoelectron spectroscopy, nuclear magnetic resonance
spectroscopy.
4.2. Introduction
Lignin is the most abundant aromatic biopolymer on earth (Boerjan et al., 2003). It is
most commonly biosynthesized in the cell walls of land plants from three monolignols (p-
coumaryl, coniferyl, and sinapyl alcohols) that vary in their degree of aromatic ring
methoxylation. It is commonly believed that lignin imparts three main evolutionary advantages
to the plant: structural rigidity, water conductivity, and pathogen resistance. Lignin is
commercially important to the pulp and paper industry, agricultural industries concerned with
forage digestibility, and the developing biofuels industry, in which it is known to foul cellulose
to ethanol conversion processes (Stewart, 2006; Chen, 2008; Li, 2008; Chapple, 2007; Jung,
1989; Jung, 1995). Lignin is also a potential source of renewable carbon for plastics, carbon
fibers, solvents, and low and high value chemicals, to name a few (Gellerstedt, et al., 2010, Chen
and Sarkanen, 2006, Dorrestijn, et al., 2000, Clark, et al., 2009).
Despite decades of research, some details of lignification are still poorly understood. For
example, it has long been known that within most plants, lignin deposition begins in the cell
corner region of the middle lamella. However, the mechanism by which the plant controls this
pattern of lignin deposition is unknown. Lignification initiation sites (sometimes referred to as
“nucleation sites”) have been postulated, with two commonly hypothesized initiation sites being
72
calcium-pectate complexes (which may bind anionic peroxidases necessary for lignin
polymerization) and cell wall structural proteins (especially extensins, which are abundant in the
cell corners) (Albersheim et al., 2010; Boerjan et al., 2003). Neither of these hypotheses has been
adequately investigated in vitro or in vivo. Several studies have shown that proteins have an
affinity to bind lignin; however, the nature of the lignin-protein binding (i.e., covalent vs. non-
covalent) was not elucidated (Whitmore, 1978a; Whitmore, 1978b; Whitmore, 1982). With
regard to lignin-protein covalent crosslinking, it was recently shown that Cys, Lys, His, Asp,
Glu, Ser, and Tyr crosslink with lignin in non-polar solvents, and that Cys and Tyr crosslink with
lignin even under aqueous, biomimetic conditions (Diehl et al., 2014; Diehl and Brown, in
review). Here, we report the preparation and characterization of lignin DHP in the presence of
gelatin, a glycine and hydroxyproline-rich animal protein.
Though gelatin protein does not originate from plants, the lignin-gelatin complex is
interesting and potentially informative for several reasons. Gelatin is both glycine and
hydroxyproline-rich, as are many plant cell wall structural proteins. Gelatin has previously been
shown to interact with lignin, though the presence or absence of covalent crosslinks was not
definitely determined (Whitmore, 1978b). Several potentially nucleophilic amino acid residues
are found in gelatin; however, cysteine and tyrosine residues are almost entirely lacking (Table
4.1) (Eastoe, 1955). While nucleophilic amino acids other than cysteine and tyrosine have been
shown to react with lignin under ideal conditions in non-polar solvents, only cysteine and
tyrosine have been shown to covalently crosslink with lignin under biomimetic conditions of
DHP preparation (Diehl et al., 2014; Diehl and Brown, in review). The affinity for gelatin to
interact with lignin under aqueous conditions is therefore interesting, as it seems most likely to
arise from physical entanglement and/or non-covalent interactions, or from lignin-protein
crosslinkage types that have not previously been observed under conditions of DHP preparation.
Understanding the interactions occurring within the lignin-gelatin complex may provide insights
into lignin-protein interactions in planta, where lignin and cell wall structural proteins could be
expected to be in close spatial proximity.
Table 4.1. Nucleophilic amino acid abundance (g/100 g dry, ash-free protein) in gelatin.
Residue Porcine Gelatin Bovine Gelatin
Cys 0.00 0.05
Lys 4.14 5.20
His 1.01 0.63
Glu 11.30 12.10
Asp 6.70 6.90
Tyr 0.60 0.14
Ser 4.13 2.90
Thr 2.19 2.20
Hyp 13.50 14.40
Total 43.57 44.52
73
In order to elucidate the lignin-gelatin interaction, five lignin-gelatin complexes (DHP-
Gel adducts) were prepared (see Table 4.2 for DHP-Gel preparations). The DHP was prepared by
slowly combining monolignol (e.g., coniferyl alcohol) and hydrogen peroxide solutions to a
horseradish peroxidase solution, as previously described (Terashima et al., 1995). In addition,
various quantities of porcine (high molecular weight) and bovine (low molecular weight) gelatin
were added to the DHP preparations. DHP-Gel1 and DHP-Gel5 contained equivalent quantities
(wt/wt basis) of monolignol and porcine gelatin, with the only difference being that gelatin was
dissolved in the same flask as the coniferyl alcohol in the case of DHP-Gel1, and was thus added
slowly and continuously to the DHP preparation during polymerization, whereas in the case of
DHP-Gel5 gelatin was added to the DHP preparation following completion of polymerization. It
was perceived that covalent crosslinking, if it occurred, might be more likely in the case of DHP-
Gel1 because the gelatin would potentially be in intimate contact with reactive lignin
intermediates (i.e., quinone methides) over the course of the polymerization reaction. It was also
perceived that addition of gelatin at different times might influence the morphology of the DHP-
Gel adducts. In the cases of DHP-Gel2, DHP-Gel3, and DHP-Gel4, decreasing quantities of
bovine gelatin (2.5:1, 5:1, and 10:1 monolignol to gelatin wt/wt basis) were added to the DHP,
again during the polymerization process. These DHP-Gel adducts allowed for investigation of
lignin interactions with small quantities of low molecular weight gelatin.
The DHP-Gel adducts were characterized using several techniques. Fourier-transform
infrared spectroscopy (FT-IR) was employed because it can show incorporation of protein into
lignin, although it is admittedly deficient at identifying lignin-protein covalent crosslinks (Diehl
et al., in review). Scanning electron microscopy (SEM) was used to determine the physical
morphology of the DHP-Gel adducts. Energy dispersive X-ray spectroscopy (EDS) and X-ray
photoelectron spectroscopy (XPS) were used to both confirm the incorporation of protein into
the lignin (via quantification of nitrogen) as well as to elucidate morphological details. Finally,
heteronuclear single quantum coherence (HSQC) nuclear magnetic resonance (NMR)
spectroscopy was used to investigate potential lignin-gelatin covalent crosslinks. Investigation of
the lignin-gelatin complexes reported here may lead to a better understanding of lignin-protein
interactions in native plant systems.
4.3. Experimental
4.3.1. Materials
Coniferyl alcohol was prepared from coniferaldehyde (Sigma Aldrich) as previously
described (Ludley and Ralph, 1996). Horseradish peroxidase (type I), hydrogen peroxide,
sodium phosphate and porcine and bovine gelatin were purchased from Sigma Aldrich. The
bovine gelatin (Sigma #G6650) had a bloom number of 75, with an estimated molecular weight
of 20 to 25 KDa. The porcine gelatin (Sigma #G2500) had an estimated molecular weight of 100
74
KDa. The peristaltic pump used in the DHP synthesis was a Cole-Parmer Masterflex, model
number 77120-52.
4.3.2. DHP and DHP-Gel preparations
Lignin DHP was synthesized according to a published method with a few modifications
(Terashima et al., 1995). Coniferyl alcohol (200 mg) was added to 200 ml warm sodium
phosphate (0.01 M, pH 6.5) buffer. Horseradish peroxidase (HRP) (4 mg) was added to this flask
after the buffer temperature dropped below 40° C. In a second flask, hydrogen peroxide was
added to 200 ml of buffer to a final concentration of 0.025%. A peristaltic pump was used to
combine the contents of the flasks into a single 500 ml flask that initially contained 2 ml of
buffer and 1 mg of HRP. Addition of reactants was performed at a rate of approximately 6
ml/min and the contents of the collection flask were allowed to stir for an additional 24 hours
upon completion of reactant addition.
DHP-Gel adducts were prepared as above, but porcine or bovine gelatin (quantities
shown in Table 4.2) were added to the flask containing coniferyl alcohol prior to the start of the
reaction. In the case of DHP-Gel5, gelatin was added following DHP polymerization (i.e., gelatin
was added approximately 24 hours after the complete addition of coniferyl alcohol and hydrogen
peroxide).
Neat DHP and DHP-Gel adducts were centrifuged at 10k g for 20 min at 4° C. The
supernatants were discarded and the samples were re-suspended in DI water and centrifuged
again. This was repeated for a total of 5 washings. DHP-Gel adducts were then dried under
vacuum at room temperature to obtain yields shown in Table 2. Solutions of neat gelatin and
gelatin that had been subjected to the oxidative conditions of DHP preparation were centrifuged
as described above and were not found to precipitate.
4.3.3. Fourier-transform infrared spectroscopy
Lignin DHP and DHP-Gel adducts were analyzed using a Bruker Vertex V70
Spectrometer (Bruker Optics Billerica MA) equipped with an MVP-Pro diamond single
reflection attenuated total reflectance (ATR) accessory (Harrick Scientific Pleasantville NY), and
100 scans at 6 cm-1 resolution were averaged for each sample using a DTGS detector and scan
frequency of 5 kHz. In all cases, the spectrum of the clean diamond crystal was used as the
reference spectrum. All spectral manipulations were performed using OPUS 6.0 (Bruker Optics,
Billerica MA).
4.3.4. X-ray photoelectron spectroscopy
The spectra were acquired with a Kratos Axis Ultra, using monochromatic Al-Kα X-rays.
Analysis chamber pressures were in the mid-10-8 torr range during measurements. Samples were
mounted on a 7mm x 7mm piece of Scotch Brand 3M double-sided tape (cat #137). The
75
materials covered the tape well enough to prevent exposure of the glue, and the tape was secured
to a piece of OFHC copper which was slightly larger than the tape. All spectra were acquired
with the analyzer set in hybrid mode, with the charge neutralizer on. The Pass Energy was set at
80 eV for surveys and 20 eV for high-resolution scans. Step sizes were 0.5 eV and 0.1 eV for
survey and high-res scans, respectively. The survey scan dwell time was set at 150 ms, while
values for high-resolutions scans varied from 600-2000 ms depending on the peak intensity.
4.3.5. Scanning electron microscopy and energy dispersive X-ray spectroscopy
Scanning electron microscopy (SEM) images were collected on a field emission SEM
(FESEM - FEI NanoSEM 630) at 2 or 3 kV under high vacuum (1.7 x 10-6 Torr). Samples were
sputter coated with iridium prior to imaging. Characteristic X-rays were collected with an X-Max
silicon drift detector (Oxford Instruments) inside the FESEM at 10 kV under low vacuum
conditions (0.6 Torr) in order to prevent sample charging. Samples were not sputter coated prior
to EDS analysis. Elements were selected and quantified using Aztec Energy Analyser Software
(Oxford Instruments).
4.3.6. Nuclear magnetic resonance spectroscopy
NMR spectra were acquired on a Bruker Biospin (Billerica, MA, USA) AVANCE 500
(500 MHz 1H resonance freq.) spectrometer fitted with a cryogenically-cooled gradient probe
having inverse geometry, i.e., with the proton coils closest to the sample. Spectra were processed
with Bruker’s Topspin 3.1 software, using the central solvent peak as internal reference [δH/δC:
dimethyl sulfoxide (DMSO), 2.50/39.5 ppm]. The DHP or DHP-Gel adducts (~50 mg) were
placed in an NMR tube (ID: 4.1 mm), swelled in DMSO-d6/pyridine-d5 (4:1 v/v, 500 ul), and
subjected to adiabatic 2D-HSQC (‘hsqcetgpsisp2.2’) experiments. Processing used typical
matched Gaussian apodization in F2 (LB = -0.3, GB = 0.001), and squared cosine-bell and one
level of linear prediction (32 coefficients) in F1 (Mansfield, 2012). For an estimation of the
various inter-unit linkage types in DHP and DHP-Gel adducts (Table 3; β-ether/α-OH, β-ether/α-
O-aryl, phenylcoumaran, pinoresinol, and dibenzodioxocin), the well resolved Cα-Hα contours
were integrated; no correction factors were used.
4.4. Results
4.4.1. Preparation of DHP-Gel adducts
DHP-Gel adducts were prepared by adding various quantities of porcine or bovine gelatin
to lignin DHP as it polymerized in the cases of DHP-Gel1, DHP-Gel2, DHP-Gel3, and DHP-Gel4,
or following lignin polymerization in the case of DHP-Gel5. Adduct yields following
centrifugation are shown in Table 4.2. The adducts were characterized using IR, EDS, XPS,
SEM, and NMR, as detailed below.
76
Table 4.2. Preparation and yields of DHP and DHP-Gel adducts.
sample CA (mg) PG (mg) BG (mg) yield (mg) yield (%)
DHP 200 0 0 124 62
DHP-Gel1 200 200 0 154 39
DHP-Gel2 200 0 80 139 50
DHP-Gel3 200 0 40 159 66
DHP-Gel4 200 0 20 138 63
DHP-Gel5 200 200 0 112 28
CA: coniferyl alcohol, PG: porcine gelatin, BG: bovine gelatin. Yields were calculated by
dividing the mass of recovered product by the total mass of reactants (i.e., CA + PG or BG). In
the case of DHP-Gel5, porcine gelatin was added following DHP polymerization.
4.4.2. Fourier-transform infrared spectroscopy of DHP-Gel adducts
Fig 4.1 shows FT-IR spectra of neat gelatin, neat DHP, and DHP-Gel adducts. The DHP
FT-IR spectrum exhibited bands typical of lignin DHPs (Faix, 1988). The DHP-Gel adducts
exhibited three peaks indicative of protein incorporation into the lignin. The peaks were located
at approximately 3200 cm-1, 1658 cm-1, and 1540 cm-1, and were previously observed in lignin-
protein adducts prepared by reacting DHPs with tripeptides (Diehl et al., in review). These peaks
were indicative of protein incorporation but were not specifically diagnostic toward covalent
versus non-covalent lignin-protein bonding. Qualitatively, FT-IR spectra showed that DHP-Gel1
apparently contained the most protein, with protein content decreasing through DHP-Gel2, DHP-
Gel3, and DHP-Gel4. Protein content of DHP-Gel5 (gelatin added following lignin
polymerization) appeared to lie between that of DHP-Gel3 and DHP-Gel4.
77
Fig 4.1. FT-IR of neat DHP and DHP-Gel adducts. Y-scaling is arbitrary.
4.4.3. Morphology and nitrogen content of DHP-Gel adducts
Fig 4.2 shows SEM images of neat DHP and DHP-Gel adducts. The neat DHP was
composed of smooth spheres of varying sizes, as previously observed (Micic et al., 2003). In the
cases of DHP-Gel1, DHP-Gel2, DHP-Gel3, and DHP-Gel4, the samples exhibited spherical
morphology, but the sizes and shapes of the spheres varied among samples and within samples.
DHP-Gel4 (lowest concentration of gelatin) exhibited the most variation in particle shape and
size—an affect that was reproducible. The particles within these DHP-Gel adducts exhibited
bumpy surfaces, which can be seen especially well in Fig. 2, DHP-Gel3, inset. Spherical particles
of DHP-Gel5 (gelatin added following polymerization) exhibited smooth surfaces, similar to neat
DHP.
78
Fig 4.2. SEM images of DHP-Gel1 (top left), DHP-Gel2 (top right), DHP-Gel3 (middle left),
DHP-Gel4 (middle right), DHP-Gel5 (bottom left) and neat DHP (bottom right). Bar: 2 µm
(DHP-Gel3 inset bar: 500 nm, DHP-Gel5 inset bar: 1 µm).
It was perceived that gelatin may not have been homogeneously dispersed throughout the
DHP-Gel particles (see Fig 4.3, models A and B). In order to test this hypothesis, elemental
analysis data was obtained using both XPS and EDS. XPS is a surface sensitive technique with
an approximate information depth of only 10 nm. In contrast, EDS has an information depth of
>1 µm when employing an accelerating voltage of 10 kV to a “light” substrate (i.e., an organic
substance such as the lignin-gelatin complex). Because the DHP-Gel particles were generally
several tens or hundreds of nanometers in diameter (Fig 4.2), XPS analyses were expected to
reveal nitrogen content near the surface (i.e., shell) of the particles, while EDS analyses were
expected to reveal nitrogen content throughout multiple particles.
Fig 4.3 shows the atomic nitrogen percentages of the DHP-Gel adducts as determined by
XPS and EDS (averages were determined by sampling three locations per DHP-Gel complex).
79
Two sample t-tests (using unequal variances and α = 0.05) showed that DHP-Gel1, DHP-Gel2,
DHP-Gel3, and DHP-Gel4 contained no significant differences in nitrogen as determined by XPS
and EDS (p-values: DHP-Gel1: 0.215, DHP-Gel2: 0.783, DHP-Gel3: 0.659, DHP-Gel4: 0.212),
suggesting a morphology similar to Fig 4.3, model A. In the case of DHP-Gel5 (gelatin added
following polymerization), the nitrogen content determined by XPS was significantly higher (p =
0.005) than the nitrogen content determined by EDS. This suggested that gelatin preferentially
bound to the lignin surface when added following lignin polymerization, suggesting morphology
similar to Fig 4.3, model B.
Fig 4.3. Morphology and nitrogen atomic percentages for DHP-Gel adducts as determined by
XPS and EDS (averages were determined by sampling three locations per DHP-Gel complex;
error bars are one positive and one negative standard deviation). For DHP-Gel morphological
models (A and B), black circles represent individual lignin modules (thin and thick lines
represent hydrophilic and hydrophobic domains, respectively), which may aggregate to form
macromolecules (Micic et al., 2004). Green lines represent gelatin. XPS and EDS showed that
model A best represented DHP-Gel1, 2, 3, and 4, while model B best represented DHP-Gel5.
Within the plant cell wall, lignification of a given region generally occurs following the
deposition of all other structural wall polymers, including structural proteins (Donaldson, 2001;
Boerjan et al., 2003). In addition, cell wall lignin (especially lignin from the cell corner region of
the middle lamella) adopts a similar morphology to the DHP and DHP-Gel adducts observed
here (Donaldson, 1994; Hafren et al., 2000; Terashima et al., 2004). This suggests that lignin
may surround and/or entangle cell wall proteins during lignification in planta, resulting in
80
morphology similar to Fig 4.3, model A. This physical entanglement, with the potential also for
lignin-protein covalent crosslink formation, may explain why protein is often found as a
contaminant in lignin extractions (Hatfield et al., 1994; Fukushima and Hatfield, 2001).
4.4.3. Nuclear magnetic resonance spectroscopy of DHP-Gel adducts
Fig 4.4 shows the HSQC NMR spectrum of DHP-Gel1, which was representative of the
DHP-Gel adducts. Shifts in orange were added during processing and are representative of
gelatin shifts. Gelatin shifts were visible when viewing the HSQC spectrum at a lower contour
level than shown in Fig 4.4. There are several reasons why the gelatin shifts were less intense
than the lignin shifts. First, the gelatin was the limiting reagent in most cases (except in the cases
of DHP-Gel1 and DHP-Gel5). Second, the gelatin was not completely incorporated into the
lignin. And third, the DHP-Gel adducts were not fully soluble in the NMR solvent system
(DMSO-d6/pyridine-d5). The gelatin shifts could be expected to be depressed in intensity
compared to the lignin shifts if the lignin component is more soluble than the gelatin component.
Diehl et al. (2014) previously identified diagnostic NMR shifts for lignin-protein
crosslinks. None of these shifts were observed in NMR spectra of DHP-Gel adducts, which
suggests that lignin and gelatin did not covalently crosslink. This may not be surprising, as Diehl
et al. (in review) also showed that cysteine and tyrosine were the only amino acid residues to
substantially crosslink with lignin under similar conditions of DHP preparation, and these
residues are almost entirely absent from gelatin (Table 1.1). The apparent lack of DHP-Gel
crosslinking suggested that non-covalent interactions were largely responsible for the observed
DHP-Gel interaction. In the case of DHP-Gel1 and DHP-Gel5, equal quantities of gelatin were
added to the reactions, with the only difference being whether the gelatin was added during
lignin polymerization or after. The DHP-Gel1 (gelatin added during lignin polymerization)
showed greater gelatin incorporation compared to DHP-Gel5 (gelatin added after lignin
polymerization) (Fig 4.1 and Fig 4.3). This increased gelatin incorporation was probably due to
physical entanglement of the gelatin within the lignin, as evidenced by XPS and EDS analyses
(Fig 4.3). Though DHP-Gel covalent crosslinks were not readily identified by NMR it may be
inappropriate to completely rule out the possibility of crosslink formation, albeit in minor
amounts. Isotopically labeled proteins and/or monolignols may be useful toward identifying trace
lignin-protein crosslinkages in further in vitro and in vivo experiments.
81
Fig 4.4. Side chain and aromatic (inset) regions of the HSQC NMR spectrum of DHP-Gel1.
Shifts in orange were added during processing. These shifts are indicative of gelatin and can be
observed when the HSQC spectrum is viewed at a lower contour level than shown here.
Table 4.3 shows estimates of the lignin inter-unit linkage ratios for the DHP and DHP-
Gel adducts as determined by volume integration of the well-resolved HSQC α-shifts. Neat DHP
contained linkage ratios typical of DHPs (Terashima et al., 1995; Terashima et al., 2009;
Tobimatsu et al., 2012). The variation in inter-unit linkage ratios exhibited no clear trend with
regards to the quantity of gelatin added, suggesting that gelatin has no significant effect on the
mechanism of lignin polymerization. The observed variation of inter-unit linkage ratios is most
likely attributable to the fact that DHP syntheses are inherently difficult to reproduce (Cathala et
al., 1998).
Table 4.3. Inter-unit linkage ratios of DHP and DHP-Gel adducts.
HSQC signal ratio (as % of total α-signal)
β-ether/α-
OH
β-ether/α-
aryl β-5 β-β Dibenz.
DHP 27.3 1.9 50.3 19.2 1.3
DHP-Gel1 23.5 0.5 53.4 18.1 4.6
DHP-Gel2 23.2 0.4 55.6 20.3 0.4
DHP-Gel3 13.5 0.7 58.7 25.4 1.7
DHP-Gel4 20.8 4.0 49.2 22.0 4.0
DHP-Gel5 32.8 1.3 45.3 16.4 4.1
82
4.5. Conclusions
DHP-Gel adducts were prepared under biomimetic conditions of lignin polymerization. A
variety of methods was used to characterize the DHP-Gel adducts. FT-IR showed incorporation
of gelatin into lignin DHP, but was unable to definitively show either the presence or absence of
lignin-gelatin covalent linkages. SEM showed that the DHP-Gel adducts generally consisted of
spherical particles ranging from tens to hundreds of nanometers in diameter, with morphological
details varying among samples. XPS and EDS were used in combination to show that gelatin was
relatively evenly dispersed throughout DHP-Gel particles when added during lignin
polymerization (Fig 4.3, model A), but aggregated mostly at the surface of DHP-Gel particles
when added following lignin polymerization (Fig 4.3, model B). It was interesting to note that
covalent crosslinking was not observed by HSQC NMR. This may not be surprising, as Diehl et
al. (in review) previously showed that cysteine and tyrosine were the only amino acid residues to
substantially crosslink with lignin under similar conditions of DHP preparation, and gelat in is
almost entirely lacking in these residues. The observation that gelatin adsorbed to the lignin
surface when added to lignin post-polymerization provided further evidence that covalent
crosslinking was not necessary to account for the lignin-gelatin interaction.
The lignin-gelatin interaction appears to be essentially non-covalent, and gelatin peptide
was dispersed throughout the DHP-Gel particles when it was added over the course of lignin
polymerization. In planta, lignification occurs in a pre-deposited polysaccharide and protein
matrix. Based on the results of this study it seems plausible that cell wall structural proteins may
become surrounded or physically entangled by lignin without necessarily forming covalent
crosslinks. It may be possible for structural proteins to serve as initiation sites of lignification
without lignin-protein covalent bond formation. Based on previous studies, covalent crosslinking
may also occur if cysteine and tyrosine residues are present (Diehl et al., in review).
Alternatively, lignin-protein crosslinking may occur at saccharide residues that are found on
plant cell wall structural proteins (particularly extensins and arabinogalactan proteins) but not on
gelatin (Lamport et al., 2011; Toikka et al., 1998; Wilson and Fry, 1986); the study reported here
did not test such a hypothesis. The intimate physical entanglement of the lignin-protein complex
may make covalent crosslinking favorable by bringing reactive lignin and protein species into
close proximity. The formation of such complexes may not only serve as lignin nucleation sites,
but may also help to rigidify and/or waterproof the cell wall. Further research is needed to
determine the role of covalent and non-covalent lignin-protein complexes in plant cell walls.
4.6. Acknowledgements
This material is based upon work supported as part of The Center for LignoCellulose
Structure and Formation, an Energy Frontier Research Center funded by the U.S. Department of
Energy, Office of Science, under Award Number DE-SC0001090. Student fellowships were
provided by the USDA National Needs Program and the National Science Foundation via the
CarbonEARTH program. Many thanks to Julie Anderson, Melisa Yashinski, and Vince Bojan
83
(Penn State Materials Research Institute) for acquisition of SEM, EDS and XPS data and for
valuable discussions, Jenna Ferraraccio for assistance with figure preparation, and John Ralph,
Yuki Tobimatsu, and Matt Regner for acquisition of NMR spectra and valuable discussions
regarding multidimensional NMR of lignin.
4.7. References
Albersheim, P.; Darvill, A.; Roberts, K.; Sederoff, R.; Staehelin, A. Principles of Cell Wall
Architecture and Assembly. In Plant Cell Walls; Garland Science, 2010; pp 227-27
Boerjan, W.; Ralph, J.; Baucher, M. Lignin Biosynthesis. Annu. Re. Plant Biol. 2003, 54, 519-
546.
Cathala, B.; Saake, B.; Faix, O.; Monties, B. 1998. Evaluation of the reproducibility of the
synthesis of dehydrogenation polymer models of lignin. Polymer Degradation and Stability
59:65-69.
Chapple, C.; Ladisch, M.; Meilan, R. Loosening lignin's grip on biofuel production. Nat.
Biotechnol. 2007, 25, 746-748.
Chen, Y. and S. Sarkanen. 2006. Cellulose Chemistry and Technology, 40(3-4): 149-163. "From
the macromolecular behavior of lignin components to the mechanical properties of lignin-based
plastics."
Chen, F.; Dixon, R. A. Genetic manipulation of lignin biosynthesis to improve biomass
characteristics for agro-industrial processes. In Vitro Cell. Dev. Biol. - Animal. 2008, 44, S28-
S29.
Clark, J. H., Deswarte, F. E. I. and T. J. Farmer. 2009. Biofuels, bioproducts and biorefining, 3:
72-90. "The integration of green chemistry into future biorefineries."
Donaldson, L.A. 1994. Mechanical constraints on lignin deposition during lignification. Wood
Sci. and Technol. 27:111-118.
Donaldson, L.A. 2001. Lignification and lignin topochemistry – an ultrastructural view.
Phytochemistry 57: 859-873.
Dorrestijn, E., Laarhoven, L. J. J., Arends, I. W. C. E. and P. Mulder. 2000. Journal of analytical
and applied pyrolysis, 54: 153-192. "The occurrence and reactivity of phenoxyl linkages in
lignin and low rank coal."
Eastoe, J.E. 1955. The amino acid composition of mammalian collagen and gelatin. Biochemical
Journal 61(4):589-600.
Faix O, Beinhoff O. 1988. Journal of Wood Chemistry and Technology, 8 (4): 505-522. "FTIR
spectra of milled wood lignins and lignin polymer models with enhanced resolution obtained by
deconvolution."
84
Fukushima, R.S.; Hatfield, R.D. 2001. Extraction and isolation of lignin for utilization as a
standard to determine lignin concentration using the acetyl bromide spectrophotometric method.
Journal of Agricultural and Food Chemistry 49(7):3133-3139.
Gellerstedt, G., Sjoholm, E. and I. Brodin. 2010. The Open Agriculture Journal, 3: 119-124.
"The wood-based biorefinery: A source of carbon fiber?"
Hafren, J., Funino, T., Itoh, T., Westermark, U., Terashima, N. 2000. Ultrastructural changes in
the compound middle lamella of Pinus thunbergii during lignification and lignin removal.
Holzforschung 54:234-240.
Hatfield, R.D.; Jung, H.G.; Ralph, J.; Buxton, D.R.; Weimer, P.J. 1994. A comparison of the
insoluble residues produced by the Klason lignin and acid detergent lignin procedures. J. Sci.
Food Agric. 65:51-58.
Jung, H. G.; Allen, M. S. Characteristics of plant cell walls affecting intake and digestibility of
forages by ruminants. J. Animal Sci. 1995, 73, 2774-2790.
Jung, H.G. Forage lignins and their effects on fiber digestibility. Agron. J. 1989, 81, 33-38.
Lamport, D.T.A.; Kieliszewski, M.J.; Chen, Y.; Cannon, M.C. Plant Phys 2011, 156, 11-19.
Li, X.; Weng, J. K.; Chapple, C. Improvement of biomass through lignin modification. Plant J.
2008, 54, 569-581.
Ludley, F. H. and J. Ralph. 1996. Journal of Agricultural and Food Chemistry, 44: 2942-2943.
"Improved preparation of coniferyl and sinapyl alcohols."
Mansfield, S. D.; Kim, H.; Lu, F.; Ralph, J. Whole plant cell wall characterization using
solution-state 2D NMR. Nature Protocols. 2012, 7(9), 1579-1589.
Micic, M.; Radotic, K.; Jeremic, M.; Leblanc, R. M. Study of the self-assembly of the lignin
model compound on cellulose model substrate. Macromol. Biosci. 2003, 3 (2), 100-106.
Micic, M.; Radotic, K.; Jeremic, M.; Djikanovic, D.; Kammer, S.B. 2004. Study of the lignin
model compound supramolecular structure by combination of near-field scanning optical
microscopy and atomic force microscopy. Colloids and Surfaces B: Biointerfaces 34:33-40.
Stewart, J. J.; Kadla, J. F.; Mansfield, S. D. The influence of lignin chemistry and ultrastructure
on the pulping efficiency of clonal aspen (Populus termuloides Michx). Holzforschung 2006, 60,
111-122.
Terashima, N.; Akiyama, T.; Ralph, S.; Evtuguin, D.; Neto, C.P.; Parkas, J.; Paulsson, M.;
Westermark, U.; Ralph, J. 2D-NMR (HSQC) difference spectra between specifically 13C-
enriched and unenriched protolignin of Ginkgo biloba obtained in the solution state of whole cell
wall material. Holzforschung 2009, 63, 379-384.
85
Terashima, N., Atalla, R.H., Ralph, S.A., Landucci L.L., Lapierre, C., Monties, B. 1995.
Holzforschung, 49: 521-527. "New preparations of lignin polymer models under conditions that
approximate cell wall lignification."
Terashima, N., Awano, T., Takabe, K., Yoshida, M. 2004. Formation of macromolecular lignin
in ginkgo xylem cell walls as observed by field emission scanning electron microscopy. C. R.
Biologies 327: 903-910.
Tobimatsu, Y.; Elumalai, S.; Grabber, J. H.; Davidson, C. L.; Pan, X.; Ralph, J.
Hydroxycinnamate conjugates as potential monolignol replacements: In vitro lignification and
cell wall studies with rosmarinic acid. ChemSusChem. 2012, 5 (4), 676-686.
Toikka, M.; Sipila, J.; Teleman, A.; Brunow, Gosta. 1998. Lignin-carbohydrate model
compounds. Formation of lignin-methyl arabinoside and lignin-methyl galactoside benzyl ethers
via quinone methide intermediates. J. Chem. Soc., Perkin Trans. 1 3813-3818.
Whitmore, F.W. 1978a. Lignin-carbohydrate complex formed in isolated cell walls of callus.
Phytochemistry 17:421-425.
Whitmore, F.W. 1978b. Lignin-protein complex catalyzed by peroxidase. Plant Science Letters
13:241-245.
Whitmore, F.W. 1982. Lignin-protein complex in cell walls of Pinus elliottii: Amino acid
constituents. Phytochemistry 21(2):315-318.
Wilson, L.G.; Fry, J.C. 1986. Extensin—A major cell wall glycloprotein. Plant, Cell and
Environment 9:239-260.
86
Chapter 5
Searching for lignin-protein linkages in Arabidopsis
5.1. Abstract
In order to explore lignin-protein linkages in planta, Arabidopsis thaliana was grown to maturity
and lignin was extracted from the inflorescence stems. Elemental analysis was used to estimate
the protein content of the crude Arabidopsis, the Arabidopsis following solvents extraction, and
the Arabidopsis lignin extracts. Nuclear magnetic resonance techniques were then used to search
for putative lignin-protein covalent linkages. No apparent linkages were identified, but further
work is needed in other wild type and mutant plant species, and isotopically labeled monolignols
may be useful toward investigating lignin-protein linkages in both wild type and mutant plants.
5.2. Introduction
The previous chapters have shown that lignin crosslinks with several amino acids under
ideal conditions (i.e., in neutral, non-polar solvents), and that cysteine and tyrosine crosslink with
lignin under biomimetic conditions of lignin polymerization. Finally, in an attempt to identify
lignin-protein linkages formed under natural conditions of lignin biosynthesis, Arabidopsis
thaliana (wild-type Columbia-0) plants were grown to maturity (8 weeks), then lignin was
extracted from the inflorescence stems and characterized.
Arabidopsis lignin is composed mainly of guaiacyl and syringyl type lignin moieties. The
lignin content of mature Arabidopsis inflorescence stems has been estimated at around 14%
using the Klason method (Chang et al., 2008), and 14-16% using the acetyl bromide method
(Yong Bum Park, unpublished data). The quantity of structural protein in mature Arabidopsis
cell wall, in terms of dry weight percentage, is unclear. However, Chang et al. (2008) previously
showed that extracted Arabidopsis lignin (dioxane and Klason lignin) contained about 3.7%
protein contamination. In order to estimate the quantity of proteins in Arabidopsis, nitrogen
content was determined for mature Arabidopsis inflorescence stems at three levels of sample
preparation: crude ball-milled Arabidopsis, Arabidopsis that was ball-milled and solvents
extracted to yield cell wall material, and extracted Arabidopsis lignin. Nitrogen content can then
be used to determine protein percentage by multiplying by 6.25 (assuming all nitrogen is due to
protein) (Chang et al., 2008; Fukushima and Hatfield, 2001), and in this way the protein content
can be monitored throughout the various steps of Arabidopsis lignin extraction.
Lignin was extracted from Arabidopsis following a previously described acidic dioxane
(ADL) method (Fukushima and Hatfield, 2004). It has been postulated that this extraction
method selectively cleaves α-ether linkages, which should raise concerns regarding the cleavage
of putative lignin-protein linkages, as well. However, this method was deemed useful for several
reasons. First, it was not possible to extract lignin using the typical milled wood lignin procedure
of refluxing the sample in 96:4 dioxane/water. This method has been employed for decades;
87
however, during preliminary investigations with Arabidopsis, only ~2 mg of lignin was extracted
per 1 g of Arabidopsis cell wall material, which is extremely inefficient and yields far too little
lignin for effective characterization. Furthermore, lignin-protein linkages are expected to be low
in quantity in wild type plants, so observing the putative linkages in cellulolytic enzyme lignins
or whole cell walls seems unlikely due to very low signal to noise.
Following lignin extraction and protein content estimation, the Arabidopsis lignin was
characterized using 2D nuclear magnetic resonance (NMR) spectroscopy. The two most
important techniques were heteronuclear single quantum coherence (HSQC) and heteronuclear
multiple bond correlation (HMBC) experiments, which were previously shown to be quite useful
toward elucidating lignin-protein linkages.
5.3. Experimental
5.3.1. Growth and lignin extraction from Arabidopsis
Arabidopsis samples were prepared as follows. After 4 days of cold treatment at 4°C,
Arabidopsis thaliana wild-type (Colombia (Col-0) ecotype) seedlings were grown on 1× MS
medium (Murashige and Skoog, 1962) containing 1% sucrose for 1 week, and 500–600 seedlings
were transferred onto soil and grown for 7–8 more weeks under 70 µmol m-2s-1 light intensity
(day/night: 16/8 h, temperature: 22/16°C). The matured Arabidopsis inflorescence stems were
collected and frozen at -80° C. Prior to lignin extraction, Arabidopsis was prepared by grinding
in a blender, followed by freeze-drying. Samples (typically 2-3 g) were then Soxhlet extracted
with water, ethanol, chloroform, and acetone, for eight hours per solvent. Solvents extracted
Arabidopsis was then Wiley milled to pass a 60 mesh screen, then ball milled in a Retsch
cryomill (1.5 g Wiley milled Arabidopsis for 2 hr at 10 Hz). Lignin was extracted following the
previously described acidic dioxane lignin (ADL) method (Fukushima and Hatfield, 2004).
Briefly, 1 g of solvents extracted Arabidopsis was refluxed for 45 min in 20 ml of 90:10
dioxane/2 M HCl (aq). The solubilized lignin was then filtered through a WhatmanTM glass
microfiber filter, and the cell wall residue was rinsed with 96:4 dioxane/water. The dioxane
filtrates were combined and neutralized with sodium bicarbonate, filtered through a 0.45 µm
nylon membrane filter, then concentrated under reduced pressure. The concentrated lignin
solution (~1-2 ml) was added dropwise to ~40 ml DI water in a centrifuge tube, then centrifuged
at 10k g at 5° C for 20 min. The aqueous supernatant was poured off and saved, then the lignin
pellet was resolubilized using a minimum volume of dioxane. Ether was added to the centrifuge
tube and the sample was centrifuged at 10k g at 5° C for 15 min, and this solubilization followed
by ether washing and centrifugation was repeated for a total of two cycles. The ether
supernatants were then discarded and the lignin was freeze dried and stored at 4° C for future
characterization. Typically, lignin yields were 30-35 mg per 1 g of solvents extracted
Arabidopsis cell wall material.
88
5.3.2. Elemental analysis of Arabidopsis lignin
Nitrogen weight percentages were determined using a CE Instruments EA 1110 CHNS-O
elemental analyzer. Approximately 3 mg of sample were massed to the nearest ten-thousandth of
a milligram and analyzed according to the manufacturer’s instruction. Protein content was
determined by multiplying nitrogen percentage by 6.25, as described previously (Chang et al.,
2008; Fukushima and Hatfield, 2001).
5.3.3. Nuclear magnetic resonance spectroscopy of Arabidopsis lignin
NMR spectra were collected in DMSO-d6/pyridine-d5 (4:1 v/v, 500 ul). DMSO-
d6/pyridine-d5 was chosen because it is a preferred solvent for NMR of lignin DHP, milled
wood lignin (MWL), and whole cell walls (Kim and Ralph, 2010). NMR spectra were acquired
on a Bruker Biospin (Billerica, MA, USA) AVANCE 500 (500 MHz 1H resonance freq.)
spectrometer fitted with a cryogenically-cooled gradient probe having inverse geometry, i.e.,
with the proton coils closest to the sample. Spectra were processed with Bruker’s Topspin 3.1
software, using the central solvent peak as internal reference (δH/δC: dimethyl sulfoxide
(DMSO), 2.50/39.5 ppm). Lignin (~20 mg) was placed in an NMR tube (ID: 4.1 mm), swollen
homogeneously in DMSO-d6/pyridine-d5 (4:1 v/v, 500 ul), and then subjected to adiabatic 2D-
HSQC (‘hsqcetgpsisp2.2’) experiments using the parameters described by Mansfield et al.
(2012). Processing used typical matched Gaussian apodization in F2 (LB = -0.3, GB = 0.001),
and squared cosine-bell and one level of linear prediction (32 coefficients) in F1 (Mansfield et
al., 2012).
5.4. Results and discussion
5.4.1. Lignin extractions from Arabidopsis
Arabidopsis was grown to maturity, then inflorescence stems were harvested and
prepared for lignin extraction by solvents extracting and cryomill grinding. The grinding time of
approximately 2 hr per 1.5 g of Arabidopsis resulted in highly variable particle sizes (Fig 5.1).
Longer grinding times may be necessary, although increased sample alteration following
increased grinding times is always cause for concern. The grinding times employed here allowed
for the extraction of ~30-35 mg of acidic dioxane lignin (ADL) per g of Arabidopsis cell wall
material. This was similar to previously reported ADL yields for grassy plants such as alfalfa and
red clover (Fukushima and Hatfield, 2001, 2004). This material was then subjected to protein
content estimation and nuclear magnetic resonance spectroscopy, as described below.
89
Fig 5.1. Optical microscopy of solvents extracted and ball milled Arabidopsis cell wall material.
Particle size distribution is highly variable. Scale bar = 50 µm.
5.4.2. Protein content of Arabidopsis extracts
Protein contents of crude Arabidopsis, solvents extracted Arabidopsis, and Arabidopsis
lignin were estimated by multiplying the measured nitrogen atomic percentages by a factor of
6.25, as previously described (Chang et al., 2008; Fukushima and Hatfield, 2001). It was found
that crude Arabidopsis inflorescence stems contained approximately 5.31% protein, Arabidopsis
that had been solvents extracted contained 4.94% protein, and Arabidopsis ADL contained
approximately 3.75% protein. This was very similar to the protein content determined by Chang
et al. (2008) for dioxane and Klason Arabidopsis lignins, but was considerably higher than that
of Loblolly ADL, which was typically <1%. This may be expected due to the prominent
secondary cell walls in Loblolly and other plants with a strong tree habit, and the general lack of
protein in secondary cell walls. Because of the increased protein content in Arabidopsis (and
presumably grasses and other non-grasses exhibiting a grass-like growth habit), wild type and
mutant Arabidopsis lines may be useful for future investigations of lignin-protein linkages.
Table 5.1. Nitrogen content and estimated protein content of Arabidopsis extracts.
N % Protein %
Crude Arabidopsis 0.85 5.31
Solvents extracted Arabidopsis 0.79 4.94
Arabidopsis acidic dioxane lignin 0.60 3.75
90
5.4.3. Nuclear magnetic resonance spectroscopy of Arabidopsis lignin
Acidic dioxane lignin isolated from Arabidopsis inflorescence stems was analyzed using
HSQC and HMBC NMR techniques in DMSO-d6/pyridine-d5. Table 5.2 shows estimates of the
lignin inter-unit linkage ratios as determined by volume integration of the well-resolved HSQC
α-shifts, and the ratios were typical of dicotyledonous G/S lignins (Capanema et al, 2004). An
HSQC spectrum of Arabidopsis ADL is shown in Fig 5.2; however, the peaks discussed in
further detail below were generally too weak to be observed at the contour levels shown.
Table 5.2. Inter-unit linkage ratios of Arabidopsis ADL.
HSQC signal ratio (as % of total α-signal)
β-ether/α-OH β-ether/α-aryl β-5 β-β Dibenz.
70.6 tr 17.7 9.7 2.1
Estimates of the lignin inter-unit linkage ratios as determined by volume integration of the well-
resolved HSQC α-shifts are shown. tr = trace.
A very weak shift was observed in the HSQC spectrum at 4.4/50.8 ppm. This is close to
the observed α-shift of a lignin-cysteine linkage (4.4/50.6) identified in Chapter 2. However, the
corresponding HMBC spectrum did not identify this shift as an α-shift, and thus the likelihood of
this shift being attributable to a lignin-cysteine linkage seems low. This demonstrates the
importance of using multiple NMR techniques when investigating putative lignin-protein
crosslinks. No other putative lignin-protein shifts were identified in the HSQC and HMBC
spectra. Furthermore, essentially no lignin α-ester and only very minor quantities of α-ether
linkages were observed in the HSQC spectrum. This suggests that even lignin-hemicellulose
linkages were essentially absent from the Arabidopsis ADL. This may have been due to the
harshness of the extraction procedure, and less harsh lignin extraction techniques may be
necessary in order to optimize the chances of lignin-protein linkage identification.
91
Fig 5.2. HSQC NMR of Arabidopsis acidic dioxane lignin. Typical lignin shifts and residual
dioxane and pyridine solvent shifts are labeled. Cinnamyl end groups and dibenzodioxocin
structures were not abundant enough to be seen at the contour levels shown. Lignin-protein
linkages were not apparent even at low contour levels.
5.5. Conclusions
Wild type Arabidopsis was grown to maturity and inflorescence stems were harvested
and lignin extracted using the acidic dioxane (ADL) method. It was found that the ADL
contained a significant protein content (3.75%), so the lignin was characterized using HSQC and
HMBC NMR techniques. No lignin-protein linkages were identified. Furthermore, the lignin
appeared to be essentially free of lignin-carbohydrate linkages. The acidic dioxane extraction
method may have cleaved such linkages if they were in fact present in the wild type Arabidopsis,
and it may be beneficial to use a milder lignin extraction procedure. Alternatively, lignin-protein
linkages may be very low in abundance (or non-existent), and thus below the NMR signal to
92
noise limit. Exploration of lignin-protein linkages in mutant plant lines and/or using isotopically
labeled monolignols may help probe this question.
5.6. Acknowledgements
This material is based upon work supported as part of The Center for LignoCellulose
Structure and Formation, an Energy Frontier Research Center funded by the U.S. Department of
Energy, Office of Science, Office of Basic Energy Sciences under Award Number DE-
SC0001090. Student fellowships were provided by the USDA National Needs Program and the
National Science Foundation via the CarbonEARTH program. Many thanks to Ephraim Govere
for assistance in acquiring elemental analysis data.
5.7. References
Capanema, E.A.; Balakshin, M. Y.; Kadla, J.F. J. Agric. Food Chem. 2004, 52, 1850-1860.
Chang, X.F.; Chandra, R.; Berleth, T.; Beatson, R.P. J. Agric. Food Chem. 2008, 56, 6825-6834.
Fukushima, R.S.; Hatfield, R.D. J. Agric. Food Chem. 2001, 49, 3133-3139.
Fukushima, R.S.; Hatfield, R.D. J. Agric. Food Chem. 2004, 52, 3713-3720.
Kim, H.; Ralph, J. Org. Biomol. Chem. 2010, 8, 576-591.
Mansfield, S.D.; Kim, H.; Lu, F.; Ralph, J. Nat. Protoc. 2012, 7(9), 1579-1589.
Murashige, T.; Skoog, F. Physiol. Plant 1962, 15, 473-497.
93
Chapter 6
Conclusions
6.1. Research summary
Lignin is a heterogeneous, aromatic polymer that is biosynthesized in the cell walls of
almost all lands plants. It is economically relevant to several industries, especially pulp and
paper, agricultural, and biofuels industries, where lignin’s natural recalcitrance to extraction and
degradation pose problems. Additionally, lignin could potentially be used as a feedstock for
many renewable products, including plastics, activated carbons, carbon fibers, solid and liquid
fuels, and other specialty chemicals, but many of these lignin utilization schemes are beyond the
reach of currently available technology. To this end, it is important that we continue to focus on
both fundamental and applied advances in lignin and its related fields.
The plant cell wall is a complex matrix composed of structural polymers such as
cellulose, hemicelluloses, pectins, lignin, and proteins, as well as lower molecular weight organic
compounds. Lignin polymerization occurs within the pre-deposited polysaccharide and protein
matrix, causing lignin to interact with its local chemical environment. This results not only in
non-covalent interactions, but also in covalent crosslinking with hemicelluloses, and perhaps
other matrix components, such as proteins. It has been hypothesized that lignin-protein linkages
may be important in both wild type and mutant plant lines, yet lignin-protein linkages have not
been previously described. This work helps fill that gap in the literature by describing the
preparation and characterization of lignin-protein linkages.
In the first study, a low molecular weight quinone methide, representative of native lignin
quinone methide structures, was reacted with single amino acids. It was found that under these
ideal conditions, a diverse array of amino acids, including those bearing thiol, amine, carboxylic
acid, and hydroxyl functional groups on their side chains, reacted with lignin quinone methides
to form lignin-protein model compounds. Characterization of these compounds with NMR
resulted in the identification of diagnostic lignin-protein crosslink signatures, which were helpful
in identifying lignin-protein linkages in more complex model systems, and should also be helpful
toward identifying lignin-protein linkages in native lignins.
The second study expanded upon the first by exploring the reactivities of amino acids
toward lignin quinone methide intermediates under biomimetic conditions. Specifically, the
amino acids were incorporated into short peptide chains and exposed to lignin dehydrogenation
polymer as it polymerized in aqueous media. Using NMR, it was possible to show that cysteine
and tyrosine-containing peptides were covalently incorporated into the synthetic lignin polymer,
while other amino acids were almost entirely inert under such biomimetic conditions. This
suggests that cysteine and/or tyrosine-rich proteins may be the most likely to covalently crosslink
with lignin in planta. In this study it was also shown that Fourier-transform infrared
94
spectroscopy and elemental analysis (specifically energy dispersive X-ray spectroscopy in this
case) were useful toward showing lignin-protein interactions.
A third study investigated the interactions between lignin DHP and gelatin protein.
Gelatin was chosen because it shares similar characteristics with plant cell wall structural
proteins, but is much more readily available. In addition, gelatin lacks cysteine and tyrosine,
which were the only two amino acids shown in this work to covalently crosslink with lignin in
aqueous media. Thus, the observed interaction between lignin and gelatin was considered worthy
of investigation. SEM, EDS, and FT-IR showed that gelatin was incorporated into lignin DHP,
but a lack of characteristic lignin-protein NMR signatures suggested that the interaction was
largely (or entirely) non-covalent. This indicates that lignin-protein non-covalent interactions
may be important in planta, and further work is necessary.
Finally, in a fourth study, an attempt was made to identify lignin-protein linkages in the
wild type dicot plant, Arabidopsis. Unfortunately, the cell wall protein content was shown to be
very low in this wild type plant, and NMR was unable to reveal lignin-protein linkages in acidic
dioxane lignin extracts. This does not necessarily rule out the formation of these linkages, but
may merely suggest that linkage abundance is below the detection limit of NMR. Furthermore,
this study did not investigate the abundance or importance of linkages in mutant plant lines, and
additional studies are needed.
Though the work described here has allowed for more efficient and reliable
characterization of lignin-protein linkages in model systems, more work is necessary, especially
regarding native plant systems. The final section of this document will outline some ways in
which future investigations could be carried out.
6.2. Future endeavors
To further investigate lignin-protein linkages in more realistic model systems, it may be
useful to lignify model cell walls, or cell walls that have been isolated from native plants, then
characterize these walls using NMR and other techniques. Cybulska et al. (2010) and
Dammstrom et al. (2005) previously described the preparation of model cell walls using either
pure cellulose, or cellulose with the incorporation of hemicelluloses and/or pectins. In addition,
Uraki et al. (2011) demonstrated the preparation of cellulose-based, honey-comb shaped model
cell walls (Fig 6.1).
95
Fig 6.1. Cell wall models. Left: cellulose, pectin, and xyloglucan model (Dammstrom et al.,
2005). Right: honey-comb cellulose model (Uraki et al., 2011). Incorporation of proteins into
these models, followed by lignin DHP polymerization, may allow for investigations into lignin-
protein crosslinking.
I propose the preparation of similar cell walls, but with the added inclusion of proteins of
interest. Peroxidases could be incorporated into the cell wall models during assembly, or flowed
over the models after assembly. Lignin monomers and hydrogen peroxide would then be flowed
over the cell wall models, causing lignin polymerization to take place. The entire cell wall model
would then be characterized without disturbing the micro and macro structures that formed, as
this might provide the most insight into the lignin polymerization mechanisms. Techniques such
as NMR could be used to explore putative lignin-protein covalent linkages, while other
techniques could be used to explore the topology and distribution of the lignin DHP. The
exploration of potential lignin nucleation sites would be particularly interesting (for example,
lignin may nucleate at sites rich in hemicelluloses, pectins/pectates, peroxidases, structural
proteins, or show no particular nucleation pattern at all). Visualization techniques such as SEM
and AFM could be useful in probing this question. Also, the use of fluorescent monolignols (and
perhaps additional labeling of other components) could be beneficial (Tobimatsu et al., 2011).
Monitoring lignin distribution and topology at various time points of polymerization might also
be helpful towards understanding the lignin polymerization mechanism.
In addition to more advanced model compound studies, studies using mutant plant lines
are also warranted. While lignin-protein linkages may exist in wild type plant lines, and studies
involving wild type plants should not be dismissed, mutant plant lines may show the most
promise toward identifying and characterizing lignin-protein linkages. These mutant plant lines
should be engineered to secrete cysteine and/or tyrosine-rich peptides into the plant cell walls, as
these amino acids have been shown to be most reactive toward lignin in model studies. A mutant
plant line meeting these characteristics has already been prepared, although lignin-protein
covalent linkages have yet to be identified at the time of document preparation (Liang et al.,
2008; Xu et al., 2013). It is hoped that lignin-protein linkages will be further explored in mutant
96
plant lines, as practical implications toward lignin extractability have been demonstrated in pilot
studies.
It seems appropriate to briefly address a method that could be useful toward enhancing
the identification of putative lignin-protein linkages via NMR. It has previously been shown that
NMR is a powerful technique for characterizing plant cell walls and their constituent
components, and this work has shown that the technique can also be applied to the identification
of lignin-protein linkages. However, when exploring lignin-protein linkages in complex chemical
systems, such as those that exist in planta, achieving adequate signal to noise is expected to
become a problem. This is because the ratio of lignin-protein linkages to all other chemical
structures in the sample is expected to be exceedingly low. Isotopic labeling experiments may
help address this problem. Previously, α-13C coniferyl alcohol glucoside (coniferin) was prepared
and fed to live ginkgo (Xie and Terashima, 1991), then the lignin was characterized via NMR
and the α-shifts were shown to exhibit increased signal to noise. A possible route to α-13C
coniferyl alcohol is shown in Fig 6.2 (slightly modified compared to Xie and Terashima’s
original route to α-13C coniferin). This labeled coniferyl alcohol could be useful for identifying
lignin-protein linkages in complex in vitro or in vivo systems, because NMR α-shifts of lignin-
protein linkages (as well as standard lignin α-shifts) would exhibit greater signal to noise
compared to spectra of unlabeled lignins (Fig 6.3).
Fig 6.2. Proposed synthetic route to α-13C coniferyl alcohol.
97
Fig 6.3. Standard lignin α-shifts (red) and α-shifts of lignin-protein linkages (red squares). These
shifts would exhibit greater signal to noise compared to standard lignin shifts (black) in spectra
of α-13C-enriched lignin, perhaps allowing for easier identification of putative lignin-protein
linkages.
Efficient extraction and utilization of lignin is of great economic importance. In spite of
this, there is still much about the lignin polymer that remains unknown. This work has
illuminated fundamental aspects of lignin-protein linkage formation in an attempt to better
understand how lignin interacts with the protein component of plant cell walls. It is hoped that
this research will translate into a practical means of reducing lignin’s recalcitrance toward
degradation and extraction.
6.3. References
Cybulska, J.; Vanstreels, E.; Tri Ho, Q.; Courtin, C.M.; Craeyveld, V.V.; Nicolai, B.; Zdunek,
A.; Konstankiewicz, K. Mechanical characteristics of artificial cell walls. J. of Food Eng. 2010,
96, 287-294.
Dammstrom, S.; Salmen, L.; Gatenholm, P. The effect of moisture on the dynamical mechanical
properties of bacterial cellulose/glucuronoxylan nanocomposites. Polymer 2005, 46, 10364-
10371.
Liang, H.; Frost, C.J.; Wei, X.; Brown, N.R.; Carlson, J.E.; Tien, M. Improved sugar release
from lignocellulosic material by inducing a tyrosine-rich cell wall peptide gene in poplar. Clean
2008, 36(8), 662-668.
98
Tobimatsu, Y.; Davidson, C.L.; Grabber, J.H.; Ralph, J. Fluorescence-tagged monolignols:
Synthesis, and application to studying in vitro lignification. Biomac. 2011, 12, 1752-1761.
Uraki, Y.; Tamai, Y,; Hirai, T.; Koda, K.; Yabu, H.; Shimomura, M. Fabrication of honeycomb-
patterned cellulose material that mimics wood cell wall formation processes. Mat. Sci. and Eng.
C 2011, 31, 1201-1208.
Xie, Y.; Terashima, N. Selective carbon 13-enrichment of side chain carbons of ginkgo lignin
traced by carbon 13 nuclear magnetic resonance. Mokuzai Gakkaishi 1991, 37(10), 935-941.
Xu, Y.; Chen, C.; Thomas, T.P.; Azadi, P.; Diehl, B.; Tsai, C.; Brown, N.; Carlson, J.E.; Tien,
M.; Liang, H. Wood chemistry analysis and expression profiling of a poplar clone expressing a
tyrosine-rich peptide. Plant Cell Rep. 2013, 32, 1827-1841.
Vita
Brett Galen Diehl
Education
May, 2014 Ph.D. Biorenewable Systems
Department of Agricultural and Biological Engineering
The Pennsylvania State University, University Park, PA
May, 2009 B.S. Wood Products, Processing and Manufacturing Option
School of Forest Resources
The Pennsylvania State University, University Park, PA
Research
2009-2014 Graduate Research, Penn State, University Park, PA
• Illuminated fundamental aspects of lignin-protein linkages, which may influence
the physical and chemical properties of plant cell walls impacting industries such
as agriculture, pulp and paper, and biofuels.
2008 NSF Research Experience for Undergraduates, Penn State, University Park, PA
• Researched cellulose-producing Acetobacter xylinum and methods of
generating biofuels from cellulosic materials.
2007 Paid Internship, Armstrong World Industries, Lancaster, PA
• Studied exotic hardwood species using scanning electron microscopy.
Publications
B.G. Diehl, H.D. Watts, J.D. Kubicki, M.R. Regner, J. Ralph, N.R. Brown. Towards lignin-
protein crosslinking: Nucleophilic amino acid adducts of a lignin model quinone methide.
Cellulose, accepted January, 2014, not yet published.
B.G. Diehl, N.R. Brown, C.W. Frantz, M.R. Lumadue, F. Cannon. Effects of pyrolysis
temperature on the chemical composition of refined softwood and hardwood lignins. Carbon
(2013), 60: 531-537.
Y. Xu, C. Chen, T.P. Thomas, P. Azadi, B.G. Diehl, C. Tsai, N. Brown, J.E. Carlson, M. Tien,
H. Liang. Wood chemistry analysis and expression profiling of a poplar clone expressing a
tyrosine-rich peptide. Plant Cell Reports (2013), 32(12): 1827-1841.
F. Cong, B.G. Diehl, J.L. Hill, N.R. Brown, M. Tien. Covalent bond formation between amino
acids and lignin: Cross-coupling between proteins and lignin. Phytochemistry (2013), 96: 449-
456.
All data collected, preparing for publication, target journal is Biomacromolecules: B.G. Diehl,
N.R. Brown. Lignin crosslinks with peptides under biomimetic conditions.
All data collected, preparing for publication, target journal is Journal of Applied Polymer
Science: B.G. Diehl, N.R. Brown. Characterization of lignin-gelatin complexes.