Upload
lyphuc
View
219
Download
0
Embed Size (px)
Citation preview
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Thomas Blicher, Center for Biological Sequence Analysis
Introduction to"Protein Structure
Function, evolution & experimental methods
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Learning Objectives
Outline the basic levels of protein structure.
Outline key differences between X-ray crystallography and NMR spectroscopy.
Identify relevant parameters for evaluating the quality of protein structures determined by X-ray crystallography and NMR spectroscopy.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Outline
Protein structure evolution and function Inferring function from structure.
Experimental techniques X-ray crystallography NMR spectroscopy
Structure validation
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Watson, Crick and DNA, 1952
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
"We wish to suggest a structure for the salt of deoxyribose nucleic acid (D.N.A.). This structure has novel features which are of considerable biological interest…. …It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material.""
J.D. Watson & F.H.C. Crick (1953) Nature, 171, 737.
DNA Conclusions
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
“Could the search for ultimate truth really have revealed so hideous and visceral-looking an object?” Max Perutz, 1964, on protein structure
John Kendrew, 1959, with myoglobin model
Once Upon a Time…
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
They provide a detailed picture of interesting biological features, such as active site, substrate specificity, allosteric regulation etc.
They aid in rational drug design and protein engineering.
They can elucidate evolutionary relationships undetectable by sequence comparisons.
Why are Protein Structures so Interesting?
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
In evolution structure is conserved longer than both function and sequence.
Structure > Function > Sequence
Structure & Evolution
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Rhamnogalacturonan acetylesterase
(A. aculeatus) (1K7C)
Platelet activating factor acetylhydrolase
(B. Taurus) (1WAB)
Serine esterase (S. scabies) (1ESC)
Structure & Evolution
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
COOH
NH2
Asp His Ser Topological switchpoint
Inferring biological features from the structure
1DEO
Structure to Function
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Platelet activating factor acetylhydrolase
Serine esterase
Rhamnogalacturonan acetylesterase
Mølgaard, Kauppinen & Larsen (2000) Structure, 8, 373-383.
Structure & Evolution
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Why Fold?
Hydrophobic collapse Hydrophobic residues cluster to
“escape” interactions with water. Indirect effect of attraction between water
molecules.
Polar backbone groups form secondary structure to satisfy hydrogen bonding donors and acceptors.
Initially formed structure is in molten globule state (ensemble).
Molten globule condenses to native fold via transition state
E
U
F
T
ΔG
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Hydrophobic Effect and Folding
Oil and water
Clathrate structures
Entropy
Indirect consequence of attraction between water molecules
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Hydrophobic Core
Hydrophobic side chains go into the core of the molecule – but the main chain is highly polar.
The polar groups (C=O and NH) are neutralized through formation of H-bonds.
Myoglobin
Surface Interior
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Hydrophobic vs. Hydrophilic
Globular protein (in solution)
Membrane protein (in membrane)
Myoglobin Aquaporin
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Hydrophobic vs. Hydrophilic
Globular protein (in solution)
Membrane protein (in membrane)
Myoglobin Aquaporin
Cross-section Cross-section
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Characteristics of Helices
Aligned peptide units Dipolar moment
Ion/ligand binding Secondary and
quaternary structure packing
Capping residues The α helix (i→i
+4) Other helix types!
(310, π) N
C
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Residue Patterns
Helices Helix capping Amphiphilic residue
patterns
Sheets Amphiphilic residue
patterns Residue preferences at
edges vs. middle
Special residues Proline
Helix breaker
Glycine In turns/loops/bends
N
C
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
β-Sheets
Multiple strands sheet Parallel vs. antiparallel Twist
Flexibility Vs. helices
Folding Structure propagation
(amyloids) Other…
Thioredoxin
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
β-Sheets
Multiple strands sheet Parallel vs. antiparallel Twist
Flexibility Vs. helices
Folding Structure propagation
(amyloids) Other…
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
β-Sheets
Multiple strands sheet Parallel vs. antiparallel Twist
Flexibility Vs. helices
Folding Structure propagation
(amyloids) Other…
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
β-Sheets
Multiple strands sheet Parallel vs. antiparallel Twist
Flexibility Vs. helices
Folding Structure propagation
(amyloids) Other…
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
β-Sheets
Multiple strands sheet Parallel vs. antiparallel Twist
Strand interactions are non-local
Flexibility Vs. helices
Folding
Antiparallel Parallel
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Turns, Loops & Bends
Between helices and sheets
On protein surface
Intrinsically “unstructured” proteins
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Structure Levels Primary structure = Sequence
Secondary Structure = Helix, sheets/strands, loops & turns
Structural Motif = Small, recurrent arrangement of secondary structure, e.g. Helix-loop-helix Beta hairpins EF hand (calcium binding motif) Etc.
Tertiary structure = Arrangement of Secondary structure elements
MSSVLLGHIKKLEMGHS…
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Myoglobin
Hemoglobin
Quaternary Structure
Assembly of monomers/subunits into protein complex Backbone-backbone,
backbone-side-chain & side-chain-side-chain interactions: Intramolecular vs.
intermolecular contacts. For ligand binding side
chains may or may not contribute. For the latter, mutations have little effect.
α
α
α
β
β
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Amino Acids Proteins are built from
amino acids
Amino group and acid group
Side chain at Cα
Chiral, only one enantiomer found in proteins (L-amino acids)
20 natural amino acids
N
O
C Cα
Cγ
Cβ
Cε
Sδ
Methionine
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
http://www.ch.cam.ac.uk/magnus/molecules/amino/
The Amino Acids
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Grouping Amino Acids
http://www.dreamingintechnicolor.com/InfoAndIdeas/AminoAcids.gif
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
The Evolution Way
Based on Blosum62 matrix
Measure of evolutionary substitution probability
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Proteins Are Polypeptides The peptide bond A polypeptide chain
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Ramachandran Plot
Allowed backbone torsion angles in proteins
N
H
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Torsion Angles
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Ramachandran Plots
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Experimental Methods
Crystallography &
NMR spectroscopy
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
X-ray crystallography Nuclear Magnetic Resonance (NMR) Modelling techniques
More exotic techniques Cryo electron microscopy (Cryo EM) Small angle X-ray scattering (SAXS) Neutron scattering
Methods for Structure Determination
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
X-ray Crystallography
No size limitation. Protein molecules are
”stuck” in a crystal lattice.
Some proteins seem to be uncrystallizable.
Slow. Especially suited for
studying structural details.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
X-rays
Fourier transform
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
The Importance of Resolution
high
low 4 Å
2 Å
3 Å
1 Å
Molecules/domains Secondary structure
elements Residue side chains
Multiple conformations
Atoms
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Evaluation of X-ray Structures Key parameters:
Resolution
R values Agreement between data and model.
Usually between 0.15 and 0.25, should not exceed 0.30.
Ramachandran plot B factors
Contributions from static and dynamic disorder Well determined ~10-20 Å2, intermediate ~20-30 Å2, flexible 30-50
Å2, invisible >60 Å2.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
NMR Spectroscopy
Upper limit for structure determination currently ~50 kDa.
Protein molecules are in solution. Dynamics, protein folding. Slow.
Especially suited for studies of protein dynamics of small to medium size proteins.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
NMR Basics
NMR is nuclear magnetic resonance
NMR spectroscopy is done on proteins IN SOLUTION
Only atoms 1H, 13C, 15N (and 31P) can be detected in NMR experiments
Proteins up to 30 kDa
Proteins stable at high concentration (0.5-1mM), preferably at room temperature
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
NMR Spectroscopy
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Evaluation of NMR Structures
What regions in the structure are most well-defined?
Look at the pdb ensembles to see which regions are well-defined
1RJH
Nielbo et al, Biochemistry, 2003
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Well-defined structures RMSDs < 0.6 Å
Evaluation of NMR Structures
Atomic backbone RMSD:
Less well-defined structures RMSDs > 0.6 Å
3GF1, Cooke et al. Biochemistry, 1991 1T1H, Andersen et al. JBC, 2004
€
RMSD =xi − xi
'( )2
1
n∑
n
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
NMR versus X-ray Crystallography
Hydrogen atoms are observed!
Only 13C,15N and 1H are observed
Study of proteins in solution
Only proteins up to 30-40 kDa
No total “map” of the structure
Information used is incomplete and used as restraints
An ensemble of structures is submitted to PDB
The solved structure can be used for further dynamics characterization with NMR
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Holdings of the Protein Data Bank (PDB):
The PDB also contains nucleotide and nucleotide analogue structures.
http://www.pdb.org
PDB
Sep. 2001 Oct. 2007 May 2008 X-ray 13116 39706 43378 NMR 2451 6862 7306 Other 338 250 277 Total 15905 46818 50961
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS TECHNICAL UNIVERSITY OF DENMARK DTU
Summary
In evolution structure is conserved longer than both function and sequence.
X-ray crystallography Proteins in crystal
lattice Many details – one
model Resolution, R-values,
Ramchandran plot
NMR spectroscopy Proteins in solution Fewer details – many
models Violations, RMSD,
Ramachandran plot