1
Modeling Arenavirus Nucleocapsid and Z Protein Structures Aristotle M. Mannan, Eric R. May, Roger S. Armen, Ranjan V. Mannige and Charles L. Brooks III Departments of Biophysics and Chemistry, University of Michigan, Ann Arbor, MI Abstract: The Arenaviridae family of viruses, responsible for neurological disease and hemorrhagic fever, is transmitted to humans via rodents. Over 20 different strains have been identified and phylogenetically classified since the first outbreak of the virus in 1933 and new strains are continuing to develop. It is known that Arenaviridae are enveloped and spherical, meaning that they are made up of a single protein copied numerous times, and contain two segments of single stranded RNA [1]. There is no structural information about the major nucleocapsid protein (NP) and other proteins associated with the virus capsid structure. This lack of structural data has hindered the identification of potential drug targets and the development of effective drugs. Currently there are no vaccines or FDA approved drugs for Arenavirus infections. The zinc-finger-like protein (Z) is known to interact with NP to induce budding, the process of viral proliferation. In order to better understand how NPs interact in the context of the spherical virus shells as well as with Z, structures of these two proteins were predicted using homology modeling methods. Amino acid sequences for the Old World Lymphocytic Choriomeningitis Virus (LCMV) and the New World Tacaribe Virus, two strains commonly studied in experimental labs, were used for prediction of tertiary structure. Models for Z were constructed from templates obtained through PSI-Blast and compared to the tertiary structure predictions from web servers. In addition, tertiary structures of all known virus capsids described in Viper DB [2] were used as potential templates for homology modeling of NP. Having constructed models of NP and Z, we identified putative protein-protein interaction sites, which may represent a better anti- viral drug target than the interaction of Z with human proteins. [1] Neuman, B.W., B.D. Adair, J.W. Burns, R.A. Milligan, M.J. Buchmeier, M. Yeager, Complementarity in the Supramolecular Design of Arenaviruses and Retroviruses Revealed by Electron Cryomicroscopy and Image Analysis, J. Virol. 2005, Vol. 79, 3822- 3830 [2] Shephed, C.M., I.A. Borelli. G. Lander. P. Natarajan, V. Siddavanhalli, C. Bajaj, J.E. Johnson, C.L. Brooks, V.S. Reddy, VIPERdb: a relational database for structural virology, Nucleic Acids Res. 2006, Vol. 34, D386-D389 Z Protein: Sequence Only Approach Zinc-finger motif in sequence PSI-Blast provided 10 templates - E-value - Percent identity (~30%) Model criteria: correct zinc binding geometry - Modeller allows for incorporation of ligand geometry Evaluated CHARMM energies Models converged to a favorable fold All models were within 1.5-2.4 Å RMSD 570 482 395 353 NP: Sequence Only Approach HHpred, Mod-Web and LoopP produced HHpred, Mod-Web and LoopP produced poor structures - unfolded/unreasonable poor structures - unfolded/unreasonable Only I-Tasser gave full-length, Only I-Tasser gave full-length, folded and well-packed models folded and well-packed models - no viral protein templates - no viral protein templates - some models had favorable - some models had favorable CHARMM energies CHARMM energies - other models were unreasonable - other models were unreasonable Leucine Rich Repeat (LRR) domain Leucine Rich Repeat (LRR) domain - helical repeat protein - helical repeat protein - unlikely to be a capsid protein - unlikely to be a capsid protein - poor CHARMM energy - poor CHARMM energy Model 1 Model 3 Model 4 Model 2 Model 1 (1-288) Model 1 (289-570) Model 3 (1-353) Model 3 (395-482) Model 3 (483-570) Z=18.2 Z=23.2 Z=22.2 Z=5.5 Z=7.2 Model 1 1 288 570 Model 3 1 I-Tasser models broke I-Tasser models broke down into separate domains down into separate domains Individual domains were Individual domains were submitted to DALI submitted to DALI DALI Z scores indicate that DALI Z scores indicate that these domains occur in these domains occur in other protein structures other protein structures Domains are candidates if Domains are candidates if virus space hypothesis is virus space hypothesis is incorrect incorrect NP: Virus Structure Approach Ran Blast of Arenavirus against all of ViperDB Eight virus families were identified as potential templates Built models using multiple structure alignment from same family Monomer energies were evaluated using CHARMM27 minimization with implicit solvent model Predicted secondary structure was compared to the secondary structure in the model BindN was used to predict strongest RNA binding site on Reoviridae template and model RNA binding sites were found on the inside of the capsid in a reasonable location Microviridae Model 24% Flaviviridae Model 17% Reoviridae Template Reoviridae Model Reoviridae Model 43% Comparison of Z and NP Modeling Z Protein (90 residues) 10 templates from PSI-Blast Multiple methods converged on similar structure LCMV and Tacaribe shared best template (2CKL_B) High confidence in prediction NP (570 residues) No suitable templates from PSI-Blast No consensus between web server predictions and Modeller Restricted search templates to virus structures LCMV and Tacaribe shared best virus template (Reoviridae) Moderate confidence in prediction Protein-protein interactions were inferred from protein binding on Reoviridae inner shell Z protein was matched to putative helix-binding site Helix Binding Site Z Protein Model NP Model Objectives/Methods: Target Protein Sequence Sequence Only Approach Virus Structure Approach 1. PSI-Blast sequence to identify templates - Modeller 2. Structure prediction via web servers - HHpred - LoopP - I-Tasser - Mod-Web 1. Blast ViperDB virus capsid structures against Arena sequence 2. Modeller - multiple structure alignment of same family 1. CHARMM monomer energy 2. CHARMM pentamer energy 3. RNA binding site location 4. Secondary structure 5. Matching features to cryo-EM data Evaluate All Models Build Capsid Model What? What? Create models for NP and Z Create models for NP and Z proteins proteins Why? Why? No structural information No structural information No Vaccines No Vaccines No FDA approved drugs No FDA approved drugs Potential to design new drugs Potential to design new drugs - Z protein is major drug - Z protein is major drug target target How? …. How? …. R Z Z Host Proteins Involved in Vesicle Formation and Membrane Fission Budding TSG 101 Nedd4 PML PPH eIF4E NP Interaction of Z and NP Zinc-Finger Azoic Compounds (Inhibits Viral Budding) RNA Nucleotide Analogs (Inhibits RNA Replication) ANNB ADA (Azodicarbonamide) Myristic Acid Analogs (Inhibits Viral Budding) β-hydroxy-Myristic Acid Myristic Acid Ribavirin High confidence in Z protein modeling High confidence in Z protein modeling Models based on Reoviridae templates Models based on Reoviridae templates are better than other virus-based models are better than other virus-based models based on evaluation criteria based on evaluation criteria Continue working on NP-NP interactions Continue working on NP-NP interactions to build full capsid to build full capsid Improve upon models of Z binding to NP Improve upon models of Z binding to NP Further work could focus on drug Further work could focus on drug development to inhibit Z and NP development to inhibit Z and NP interaction to inhibit budding interaction to inhibit budding Possibility of using I-Tasser domains to Possibility of using I-Tasser domains to build novel Arenavirus fold build novel Arenavirus fold Conclusions/Future Directions Model 1 Model 2 Model 3 Model 4 353 395 482 570 Model 1 (1-288) Model 1 (289-570) Z=18.2 Z=23.2 Model 3 (1-353) Z=22.2 Z=5.5 Model 3 (395-482) Model3 (483-570) Z=7.2 Prediction of Z and NP Protein-Protein Interaction

Modeling Arenavirus Nucleocapsid and Z Protein Structures · Arenavirus infections. The zinc-finger-like protein (Z) is known to interact with NP to induce budding, the process of

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Modeling Arenavirus Nucleocapsid and Z Protein Structures · Arenavirus infections. The zinc-finger-like protein (Z) is known to interact with NP to induce budding, the process of

Modeling Arenavirus Nucleocapsid and Z Protein Structures

Aristotle M. Mannan, Eric R. May, Roger S. Armen, Ranjan V. Mannige and Charles L. Brooks III

Departments of Biophysics and Chemistry, University of Michigan, Ann Arbor, MI

Abstract: The Arenaviridae family of viruses, responsible for neurological disease and hemorrhagic fever, is transmitted to humans via rodents. Over 20 different strains have been identified and phylogenetically classified since the first outbreak of the virus in 1933 and new strains are continuing to develop. It is known that Arenaviridae are enveloped and spherical, meaning that they are made up of a single protein copied numerous times, and contain two segments of single stranded RNA [1]. There is no structural information about the major nucleocapsid protein (NP) and other proteins associated with the virus capsid structure. This lack of structural data has hindered the identification of potential drug targets and the development of effective drugs. Currently there are no vaccines or FDA approved drugs for Arenavirus infections. The zinc-finger-like protein (Z) is known to interact with NP to induce budding, the process of viral proliferation. In order to better understand how NPs interact in the context of the spherical virus shells as well as with Z, structures of these two proteins were predicted using homology modeling methods. Amino acid sequences for the Old World Lymphocytic Choriomeningitis Virus (LCMV) and the New World Tacaribe Virus, two strains commonly studied in experimental labs, were used for prediction of tertiary structure. Models for Z were constructed from templates obtained through PSI-Blast and compared to the tertiary structure predictions from web servers. In addition, tertiary structures of all known virus capsids described in Viper DB [2] were used as potential templates for homology modeling of NP. Having constructed models of NP and Z, we identified putative protein-protein interaction sites, which may represent a better anti-viral drug target than the interaction of Z with human proteins.

[1] Neuman, B.W., B.D. Adair, J.W. Burns, R.A. Milligan, M.J. Buchmeier, M. Yeager, Complementarity in the Supramolecular Design of Arenaviruses and Retroviruses Revealed by Electron Cryomicroscopy and Image Analysis, J. Virol. 2005, Vol. 79, 3822-3830[2] Shephed, C.M., I.A. Borelli. G. Lander. P. Natarajan, V. Siddavanhalli, C. Bajaj, J.E. Johnson, C.L. Brooks, V.S. Reddy, VIPERdb: a relational database for structural virology, Nucleic Acids Res. 2006, Vol. 34, D386-D389

Z Protein: Sequence Only Approach

• Zinc-finger motif in sequence

• PSI-Blast provided 10 templates

- E-value

- Percent identity (~30%)

• Model criteria: correct zinc binding

geometry

- Modeller allows for incorporation

of ligand geometry

• Evaluated CHARMM energies

• Models converged to a favorable

fold

• All models were within 1.5-2.4 Å

RMSD

570482395353

NP: Sequence Only Approach

● ● HHpred, Mod-Web and LoopP produced HHpred, Mod-Web and LoopP produced

poor structures - unfolded/unreasonablepoor structures - unfolded/unreasonable

● ● Only I-Tasser gave full-length, Only I-Tasser gave full-length,

folded and well-packed modelsfolded and well-packed models

- no viral protein templates- no viral protein templates

- some models had favorable - some models had favorable

CHARMM energiesCHARMM energies

- other models were unreasonable- other models were unreasonable

● ● Leucine Rich Repeat (LRR) domainLeucine Rich Repeat (LRR) domain

- helical repeat protein- helical repeat protein

- unlikely to be a capsid protein- unlikely to be a capsid protein

- poor CHARMM energy - poor CHARMM energy

Model 1

Model 3

Model 4

Model 2

Model 1 (1-288) Model 1 (289-570)

Model 3 (1-353) Model 3 (395-482) Model 3 (483-570)

Z=18.2 Z=23.2

Z=22.2 Z=5.5 Z=7.2

Model 1

1 288 570

Model 3

1

● ● I-Tasser models broke I-Tasser models broke

down into separate domainsdown into separate domains

● ● Individual domains wereIndividual domains were

submitted to DALIsubmitted to DALI

● ● DALI Z scores indicate thatDALI Z scores indicate that

these domains occur inthese domains occur in

other protein structuresother protein structures

● ● Domains are candidates ifDomains are candidates if

virus space hypothesis isvirus space hypothesis is

incorrectincorrect

NP: Virus Structure Approach• Ran Blast of Arenavirus against all of ViperDB

• Eight virus families were identified as potential templates

• Built models using multiple structure alignment from same family

• Monomer energies were evaluated using CHARMM27 minimization with implicit solvent model

• Predicted secondary structure was compared to the secondary structure in the model

• BindN was used to predict strongest RNA binding site on Reoviridae template and model

• RNA binding sites were found on the inside of the capsid in a reasonable location

Microviridae Model

24%

Flaviviridae Model 17%Reoviridae Template

Reoviridae Model

Reoviridae Model43%

Comparison of Z and NP ModelingZ Protein (90 residues)

• 10 templates from PSI-Blast

• Multiple methods converged on

similar structure

• LCMV and Tacaribe shared best

template (2CKL_B)

• High confidence in prediction

NP (570 residues) • No suitable templates from PSI-Blast

• No consensus between web server

predictions and Modeller

• Restricted search templates to virus

structures

• LCMV and Tacaribe shared best

virus template (Reoviridae)

• Moderate confidence in prediction

• Protein-protein interactions were inferred from protein binding on Reoviridae inner shell

• Z protein was matched to putative helix-binding site

Helix Binding Site

Z Protein Model

NP Model

Objectives/Methods:

Target Protein Sequence

Sequence Only Approach Virus Structure Approach

1. PSI-Blast sequence to identify templates - Modeller2. Structure prediction via web servers - HHpred - LoopP - I-Tasser - Mod-Web

1. Blast ViperDB virus capsid structures against Arena sequence2. Modeller - multiple structure alignment of same family

1. CHARMM monomer energy2. CHARMM pentamer energy3. RNA binding site location4. Secondary structure5. Matching features to cryo-EM data

Evaluate All Models

Build Capsid Model

What? What? ● ● Create models for NP and ZCreate models for NP and Z

proteinsproteins

Why? Why? ● ● No structural informationNo structural information● ● No VaccinesNo Vaccines● ● No FDA approved drugsNo FDA approved drugs● ● Potential to design new drugsPotential to design new drugs

- Z protein is major drug- Z protein is major drug targettarget

How? ….How? ….

R

Z Z

Host Proteins Involved in Vesicle Formation and Membrane Fission

Budding

TSG 101

Nedd4

PML

PPH

eIF4ENP

Interaction of Z and NP

Zinc-Finger Azoic Compounds (Inhibits Viral Budding)

RNA Nucleotide Analogs (Inhibits RNA Replication)

ANNB ADA (Azodicarbonamide)

Myristic Acid Analogs (Inhibits Viral Budding)

β-hydroxy-Myristic Acid

Myristic Acid

Ribavirin

● ● High confidence in Z protein modelingHigh confidence in Z protein modeling ● ● Models based on Reoviridae templatesModels based on Reoviridae templates are better than other virus-based modelsare better than other virus-based models based on evaluation criteriabased on evaluation criteria ● ● Continue working on NP-NP interactionsContinue working on NP-NP interactions to build full capsidto build full capsid

● ● Improve upon models of Z binding to NPImprove upon models of Z binding to NP ● ● Further work could focus on drugFurther work could focus on drug development to inhibit Z and NPdevelopment to inhibit Z and NP interaction to inhibit buddinginteraction to inhibit budding ● ● Possibility of using I-Tasser domains toPossibility of using I-Tasser domains to build novel Arenavirus foldbuild novel Arenavirus fold

Conclusions/Future Directions

Model 1 Model 2

Model 3

Model 4

353 395 482 570

Model 1 (1-288) Model 1 (289-570)

Z=18.2 Z=23.2

Model 3 (1-353)

Z=22.2 Z=5.5

Model 3 (395-482) Model3 (483-570)

Z=7.2

Prediction of Z and NP Protein-Protein Interaction