Upload
novia
View
27
Download
0
Tags:
Embed Size (px)
DESCRIPTION
A Data Management and Analysis Software Platform for Phospho -Proteomics Data . Larry Lam Southern California Bioinformatics Summer Institute 2009 Graeber Lab – Crump Institute for Molecular Imaging UCLA . Outline. Graeber Lab Background Project Objective - PowerPoint PPT Presentation
Citation preview
Larry Lam Southern California Bioinformatics Summer Institute 2009Graeber Lab – Crump Institute for Molecular Imaging UCLA
A Data Management and Analysis Software Platform for
Phospho-Proteomics Data
Outline
• Graeber Lab Background• Project Objective• My Experimental Project (Example
Dataset) • Software Design• Software Demo • Conclusion / Future Work• Acknowledgements
Systems Biology of Cancer Signaling
• Lab Goals– Understand Cancer Signaling
Through Systems Biology Approaches
– [long term] Improve Cancer Treatment
• Signaling Pathway Modeling Through– Kinetics– Phospho-Profiling– Adaptor Complex Analysis
Project Objective
Develop a Software Platform for Convenient Storage and Analysis of Large-Scale Data Sets
- Design Database to Collect and Store Large Scale Proteomic Data Sets
- Allow for Comprehensive Meta Information - Simplify Access to Multiple Data Sets- Simplify The Use of Common Tools of Analysis
BCR/Abl Leukemia
• BCR/Abl fusion protein found in - 90% - 95% of chronic myleoid leukemia- 20% of adult acute lymphoblastic leukemia- 5% of children acute lymphoblastic leukemia
• Analyze the adaptor proteins in BCR/Abl signaling- Adaptor proteins mediate protein interactions
http://www.annals.org/cgi/content/full/138/10/819
BaitPrey Prey Complex Capture Protein
Interacting Protein
Experimental WorkflowExperimental
Protocol
Mass SpectometryQuantitation
Pipeline
IPI Proteomics Database
[Complex] NS Filter/
Consolidation
Complex
Phospho Profiling Quantitation Output File
Manual Organization/
Analysis
Purification
Current Workflow
Identifying Interactions of the Crk Adaptor Proteins
1. Genetic modification of pro-B-lymphocytes (Baf3) • Express adaptor + streptavidin binding peptide(SBP)
2. Culture 3. Lyse each culture for protein complex purification
Crk I Lysate Crk L Lysate Crk II Lysate NTAP Lysate
1. Separation of protein complex with streptavidin beads
2. Trypsin digestion from proteins to peptides
3. Separation of phosphorylated peptides with Fe(III)-NTA beads
4. Liquid Chromotography + Mass Spectometry
5. Quantitation Pipeline
Protein Complex Purification
P P
P
P
Quantitation Output File
Consolidation of quantified peptides and associated proteins per sample
• All peptides identified• All adaptor proteins used • Phosphorylation position within the peptide [optional]
Peptide Sequence Description/IPI Accession
Crk I Crk L Crk II NTAP
K.ADAAEFWR.K CBLIPI00027269
16568291 1802395 2821019 0
R.QEAVALLQGQR.H Isoform Crk-II IPI00307991
2859381909 924541 466328113 0
NS Filter/ConsolidationQuantitation Output File
Collapse Peptides To
Protein Quantity
Remove Insignificant
Proteins
Heatmap Analysis
Remove Known Contaminants
Peptide Sequence Description/IPI Accession
Crk I Crk L Crk II NTAP
K.ADAAEFWR.K CBLIPI00027269
170685828 10127151 0 0
K.ALVIAHNNIEMAK.N CBLIPI00027269
134461139 897107 51499 0
R.QEAVALLQGQR.H Isoform Crk-II IPI00307991
793418583 145308 139088540 85523
K.IHYLDTTTLIEPVAR.S Isoform Crk-II IPI00307991
16062973521 349417 3223960034 4617083
• Quantity Is Normalized For Each Row
NS Filter/ConsolidationQuantitation Output File
Collapse Peptides To
Protein Quantity
Remove Insignificant
Proteins
Heatmap Analysis
Remove Known Contaminants
NS Filter/ConsolidationQuantitation Output File
Collapse Peptides To
Protein Quantity
Remove Insignificant
Proteins
Heatmap Analysis
Remove Known Contaminants
Protein Enrichment Factor = (Median – NTAP Median)/ s + s
Protein NTAP
NS Filter/ConsolidationQuantitation Output File
Collapse Peptides To
Protein Quantity
Remove Insignificant
Proteins
Heatmap Analysis
Remove Known Contaminants
• Configuration File of Known Contaminants
Statistical Analysis:Peptide Quantity Heatmap
Java TreeView
High Quantity
Low Quantity
Crk I Crk L CrkII NTAP
• Cbl Peptides• Crk I Peptides
Experimental WorkflowExperimental
Protocol
Mass SpectometryQuantitation
Pipeline
IPI Proteomics Database
[Complex] NS Filter/
Consolidation
Complex
Phospho Profiling Quantitation Output File
Manual Organization/
Analysis
Purification
Current Workflow
Quantitation Import
Local DB
Statistical Analysis
External SourcesExternal
SourcesExternal Sources
New Workflow
Program Design
C# GUI Application
Quantitation Output File
DATA IMPORT
MySQL Database
DATA QUERY
QuantitationData Set
R Statistical Function
• Programming Language: C#• Database: MySQL
– Free• Statistical Computing: R
– Free, Accessible to C#
Data Import Methodology1. Define Meta Data (Descriptors) And Relationships About The
Quantitation Values2. Create The Tables In MySQL3. Access Using MySQL Connector/Net
http://dev.mysql.com/downloads/connector/
Statistical AnalysisMethodology
• R Language and Environment for Statistical Computing and Graphics- Modeling- Statistical Tests- Clustering- Heatmaps
• Develop a Graphical User Interface To R Functions- Access R Functions Through R-(D)COM Interface
http://cran.r-project.org/contrib/extra/dcom/
Software Demo
Conclusion
• Management Software– Standardized approach in maintaining lab data
• Analyze Data Sets – Analysis tools highly accessible to biologists of
various technical levels• Combine Data Sets
– Potentially lead to new discoveries
Future Work
• Add More Links To External Database• Enhance Data Query• Include More Analysis Functions
Acknowledgments• Graeber Lab Members
– Dr. Thomas Graeber– Dr. Björn Titz
• SoCalBSI Faculty and Members– Dr. Jamil Momand – Dr. Sandy Sharp – Dr. Nancy Warter-Perez – Dr. Wendie Johnston – Dr. Beverly Krilowicz – Ronnie Cheng
• Funding
Main Window
Main Window: Options
Batch Import
Batch Information
Sample Information
Sample Information: Technical Replicates
Feature Type
Features
Project Assignment
Batch Prtotocol Assignment
Biological System Assignment
Import
Batch Query
Feature Type Selection
Matrix/Heatmap Dialog
Heatmap Options
Heatmap Options
Data Import Design Methodology
Batch
Feature
LabelDescriptionExperimenterDate
LabelDescriptionFeature Type
Sample
LabelDescriptionQuality
1. Define Meta Data (Descriptors) About The Quantitation Values- Define Relationships
2. Create The Tables In MySQL3. Develop Support for MySQL Access
- MySQL Connector
Feature Value
ValueValue Type
V
V
VV