Improving the Sensitivity of Peptide Identification With Meta-Search and Machine Learning

Poster produced by Faculty & Curriculum Support (FACS), Georgetown University Medical Center

Peptide sequence databases, meta-search engine,

machine-learning combiner available from:

http://edwardslab.bmcb.georgetown.edu Application of enumeration, meta-search, and

machine-learning can significantly improve the

sensitivity of peptide identification.

Improving the Sensitivity of Peptide Identification With Meta-Search and Machine LearningNathan J. Edwards1, Xue Wu2, Chau-Wen Tseng2

Introduction

All peptide sequences from: Six-frame translation of EST and HTC sequences; Three-frame translation of mRNA sequences; All IPI, RefSeq, Genbank, Vega, EMBL, HInvDB,

SwissProt and TrEMBL proteins; SwissProt variants, splices, conflicts, mature isoforms

grouped by gene-cluster & compressed, as FASTA.

1Georgetown University Medical Center; 2University of Maryland, College Park

Peptide Sequence Databases

PepSeqDB Release 1.2

Peptide Identification Meta-Search HMMatch Spectral Matching

Conclusions

References

We use a variety of techniques, from sequence

enumeration and meta-search to machine learning

to increase the number high-confidence peptide

identifications from large tandem mass-spectra

datasets. These techniques seek to improve the number of

peptide identifications made at a given level of

statistical significance. We show that these techniques can improve

identification sensitivity significantly.

Georgetown University

1. Edwards. Novel Peptide Identification using Expressed Sequence

Tags and Sequence Database Compression. Mol. Sys. Biol. 2007.

2. Wu, Tseng, Edwards. HMMatch: Peptide Identification by Spectral

Matching of Tandem Mass Spectra using Hidden Markov Models.

J. Comp. Biol. 2007.

3. Wu, Tseng, Rudnick, Balgley, Edwards. PepArML: An Unsupervised,

Model-Free, Combining, Peptide Identification Arbiter for Tandem

Mass Spectra via Machine Learning. In preparation.

Organism Size (AA) Size (Entries)

Human 209Mb 75,043

Mouse 151Mb 55,929

Rat 67Mb 43,211

Zebra-fish 90Mb 47,922

Schedule: Automated rebuild every few months.

Coming soon: Fast peptide to gene and source sequence

mapping using suffix-trees and gene sequence-groups.

Annual Meeting, 2008

PepArML - Unsupervised Machine-Learning Combiner

NSF TeraGrid1000+ CPUs

UMIACS250+ CPUs

Edwards LabScheduler &48+ CPUs

Meta-search with four search engines;Target & decoy searches automatically.

Web-service API for all data

Securecommunication

Heterogeneouscompute resources

Simple search descriptionScales to 100’s of

simultaneous searchesFree, instantregistration

Iteration

Legend: Heuristic: H; Classifier w/ 5-fold-CV: C-T, C-M, C-O, C-TM, C-TO, C-MO, C-TMO; Unsupervised classifier w/ 5-fold-CV: U-TMO; Unsupervised classifier w/ no-CV: U*-TMO.

False Positive Rate

HC-TMO

U*-TMO

I0 b1 I1 I2 I3 I4 I5 I6y1 b2 y2 b3 y3

11% 17% 6% 94% 8% 0% 11% 86% 17% 0% 6% 92% 19%

Improving the Sensitivity of Peptide Identification With Meta-Search and Machine Learning

Documents

ANYGEN - prokcssmedia.blob.core.windows.net€¦ · the peptide synthesis. Custom Peptide Catalog Peptide CMO Service Peptide Service - ISO 9001, 14001 - GMP. 4 ANYGEN 5 Custom Peptide

Peptide and protein analysis with mass spectrometrymolecular weight information on peptides and proteins is quite unambiguous. In general the sensitivity of mass spectrometry is excellent

SpyLigase peptide peptide ligation polymerizes affibodies to … · SpyLigase peptide–peptide ligation polymerizes affibodies to enhance magnetic cancer cell capture Jacob O. Fierer1,

Application HR/AM Targeted Peptide Quantitation on a Note ...ELISAs, yet several major challenges remain: • The balance between duty cycle and sensitivity limits the number of peptide

Practical Guide to Significantly Improve Peptide Identification Sensitivity and Accuracy Bin Ma, CTO Bioinformatics Solutions Inc. June 5, 2011

Towards Visible Light Switching of Peptide-DNA and Peptide

Sensitivity of 2D IR Spectra to Peptide Helicity: A ...mukamel.ps.uci.edu/publications/pdfs/648.pdf · Sensitivity of 2D IR Spectra to Peptide Helicity: A Concerted Experimental and

(NH/ 15 N) (ppm) Peptide II Peptide I Peptide III Peptide IV Peptide V Efb residues A29 – R165 Figure 3.10 Chemical shift perturbation of Efb upon titration

BIRO1, a Cell-Permeable BH3 Peptide, Promotes ... · BH3 peptide–based approach uses the prodeath BH3 min-imal death domains to re-establish mitochondrial sensitivity in tumoral

Improving the Sensitivity of Peptide Identification for Genome Annotation

SpyLigase peptide–peptide ligation polymerizes affibodies to

Peptide unprotected methods Peptide...Butoxycarbonyl (Boc)- and fluorenylmethoxycarbonyl (Fmoc)-aminoacid derivativeswerepurchasedfrom Bachem. Solid-Phase Peptide Synthesis. Peptides,

Introduction to Protein/Peptide Quantitation Using Normal ... Three... · Introduction to Protein/Peptide Quantitation Using Normal Flow LC/MS High Sensitivity Protein/Peptide Quantitation

ABIOTIC PEPTIDE SYNTHESIS Solid Phase Peptide Synthesis … · ABIOTIC PEPTIDE SYNTHESIS Solid Phase Peptide Synthesis (SPPS) ... His, N-terminal end of the peptide Amino acids with

Innovative Peptide Solutions - JPT Peptide Technologies · Innovative Peptide Solutions Peptide Tools for Immunotherapy Immune Monitoring Vaccine Development ... optimization for

Practical Guide to Significantly Improve Peptide Identification Sensitivity and Accuracy

Sensitivity questions for electric machines and answers ......Sensitivity questions for electric machines and answers via meta-models 9 4th CADFEM ANSYS Simulation Conference Ireland,

Peptide asparaginyl ligases—renegade peptide bond makers

Improving the Sensitivity of Peptide Identification Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical

PEPTIDE-FOCUSED STREAMS: OLIGONUCLEOTIDE-FOCUSED … · 2019. 12. 6. · peptide delivery strategies, oral delivery of peptide-based therapeutics and developing stable peptide formulations