29
Biocomputatio n: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke

Biocomputation : Comparative Genomics

  • Upload
    naida

  • View
    53

  • Download
    1

Embed Size (px)

DESCRIPTION

Biocomputation : Comparative Genomics. Tanya Talkar Lolly Kruse Colleen O’Rourke. DNA . Junk DNA. Conserved DNA. What is Biocomputation ?. Four Main Parts. Biomolecular computation Biological Computation Computational Biology Bioinformatics. Bioinformatics:. Sequence Analysis. - PowerPoint PPT Presentation

Citation preview

Page 1: Biocomputation : Comparative Genomics

Biocomputation: Comparative Genomics

Tanya TalkarLolly KruseColleen O’Rourke

Page 2: Biocomputation : Comparative Genomics

DNA

Page 3: Biocomputation : Comparative Genomics
Page 4: Biocomputation : Comparative Genomics

JunkDNA

ConservedDNA

Page 5: Biocomputation : Comparative Genomics

What is Biocomputation?

Statistics

Computer Science

Molecular Biology

Page 6: Biocomputation : Comparative Genomics

Four Main Parts Biomolecular computation Biological Computation Computational Biology Bioinformatics

Page 7: Biocomputation : Comparative Genomics

Bioinformatics:

Biology

Computer Science

Information

Technology

Page 8: Biocomputation : Comparative Genomics

Sequence Analysis Very Functional! Compare DNA between Species Small Fragments Return full sequence

Page 9: Biocomputation : Comparative Genomics

Computational Genomics Needleman – Wunsch

Not used much More Mapped Genomes =

Computational Genomics!

Page 10: Biocomputation : Comparative Genomics

Alignment

Page 11: Biocomputation : Comparative Genomics

Global Alignment:Needleman - Wunsch O(N3) Fewest edit operations Similar strings

Page 12: Biocomputation : Comparative Genomics

Local AlignmentSmith - Waterman O(N2) Dissimilar strings Find high similarity regions

Page 13: Biocomputation : Comparative Genomics

Comparison

Page 14: Biocomputation : Comparative Genomics

S1 P Q R A X A B C S T V Q

S2 X Y A X B A C S L T

A X A B C S

A X B A C S

Page 15: Biocomputation : Comparative Genomics

S1 A X A B _ C S

S2 A X _ B A C S

Score 2 2 -1 2 -1 2 2

Page 16: Biocomputation : Comparative Genomics

Advantages:Global Alignment

Page 17: Biocomputation : Comparative Genomics

Advantages:Local Alignment

Page 18: Biocomputation : Comparative Genomics

BLAST• Basic Local Alignment Search Tool• FASTA

Page 19: Biocomputation : Comparative Genomics

Improvements Increased Speed Locate initial alignment hot spots Statistical significance

Page 20: Biocomputation : Comparative Genomics

Terminology Segment Pairs Locally maximal segment pairs Maximal segment pairs

Page 21: Biocomputation : Comparative Genomics

How it works Query sentence, P Database

Must have score over C! Multiple segment pairs combined

A B C D E F G

A G C B F D EB E D G A F BG F B E D C A

Page 22: Biocomputation : Comparative Genomics

How it works Extends each hit Done efficiently Truncates Doesn’t find all pairs

Page 23: Biocomputation : Comparative Genomics

Proteins Fixed length, W Words above threshold Each hit extended

Page 24: Biocomputation : Comparative Genomics

DNA Word List Exact matches NOT dynamic programming

Page 25: Biocomputation : Comparative Genomics

Scoring Blosum62 Matrix Match (+2), Mismatch (-3),

Gaps penalized

Page 26: Biocomputation : Comparative Genomics

Substitution Matrix Represents Scoring Functions

Page 27: Biocomputation : Comparative Genomics

Multiple Sequence Alignment

Page 28: Biocomputation : Comparative Genomics

Methods of MSA Progressive Alignment Construction Iterative Methods Hidden Markov Models Genetic Algorithms and Simulated

Annealing

Page 29: Biocomputation : Comparative Genomics

Comparative Genomics Compare Species

Find Evolutionary Significances! Low Level High Level

Importance of Non Coding DNA