41
01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation, biology, computational biology Fundamentals of Bioinformatics: computation, biology, computational biology Vasilis J. Promponas Bioinformatics Research Laboratory Department of Biological Sciences University of Cyprus Vasilis J. Promponas Bioinformatics Research Laboratory Department of Biological Sciences University of Cyprus

Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

  • Upload
    buikhue

  • View
    221

  • Download
    4

Embed Size (px)

Citation preview

Page 1: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Fundamentals of Bioinformatics:computation, biology,computational biology

Fundamentals of Bioinformatics:computation, biology,computational biology

Vasilis J. PromponasBioinformatics Research LaboratoryDepartment of Biological Sciences

University of Cyprus

Vasilis J. PromponasBioinformatics Research LaboratoryDepartment of Biological Sciences

University of Cyprus

Page 2: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

A short self-introductionA short self-introduction● Vasilis, pronounced: “Vass`ilis”● A Frog Physicist turned into a Biologist

– Coincidence: it all happened around 1995-96

– Computational approach

● PhD in Biology (2004, ComputationalBiology/Bioinformatics), University of Athens,Greece

● 2005 – Moved to Cyprus

Page 3: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Cyprus?Cyprus?Source: http://maps.google.com

Page 4: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Cyprus?Cyprus?

Source: http://maps.google.com

Page 5: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Cyprus?Cyprus?

Source: http://maps.google.com

Page 6: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Cyprus?Cyprus?

Page 7: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

University of CyprusUniversity of Cyprus

Page 8: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Page 9: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Page 10: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Page 11: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

OverviewOverview● Introduction

– Some definitions and concepts from (Molecular)Biology

– The rapid growth of Biological Data

● The advent of the Genome Era (a paradigmshift in Biology?)

● Bioinformatics and Computational Biology:Fundamental Problems – Concepts –Applications

● Discussion

Page 12: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionIntroduction

Page 13: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionBio::revenge

IntroductionBio::revenge

● Biology IS the science of the 21st century

– Used to be a QUALITATIVE scientific domain● Exceptions have been the Rule

– Turning into QUANTITATIVE

– An Information Rich field● Impact in every aspect of (human) lives

– Food production and Quality Control

– Environment (e.g. Ecology, Monitoring,Management)

– Human activities/welfare (e.g. sports, cosmetics,health)

– ...

Page 14: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

Page 15: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

Source: http://www.google.com

Page 16: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

Source: http://www.google.com

But what about some viral genomes?

Page 17: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

Source: http://www.google.com

Contains heritable(?) information

Page 18: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

✔ Tighly packaged (DNA/RNA/proteins)

✔ 3D-structural organization✔ Contains “functional” regions (a.k.a. genes) and regions of (yet) unknown function

Page 19: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

✔ A chromosomal region encoding mRNAs, tRNAs, etc.

✔ Useful keyword: Transcription✔ mRNAs: non-terminal

Page 20: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

IntroductionKey actors

IntroductionKey actors

● Genome(s)● Chromosome(s)● Gene(s)● Protein(s)

✔ Main components of the (cellular) toolkit

✔ Linear polymers ✔ Interact with other biological molecules

✔ Useful keyword: Translation

Page 21: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Information Flow in BiologicalSystems

The “Central Dogma”

Information Flow in BiologicalSystems

The “Central Dogma”

PROTEINPROTEIN

RNARNA

DNADNA

Replication

Transcription

Translation

Page 22: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Coupled with a “Universal”genetic code

Coupled with a “Universal”genetic code

The Genetic Code is Degenerate!

Page 23: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

… some more complexity ...… some more complexity ...

ORF1 ORF2 ORF3 ORF4

Genome

Chromosomes

DNASequence

Gene (fine)Structure

Amino AcidSequence

E1 E2 E3

Page 24: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

...ACTGTCTGACCGGCAGCA...

...TGACAGACTGGCCGTCGT...

DNA stores information ...

Page 25: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Proteins get the dirty job done …

Page 26: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

ProteinsProteins● Assembled from one or more polypeptide

chains (homo-/hetero-polymers)● The functional “toolkit”

● Enzymes● Transport-Storage● Motion● Binding● Molecular Recognition● Signal Transduction

● Structural Proteins● Energy Production● Cell Regulation and

Differentiation● ... (...)

Page 27: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

F V N Q H L C G S H L V E A L V C G E R G F F Y T P K AY L

G I V E Q C C T S I C S L Y Q L E N Y C Q

SS SS

SS

SS

SS

SS

Chain AChain A

Chain BChain B

Pig Insulin Dimer(PDB_ID:4INS)

MALWTRLLPLLALLALWAPAPAQAFVNQHLCGSHLVEALYLVCGERGFFYTPKARREAENPQAGAVELGGGLGGLQALALEGPPQKRGIVEQCCTSICSLYQLENYCN

Pig Insulin Precursor

Yet, some more complexity(PTMs)

Yet, some more complexity(PTMs)

Page 28: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Back to the “Central doma”Back to the “Central doma”

For (almost) all proteins

FunctionFunction

3D-structure3D-structure

SequenceSequence Determines

Determines • Glucose Uptake PathwayGlucose Uptake Pathway• Glycogen SynthesisGlycogen SynthesisPathwayPathway• Formation ofFormation oftriglyceridestriglycerides

..VEQCCTSICSLYQL..

Again, this “genetic code”is redundant

Page 29: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

But, where is the computation inbiology??

But, where is the computation inbiology??

Page 30: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

BioinformaticsBioinformatics

Biology

Statistics,Mathematics

Physics,ChemistryEngineeringLinguistics,

...

Informatics,Comput

erScience

Bioinformatics

Page 31: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Bio – related fieldsBio – related fields

● Computational Molecular Biology● Bioinformatics● Theoretical Biology● Biomedical Informatics● ...

● Where are the limits?

Page 32: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

A (fuzzy?) definition ofBioinformatics

A (fuzzy?) definition ofBioinformatics

● Bioinformatics is the “computational handling and processing of genetic information”

Ouzounis & Valencia, 2003

Page 33: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Handling Genetic InformationHandling Genetic Information

● Apply existing (or develop custom) efficientmethods for

– Describing and Visualizing

– Storing

– Retrieving

– Integrating

● Large volumes of complex andinhomogeneous data*

*some still call it “Designing and Building Biological Databases”

Page 34: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Handling Genetic Information(part II)

Handling Genetic Information(part II)

● Particular attention:– Origin and Quality of Biological Data

– Data Annotation [Expert-based,(semi-)automatic)

– Interconnectivity

– Friendly to the end-user

Page 35: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Processing Genetic InformationProcessing Genetic Information

● Analysing biological data– AIM I: ADRESSING BIOLOGICAL questions.

● What makes Frodo Baggins (the Hobbit) differfrom Spiderman? (consider that Spiderman'skitsch costume is not a valid answer)

● Does molecule A interact with molecule B?● What is the 3D structure adopted by X?● How does the 3D structure of a molecule

specify its function?– AIM II: ADRESSING other SCIENTIFIC or

TECHNICAL questions

Page 36: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Processing Genetic Information(part II)

Processing Genetic Information(part II)

● Other questions???– Which is the optimal way to store genome data in

a database?

– How can I represent sequences belonging in afamily with a statistical model?

– How can I obtain the optimal pairwiseDNA/RNA/Protein sequence alignment?

– Is their any statistical measure for indicating thesignificance of a sequence comparison score?

Page 37: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

A parenthesis (...) for solving acommon misunderstanding

A parenthesis (...) for solving acommon misunderstanding

● Traditional biologists often see Bioinformaticsas a “Black box”

– i.e., predict, then go back in the lab to confirmwith experiment …

● However,

– the computational approach to addressingbiological problems is an experimentalfield on its own

– a single difference: experiments areperformed in silico.

Page 38: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

And finally ... what do you meanby “Genetic Information”?

And finally ... what do you meanby “Genetic Information”?

● It can be quite generic– Nucleotide and amino acid sequences

– Three dimensional molecular structuresstructures (proteins, DNA, RNA, sugars,drugs, ...)

– Gene expression data

– Molecular interaction networks

– Complex biological systems (cells, tissues,organisms, ...)

– ... even text in the biomedical literature ...

Page 39: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

OMICSOMICS● GenOMICS● TranscriptOMICS● ProteOMICS● MetabolOMICS● KinOMICS● PhylogenOMICS● EpitOMICS

even more ...– BibliOMICS

– DegradOMICS

??? cOMICS ???

Also be aware:

A comprehensive list may be found at the URLhttp://www.genomicglossaries.com/content/omes.asp

Page 40: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

Importatnly ...Importatnly ...

●Freely available data

●Accessible software[free/open software]

Page 41: Fundamentals of Bioinformatics: computation, biology ... · PDF file01-05/12/2014 Computational Metagenomics Workshop University of Mauritius Fundamentals of Bioinformatics: computation,

01-05/12/2014 Computational Metagenomics Workshop University of Mauritius

ر ك شthanksઆ ભ ારmerciशुि क या

ευχαριστώ

ر ك شthanksઆ ભ ારmerciशुि क या

ευχαριστώ