18
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU)

Bioinformatics For MNW 2 nd Year

Embed Size (px)

DESCRIPTION

Bioinformatics For MNW 2 nd Year. Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) [email protected], www.cs.vu.nl/~ibivu, Tel. 47649, Rm R4.41. Current Bioinformatics Unit. Jens Kleinjung (1/11/02) Victor Simosis – PhD (1/12/02) Radek Szklarczyk - PhD (1/01/03) - PowerPoint PPT Presentation

Citation preview

Page 1: Bioinformatics  For MNW 2 nd  Year

Bioinformatics For MNW 2nd Year

Jaap Heringa

FEW/FALW

Integrative Bioinformatics Institute VU (IBIVU)

[email protected], www.cs.vu.nl/~ibivu, Tel. 47649, Rm R4.41

Page 2: Bioinformatics  For MNW 2 nd  Year

Current Bioinformatics Unit

• Jens Kleinjung (1/11/02)

• Victor Simosis – PhD (1/12/02)

• Radek Szklarczyk - PhD (1/01/03)

• John Romein (1/12/02, Henri Bal)

Page 3: Bioinformatics  For MNW 2 nd  Year

Bioinformatics course 2nd year MNW spring 2003

• Pattern recognition– Supervised/unsupervised learning– Types of data, data normalisation, lacking data– Search image– Similarity tables– Clustering– Principal component analysis– Discriminant analysis

Page 4: Bioinformatics  For MNW 2 nd  Year

Bioinformatics course 2nd year MNW spring 2003

• Protein– Folding– Structure and function– Protein structure prediction– Secondary structure– Tertiary structure– Function– Post-translational modification – Prot.-Prot. Interaction -- Docking algorithm– Molecular dynamics/Monte Carlo

Page 5: Bioinformatics  For MNW 2 nd  Year

Bioinformatics course 2nd year MNW spring 2003

• Sequence analysis– Pairwise alignment– Dynamic programming (NW, SW, shortcuts)– Multiple alignment– Combining information– Database/homology searching (Fasta, Blast,

Statistical issues-E/P values)

Page 6: Bioinformatics  For MNW 2 nd  Year

Bioinformatics course 2nd year MNW spring 2003

• Gene structure and gene finding algorithm• Omics

– DNA makes RNA makes protein– Expression data, Nucleus to ribosome, translation, etc.– Metabolomics– Physiomics– Databases

• DNA, EST• Protein sequence• Protein structure

Page 7: Bioinformatics  For MNW 2 nd  Year

Bioinformatics course 2nd year MNW spring 2003

o Microarray data

o Protein structure (PDB)

o Proteomics

o Mass spectrometry/NMR/X-ray?

Page 8: Bioinformatics  For MNW 2 nd  Year

Bioinformatics course 2nd year MNW spring 2003

• Bioinformatics method development• IPR issues• Programming and scripting languages• Web solutions• Computational issues

– NP-complete problems– CPU, memory, storage problems– Parallel computing

• Bioinformatics method usage/application• Molecular viewers (RasMol, MolMol, etc.)

Page 9: Bioinformatics  For MNW 2 nd  Year

Gathering knowledge

• Anatomy, architecture

• Dynamics, mechanics

• Informatics(Cybernetics – Wiener, 1948) (Cybernetics has been defined as the science of control in machines and animals, and hence it applies to technological, animal and environmental systems)

• Genomics, bioinformatics

Rembrandt, 1632

Newton, 1726

Page 10: Bioinformatics  For MNW 2 nd  Year

MathematicsStatistics

Computer ScienceInformatics

BiologyMolecular biology

Medicine

Chemistry

Physics

Bioinformatics

Bioinformatics

Page 11: Bioinformatics  For MNW 2 nd  Year

Bioinformatics

“Studying informational processes in biological systems” (Hogeweg, early

1970s)• No computers necessary• Back of envelope OK

Applying algorithms with mathematical formalisms in biology (genomics) -- USA

“Information technology applied to the management and analysis of biological data” (Attwood and Parry-Smith)

Page 12: Bioinformatics  For MNW 2 nd  Year

Bioinformatics in the olden days• Close to Molecular Biology:

– (Statistical) analysis of protein and nucleotide structure

– Protein folding problem– Protein-protein and protein-nucleotide

interaction

• Many essential methods were created early on (BG era)– Protein sequence analysis (pairwise and

multiple alignment)– Protein structure prediction (secondary, tertiary

structure)

Page 13: Bioinformatics  For MNW 2 nd  Year

Bioinformatics in the olden days (Cont.)

• Evolution was studied and methods created– Phylogenetic reconstruction (clustering – NJ

method

Page 14: Bioinformatics  For MNW 2 nd  Year

The Human Genome -- 26 June 2000

Page 15: Bioinformatics  For MNW 2 nd  Year

The Human Genome -- 26 June 2000

Dr. Craig Venter

Celera Genomics

-- Shotgun method

Sir John Sulston

Human Genome Project

Page 16: Bioinformatics  For MNW 2 nd  Year

Human DNA

• There are about 3bn (3 109) nucleotides in the nucleus of almost all of the trillions (3.5 1012 ) of cells of a human body (an exception is, for example, red blood cells which have no nucleus and therefore no DNA) – a total of ~1022 nucleotides!

• Many DNA regions code for proteins, and are called genes (1 gene codes for 1 protein in principle)

• Human DNA contains ~30,000 expressed genes • Deoxyribonucleic acid (DNA) comprises 4 different

types of nucleotides: adenine (A), thiamine (T), cytosine (C) and guanine (G). These nucleotides are sometimes also called bases

Page 17: Bioinformatics  For MNW 2 nd  Year

Human DNA (Cont.)

• All people are different, but the DNA of different people only varies for 0.2% or less. So, only 2 letters in 1000 are expected to be different. Over the whole genome, this means that about 3 million letters would differ between individuals.

• The structure of DNA is the so-called double helix, discovered by Watson and Crick in 1953, where the two helices are cross-linked by A-T and C-G base-pairs (nucleotide pairs – so-called Watson-Crick base pairing).

Page 18: Bioinformatics  For MNW 2 nd  Year

Tot hier 03/02/2003 – 10.45-12.30