26
CNIT Final Presentation Chris Thompson April 18 th , 2013 CNIT 227

Cnit final presentation

Embed Size (px)

Citation preview

Page 1: Cnit final presentation

CNIT Final PresentationChris ThompsonApril 18th, 2013

CNIT 227

Page 2: Cnit final presentation

Introduction

Materials

Methods

Results and Conclusion

Table of Contents

Page 3: Cnit final presentation

INTRODUCTION

Page 4: Cnit final presentation

Bioinformatics

Bioinformatics – an interdisciplinary field that develops and improves upon methods for storing, retrieving, organizing, and analyzing biological data.

Bioinformatics is important because without the technologies produced and developed through it, many of the experiments and assays we do today would not be possible.

Page 5: Cnit final presentation

CNIT

CNIT is the bioinformatics course at Purdue, focused on annotating the genome of mycobacteriophages.

Overall goal is to annotate the genome of the RiverMonster phage, so other researchers can use it in the future.

Page 6: Cnit final presentation

Bacteriophages• A virus that infects and replicates in bacteria• One of the most common and populous

organism in existence • Many have a mosaic genome• Unlimited potential usage• Mycobacteriophages infect M.smegmatis

Page 7: Cnit final presentation

Clusters

• System to organize bacteriophages• Phages sorted by factors such as genome

length, presence of certain genes, organization of genome, GC content, and plaque size and characteristics

• A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, Singleton, and T

Page 8: Cnit final presentation

RiverMonster

• Discovered in 2010 in West Lafayette• Mycobacteriophage• Cluster E• 144 genes in total• Many protein products are unknown• Overall geographical presence is unknown

Through CNIT and bioinformatics we are trying to answer some of the unknowns about RiverMonster

Page 9: Cnit final presentation

MATERIALS

Page 10: Cnit final presentation

Bioinformatics Tools

• DNA Master• Phamerator• Glimmer• GeneMark• NCBI and BLAST• EverNote

Page 11: Cnit final presentation

DNA Master• Designed and written by Dr. Jeffrey Lawrence• Annotation program• Can auto-annotate entire genomes• Uses information from Glimmer and GeneMark• Can locally BLAST genes

Page 12: Cnit final presentation

Phamerator

• Developed in 2011• Linux-based bioinformatic program• Used for comparative phage genomics• Can visualize entire phage genomes• Separates phages into “phams”

Page 13: Cnit final presentation

Glimmer

• Stands for Gene Locator and Interpolated Markov ModelER

• Used for finding genes in microbial DNA• Uses models and algorithms to distinguish between

coding and non-coding DNA

Page 14: Cnit final presentation

GeneMark

• A family of gene prediction programs developed at the Georgia Institute of Technology

• Determines the protein-coding potential of a DNA sequence

• Uses many of the same algorithms and models as GIimmer

Page 15: Cnit final presentation

NCBI and BLAST• National Center for Biotechnology Information• Basic Local Alignment Search Tool• Program that compares DNA sequences with a large

database of known sequences• Used to find similar gene sequences

Page 16: Cnit final presentation

EverNote

• Started in 2008• Designed for note-taking and archiving• Used as an online lab notebook for CNIT

Page 17: Cnit final presentation

METHODS

Page 18: Cnit final presentation

Organization

• Genome split into two sections• Genes 0 to 65 by Jon and Bill• Genes 66 to 144 by Chris and Nyema• Split again into four sections• 0 to 23 by Jon• 24 to 65 by Bill• 100 to 123 by Chris• 124 to 144 by Nyema

Page 19: Cnit final presentation

Process

• Document the auto-annotated gene call• Ran the Shine-Delgarno Test• BLASTed gene and compared scores• Compared homologous genes in Phamerator• Made final call

Page 20: Cnit final presentation

First Section

• Genes 66 to 144• Split up evens and odds• I had even numbered genes• No outstandingly tricky gene calls• Gene 88 seems to be a family of Kinases, many of

them hypothetical• Gene 92 is a family of RNA ligases• Gene 94 is Transcription factor WhiB

Page 21: Cnit final presentation

Second Section

• Genes 101 to 123• Every gene• Gene 101 is a protease family• Gene 112 contains genes for polymerases• Genes 116 and 117 were reverse genes• 117 had many inconsistencies and was difficult to call

Page 22: Cnit final presentation

RESULTS AND CONCLUSION

Page 23: Cnit final presentation

Accomplishments

• Personally called 39 genes• Called 144 genes as a class• Analyzed protein products• Completed a final draft of the RiverMonster genome

Page 24: Cnit final presentation

Significance

• Genome can be used by future scientists• Proves validity of undergraduate research• Learned about bioinformatics, bacteriophages,

genomes, annotation, and biotechnology

Page 25: Cnit final presentation

Future Work

• Check and finalize all gene calls• Compilation of DNA Master file• Send to HHMI and SEA Phages to be put in

Phamerator

Page 26: Cnit final presentation

The End