21
Biomedical Information Technology 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005 Gell Electrophoresis © cfdewey 2005

2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

BiomedicalInformationTechnology

2.771J BEH.453J HST.958J Spring 2005

Lecture 5 April 2005

Gell Electrophoresis

© cfdewey 2005

Page 2: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Gel Electrophoresis

� Statement of experimental problem � Defining experimental information objects � The origins of ExperiBase �Design of database schema � Keeping raw image data

� Interaction with external analysis packages � Creating individual tables and functions � User query functions for applications

© cfdewey 2005

Page 3: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Statement of the experimental problem

� Identify the key experimental results and their natural electronic form (text, image, . .)

� Identify all parameters that define the experimental conditions

� Create a complete set of parameters to define the sample that is being analyzed

� Design the information object to accomodate changes in the experimental protocol

© cfdewey 2005

Page 4: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Generalize your customer

� What new items will the next researcher have?

� What will the next generation of equipment demand?

� What queries will be run against the data?

� Will you support metadata (computed results) as well?

© cfdewey 2005

Page 5: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Define representative use cases

� What procedure will an experimenter perform? � Who will query the data? � What queries will be run against the data? � What user interfaces are available? � What user interface functionality is required?

© cfdewey 2005

Page 6: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Development process (from NCICB)� Initial high level Use

Cases were developedfor gene components

� Additional high level UseCases were added as additional functional areas are mapped (e.g.pathways, therapies,microarray objects,mouse models)

� Detailed Use Cases are derived from working withdomain experts inrequirements gathering.

© cfdewey 2005

View gene list Link to external site

Find gene by attribute

Browse CGAP/GAI site

View gene information

Find genes by sequence

RESEARCHER

Find gene by name

Search gene information

Browse CGAP web site

Figure by MIT OCW.

Page 7: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Background on Experibase

ExperiBase is conceived as an integrated softwareplatform capable of supporting a variety of experimentalmethods found in modern biology. It provides:comprehensive database features for searching,classifying, and supporting; web-based client services; data import and export capabilities to accommodate other datarepositories; and direct support for metadata produced byanalysis programs.

Because ExperiBase includes many differentexperimental methods, and the semantic contexts thatsupport them, the creation of this software platform willprovide a major contribution to normalizing semanticsupport for these different methods.

© cfdewey 2005

Page 8: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Gel electrophoresis –the first modality of ExperiBase

© cfdewey 2005

Page 9: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Manipulation of the gel images is required

More contrast

Less intensity

© cfdewey 2005

Page 10: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Database

Work flow for gel electrophoresis

Image analyzed Analysis saved with object

Biomedical Information Technology

Photo of SDS-Page gel plateremoved for copyright reasons.

© cfdewey 2005

Page 11: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Classes of information for gel electrophoresis

� Sample information ¾Source ¾Description

� Instrument description ¾Manufacturer & model ¾Settings – time, gel, voltage, etc.

� Preconditioning ¾Process method ¾Process materials ¾Process duration

� Raw image data � Post-processing protocols � Derived data from image

© cfdewey 2005

Page 12: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Platform independent WebPlatform independent Web--enabled user interfaceenabled user interface

© cfdewey 2005

Request from Client (Jsp and Java Servlet)

Clients and Stubs (CORBA Interface)

Database Server and Authentication (CORBA Interface in JAVA)

Class Mappers and JDBC

Output to Client (Jsp and Java Servlet) Query in Databases

Core Engine

Interpreter/ Authentication

Encoder/Format

XML/SOAP Miscellaneous

Servers

Excel Spreadsheet Parser (Perl)

XML/HTML Producer

......

Image Classes

Image Converter

Image Reader

Image Writer

Page 13: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Excel to XML Parser

Clients and Stubs (CORBA Interface)

XML/SOAP Excel Server

(CORBA Interface in Perl)

Excel Spreadsheet Parser (Perl)

XML/HTML Producer

XML Parser (Parse XML to elements)

Database Server and Authentication (CORBA Interface in JAVA)

Class Mappers and JDBC

Database (data_xml, data_html,

data_excel)

Based onBased onSpreadsheetSpreadsheet--ParseExcelParseExcel--0.260.26

byby KawaiKawai TakanoriTakanori andandXercesXerces Java ParserJava Parserbyby xml.apache.orgxml.apache.org

© cfdewey 2005

Page 14: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Customized Data Selection--choose interesting columns and rows

Excel Spreadsheets

Excel Spreadsheets

Keyword Collection

Selected Keyword as Row and Column’s Name

Results

data_excel Table (uniqueid,subid,sheetname,row#,column#,data)

© cfdewey 2005

Page 15: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Biomedical Information Technology

Keyword Collections for Excel Spreadsheets

protocol word wordcount kinase_assay Akt (norm) 4

kinase_assay ERK2 (norm) 4

kinase_assay JNK1 (norm) 4

kinase_assay Replicate # 4

kinase_assay Time (min) 4

westernblot Dec7 correction 1

westernblot Height 14

westernblot High/MEK 2

westernblot IFN 24h 62

westernblot IFN 24h, TNF 12h 62

© cfdewey 2005

Page 16: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

©

Customized Data Selection

cfdewey 2005

Page 17: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

© cfdewey

Results of customized data selection

2005

Page 18: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

Users can view HTML and XML format of Excel spreadsheets

© cfdewey 2005

Page 19: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

HTML format of Excel spreadsheet

© cfdewey 2005

Page 20: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

XML format of Excel spreadsheet

© cfdewey 2005

Page 21: 2.771J BEH.453J HST.958J Spring 2005 Lecture 5 April 2005

List All Experiments

© cfdewey 2005