Encode RNA Dashboard

Preview:

DESCRIPTION

The ENCODE project has produced a massive amount of transcriptome data, made possible by the collaboration of a world wide consortium of laboratories. During the project it was critical to immediately know what data was being produced by which lab. The ENCODE RNA dashboard kept the researchers informed about new results, even before they were officially registered with the ENCODE Data Coordination Center (DCC). It was instrumental for management to have direct insight into the current state of the project at any given point in time. Collaborators could quickly proceed with their own analysis steps once the raw data and processed results were published on the dashboard by other groups. The dashboard was also enriched with direct links to additional summary statistics that had been published using the Grape RNA-Seq pipeline. Now that the project has yielded its results, the ENCODE dashboard still remains the only central place collecting all the RNA data produced by the ENCODE project. The international research community can explore the wide range of experiments, and quickly find and download the exact data sets they need for their own data analysis. The dashboard is not only useful for web access, but command line users will enjoy the friendly batch processing capabilities. There is a huge demand to provide the same kind of dashboard for additional ENCODE projects, and with the new version of our dashboard software package, the system can even be extrapolated to any other bioinformatics project having to deal with a lot of data. For example, the ENCODE Mouse (Mus musculus) dashboard is one of the upcoming dashboards, replicating the success of the ENCODE hg19 (Homo sapiens) dashboard.

Citation preview

ENCODE RNA Dashboard

HUB5 - Heidelberg21. March 2013

Maik RöderPython Consultantroeder@berg.net

http://genome.crg.es/encode_RNA_dashboard

Thursday, March 21, 13

ENCODE RNA Dashboard• The ENCODE project has produced

massive amount of transcriptome data

• How to keep up to date during production, even before submission to the ENCODE Data Coordination Center (DCC)?

• Direct links to additional summary statistics produced by the Grape RNA-Seq pipeline

Thursday, March 21, 13

USCS Genome Browser - ENCODE

• First, a look at the current UCSC genome browser Experiment Matrix for ENCODE

• http://genome.ucsc.edu/ENCODE/

Thursday, March 21, 13

ENCODE Experiment Matrix (UCSC)

RNA-Seq

Files

Thursday, March 21, 13

Search + List (UCSC)

Thursday, March 21, 13

Mouse Experiment Matrix (UCSC)

Thursday, March 21, 13

Search + List (UCSC)

Thursday, March 21, 13

ENCODERNA dashboard

• http://genome.crg.es/encode_RNA_dashboard/hg19/

• Summary of transcriptome data production

• Overview

• Exploration

• Batch Download

Thursday, March 21, 13

ENCODE RNA DashboardHuman (hg19)

Exploration

Thursday, March 21, 13

ExplorationClick to make file details appear in

place

Click again to make list disappear and continue exploring

Thursday, March 21, 13

ENCODE RNA Dashboard Mouse (mm9)

Thursday, March 21, 13

Exploration

Thursday, March 21, 13

Today• ENCODE dashboard still remains the only

central place collecting all the RNA data produced by the ENCODE project

• The international research community can explore the wide range of experiments

• Quickly find and download the exact data sets they need for their own data analysis.

• command line users will enjoy the friendly batch processing capabilities

Thursday, March 21, 13

Upcoming

• Dashboard generation tool

• Reimplementation in Python using Pandas

• Methods Paper to be published soon

Thursday, March 21, 13

Thanks• This work has been supported by the Centre for

Genomic Regulation (CRG) in Barcelona

• Roderic Guigó

• Julien Lagarde

• Learn about Pandas at the Python Meetup:

• Monday, April 15th

• http://www.meetup.com/HeidelbergPython

Thursday, March 21, 13

Recommended