30
Unlocking breeding potential of African crops through data management an example with CASSAVABASE Guillaume Bauchet Plant and Animal Genome Conference San Diego January 2016 [email protected]

Cassavabase general presentation PAG 2016

Embed Size (px)

Citation preview

Page 1: Cassavabase general presentation PAG 2016

Unlocking breeding potential of African crops through data management an

example with CASSAVABASE

Guillaume Bauchet

Plant and Animal Genome Conference San Diego January 2016

[email protected]

Page 2: Cassavabase general presentation PAG 2016

OUTLINE

http://nextgencassava.org/

CASSAVABASE , What  for?

CASSAVABASE , a  user  perspective

CASSAVABASE , search,  manage,  analyze

CASSAVABASE , a  view

Page 3: Cassavabase general presentation PAG 2016

The  Central  data  store  for  NEXTGEN CASSAVA :Genomic  selection  in  African  cassava  breeding  programs

http://nextgencassava.org/

Page 4: Cassavabase general presentation PAG 2016

NEXTGEN CASSAVA

Page 5: Cassavabase general presentation PAG 2016

What are the major challenges?

Page 6: Cassavabase general presentation PAG 2016

● Multi trait and Multi breeding environments for cassava phenotypic data collection

● Large scale production of genomic data using GBS

● Integrate Genomic Selection tool via web interface

What are the major challenges?

● Make the most of this resource for cassava breeders: speed up the analysis and decision making

Page 7: Cassavabase general presentation PAG 2016

What are the needs?● Search various data types (phenotypes and germplasm) in a large datastore

●Manage data and daily breeding activity through comprehensive interface

●Analyse and retrieve data for genomic assisted breeding

What are our solutions?● Integrate phenomic & genomic data with breeding tools

●Use Perl with the Bio::Chado::Schema and Natural Diversitymodule as database architecture

●Retrieve genomic information

●Sequence visualization ●Open source

https://github.com/solgenomics/

Page 8: Cassavabase general presentation PAG 2016

http://cassavabase.org/

Page 9: Cassavabase general presentation PAG 2016

New search bar

Navigation bar always visible on top Expandable search box

Page 10: Cassavabase general presentation PAG 2016

Caroussel

Page 11: Cassavabase general presentation PAG 2016

New responsive design

Page 12: Cassavabase general presentation PAG 2016

CASSAVABASEby numbers

2016: + 80,000 accessions, 2,5 billion genetic observations

2014:

+360 registered users

Page 13: Cassavabase general presentation PAG 2016

From Phenotype to Genotype to Breeding: Harvesting the fruits of CASSAVABASE

Page 14: Cassavabase general presentation PAG 2016

CASSAVABASE, an Office perspective: Search

Search breeding program, location, trial, trait, year, accession

Page 15: Cassavabase general presentation PAG 2016

CASSAVABASE, a field perspective: Manage Phenotypes

Define phenotypic traits via Cassava trait dictionaryin CASSAVABASE

Data collection

via FieldBookapp*

Design trials, barcodes & field maps

in CASSAVABASE*

Data uploading in CASSAVABASE

via .xls and .txt file **See Alex Ogbonna PAG presentation

“Managing Phenotypic Data through Cassavabase with Fieldbook App”“

Data analysis in CASSAVABASE

-Sum. stat-ANOVA-BLUP-GSIn CASSAVABASE

Page 16: Cassavabase general presentation PAG 2016

Design genotyping Trial in CASSAVABASE

TASSEL pipeline

Data filtering &

imputationGBS data uploading

In CASSAVABASE

GS Analysis & Visualization

in CASSAVABASE

GBS facility @ Cornell

CASSAVABASE, a lab perspective: Manage Genotypes

Page 17: Cassavabase general presentation PAG 2016

CASSAVABASE an office perspective: ManageBreeding programs, trial, accession

Page 18: Cassavabase general presentation PAG 2016

CASSAVABASE : Analyze with SolGS

Phenotypic values Population Structure GEBV vs phenotypes

See Isaak Tecle PAG presentation & poster 342“solGS: A Web-based Solution for Genomic Selection”

GEBV

Page 19: Cassavabase general presentation PAG 2016

CASSAVABASE : Analyze with SolGS

Page 20: Cassavabase general presentation PAG 2016

CASSAVABASE from the Office: Analyze phenotypesQC to phenotypes

Single trial

Page 21: Cassavabase general presentation PAG 2016

CASSAVABASE from the Office: Analyze phenotypesQC to phenotypes

Single trial

Page 22: Cassavabase general presentation PAG 2016

CASSAVABASE tools: Analyze pedigree

Page 23: Cassavabase general presentation PAG 2016

CASSAVABASE from the Office: Analyze phenotypes

data_2011_B1

4 6 8 10

r= 0.68

p<0.001

r= 0.66

p<0.001

4 6 8 10 14

r= 0.70

p<0.001

46

810

12

r= 0.63

p<0.001

46

810

data_2011_B2

r= 0.76

p<0.001

r= 0.79

p<0.001

r= 0.73

p<0.001

data_2011_B3

r= 0.76

p<0.001

46

810

r= 0.68

p<0.001

46

810

14 data_2012_B1

r= 0.75

p<0.001

4 6 8 10 12 4 6 8 10 4 6 8 12

46

812

data_2012_B2

30 31 32 33 34 35 36 37

-1.5

-0.5

0.5

1.5

Fitted values

Residuals

Residuals vs Fitted

26

9

15

-2 -1 0 1 2

-10

12

Theoretical Quantiles

Sta

ndar

dize

d re

sidu

als

Normal Q-Q

26

9

15

30 31 32 33 34 35 36 37

0.0

0.4

0.8

1.2

Fitted values

Standardized residuals

Scale-Location269

15

0.0 0.1 0.2 0.3 0.4 0.5

-2-1

01

2

Leverage

Sta

ndar

dize

d re

sidu

als

Cook's distance

Residuals vs Leverage

9

2615

ANOVA, h2,

BLUP, GxE

QC phenotypesMultiple trials

Page 24: Cassavabase general presentation PAG 2016

JBrowse

CASSAVABASE tools: Analyze sequence

Variant effects

prediction

Page 25: Cassavabase general presentation PAG 2016

VIGS tool

CASSAVABASE tools: Analyze sequence

BLAST

Page 26: Cassavabase general presentation PAG 2016

CASSAVABASE, a User perspective: support & interaction

Page 27: Cassavabase general presentation PAG 2016

CASSAVABASE, a User perspective: support & interaction

-> Provide support on technical issues ( data management)-> Gather user request for tool improvement and new developments (pedigree queries, VIGS)-> 2016: Install Mirror site @ IITA Ibadan, Nigeria

Weekly meetings with users in Africa: Wiki, FB pages & mailing list:

Page 28: Cassavabase general presentation PAG 2016

CASSAVABASE Upcoming developments

Search: Integrate trait & values in the wizard search

Manage: extract data subset according to their phenotypicvalues, conditionnal choices

Analyze: -Phenotypic analysis developments (ANOVA, GxE)-Pedigree analysis-Jbrowse: Mutation prediction of genetic variants-SolGS: Jobs queuing, trial selection improvement

Page 29: Cassavabase general presentation PAG 2016

LukasMueller

AlexOgbonna

BryanEllerbrock

NaamaMenda

IsaakTecle

NickMorales

AKNOWLEDGEMENTS

Jeremy Edwards

BMGF

ChiedozieEgesi

PeterKulakow

Robert Kawuki

IsmailRabbi

Page 30: Cassavabase general presentation PAG 2016

Questions?