Upload
valeria-pesce
View
409
Download
0
Embed Size (px)
Citation preview
A global linked and open data infrastructure for agricultural
development
Valeria PesceGlobal Forum on Agricultural ResearchFood and Agriculture Organization of
the United nations
BDE proposed infrastructure
• ICT infrastructure• Computing infrastructure
• One re-depolyable generic infrastructure, n “Domain-specific Big Data Integrator Instances”
• These tools may all be thought of as generic tools and they will be available as options in the generic plat-form. However, individual domains are likely to need more specialised tools and datasets
The ag-data context
• Different actors have built several infrastructural components over the years under the umbrellas of different initiatives and projects
• Mostly vocabularies, authority data, online tools, APIs; mostly for non-big data
• Little work on computational services
IGAD group
“Positioning” this infrastructure within the ag-data context
• Open data for agricultural development is big and non-big data
• Beyond all the additional features (the 3 Vs) of being “big”, big data still also have the same features as other non-big data
• A dedicated big data infrastructure should be aware of and interlink with other existing infrastructural elements in a distributed and inter-linked ecosystem
Re-usablesoftware
Interoperable vocabularies
Authority
Some existing infrastructural components
5
Registries
Some existing infrastructural components
6
Interoperable vocabularies
Authority
Registries
Re-usable software
Registries
Description vocabulariesDarwin CoreINSPIREDCATetc.
7
Interoperable vocabulariesRe-usable
software
Some existing infrastructural components
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
8
Interoperable vocabulariesRe-usable
software
Some existing infrastructural components
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
9
Interoperable vocabulariesRe-usable
software
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
10
Interoperable vocabulariesRe-usable
software
Vocabulary tools
VocBench
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Vocabulary APIs
Agrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
11
Interoperable vocabulariesRe-usable
software
Vocabulary tools
VocBench
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Vocabulary APIs
Agrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
12
Interoperable vocabulariesRe-usable
software
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Vocabulary APIs
Agrovoc WSClimate Tagger
Cloud / SaaS tools
Vocabulary tools
CM / DM tools
VocBench
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
13
Interoperable vocabulariesRe-usable
software
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Vocabulary APIs
Agrovoc WSClimate Tagger
Cloud / SaaS tools
Vocabulary tools
CM / DM tools
VocBench
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
14
Interoperable vocabulariesRe-usable
software
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Vocabulary APIs
Agrovoc WSClimate Tagger
Cloud / SaaS tools
Vocabulary tools
CM / DM tools
VocBench
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
Datasets
Vocabularies
APIs
Some existing infrastructural components
APIs
Registries
Cloud / SaaS tools
Shared URIs
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
15
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
CM / DM tools
VocBench
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
Cloud / SaaS tools
Shared URIs
Grid jobsGrid workflows
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
16
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
agINFRA
CM / DM tools
VocBench
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
Cloud / SaaS tools
Shared URIs
Grid jobsGrid workflows
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
17
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
agINFRA
CM / DM tools
VocBench
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
Some existing infrastructural components
BDE software stack
BDE in context
• An infrastructure that minimises the disruption to current workflows
• An adaptable, easy to deploy and use solution, which will allow the interest-ed user groups and stakeholders to extend their Big Data solutions or introduce Big Data technology to their business processes
Big Data Aggregator platform
APIs
Registries
Cloud / SaaS tools
Shared URIs
Infrastructural components
Grid jobsGrid workflows
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
20
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
agINFRA
CM / DM tools
VocBench
Authority
PersonsInstitutionsProjectsetc.
Au
tho
rity
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
BDE platform
BDE platform in ag-data infrastructure
APIs
Cloud / SaaS tools
Shared URIs
Re-usable software
Vocabulary tools
CM / DM tools
VocBench
Processing APIs
agINFRABioCatalogue
APIs
Vocabulary APIs
Agrovoc WSClimate Tagger
Registries
Grid jobsGrid workflows
agINFRA Inte
rop
erab
le
voca
bu
lari
esA
uth
ori
ty
BDE approach
• One re-depolyable generic infrastructure, n “Domain-specific Big Data Integrator Instances”
• Provide a comprehensive test-beds for the evaluation of the BDE Aggregator Platform according to the requirements of the respective societal domain;
• Carefully select pilot use cases, across different domains, so that they are adequate test beds but self-sustainable systems beyond the action’s end;
• These tools may all be thought of as generic tools and they will be available as options in the generic plat-form. However, individual domains are likely to need more specialised tools and datasets
Conclusion
• We would like the new big data infrastructure to be interlinked with existing infrastructural components
Thank you
Valeria PesceGlobal Forum on Agricultural ResearchFood and Agriculture Organization of
the United nations