Upload
bigdataeurope
View
719
Download
0
Embed Size (px)
Citation preview
A global linked and open data infrastructure for agricultural
development Valeria Pesce
Global Forum on Agricultural ResearchFood and Agriculture Organization of the United
nations
BDE proposed infrastructure
• ICT infrastructure• Computing infrastructure
• One re-depolyable generic infrastructure, n “Domain-specific Big Data Integrator Instances”
• These tools may all be thought of as generic tools and they will be available as options in the generic plat-form. However, individual domains are likely to need more specialised tools and datasets
The ag-data context• Different actors have built several infrastructural
components over the years under the umbrellas of different initiatives and projects
• Mostly vocabularies, authority data, online tools, APIs; mostly for non-big data
• Little work on computational services
IGAD group
“Positioning” this infrastructure within the ag-data context
• Open data for agricultural development is big and non-big data
• Beyond all the additional features (the 3 Vs) of being “big”, big data still also have the same features as other non-big data
• A dedicated big data infrastructure should be aware of and interlink with other existing infrastructural elements in a distributed and inter-linked ecosystem
5
Re-usablesoftware
Interoperable vocabularies
Authority
Some existing infrastructural components
Registries
6
Some existing infrastructural components
Interoperable vocabularies
Authority
Registries
Re-usable software
Registries
Description vocabulariesDarwin CoreINSPIREDCATetc.
7
Interoperable vocabulariesRe-usable
software
Some existing infrastructural components
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
8
Interoperable vocabulariesRe-usable
software
Some existing infrastructural components
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
9
Interoperable vocabulariesRe-usable
software
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
10
Interoperable vocabulariesRe-usable
software
Vocabulary toolsVocBench
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Vocabulary APIsAgrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
11
Interoperable vocabulariesRe-usable
software
Vocabulary toolsVocBench
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Vocabulary APIsAgrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
12
Interoperable vocabulariesRe-usable
software
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Vocabulary APIsAgrovoc WSClimate Tagger
Cloud / SaaS toolsVocabulary tools
CM / DM tools
VocBench
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
13
Interoperable vocabulariesRe-usable
software
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Vocabulary APIsAgrovoc WSClimate Tagger
Cloud / SaaS toolsVocabulary tools
CM / DM tools
VocBench
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
14
Interoperable vocabulariesRe-usable
software
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Vocabulary APIsAgrovoc WSClimate Tagger
Cloud / SaaS toolsVocabulary tools
CM / DM tools
VocBench
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
Datasets
Vocabularies
APIs
Some existing infrastructural components
APIs
Registries
Cloud / SaaS tools
Shared URIs
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
15
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
CM / DM tools
VocBench
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
Cloud / SaaS tools
Shared URIs
Grid jobsGrid workflows
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
16
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
agINFRA
CM / DM tools
VocBench
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
Some existing infrastructural components
APIs
Registries
Cloud / SaaS tools
Shared URIs
Grid jobsGrid workflows
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
17
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
agINFRA
CM / DM tools
VocBench
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
Some existing infrastructural components
BDE software stack
BDE in context
• An infrastructure that minimises the disruption to current workflows
• An adaptable, easy to deploy and use solution, which will allow the interest-ed user groups and stakeholders to extend their Big Data solutions or introduce Big Data technology to their business processes
Big Data Aggregator platform
APIs
Registries
Cloud / SaaS tools
Shared URIs
Infrastructural components
Grid jobsGrid workflows
AGROVOCNALTGACSGene OntologySoil TermsLocal KOSsControlled listsetc.
Description vocabularies
KOSs
Darwin CoreINSPIREDCATetc.
20
Interoperable vocabularies
Datasets
Vocabularies
APIs
Re-usable software
Vocabulary tools
agINFRA
CM / DM tools
VocBench
AuthorityPersonsInstitutionsProjectsetc.
Auth
ority
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
BDE platform
BDE platform in ag-data infrastructure
APIs
Cloud / SaaS tools
Shared URIs
Re-usable software
Vocabulary tools
CM / DM tools
VocBench
Processing APIsagINFRABioCatalogue
APIs
Vocabulary APIsAgrovoc WSClimate Tagger
Registries
Grid jobsGrid workflows
agINFRA Inte
rope
rabl
e vo
cabu
larie
sAu
thor
ity
BDE approach• One re-depolyable generic infrastructure, n “Domain-specific Big
Data Integrator Instances”• Provide a comprehensive test-beds for the evaluation of the BDE
Aggregator Platform according to the requirements of the respective societal domain;
• Carefully select pilot use cases, across different domains, so that they are adequate test beds but self-sustainable systems beyond the action’s end;
• These tools may all be thought of as generic tools and they will be available as options in the generic plat-form. However, individual domains are likely to need more specialised tools and datasets
Conclusion
• We would like the new big data infrastructure to be interlinked with existing infrastructural components
Thank you
Valeria PesceGlobal Forum on Agricultural Research
Food and Agriculture Organization of the United nations