View
109
Download
2
Category
Tags:
Preview:
DESCRIPTION
Citation preview
Presentations by Johannes Keizer is licensed under a Creative Commons Attribution-
NonCommercial-ShareAlike 3.0 Unported License.
Dr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN
CIARD - creating a global framework for information sharing in agricultural research and innovation – Role of VIVO
Talk at the VIVO 2011 conference
“... FAO’s principle task is to work to ensure that the world’s knowledge of food and agriculture is available to those who need it when they need it and in a form which they can access and use ...”
johannes keizerhttp://aims.fao.org
There will be generated more scientific data in the next 5 years than in the history of humankind
johannes keizerhttp://aims.fao.org
Contribution and Participation in Science
Territory size shows proportion of scientific papers published in 2001 by authors living there. Copyright SASI Group (University of Sheffield) and Mark Newman (University of Michigan)
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
Global Trends in Publishing
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
The Internet!
johannes keizerhttp://aims.fao.org
Aggregation States of Knowledge
johannes keizerhttp://aims.fao.org
Data and Information in Agricultural Research and Extension
johannes keizerhttp://aims.fao.org
Distributed Repositories
• stats• gene banks• gis data• blogs, • journals• open archives• raw data• technologies• learning objects• ………..
johannes keizerhttp://aims.fao.org
Task 1: making services
? ? ?
johannes keizerhttp://aims.fao.org
Task 2: getting knowledge
? ? ?
johannes keizerhttp://aims.fao.org
? ? ?
How can I get in real time all the specimen data on useful insects from all people making research on this on my desktop? How can I share in real time my data with other colleagues working on that.
Task 3: working together
johannes keizerhttp://aims.fao.org
http://www.ciard.net
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
The Project: agINFRA
Enforce Webpublishing of Data Produce linked open data from
all datasets Use common reference
vocabularies to interlink data sets
Don’t wait ! Wrap the Legacy
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
johannes keizerhttp://aims.fao.org
Lod Generator: process
johannes keizerhttp://aims.fao.org
Data Services: process
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
Under Construction!!!!!!!
VocBench
AGROVOC Linked Open Data
AgroTagger
Triplifying AGRIS
Linking Data!
Drupal front ends for triple stores
The CIARD R.I.N.G
“AgriVIVO
johannes keizerhttp://aims.fao.org
So…
Where does VIVO fit in?
johannes keizerhttp://aims.fao.org
Background
Many Agriculture Information communities, how can we bring together an international network of researchers in this domain
A precedent for a community open source based solution
johannes keizerhttp://aims.fao.org
Many Agriculture Communities
• E-Agriculture site has 7111 members from 150 countries
• CIARD site: 6038 seeded from E-Agriculture site
• AIMS site has 280 users
• SIDALC has 150+ institutions in 22 LAC countries
• IAALD has 400+ members in 80 countries
• None of these user communities “talk” to each other (though there is a lot of overlap
johannes keizerhttp://aims.fao.org
Precedent for Open Source Applications
AgriDrupal – A content management system based on Drupal with Agriculture Information Systems customizations
AgriOceanDSpace – An institutional repository platform, customized for the Agriculture domain
johannes keizerhttp://aims.fao.org
How about “AgriVivo”?
An open source semantic web application, customized for managing people (and their activities) that work in the Agriculture domain.
Enabling an International Network of Agriculture researchers…
johannes keizerhttp://aims.fao.org
AgriVivoAgri VIVO
Triple Store
AIMS
E-AGRICULTURE
AGRIS
SIDALC
IAALD
johannes keizerhttp://aims.fao.org
AgriVivo: Areas of Work
Multi Language Support
Integration of Agricultural Ontolologies
Development of Harvesters for ingest of Agricultural documents (AGRIS)
Community Search of AgriVivo Sites
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
Linking Data
johannes keizerhttp://aims.fao.org
Serendipity linking
johannes keizerhttp://aims.fao.org
http://aims.fao.org/aos/agrovoc/c_7825
Semantic Linking
johannes keizerhttp://aims.fao.org
http://aims.fao.org/aos/agrovoc/c_7825
http://eurovoc.europa.eu/218754
Semantic Linking
johannes keizerhttp://aims.fao.org
http://aims.fao.org/aos/agrovoc/c_7825
http://eurovoc.europa.eu/218754
johannes keizerhttp://aims.fao.org
http://aims.fao.org/aos/agrovoc/c_7825
http://eurovoc.europa.eu/218754
http://agclass.nal.usda.gov/nalt/2011.xml#1780
johannes keizerhttp://aims.fao.org
http://aims.fao.org/aos/agrovoc/c_7825
AGROVOC
http://aims.fao.org/aos/agrovoc/c_12332 owl:sameAs http://eurovoc.europa.eu/219871 skos: exact match UNBIS: Toxic Substances
http://agris.fao.org/agris-search/search/display.do?f=1996/TR/TR96001.xml;TR9600026
Linking data through common URIs
http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:202:0011:0015:EN:PDF
http://unbisnet.un.org:8080/ipac20/ipac.jsp?session=128F308557F34.283092&profile=bib&uri=full=3100001~!685149~!1&ri=1&aspect=subtab124&menu=search&source=~!horizon
http://eurovoc.europa.eu/218754
Eurovoc TOXIC SUBSTANCES
UNBIS
http://agclass.nal.usda.gov/nalt/2011.xml#1780
NALT
http://www.agnic.org/search/CAT85822953
johannes keizerhttp://aims.fao.org
http://lprapp14:8090/openagris/search.do?recordID=JP2010001379
johannes keizerhttp://aims.fao.org
Open AGRIS threads
johannes keizerhttp://aims.fao.org
The AIMS Community
Thank You!Credits: John Ferreira, Imma Subirats, Yves
Jaques, Valeria Pesce, Fabrizio Celli, Ahsan Morshed, Catarina Caracciolo, Dickson Lukose, Gudrun Johannsen, Stefano Anibaldi, Armando Stellato, Tom Baker and many others
Annexes
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The VocBench
johannes keizerhttp://aims.fao.org
The VocBench
johannes keizerhttp://aims.fao.org
VocBench Features
Domain independent
Structure independent (i.e. thesauri, Glossaries, etc)
Supports RDF (SKOS, SKOS-XL), OWL
Supports collaborative editing
Supports editorial workflow, with user roles
Simple and advanced search
Supports data export: SKOS, Relational format (MySQL)
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
Further schemes in FAO
skos:broader
:bar
has_synonymhas_translation
skos:literalForm “maize”:foomaïs (fr)
:foo
has_synonymskos:literalForm “corn”
:bar
8171
1474
skosxl:altLabel
skosxl:prefLabel
skos:broader
has_synonym
SKOS Label
The AGROVOC concept scheme
SKOSConcept
rdf:type
rdf:type
6211
skos:broader
AGROVOCConceptScheme
skos:topConceptOf
skos:inScheme
Another scheme in FAO
Other scheme in FAO
skos:inScheme
12332
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
LOD Generator/Agrotagger
johannes keizerhttp://aims.fao.org
• Does Concept identification in unstructured texts
• Uses Agrovoc as a controlled vocabulary
• Prototype under testing with excellent results (entire repository of ICARDA indexed)
• Will produce in future Structured RDF files that can be used to link data like “open Calais”
AgroTagger
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
AGRIS Journal disambiguation
2.644.818 AGRIS records
2.171.113 records are journal records (82.09%)
1.788.083 journal records have been covered by the disambiguation process (82.35%)
14.658 journals have been correctly disambiguated
~20.000 strings must be examined yet: they refer to journal’s titles
Triples have been generated:
johannes keizerhttp://aims.fao.org
Triplifying AGRIS (small example)
<?xml version="1.0" encoding="utf-8"?><rdf:RDF xmlns:ags="http://purl.org/agmes/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/"><bibo:Journal rdf:about="http://aims.fao.org/aos/journal/c_b6e4ca85">
<bibo:ISSN>0101-9066</bibo:ISSN><bibo:ISSN>0101-9066</bibo:ISSN><dct:title><![CDATA[Circular técnica]]></dct:title><dct:alternative><![CDATA[Circular técnica (Centro Nacional de Pesquisa de Seringueira e Dendê)]]></dct:alternative><dct:alternative><![CDATA[Circular Tecnica - Centro Nacional de Pesquisa da Seringueira e Dende]]></dct:alternative><dct:alternative><![CDATA[Circular técnica - CNPSD]]></dct:alternative><dct:alternative><![CDATA[Circ. téc.]]></dct:alternative><ags:publisherPlace rdf:resource="http://aims.fao.org/aos/geopolitical.owl#Brazil"/><dct:publisher><![CDATA[Empresa Brasileira de Pesquisa Agropecuária, Centro Nacional de Pesquisa de Seringueira e
Dendê]]></dct:publisher><dct:language>por</dct:language><dct:date>1980</dct:date><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_10795"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_4650"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_32372"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_332"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_3589"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_5556"/>
</bibo:Journal>
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The CIARD RING
johannes keizerhttp://aims.fao.org
The CIARD RING
Roadmap to information nodes and gateways
Community switchboard to find data sources
Not only registry, but dynamic instrument for data linking
johannes keizerhttp://aims.fao.org
RING - Charts and numbers
http://ring.ciard.net
johannes keizerhttp://aims.fao.org
RING – Numbers
Number of documents potentially reachable through the services registered in the RING.
Types of service considered: document repositories and bibliographic databases.
http://ring.ciard.net/totals
johannes keizerhttp://aims.fao.org
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
johannes keizerhttp://aims.fao.org
http://aims.fao.org
johannes keizerhttp://aims.fao.org
StandardsTools
ServicesAdvice
http://www.ciard.nethttp://ring.ciard.nethttp://aims.fao.orghttp://agris.fao.org
Agricultural Information Management Standards
Recommended