Upload
clarissa-gordon
View
221
Download
0
Tags:
Embed Size (px)
Citation preview
johannes keizerhttp://aims.fao.org
The Presenter
Johannes Keizer, PhD
• Background Molecular Biology• More than 25 years experience in
data management• Team Leader at United Nations
specialized Agency (FAO)• Expert in Linked Open Data for the
EC Semantic Interoperability Center
johannes keizerhttp://aims.fao.org
Disappearing Data
3
These are the data from my Phd thesis, I started to look for them
johannes keizerhttp://aims.fao.org
https://www.google.de/search?q=johannes+Keizer++Dissertation+Toxizitaet+und+Biotransformation+-+Unterschiede+in+der&aq=f&oq=johannes+Keizer++Dissertation+Toxizitaet+und+Biotransformation+-+Unterschiede+in+der&aqs=chrome.0.57.60915j0&sourceid=chrome&ie=UTF-8
johannes keizerhttp://aims.fao.org 5
2nd Google Searchhttps://www.google.de/search?q=johannes+Keizer++Dissertation+Toxizitaet+und+Biotransformation+-+Unterschiede+in+der&aq=f&oq=johannes+Keizer++Dissertation+Toxizitaet+und+Biotransformation+-+Unterschiede+in+der&aqs=chrome.0.57.60915j0&sourceid=chrome&ie=UTF-8
http://agris.fao.org/agris-search/search/display.do?f=2012%2FOV%2FOV2012002800028.xml%3BDE19940007889
…at the end I found a record about my PhD thesis, somewhat proud, that I found it through AGRIS, our own service.
But the data could not be found, they were practically lost in the few print copies and microfiche of the thesis
6
johannes keizerhttp://aims.fao.org
There will be generated more scientific data in the next 5 years than in the history of humankind
johannes keizerhttp://aims.fao.org 11
Data definition (from RDA 1)
Data: digital recorded factual material commonly accepted n the scientific community as necessary to validate research findings
johannes keizerhttp://aims.fao.org
Widgets
Authoring services
Data Discovery Services
Analytics services
OntoServersOntoServers
Cloud CMSCloud CMS
…to RDF APIs…to RDF APIs
AggregatorsAggregators
Mash UpsMash Ups
Structured text (Bibliographie….
Semi structured text…CMS
Unstructured text/HTML…
Prepare data for
meaningful services
SemanticEnrichmentSemanticEnrichment
Highly structured data (data bases)
Semi structured data
LOD –TripleStores
Images
LOD – Infrastructure Services
Dataset DirectoriesDataset Directories
johannes keizerhttp://aims.fao.org
Community 2004 - AIMS
• Community of practice, 2000 practitioners
2007 - CIARD• Movement for opening access to agricultural knowledge
2013 - GODAN• High level advocacy for open Data in Agriculture and
Nutrition, influencing government, mobilizing resources
And
2012 – RDA, Research Data Alliance• Interdisciplinary Forum for all data related issues
johannes keizerhttp://aims.fao.org
G8 conference (April 2013)“How Open Data can be harnessed to help meet the challenge of sustainably feeding nine billion people by 2050”
johannes keizerhttp://aims.fao.org
Institutional Data Sharing Practice
Data Access and Distribution Policy
Data Discovery Tools
Common Metadata Standards
Digital Object Identifiers
Data CitationStandards
Data Analytics Algorithms
Data Preservation Practice
Data Scientists and Expert Support
Sustainable Economic Models
Curation Practice and Policy
Auditing, Certification and Reporting Practice
RDA: Many Infrastructure Building Blocks Needed to Accelerate Progress
johannes keizerhttp://aims.fao.org
Wheat Data Interoperability Example
Files in a local drive
Files in a shared drive
Local databases
Shared databases
0
20
40
60
SNPs
Files in a local drive
Files in a shared drive
Local databases
Shared databases
020406080
100
Phenotypes
Files in a local drive
Files in a shared drive
Local databases
Shared databases
0
10
20
30
Physical maps
Files in a local drive
Files in a shared drive
Local databases
Shared databases
0
20
40
60
80
Genomic annotations
Files in a local drive
Files in a shared drive
Local databases
Shared databases
0
20
40
60
Genetic maps
Files in a local drive
Files in a shared drive
Local databases
Shared databases
020406080
100
Germplasms
(Counts are the number of answers)
johannes keizerhttp://aims.fao.org
Map courtesy traveltip.org
Austral-pacific
4%
Africa
2% SouthAmerica
1%
The RDA Community Today: Over 1950 members from 80+ countries (July 14)
Asia4%
EU49%
AU3%
US37%
Other11%
johannes keizerhttp://aims.fao.org
Widgets
Authoring services
Data Discovery Services
Analytics services
OntoServersOntoServers
Cloud CMSCloud CMS
…to RDF APIs…to RDF APIs
AggregatorsAggregators
Mash UpsMash Ups
Structured text (Bibliographie….
Semi structured text…CMS
Unstructured text/HTML…
Prepare data for
meaningful services
SemanticEnrichmentSemanticEnrichment
Highly structured data (data bases)
Semi structured data
LOD –TripleStores
Images
LOD – Infrastructure Services
Dataset DirectoriesDataset Directories
johannes keizerhttp://aims.fao.org
Infrastructure elements
The RING• http://ring.ciard.net
Vocabulary Server• AGROVOC, CABT, NALT
• http://www.agrisemantics.org
The “VEST registry” (Tools, metadata)
Customized Content Management on the cloud• Agridrupal, AgriOcean Dspace
Tools and Methodologies to produce LOD
johannes keizerhttp://aims.fao.org 29
Infrastructure: Agrisemantics
Creating a common access point for Ontologies, taxonomies, vocabularies
Aligning the 3 agricultural thesauri (Agrovoc, NALT, CABT) Global Agricultural Concept Scheme
Cutting edge editing environments (VocBench and Skosmos) have already been deployed
GACS prototype to be released in spring 2015
johannes keizerhttp://aims.fao.org 30
Applications
AGRIS• Basis bibliographical database with more than 7 million
records
• Uses AGROVOC and bib metadata to link to other open datasets
AgriProfiles• Based on the “Vivo” application,
• Harvests Expert data from different sources and gives the possibility for semantic searches
AgriFeeds• Aggregated News and Events Feeds semantically organized
johannes keizerhttp://aims.fao.org 31
Content Coverage
International System for Agricultural Science and Technology
Food science, forestry, aquaculture, fisheries
Grey literature, small and big publishers
7,802,156 multilingual bibliographic records
200,585,375 triples
johannes keizerhttp://aims.fao.org 32
AGRIS’ users
Accessed from more than 200 countries and territories
Peaks of 250,000 visits/month (G.A.)
Users’ categories:• Researchers, professors, graduated students
• Librarians, cataloguers
• Small journal publishers, professional associations, conference organizers
• Government officers asking for reports on a certain topic
johannes keizerhttp://aims.fao.org
AGRIS RDFbibo:Articlebibo:doibibo:isbndct:languagebibo:presentedAt -> bibo:Conference -> dct:titlebibo:uridct:alternativedct:creator -> foaf:organization -> foaf:namedct:creator -> foaf:Person -> foaf:namebibo:authorList -> rdf:Seq-> rdf:lidct:dateSubmitteddct:descriptionbibo:abstract
dct:extentdct:identifierdct:mediumdct:isPartOfdct:issueddct:publisher -> foaf:Organization -> foaf:namedct:sourcedct:subjectdc:subjectdct:titlebibo:volumebibo:issuedct:typedct:rights
There is a lot to do!
Some problems need to be resolved, especially on interoperability standards
But there is an enormous potential in making linked open data available
From government, science and business…
36
..and more
http://rd-alliance.org/
http://www.aginfra.eu
http://agris.fao.org/http://aims.fao.orghttp://ring.ciard.nethttp://www.ciard.net