Upload
mai
View
41
Download
2
Embed Size (px)
DESCRIPTION
The goal of FAOs WAICENT programme: “ Fighting Hunger with Information ” But how to get the information? The Web Search experience (Google etc.) : you get not what you are you are looking for you get a lot of irrelevant stuff. Number of Relevant Documents Identified. Precision. - PowerPoint PPT Presentation
Citation preview
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 1
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The goal of FAOs WAICENT programme:“Fighting Hunger with Information”
But how to get the information?
The Web Search experience (Google etc.) :
• you get not what you are you are looking for• you get a lot of irrelevant stuff
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 2
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The Search Problem
Both parameters are ranking low today!
RecallNumber of Relevant Documents in the Collection
Number of Relevant Documents Identified
Precision Number of Relevant Documents Identified
Total Number of Documents Identified
How to evaluate Search Results?
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 3
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
• The Role of Bibliographical Databases
• The State of the AGRIS network
• Enforcing Metadatastandards
• The Agricultural Ontology Service
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 4
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The Role of Bibliographical databases
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 5
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Since 1999 the AGRIS database has
accumulated nearly 200.000 new records
But what is the real value of these records?
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 6
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The new paradigm
Everyone wants to find everything on the WWW, But there are several constraints
The information must be available on the web The information must be identified The information must be accessible
Is AGRIS (bibliographical databases) helpful in overcoming these restrictions?
If so, how?
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 7
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
AvailabilitySomeone has to publish the information on a web
server
In research there exist cultural and economic reasons for the unavailability of knowledge:
– Not everyone likes to share (especially scientists)– The necessary infrastructure is lacking– Some publications are career relevant and only foreseen
for career relevant places to publish– Other publications might have a market value
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 8
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Identification and Accessibility
You need to know what you are looking for!It is estimated that more than 60 % of all searches are
not well defined! You need to define your range of knowledge
If you are looking for a very well defined information object then –
Google will generally do it for you
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 9
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
If, for example, we request :“all the information about the “Rinderpest” in Africa that has
been published in the year 2000 and has been presented at conferences”
No full text search engine would give a satisfactory answer!
AGRIS can!
But AGRIS will give you “only” bibliographical records…….
From catalogues you can extract knowledge
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 10
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
How to get the real stuff….• In pre-Web times, bibliographies gave you a call
number; AGRIS, the address of the centre• In Web times you can get a link to the URL• But the URL is the most unstable part of a
metadata record, physical locations will never be stable!
• To use the physical URL as identifier for a document is a sure path to future failure;
• PURL databases are no solution, because no one wants to maintain them;
• At the moment there is no solution agreed for an “Unique Identifier”
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 11
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
…better the bibliography than the link…..• A bibliographical record as entity defines an
information object very well;• (Title, author, Identifiers, subject keywords…)• If these metadata are of high quality, there is no
need for a direct link to the full text;• The document can be anywhere on the WWW … • The bibliographical Ontology identifies what you
are looking for;• Google (or any quality full text search) will do the
rest
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 12
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Example (title contains “diazinon” and ACROVOC contains “toxicity”, 17 records in Agris)
Accession Number: 97-004187
Title: Effects of diazinon on large outdoor pond microcosms.
Publication Year: 1996
Subject Category: Protection of plants - General aspects; Fisheries and aquaculture - General aspects; Veterinary science and hygiene - General aspects;
Author: Giddings, J.M.;Biever, R.C.;Annunziatio, M.F.;Hosmer, A.J.
ISSN: ISSN 0730-7268.
Availability: NAL, USDA, Beltsville, Md. 20705 - USA. E-mail: [email protected] (DNAL QH545.A1E58).
Bibliographic Source: references. Paper presented at the 15th Annual Meeting, Society of Environmental Toxicology and Chemistry, October 30-November 3, 1994, Denver, Colorado. Environmental toxicology and chemistry / (USA). (May 1996). v. 15(5) p. 618-629.
Identifiers: species abundance; ecotoxicology; LEPOMIS MACROCHI
AGROVOC keywords:
English: diazinon; toxicity; aquatic organisms; percoidei; biodiversity; aquatic environment; survival; side effects; toxicology;
French: diazinon; toxicite; organisme aquatique; percoidei; biodiversite; milieu aquatique; survie; effet secondaire; toxicologie;
Spanish: diazinon; toxicidad; organismos acuaticos; percoidei; biodiversidad; ambiente acuatico; supervivencia; efectos secundarios; toxicologia;
Abstract: An aquatic microcosm study (simulated aquatic field study) was conducted with the technical grade of the organophosphorus insecticide diazinon. Eighteen fiberglass tanks, each 3.2 m in diameter and 1.5 m in depth, were established with sediment and water (11.2 m3) from natural ponds and stocked with 40 juvenile bluegill sunfish (Lepomis macrochirus). Diazinon was applied in aqueous solution three times at 7-d intervals. Eight loading rates were used, with two microcosms at each level plus two controls. The amounts of diazinon added during each application corresponded to theoretical concentrations from 2.0 micrograms/L to 500 micrograms/L. The most sensitive ecological components of the microcosms were Cladocera (zooplankton), and Pentaneurini and Ceratopogonidae (insects), which were reduced at all treatment levels. Effects on many zooplankton and macroinvertebrate taxa occurred at diazinon concentrations (time-weighted averages) of 9.2 micrograms/L and higher. Total fish biomass was
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 13
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
“Effects of diazinon on large outdoor pond microcosms”
1997topics - [ Traduci questa pagina ]... Giddings, JM, et al. 1996. Effects of diazinon on large outdoor pond microcosms.Environmental Toxicology and Chemistry 15:618-629. 4/8 Computer models. ... www.cnr.colostate.edu/~danb/seminar/1997topics.htm - 6k - Copia cache - Pagine simili Volume 15, Number 5 - [ Traduci questa pagina ]... Multivariate Analysis (pp. 608-617) JL Shaw, JP Manning. Effects ofDiazinon on Large Outdoor Pond Microcosms (pp. 618-629) JM Giddings ... www.ruf.rice.edu/~etcj/155.html - 7k - Copia cache - Pagine simili TOP 5 CHEMICALS - [ Traduci questa pagina ]... Does it bioaccumulate? References. Giddings, JM, RC Biever, MF Annunziato andAJ Hosmer. 1996. Effects of Diazinon on Large Outdoor Pond Microcosms. ... www.science.mcmaster.ca/Biology/4S03/LB5.HTM - 9k - Copia cache - Pagine simili Methods in Aquatic Toxicology - [ Traduci questa pagina ]... Chem. 13(3): 453-460. Giddings, JM., RC Biever, MF Annunziato, and AJ Hosmer. (1996).Effects of diazinon on large outdoor pond microcosms. Env. Tox. Chem. ... www.science.mcmaster.ca/Biology/4S03/SB7.HTM - 14k - Copia cache - Pagine simili9 Reference List, Danish Environmental Protection Agency - [ Traduci questa pagina ]... Giddings J, Biever RC, Annunziatio MF & AJ Hosmer. 1996. Effects of diazinonon large outdoor pond microcosms. Environmental toxicology and chemistry. ... www.mst.dk/udgiv/Publications/2001/ 87-7944-634-5/html/kap09_eng.htm - 21k - Copia cache - Pagine simili
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 14
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The main problem of the semantic web is not of technical nature
but is creating the semantic context
=
Creating metadata
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 15
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Conclusions
Bibliographical databases guarantee knowledge organization and especially retrieval;
The collection of “bibliographical” data is the single,
most important factor in the retrieval of electronic information
The better – and more versatile – our metadata, the easier the task of accessing knowledge objects on the web will become;
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 16
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002 The State of the AGRIS network
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 17
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Western Europe Asia Latin America Eastern Europe Africa International
NETHERLANDS JAPAN BRAZILCZECH REPUBLIC EGYPT CIAT
FRANCE CHINA COLOMBIA YUGOSLAVIA SOUTH AFRICA FAO
ITALY THAILAND CHILE POLAND ETHIOPIA ICARDA
DENMARK PHILIPPINES CUBA SLOVAKIA MOROCCO IRRI
SPAIN INDONESIA MEXICO HUNGARY ALGERIA ICIMOD
BELGIUM KOREA (South) PERU BULGARIA MOZAMBIQUE CAGRIS
NORWAY PAKISTAN VENEZUELA SLOVENIA ZIMBABWE ACSAD
GERMANY TURKEY PANAMA LATVIA KENYA CIMMYT
PORTUGAL VIET NAM ARGENTINA BELARUS UGANDA CIFOR
SWEDEN Syria TRINIDAD/Tob. GEORGIA SWAZILAND ICRAF
AUSTRIA INDIA COSTA RICA MOLDOVA ICRISAT
CYPRUS MALAYSIA SURINAME ESTONIA IIMI
GREECE KAZAKHSTAN PARAGUAY UKRAINE CARDI
SWITZERLAND IRAQ ECUADOR LITHUANIA IPGRI
LEBANON NICARAGUA ALBANIA CIHEAM
NEPAL ROMANIA ISNAR
BANGLADESH MACEDONIA
Korea (North)
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 18
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
AGRIS has unique features!
• Translated Metadata from otherwise inaccessible material (China, Japan, Thailand)
• AGRIS is the unique source for references from many national systems (China, India, Thailand, Japan)
• The AGRIS methodologies have assured quality in the capture of metadata for two decades
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 19
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
• About 70.000 bibliographical records/year are sentto FAO:
ca. 30 % with abstracts ca 1% with link to the full text
• The AGRIS website has about 9,000 users/month; 2,000 are regular users from institutions
AGRIS is used
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 20
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
AGRIS: the use of the website
Users of the AGRIS website
0100020003000400050006000700080009000
10000
1
Visi
tors
Nov-99
Nov-00
Nov-01
Apr-02
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 21
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
AGRIS: the comparative advantage• Research Material• Grey Literature• South – South Transfer• Promotion of Standards• Existing documentation centres
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 22
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The Next Challenges
•New Standards (electronic publishing)
•Better coverage (more material)
•Better and easier access to the collected knowledge
•More participation of the information producers
•Better tools
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 23
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The main problem of the semantic web is not of technical nature
but is creating the semantic context
=
Creating metadata
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 24
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002 Metadatastandards
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 25
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Why do we need common metadata?
• The AGRIS community had never doubts about the necessity of Metadata standards
• Now this is a hot issue in the development of the Semantic Web
• We are able to say that in a given application dc:title means “A name given to the resource”
– And for example, not…A title given to a person, such as “Sir” or “Ms.”
• Standardization ensures that we are talking about the same thing!
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 26
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Metadata – why we need them so urgently
• Common metadata allows us to :– Give lexical words a meaning– Facilitate easy exchange between systems– Facilitate resource discovery and request access for it– Recombine content to be used for different purposes
• inventory across databases• Ex. send an email using all <ags:email> fields
– Reduction of cost by using standardized tools• AgMES document• DC.Dot
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 27
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Example: Common Environment
• Parallel search queries to search across systems
arc:title my:titlebest:title
Query to search title element for “Shrimp Production in Thailand”
wrapper wrapper wrapper
DB-A DB-CDB-B
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 28
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Example: Ideal Environment
• Single search across systems
Query to search dc:title tag for “Shrimp Production in Thailand”
dc:title dc:titledc:title
DB-A DB-CDB-B
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 29
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Example: Achievable Compromise
• Single search across systems
Query to search dc:title tag for “Shrimp Production in Thailand”
arc:title my:titlebest:title
DB-A DB-CDB-B
AgMES compliant XML mapping
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 30
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The AGStandards Initiative• The Agricultural Metadata Element set (AGMES)
http://www.fao.org/agris/agmes
• The new AGRIS application Profile draft
• Improving and Creating Semantic Standards participating from AGROVOC
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 31
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The main problem of the semantic web is not of technical nature
but is creating the semantic context
=
Creating metadata
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 32
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002Agricultural Ontology Service
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 33
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Problems we want to solve (1)• No cross navigation between applications
• Full text search engines based on statistical text analysis are imprecise
Systems based only on “machine intelligence” do not show too promising results
• Web crawlers and harvesters do good jobs only on already structured information sources.
• Cataloging and indexing are labor-intensive processes, requiring special training. Tools for automating or semi-automating these processes are much in demand.
Recognition of meaning (semantic analysis) by machines is only possible by using using structured meta-information and formal knowledge description Agreed metadata schemas Controlled vocabularies, Taxonomies
• We need reusable parts of web searchlets and portlets
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 34
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Problems we want to solve (2) • Topic Trees from categorization schemes and thesauri are rigid and not very
expressive
• Machine produced clusters are “flexible”, but imprecise and at times out of context
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 35
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Knowledge Organization Systems: Vocabularies
AGROVOC
NAL Thesaurus
CABI Thesaurus
Dedicated KOSs
Non-dedicated KOSs
e.g., ASFA thesaurus
e.g., the Multilingual Forestry Thesaurus
e.g., the Sustainable Development
website classification
e.g., biological taxonomies such as NCBI and ITIS
GEMET
Other thematic thesauri
Existing Thesauri and Knowledge Organization Systems (KOSs)
Common concepts are not declared
No or very limited interoperability
Insufficient subject + language coverage
Severe maintenance problems
Very limited machine readability
Only very simple encoding of semantic relations
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 36
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The solution we propose - Domain Ontologies
An ontology is a formal knowledge organization system
A formal description of the application knowledge
It contains concepts and their definitions
Relations between concepts
Possibility for machine processing
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 37
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
What is an Ontology
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 38
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Benefits from Ontologies– Semantic Organization of websites
Knowledge maps Guided discovery of knowledge Easy retrievability of information without using complicated Boolean logic
– Text processing by machines Text Mining on the Web (meaning-oriented access) Automatic indexing and text annotation tools Full text search engines that create meaningful classification (FAO-Schwartz
not related to FAO) (semantic clustering)
– Intelligent search of the Web Building dynamical catalogues from machine readable meta data Cross Domain Search
– Natural Language processing Better machine translation Queries using natural language
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 39
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
The Collaborative Approach We Want to Adopt• Only agreed semantic standards guarantee knowledge
discovery between different applications.
• Developing Knowledge Organization Systems is resource intensive and requires stakeholder’s agreement and participation.
• Hence, FAO started initiatives to bring interested partners together The AGStandards initiative was launched in October, 2000 to agree on
agricultural metadata standards The Agricultural Ontology Service (AOS) concept paper was publicized
in July 2001. 3 AOS workshops have been organized up to now A consortium of partners is to be established
Johannes Keizer
Food and Agriculture
Organization of the UN
Library and Documentation
Systems Division
Slide 40
AGRIS
the next challenges
13-08-2002
Katsetsart University
August 13
2002
Collaboration between FAO/GIL and Katsetsart University
• Enforcing National AGRIS activities Better coverage of Thai literature Better access to references important for Thai
Agriculture
• Participation in AOS consortium Agrovoc development Partner in a consortium AOS workshop in Thailand